Publication
Improving Hierarchical Models Using Historical Data with Applications in High-Throughput Genomics Data Analysis
Downloadable Content
- Persistent URL
- Last modified
- 05/21/2025
- Type of Material
- Authors
-
-
Ben Li, Emory UniversityYunxiao Li, Emory UniversityZhaohui Qin, Emory University
- Language
- English
- Date
- 2017-06-01
- Publisher
- Springer Verlag (Germany)
- Publication Version
- Copyright Statement
- © 2016, International Chinese Statistical Association.
- Final Published Version (URL)
- Title of Journal or Parent Work
- ISSN
- 1867-1764
- Volume
- 9
- Issue
- 1
- Start Page
- 73
- End Page
- 90
- Grant/Funding Information
- This work was partially supported by National Institutes of Health grant P01GM085354.
- Abstract
- Modern high-throughput biotechnologies such as microarray and next-generation sequencing produce a massive amount of information for each sample assayed. However, in a typical high-throughput experiment, only limited amount of data are observed for each individual feature, thus the classical “large p, small n” problem. Bayesian hierarchical model, capable of borrowing strength across features within the same dataset, has been recognized as an effective tool in analyzing such data. However, the shrinkage effect, the most prominent feature of hierarchical features, can lead to undesirable over-correction for some features. In this work, we discuss possible causes of the over-correction problem and propose several alternative solutions. Our strategy is rooted in the fact that in the Big Data era, large amount of historical data are available which should be taken advantage of. Our strategy presents a new framework to enhance the Bayesian hierarchical model. Through simulation and real data analysis, we demonstrated superior performance of the proposed strategy. Our new strategy also enables borrowing information across different platforms which could be extremely useful with emergence of new technologies and accumulation of data from different platforms in the Big Data era. Our method has been implemented in R package “adaptiveHM,” which is freely available from https://github.com/benliemory/adaptiveHM.
- Author Notes
- Keywords
- Research Categories
- Engineering, Biomedical
- Biology, Biostatistics
Tools
- Download Item
- Contact Us
-
Citation Management Tools
Relations
- In Collection:
Items
| Thumbnail | Title | File Description | Date Uploaded | Visibility | Actions |
|---|---|---|---|---|---|
|
|
Publication File - sqd41.pdf | Primary Content | 2025-03-15 | Public | Download |