TL;DR
This paper introduces a modular Empirical Bayes approach combined with factor analysis to improve false discovery rate estimation in large-scale genomics, effectively handling unwanted variation and sparse effects.
Contribution
It presents a novel, integrated method that enhances EB techniques with factor analysis, providing better power and calibration in genomics data analysis.
Findings
Significant power gains over existing methods in simulations
Improved calibration of false discovery rates
Real data analysis shows varied results across methods
Abstract
We combine two important ideas in the analysis of large-scale genomics experiments (e.g. experiments that aim to identify genes that are differentially expressed between two conditions). The first is use of Empirical Bayes (EB) methods to handle the large number of potentially-sparse effects, and estimate false discovery rates and related quantities. The second is use of factor analysis methods to deal with sources of unwanted variation such as batch effects and unmeasured confounders. We describe a simple modular fitting procedure that combines key ideas from both these lines of research. This yields new, powerful EB methods for analyzing genomics experiments that account for both sparse effects and unwanted variation. In realistic simulations, these new methods provide significant gains in power and calibration over competing methods. In real data analysis we find that different…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
