Building A Theoretical Foundation for Combining Negative Controls and Replicates
Jiming Jiang, Johann A. Gagnon-Bartsch, Terence P. Speed

TL;DR
This paper develops a theoretical framework explaining how negative controls and replicates can be combined to effectively remove unwanted variation in gene expression studies, supported by empirical and simulation results.
Contribution
It introduces asymptotic theory for combining negative controls and replicates to estimate and remove batch effects in high-throughput assays.
Findings
The theory explains the effectiveness of combining controls and replicates.
Empirical results validate the theoretical predictions.
Pseudo-replicates can be used when actual replicates are unavailable.
Abstract
Studies using assays to quantify the expression of thousands of genes on tens to thousands of cell samples have been carried out for over 20 years. Such assays are based on microarrays, DNA sequencing or other molecular technologies. All such studies involve unwanted variation, often called batch effects, associated with the cell samples and the assay process. Removing this unwanted variation is essential before the measurements can be used to address the questions that motivated the studies. Combining the results of replicate assays with measurements on negative control genes to estimate the unwanted variation and remove it has proved to be effective at this task. The main goal of this paper is to present asymptotic theory that explains this effectiveness. The approach can be widened by using pseudo-replicate sets of pseudo-samples, for use with studies having no replicate assays.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistics Education and Methodologies · Data Analysis with R · Advanced Clustering Algorithms Research
