Segmented correspondence curve regression model for quantifying reproducibility of high-throughput experiments
Feipeng Zhang, Frank Shen, Tao Yang, Qunhua Li

TL;DR
This paper introduces a segmented regression model to quantify how operational factors affect reproducibility in high-throughput experiments, accounting for heterogeneity across significance levels, and demonstrates its effectiveness through simulations and real data analysis.
Contribution
The paper develops a novel segmented regression model based on rank concordance to analyze operational factors' effects on reproducibility at different significance levels.
Findings
The method accurately detects change points in operational effects.
It outperforms existing methods in model fitting and error control.
Application reveals sequencing depth impacts reproducibility.
Abstract
The reliability of a high-throughput biological experiment relies highly on the settings of the operational factors in its experimental and data-analytic procedures. Understanding how operational factors influence the reproducibility of the experimental outcome is critical for constructing robust workflows and obtaining reliable results. One challenge in this area is that candidates at different levels of significance may respond to the operational factors differently. To model this heterogeneity, we develop a novel segmented regression model, based on the rank concordance between candidates from different replicate samples, to characterize the varying effects of operational factors for candidates at different levels of significance. A grid search method is developed to identify the change point in response to the operational factors and estimate the covariate effects accounting for the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Statistical Methods in Clinical Trials · Machine Learning in Materials Science
