Evaluation of statistical approaches for association testing in noisy drug screening data
Petr Smirnov, Ian Smith, Zhaleh Safikhani, Wail Ba-alawi, Farnoosh, Khodakarami, Eva Lin, Yihong Yu, Scott Martin, Janosch Ortmann, Tero, Aittokallio, Marc Hafner, Benjamin Haibe-Kains

TL;DR
This paper introduces two semi-parametric variants of the concordance index, the rCI and kCI, designed to improve association testing in noisy drug screening data, with better power demonstrated in simulations and real datasets.
Contribution
The paper presents novel semi-parametric modifications of the concordance index that incorporate noise distribution information for improved association testing.
Findings
rCI and kCI outperform the concordance index in simulation power.
Pearson correlation is most robust to measurement noise.
Proposed methods show some improvement on real pharmacogenomic data.
Abstract
dentifying associations among biological variables is a major challenge in modern quantitative biological research, particularly given the systemic and statistical noise endemic to biological systems. Drug sensitivity data has proven to be a particularly challenging field for identifying associations to inform patient treatment. To address this, we introduce two semi-parametric variations on the commonly used concordance index: the robust concordance index and the kernelized concordance index (rCI, kCI), which incorporate measurements about the noise distribution from the data. We demonstrate that common statistical tests applied to the concordance index and its variations fail to control for false positives, and introduce efficient implementations to compute p-values using adaptive permutation testing. We then evaluate the statistical power of these coefficients under simulation and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods in Clinical Trials · Gene expression and cancer classification · Statistical Methods and Inference
