StaRQR-K: False Discovery Rate Controlled Regional Quantile Regression
Sang Kyu Lee, Tongwu Zhang, Hyokyoung G. Hong, Haolei Weng

TL;DR
StaRQR-K is a novel statistical framework that identifies genomic regions associated with specific parts of an outcome distribution, controlling false discoveries in ultrahigh-dimensional settings, demonstrated through simulations and cancer data analysis.
Contribution
It introduces a stabilized regional quantile regression method with FDR control using model-X knockoffs, tailored for high-dimensional genomic data analysis.
Findings
Achieves valid FDR control in simulations
Detects region-specific effects in cancer data
Improves out-of-sample prediction accuracy
Abstract
Quantifying how genomic features influence different parts of an outcome distribution requires statistical tools that go beyond mean regression, especially in ultrahigh-dimensional settings. Motivated by the study of LINE-1 activity in cancer, we propose StaRQR-K, a stabilized regional quantile regression framework with model-X knockoffs for false discovery rate control. StaRQR-K identifies CpG sites whose methylation levels are associated with specific quantile regions of an outcome, allowing detection of heterogeneous and tail-sensitive effects. The method combines an efficient regional quantile sure independence screening procedure with a winsorizing-based model-X knockoff filter, providing false discovery rate (FDR) control for regional quantile regression. Simulation studies show that StaRQR-K achieves valid FDR control and substantially higher power than existing approaches. In an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenetic Associations and Epidemiology · Gene expression and cancer classification · Statistical Methods and Inference
