A novel algorithm for simultaneous SNP selection in high-dimensional genome-wide association studies
Verena Zuber, A. Pedro Duarte Silva, Korbinian Strimmer

TL;DR
This paper introduces a new efficient multivariate algorithm based on CAR scores for simultaneous SNP selection in high-dimensional genome-wide association studies, outperforming existing methods in identifying causal variants.
Contribution
The paper presents a novel computationally efficient procedure for shrinkage estimation of CAR scores, improving SNP selection accuracy in high-dimensional GWAS.
Findings
CAR score-based algorithm outperforms competing methods in causal SNP detection
The approach demonstrates higher accuracy in SNP ranking and recovery
An R package implementation is provided for practical use
Abstract
Background: Identification of causal SNPs in most genome wide association studies relies on approaches that consider each SNP individually. However, there is a strong correlation structure among SNPs that need to be taken into account. Hence, increasingly modern computationally expensive regression methods are employed for SNP selection that consider all markers simultaneously and thus incorporate dependencies among SNPs. Results: We develop a novel multivariate algorithm for large scale SNP selection using CAR score regression, a promising new approach for prioritizing biomarkers. Specifically, we propose a computationally efficient procedure for shrinkage estimation of CAR scores from high-dimensional data. Subsequently, we conduct a comprehensive comparison study including five advanced regression approaches (boosting, lasso, NEG, MCP, and CAR score) and a univariate approach…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
