Computationally Efficient Whole-Genome Signal Region Detection for Quantitative and Binary Traits
Wei Zhang, Fan Wang, Fang Yao

TL;DR
This paper introduces a distributed, covariate-adjusted version of the BiRS algorithm, called dBiRS, which efficiently detects genetic signal regions in whole-genome data for both binary and continuous traits, with theoretical error control.
Contribution
The paper presents dBiRS, a scalable, parallelizable algorithm that extends BiRS to handle covariates and various outcome types, improving computational efficiency and statistical power.
Findings
Validated dBiRS on UK Biobank data for cognitive traits.
Identified novel rare variants near new genes.
Confirmed previous genetic associations.
Abstract
The identification of genetic signal regions in the human genome is critical for understanding the genetic architecture of complex traits and diseases. Numerous methods based on scan algorithms (i.e. QSCAN, SCANG, SCANG-STARR) have been developed to allow dynamic window sizes in whole-genome association studies. Beyond scan algorithms, we have recently developed the binary and re-search (BiRS) algorithm, which is more computationally efficient than scan-based methods and exhibits superior statistical power. However, the BiRS algorithm is based on two-sample mean test for binary traits, not accounting for multidimensional covariates or handling test statistics for non-binary outcomes. In this work, we present a distributed version of the BiRS algorithm (dBiRS) that incorporate a new infinity-norm test statistic based on summary statistics computed from a generalized linear model. The…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenomics and Phylogenetic Studies · Gene expression and cancer classification · Machine Learning in Bioinformatics
