An Efficient Sufficient Dimension Reduction Method for Identifying Genetic Variants of Clinical Significance
Momiao Xiong, Long Ma

TL;DR
This paper introduces a novel sparse sufficient dimension reduction method tailored for identifying clinically significant genetic variants from high-dimensional genomic data, enhancing efficiency and interpretability in genetic analysis.
Contribution
It develops a sparse SDR framework with algorithms capable of handling millions of predictors, incorporating parallel computation for efficient genome-wide variant discovery.
Findings
Successfully applied to simulation data demonstrating effectiveness.
Analyzed NHLBI Exome Sequencing Project data with promising results.
Provides a scalable approach for genetic variant selection.
Abstract
Fast and cheaper next generation sequencing technologies will generate unprecedentedly massive and highly-dimensional genomic and epigenomic variation data. In the near future, a routine part of medical record will include the sequenced genomes. A fundamental question is how to efficiently extract genomic and epigenomic variants of clinical utility which will provide information for optimal wellness and interference strategies. Traditional paradigm for identifying variants of clinical validity is to test association of the variants. However, significantly associated genetic variants may or may not be usefulness for diagnosis and prognosis of diseases. Alternative to association studies for finding genetic variants of predictive utility is to systematically search variants that contain sufficient information for phenotype prediction. To achieve this, we introduce concepts of sufficient…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenetic Associations and Epidemiology · Genomics and Rare Diseases · Gene expression and cancer classification
