Wavelet-based genetic association analysis of functional phenotypes arising from high-throughput sequencing assays
Heejung Shim, Matthew Stephens

TL;DR
This paper introduces wavelet-based statistical methods for high-resolution genetic association analysis of functional phenotypes from sequencing data, improving detection of associated variants over traditional window-based approaches.
Contribution
It develops a novel wavelet-based approach to analyze functional genomic data, enhancing the identification of genetic variants linked to cellular phenotypes.
Findings
Identified 772 novel dsQTLs not found by previous methods.
More associations detected compared to traditional window-based analysis.
Method leverages high-resolution sequencing data effectively.
Abstract
Understanding how genetic variants influence cellular-level processes is an important step towards understanding how they influence important organismal-level traits, or "phenotypes", including human disease susceptibility. To this end scientists are undertaking large-scale genetic association studies that aim to identify genetic variants associated with molecular and cellular phenotypes, such as gene expression, transcription factor binding, or chromatin accessibility. These studies use high-throughput sequencing assays (e.g. RNA-seq, ChIP-seq, DNase-seq) to obtain high-resolution data on how the traits vary along the genome in each sample. However, typical association analyses fail to exploit these high-resolution measurements, instead aggregating the data at coarser resolutions, such as genes, or windows of fixed length. Here we develop and apply statistical methods that better…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
