Powerful extreme phenotype sampling designs and score tests for genetic association studies
Thea Bj{\o}rnland, Anja Bye, Einar Ryeng, Ulrik Wisl{\o}ff, Mette, Langaas

TL;DR
This paper introduces personalized extreme phenotype sampling designs and score tests for genetic association studies, demonstrating improved power over classical methods through simulations and data analysis.
Contribution
It proposes a novel $z$-extreme sampling design with tailored thresholds and derives score tests for analyzing extreme phenotypes, enhancing power in genetic association studies.
Findings
$z$-extreme sampling is as powerful as all case analysis.
Classical extreme sampling can be less powerful than random sampling.
All case analysis generally outperforms complete case analysis.
Abstract
We consider cross-sectional genetic association studies (common and rare variants) where non-genetic information is available, or feasible to obtain for individuals, but where it is infeasible to genotype all individuals. We consider continuously measurable Gaussian traits (phenotypes). Genotyping extreme phenotype individuals can yield better power to detect phenotype-genotype associations, as compared to randomly selecting individuals. We define a person as having an extreme phenotype if the observed phenotype is above a specified threshold or below a specified thresholds. We consider a model where these thresholds can be tailored to each individual. The classical extreme sampling design is to set equal thresholds for all individuals. We introduce a design (-extreme sampling) where personalized thresholds are defined based on the residuals of a regression model…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
