A powerful and efficient set test for genetic markers that handles confounders
Jennifer Listgarten, Christoph Lippert, Eun Yong Kang, Jing Xiang,, Carl M. Kadie, David Heckerman

TL;DR
This paper introduces a new set test for genetic markers that effectively handles confounders like population structure and relatedness, improving power and accuracy in large genetic datasets.
Contribution
The authors develop a linear mixed model-based set test with a computational speedup, capable of correcting for confounders in large cohorts, which was not addressed by previous methods.
Findings
Successfully corrects for population structure and relatedness
Recovers genes missed by univariate analysis in Crohn's disease data
More powerful than existing methods while controlling type I error
Abstract
Approaches for testing sets of variants, such as a set of rare or common variants within a gene or pathway, for association with complex traits are important. In particular, set tests allow for aggregation of weak signal within a set, can capture interplay among variants, and reduce the burden of multiple hypothesis testing. Until now, these approaches did not address confounding by family relatedness and population structure, a problem that is becoming more important as larger data sets are used to increase power. Results: We introduce a new approach for set tests that handles confounders. Our model is based on the linear mixed model and uses two random effects-one to capture the set association signal and one to capture confounders. We also introduce a computational speedup for two-random-effects models that makes this approach feasible even for extremely large cohorts. Using this…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
