Variance Components Genetic Association Test for Zero-inflated Count Outcomes
Matthew Goodman, Lori Chibnik, Tianxi Cai

TL;DR
This paper introduces a new variance components score test for detecting genetic associations with zero-inflated count outcomes, common in biomedical studies, demonstrating improved power over existing methods especially with correlated markers.
Contribution
The paper presents a novel statistical test tailored for zero-inflated count data, extending SNP-set testing frameworks like SKAT to handle mixture distributions with structural zeros.
Findings
The method shows superior power compared to existing tests.
Application to Alzheimer's data reveals significant associations with APOE.
The test effectively detects associations with both zero-inflation and count parameters.
Abstract
Commonly in biomedical research, studies collect data in which an outcome measure contains informative excess zeros; for example when observing the burden of neuritic plaques in brain pathology studies, those who show none contribute to our understanding of neurodegenerative disease. The outcome may be characterized by a mixture distribution with one component being the `structural zero' and the other component being a Poisson distribution. We propose a novel variance components score test of genetic association between a set of genetic markers and a zero-inflated count outcome from a mixture distribution. This test shares advantageous properties with SNP-set tests which have been previously devised for standard continuous or binary outcomes, such as the Sequence Kernel Association Test (SKAT). In particular, our method has superior statistical power compared to competing methods,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenetic Associations and Epidemiology · Gene expression and cancer classification · Bioinformatics and Genomic Networks
