Population Structure and Cryptic Relatedness in Genetic Association Studies
William Astle, David J. Balding

TL;DR
This paper reviews how population structure and cryptic relatedness confound genetic association studies, proposing a unified kinship-based framework and discussing methods like linear mixed models to address these issues.
Contribution
It introduces a unified approach to understanding and estimating kinship, integrating various solutions to control confounding in genetic association studies.
Findings
Linear mixed models effectively control for population structure.
Kinship estimation methods improve association test accuracy.
Recent computational tools enhance analysis of human genetic data.
Abstract
We review the problem of confounding in genetic association studies, which arises principally because of population structure and cryptic relatedness. Many treatments of the problem consider only a simple ``island'' model of population structure. We take a broader approach, which views population structure and cryptic relatedness as different aspects of a single confounder: the unobserved pedigree defining the (often distant) relationships among the study subjects. Kinship is therefore a central concept, and we review methods of defining and estimating kinship coefficients, both pedigree-based and marker-based. In this unified framework we review solutions to the problem of population structure, including family-based study designs, genomic control, structured association, regression control, principal components adjustment and linear mixed models. The last solution makes the most…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
