Low-pass Genomewide Sequencing and Variant Imputation Using Identity-by-descent in an Isolated Human Population
A Gusev, MJ Shah, EE Kenny, A Ramachandran, JK Lowe, J Salit, CC Lee,, EC Levandowsky, TN Weaver, QC Doan, HE Peckham, SF McLaughlin, MR Lyons, VN, Sheth, M Stoffel, FM De La Vega, JM Friedman, JL Breslow, I Pe'er

TL;DR
This study demonstrates that low-pass genome sequencing combined with identity-by-descent analysis enables high-accuracy imputation of both known and novel variants in an isolated human population, reducing the need for extensive sequencing.
Contribution
It introduces a novel approach leveraging shared haplotypes for variant imputation in isolated populations, achieving high accuracy with minimal sequencing.
Findings
Sequencing 40 genomes allows imputation of 60% of the cohort.
Identified over 5.7 million variants, including 1.2 million novel.
Achieved 99.6% accuracy for known variants and 97.3% for novel variants.
Abstract
Whole-genome sequencing in an isolated population with few founders directly ascertains variants from the population bottleneck that may be rare elsewhere. In such populations, shared haplotypes allow imputation of variants in unsequenced samples without resorting to statistical methods, as in studies of outbred cohorts. We focus on an isolated population cohort from the Pacific Island of Kosrae, Micronesia, where we previously collected SNP array and rich phenotype data for the majority of the population. We report identification of long regions with haplotypes co-inherited between pairs of individuals and methodology to leverage such shared genetic content for imputation. Our estimates show that sequencing as few as 40 personal genomes allows for imputation in up to 60% of the 3,000-person cohort at the average locus. We ascertained a pilot data-set of whole-genome sequences from…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsForensic and Genetic Research · Genetic Associations and Epidemiology · Genomics and Rare Diseases
