Detecting the structure of haplotypes, local ancestry and excessive local European ancestry in Mexicans
Yongtao Guan

TL;DR
This paper introduces a two-layer hidden Markov model for detecting haplotype structure and local ancestry in admixed individuals, outperforming existing methods especially for short ancestral segments, and reveals regions of excessive European ancestry in Mexicans linked to neurodevelopmental genes.
Contribution
The paper presents a novel two-layer hidden Markov model that improves local ancestry inference by modeling two scales of linkage disequilibrium, specifically tailored for admixed populations.
Findings
Identified five regions with excessive European ancestry in Mexicans.
Discovered a 1.1Mb region on Chromosome 2p23 linked to autism and schizophrenia.
Validated findings using independent Mexican samples from the 1000 Genomes Project.
Abstract
We present a two-layer hidden Markov model to detect structure of haplotypes for unrelated individuals. This allows modeling two scales of linkage disequilibrium (one within a group of haplotypes and one between groups), thereby taking advantage of rich haplotype information to infer local ancestry for admixed individuals. Our method outperforms competing state-of-art methods, particularly for regions of small ancestral track lengths. Applying our method to Mexican samples in HapMap3, we found five coding regions, ranging from megabase (Mb) in lengths, that exhibit excessive European ancestry (average dosage > 1.6). A particular interesting region of 1.1Mb (with average dosage 1.95) locates on Chromosome 2p23 that harbors two genes, PXDN and MYT1L, both of which are associated with autism and schizophrenia. In light of the low prevalence of autism in Hispanics, this region…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenetic Associations and Epidemiology · Genetic Mapping and Diversity in Plants and Animals · Genomic variations and chromosomal abnormalities
