Convergence of Y chromosome STR haplotypes from different SNP haplogroups compromises accuracy of haplogroup prediction
Chuan-Chao Wang, Ling-Xiang Wang, Rukesh Shrestha, Shaoqing Wen,, Manfei Zhang, Xinzhu Tong, Li Jin, Hui Li

TL;DR
This study investigates how converging Y chromosome STR haplotypes from different SNP-defined haplogroups can reduce the accuracy of haplogroup prediction in forensic and population genetics.
Contribution
It identifies specific haplotype similarities across haplogroups that compromise the reliability of STR-based haplogroup prediction methods.
Findings
Identified multiple haplotype similarities among different haplogroups.
Convergence of haplotypes reduces prediction accuracy.
Highlights limitations of using STRs alone for haplogroup inference.
Abstract
Short tandem repeats (STRs) and single nucleotide polymorphisms (SNPs) are two kinds of commonly used markers in Y chromosome studies of forensic and population genetics. There has been increasing interest in the cost saving strategy by using the STR haplotypes to predict SNP haplogroups. However, the convergence of Y chromosome STR haplotypes from different haplogroups might compromise the accuracy of haplogroup prediction. Here, we compared the worldwide Y chromosome lineages at both haplogroup level and haplotype level to search for the possible haplotype similarities among haplogroups. The similar haplotypes between haplogroups B and I2, C1 and E1b1b1, C2 and E1b1a1, H1 and J, L and O3a2c1, O1a and N, O3a1c and O3a2b, and M1 and O3a2 have been found, and those similarities reduce the accuracy of prediction.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsForensic and Genetic Research · Genetic diversity and population structure · Genetic Associations and Epidemiology
