Phased-assembly-driven pangenome graphs for structural variant genotyping and complex trait mapping in dairy cattle
Liu Yang, Yahui Gao, Kristen L. Kuhn, Nayan Bhowmik, Wenli Li, Geoffrey Zanton, Lingzhao Fang, John B. Cole, Congjun Li, Ransom L. Baldwin, VI, Curtis P. Van Tassell, Benjamin D. Rosen, Li Ma, Timothy P. L. Smith, George E. Liu

TL;DR
This study creates a detailed genetic map for Holstein cattle to better detect structural variants and link them to important traits.
Contribution
The novel contribution is a phased, breed-specific pangenome graph that improves structural variant detection and genotyping in dairy cattle.
Findings
The H20D pangenome graph identifies over 10,000 additional structural variants per sample compared to other methods.
Phased pangenome graphs significantly improve variant detection in complex genomic regions.
Structural variants show greater genome-wide significance in GWAS than SNPs, suggesting their role in economically important traits.
Abstract
Structural variants are an underexplored source of genetic diversity. As part of the FarmGTEx Project, here we report a Holstein breed-specific pangenome graph (H20D) using Minigraph-Cactus and 40 phased haploid assemblies from 20 cows. H20D outperforms both assembly- and read-based long-read callers, and far exceeds short-read approaches, identifying over 10,000 additional structural variants per sample. It also significantly improves structural variant detection and genotyping relative to graphs built across breeds or from fewer/unphased assemblies, with particular advantages in complex regions. Using H20D, we genotype variants in 173 cattle and performed a GWAS, where a larger fraction of structural variants than SNPs reach genome-wide significance, implicating them as potential causal variants. Together, these results demonstrate the power of phased, within-breed pangenome graphs…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenetic and phenotypic traits in livestock · Genetic Mapping and Diversity in Plants and Animals · Genomics and Phylogenetic Studies
