Species Tree Branch Length Estimation despite Incomplete Lineage Sorting, Duplication, and Loss
Yasamin Tabatabaee, Chao Zhang, Shayesteh Arasti, Siavash Mirarab

TL;DR
This paper introduces a new algorithm called CASTLES-Pro to estimate species tree branch lengths while accounting for gene duplication, loss, and incomplete lineage sorting.
Contribution
CASTLES-Pro is the first method to accurately estimate species tree branch lengths from multi-copy gene families while addressing gene duplication, loss, and incomplete lineage sorting.
Findings
CASTLES-Pro improves accuracy for single-copy gene trees and extends to multi-copy gene families.
CASTLES-Pro reduces systematic bias in terminal branch length estimation compared to concatenation methods.
CASTLES-Pro is robust to random horizontal gene transfer but accuracy decreases at high transfer levels.
Abstract
Phylogenetic branch lengths are essential for many analyses, such as estimating divergence times, analyzing rate changes, and studying adaptation. However, true gene tree heterogeneity due to incomplete lineage sorting, gene duplication and loss, and horizontal gene transfer can complicate the estimation of species tree branch lengths. While several tools exist for estimating the topology of a species tree addressing various causes of gene tree discordance, much less attention has been paid to branch length estimation on multi-locus datasets. For single-copy gene trees, some methods are available that summarize gene tree branch lengths onto a species tree, including coalescent-based methods that account for heterogeneity due to incomplete lineage sorting. However, no such branch length estimation method exists for multi-copy gene family trees that have evolved with gene duplication and…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenomics and Phylogenetic Studies · Chromosomal and Genetic Variations · Genetic diversity and population structure
