Identifying the Rooted Species Tree from the Distribution of Unrooted Gene Trees under the Coalescent
Elizabeth S. Allman, James H. Degnan, John A. Rhodes

TL;DR
This paper demonstrates that the distribution of unrooted gene trees under the multispecies coalescent model can identify species tree topology and branch lengths, especially with five or more species, aiding phylogenetic inference.
Contribution
It shows that unrooted gene tree distributions can determine species tree topology and branch lengths, extending phylogenetic methods to unrooted gene trees.
Findings
For 4 species, unrooted gene tree distribution identifies the unrooted species tree topology.
For 5 or more species, it identifies the rooted species tree topology and all internal branch lengths.
Pendent branch lengths are identifiable when multiple genes are sampled from a species.
Abstract
Gene trees are evolutionary trees representing the ancestry of genes sampled from multiple populations. Species trees represent populations of individuals -- each with many genes -- splitting into new populations or species. The coalescent process, which models ancestry of gene copies within populations, is often used to model the probability distribution of gene trees given a fixed species tree. This multispecies coalescent model provides a framework for phylogeneticists to infer species trees from gene trees using maximum likelihood or Bayesian approaches. Because the coalescent models a branching process over time, all trees are typically assumed to be rooted in this setting. Often, however, gene trees inferred by traditional phylogenetic methods are unrooted. We investigate probabilities of unrooted gene trees under the multispecies coalescent model. We show that when there are 4…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenomics and Phylogenetic Studies · Genetic diversity and population structure · Bioinformatics and Genomic Networks
