Split scores: a tool to quantify phylogenetic signal in genome-scale   data

Elizabeth S. Allman; Laura S. Kubatko; John A. Rhodes

arXiv:1608.00942·q-bio.PE·January 3, 2017

Split scores: a tool to quantify phylogenetic signal in genome-scale data

Elizabeth S. Allman, Laura S. Kubatko, John A. Rhodes

PDF

TL;DR

The paper introduces the split score, a rapid and algebraic method to quantify phylogenetic support and detect evolutionary process variation along genomes using genome-scale data, applicable to large datasets.

Contribution

It presents the split score as a novel, efficient tool for assessing phylogenetic support and detecting evolutionary shifts in genome-scale data, outperforming traditional likelihood-based methods.

Findings

01

Split score effectively detects true phylogenetic splits.

02

It can identify changes in evolutionary processes across genomes.

03

The method is computationally efficient for large datasets.

Abstract

Detecting variation in the evolutionary process along chromosomes is increasingly important as whole-genome data becomes more widely available. For example, factors such as incomplete lineage sorting, horizontal gene transfer, and chromosomal inversion are expected to result in changes in the underlying gene trees along a chromosome, while changes in selective pressure and mutational rates for different genomic regions may lead to shifts in the underlying mutational process. We propose the split score as a general method for quantifying support for a particular phylogenetic relationship within a genomic data set. Because the split score is based on algebraic properties of a matrix of site pattern frequencies, it can be rapidly computed, even for data sets that are large in the number of taxa and/or in the length of the alignment, providing an advantage over other methods (e.g., maximum…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.