Statistical learning with phylogenetic network invariants

Travis Barton; Elizabeth Gross; Colby Long; Joseph Rusinko

arXiv:2211.11919·q-bio.PE·November 23, 2022·1 cites

Statistical learning with phylogenetic network invariants

Travis Barton, Elizabeth Gross, Colby Long, Joseph Rusinko

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel method combining phylogenetic network invariants and support vector machines to accurately infer 4-leaf phylogenetic networks, addressing the challenge of residual deviations in real data.

Contribution

It proposes a new approach that uses invariant residuals and machine learning to classify phylogenetic networks from sequence data.

Findings

01

Effective classification of 4-leaf networks demonstrated on simulated data

02

Method successfully applied to primate genetic data

03

Improves inference accuracy over traditional invariant-based methods

Abstract

Phylogenetic networks provide a means of describing the evolutionary history of sets of species believed to have undergone hybridization or gene flow during their evolution. The mutation process for a set of such species can be modeled as a Markov process on a phylogenetic network. Previous work has shown that a site-pattern probability distributions from a Jukes-Cantor phylogenetic network model must satisfy certain algebraic invariants. As a corollary, aspects of the phylogenetic network are theoretically identifiable from site-pattern frequencies. In practice, because of the probabilistic nature of sequence evolution, the phylogenetic network invariants will rarely be satisfied, even for data generated under the model. Thus, using network invariants for inferring phylogenetic networks requires some means of interpreting the residuals, or deviations from zero, when observed…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

lizgross/inferring-phylogenetic-networks-with-qnr-svm
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEvolution and Paleontology Studies · Genomics and Phylogenetic Studies · Bayesian Methods and Mixture Models