Distance to the stochastic part of phylogenetic varieties

Marta Casanellas; Jes\'us Fern\'andez-S\'anchez; Marina; Garrote-L\'opez

arXiv:1912.02138·q-bio.PE·October 12, 2020·J. Symb. Comput.

Distance to the stochastic part of phylogenetic varieties

Marta Casanellas, Jes\'us Fern\'andez-S\'anchez, Marina, Garrote-L\'opez

PDF

TL;DR

This paper investigates how the distance from observed data to algebraic varieties, especially their stochastic parts, can enhance phylogenetic tree reconstruction by integrating algebraic geometry and optimization techniques.

Contribution

It introduces methods to compute distances to algebraic varieties and their stochastic parts, improving phylogenetic inference accuracy.

Findings

01

Analytical distances computed for specific models

02

Use of nonlinear programming for distance optimization

03

Application of numerical algebraic geometry techniques

Abstract

Modelling the substitution of nucleotides along a phylogenetic tree is usually done by a hidden Markov process. This allows to define a distribution of characters at the leaves of the trees and one might be able to obtain polynomial relationships among the probabilities of different characters. The study of these polynomials and the geometry of the algebraic varieties defined by them can be used to reconstruct phylogenetic trees. However, not all points in these algebraic varieties have biological sense. In this paper, we explore the extent to which adding semi-algebraic conditions arising from the restriction to parameters with statistical meaning can improve existing methods of phylogenetic reconstruction. To this end, our aim is to compute the distance of data points to algebraic varieties and to the stochastic part of these varieties. Computing these distances involves optimization…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.