A Stochastic Local Search algorithm for distance-based phylogeny reconstruction
F. Tria, E. Caglioti, V. Loreto, A. Pagnani

TL;DR
This paper introduces a novel stochastic local search algorithm for phylogeny reconstruction that effectively handles high mutation rates and horizontal transfer, outperforming existing methods in challenging scenarios.
Contribution
A new unified framework combining the four-points condition and Pauplin's formula, leading to a stochastic local search algorithm that improves phylogeny reconstruction accuracy.
Findings
Outperforms state-of-the-art algorithms on artificial data with high mutation rates.
Shows significant improvement in cases of horizontal transfer.
Effectively reduces the impact of non-additive distances.
Abstract
In many interesting cases the reconstruction of a correct phylogeny is blurred by high mutation rates and/or horizontal transfer events. As a consequence a divergence arises between the true evolutionary distances and the differences between pairs of taxa as inferred from available data, making the phylogenetic reconstruction a challenging problem. Mathematically this divergence translates in a loss of additivity of the actual distances between taxa. In distance-based reconstruction methods, two properties of additive distances were extensively exploited as antagonist criteria to drive phylogeny reconstruction: on the one hand a local property of quartets, i.e., sets of four taxa in a tree, the four-points condition; on the other hand a recently proposed formula that allows to write the tree length as a function of the distances between taxa, the Pauplin's formula. Here we introduce a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenomics and Phylogenetic Studies · Genomics and Rare Diseases
