TL;DR
This paper advances the computation of unrooted SPR distances in evolutionary trees by introducing the replug distance and a new search algorithm, enabling faster calculations of larger distances than previously possible.
Contribution
It introduces the replug distance as a new lower bound and develops a progressive A* search algorithm for exact unrooted SPR distance computation.
Findings
Nearly two orders of magnitude faster than previous methods on small trees
Allows computation of unrooted SPR distances up to 14 on 50-leaf trees
Identifies properties indicating no MAF formulation exists for unrooted SPR
Abstract
The subtree prune-and-regraft (SPR) distance metric is a fundamental way of comparing evolutionary trees. It has wide-ranging applications, such as to study lateral genetic transfer, viral recombination, and Markov chain Monte Carlo phylogenetic inference. Although the rooted version of SPR distance can be computed relatively efficiently between rooted trees using fixed-parameter-tractable maximum agreement forest (MAF) algorithms, no MAF formulation is known for the unrooted case. Correspondingly, previous algorithms are unable to compute unrooted SPR distances larger than 7. In this paper, we substantially advance understanding of and computational algorithms for the unrooted SPR distance. First we identify four properties of optimal SPR paths, each of which suggests that no MAF formulation exists in the unrooted case. Then we introduce the replug distance, a new lower bound on the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
