Generalized Buneman pruning for inferring the most parsimonious multi-state phylogeny
Navodit Misra, Guy Blelloch, R. Ravi, Russell Schwartz

TL;DR
This paper introduces a new exact method for multi-state phylogeny reconstruction that generalizes the Buneman graph, enabling provably optimal solutions for larger and more complex datasets than previously possible.
Contribution
It presents a generalized Buneman graph and an ILP-based approach for exact multi-state maximum parsimony phylogeny inference, improving scalability and optimality.
Findings
Able to solve previously intractable datasets
Runs in times comparable to heuristics
First practical exact method for multi-state data
Abstract
Accurate reconstruction of phylogenies remains a key challenge in evolutionary biology. Most biologically plausible formulations of the problem are formally NP-hard, with no known efficient solution. The standard in practice are fast heuristic methods that are empirically known to work very well in general, but can yield results arbitrarily far from optimal. Practical exact methods, which yield exponential worst-case running times but generally much better times in practice, provide an important alternative. We report progress in this direction by introducing a provably optimal method for the weighted multi-state maximum parsimony phylogeny problem. The method is based on generalizing the notion of the Buneman graph, a construction key to efficient exact methods for binary sequences, so as to apply to sequences with arbitrary finite numbers of states with arbitrary state transition…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
