Agreement forests of caterpillar trees: complexity, kernelization and branching
Steven Kelk, Ruben Meuwese

TL;DR
This paper investigates the computational complexity of finding agreement forests in caterpillar trees, showing NP-hardness, kernelization bounds, and an improved fixed-parameter tractable algorithm.
Contribution
It proves NP-hardness and APX-hardness for caterpillars, establishes a tight kernel size, and develops a faster FPT algorithm for agreement forests in caterpillar trees.
Findings
NP-hardness and APX-hardness for caterpillars
Kernel size reduced to at most 7k
FPT algorithm with time complexity O*(2.49^k)
Abstract
Given a set of species, a phylogenetic tree is an unrooted binary tree whose leaves are bijectively labelled by . Such trees can be used to show the way species evolve over time. One way of understanding how topologically different two phylogenetic trees are, is to construct a minimum-size agreement forest: a partition of into the smallest number of blocks, such that the blocks induce homeomorphic, non-overlapping subtrees in both trees. This comparison yields insight into commonalities and differences in the evolution of across the two trees. Computing a smallest agreement forest is NP-hard (Hein, Jiang, Wang and Zhang, Discrete Applied Mathematics 71(1-3), 1996). In this work we study the problem on caterpillars, which are path-like phylogenetic trees. We will demonstrate that, even if we restrict the input to this highly restricted subclass, the problem remains NP-hard…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgorithms and Data Compression · Constraint Satisfaction and Optimization · semigroups and automata theory
