The maximum agreement subtree problem
Daniel M. Martin, Bhalchandra D. Thatte

TL;DR
This paper studies the maximum size of common subtrees between two binary phylogenetic trees, establishing new lower bounds and structural results that improve understanding of their shared substructure.
Contribution
The paper proves a new lower bound of 1(\u221a{ log n}) for the largest common subtree, improving previous bounds, and introduces a Ramsey-type result for binary trees.
Findings
Any two binary phylogenetic trees have a common subtree on 1({ log n}) leaves.
When one tree is balanced or a caterpillar, the largest common subtree has 1( log n) leaves.
Balanced trees share a common subtree on a constant fraction of leaves, n^, for some > 0.
Abstract
In this paper we investigate an extremal problem on binary phylogenetic trees. Given two such trees and , both with leaf-set , we are interested in the size of the largest subset of leaves in a common subtree of and . We show that any two binary phylogenetic trees have a common subtree on leaves, thus improving on the previously known bound of due to M. Steel and L. Szekely. To achieve this improved bound, we first consider two special cases of the problem: when one of the trees is balanced or a caterpillar, we show that the largest common subtree has leaves. We then handle the general case by proving and applying a Ramsey-type result: that every binary tree contains either a large balanced subtree or a large caterpillar. We also show that there are constants $c,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsConstraint Satisfaction and Optimization · Data Management and Algorithms · Topological and Geometric Data Analysis
