Defining a phylogenetic tree with the minimum number of small-state characters
Yangjing Long, Tong Wang

TL;DR
This paper determines the minimal number of small-state characters needed to uniquely define phylogenetic trees, establishing exact thresholds for 3- and 5-state cases through rigorous mathematical proofs.
Contribution
It provides the first exact thresholds for the minimal number of characters needed in 3- and 5-state cases to define phylogenetic trees, advancing theoretical understanding.
Findings
Established that n_3 = 8 for 3-state characters.
Constructed a counterexample for 15 5-state characters.
Proved that for n ≥ 16, ⌈(n-3)/4⌉ 5-state characters suffice.
Abstract
Phylogenetic trees represent evolutionary relationships and can be uniquely defined by sets of finite-state biological characteristics. Despite prior work showing that sufficiently large trees can be determined by -state character sets, the minimal leaf thresholds remain largely unknown. In this work, we establish the 3-state case as , providing a concrete base for higher-state analyses. We then resolve the 5-state problem by constructing a counterexample for and proving that for , 5-state characters suffice to uniquely define any tree. Our approach relies on rigorous mathematical induction with complete verification of base cases and logically consistent inductive steps, offering new insights into the minimal conditions for character-based tree identification.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenomics and Phylogenetic Studies · Plant Diversity and Evolution · Genome Rearrangement Algorithms
