Capturing a phylogenetic tree when the number of character states varies with the number of leaves
Mike Steel

TL;DR
This paper proves that for large enough phylogenetic trees, it is possible to select a small set of characters with limited states that uniquely determine the tree, using probabilistic methods.
Contribution
It establishes a new combinatorial result showing the existence of small character sets with bounded states that capture any large binary phylogenetic tree.
Findings
Existence of character sets with size proportional to n^α
Characters can have at most n^β states
Unique tree capture guaranteed for large n
Abstract
We show that for any two values for which then there is a value so that for all the following holds. For any binary phylogenetic tree on leaves there is a set of characters that capture , and for which each character takes at most distinct states. Here `capture' means that is the unique perfect phylogeny for these characters. Our short proof of this combinatorial result is based on the probabilistic method.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenomics and Phylogenetic Studies · Genome Rearrangement Algorithms · Chromosomal and Genetic Variations
