Predicting the ancestral character changes in a tree is typically easier than predicting the root state
Olivier Gascuel, Mike Steel

TL;DR
Predicting ancestral character changes along a phylogenetic tree is generally easier than predicting the root state, with methods showing robustness even at high mutation rates and complex scenarios.
Contribution
This study compares the difficulty of reconstructing ancestral changes versus root states, revealing that internal node predictions are often more accurate and robust.
Findings
Reconstructing ancestral changes is easier than root state prediction.
Internal node reconstruction remains feasible at high substitution rates.
Methods are robust to sampling bias and model mis-specification.
Abstract
Predicting the ancestral sequences of a group of homologous sequences related by a phylogenetic tree has been the subject of many studies, and numerous methods have been proposed to this purpose. Theoretical results are available that show that when the mutation rate become too large, reconstructing the ancestral state at the tree root is no longer feasible. Here, we also study the reconstruction of the ancestral changes that occurred along the tree edges. We show that, depending on the tree and branch length distribution, reconstructing these changes (i.e. reconstructing the ancestral state of all internal nodes in the tree) may be easier or harder than reconstructing the ancestral root state. However, results from information theory indicate that for the standard Yule tree, the task of reconstructing internal node states remains feasible, even for very high substitution rates.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenomics and Phylogenetic Studies · Genetic diversity and population structure · Fractal and DNA sequence analysis
