Probabilities on cladograms: introduction to the alpha model
Daniel J. Ford

TL;DR
The paper introduces the alpha model, a flexible probabilistic framework for cladograms that encompasses several classical models, providing explicit formulas and analyzing properties like leaf depth and cherry distribution.
Contribution
It presents the alpha model as a new, parametrized family of probabilities on cladograms with explicit formulas and analysis of its properties and implications.
Findings
Expected leaf depth scales as O(n^α)
Number of cherries is asymptotically normal
Model includes Yule, Uniform, and Comb models
Abstract
The alpha model, a parametrized family of probabilities on cladograms (rooted binary leaf labeled trees), is introduced. This model is Markovian self-similar, deletion-stable (sampling consistent), and passes through the Yule, Uniform and Comb models. An explicit formula is given to calculate the probability of any cladogram or tree shape under the alpha model. Sackin's and Colless' index are shown to be with asymptotic covariance equal to 1. Thus the expected depth of a random leaf with leaves is . The number of cherries on a random alpha tree is shown to be asymptotically normal with known mean and variance. Finally the shape of published phylogenies is examined, using trees from Treebase.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic processes and statistical mechanics · Bayesian Methods and Mixture Models · Ecology and Vegetation Dynamics Studies
