Distributions of cherries and pitchforks for the Ford model
Gursharn Kaur, Kwok Pui Choi, Taoyang Wu

TL;DR
This paper analyzes the distribution of cherries and pitchforks in Ford's alpha model, deriving laws, formulas, and correlations for these subtree counts in phylogenetic trees.
Contribution
It introduces a nonuniform Pólya urn approach to derive laws, formulas, and correlation behaviors for subtree counts in Ford's alpha model.
Findings
Strong law of large numbers for subtree counts
Exact recursive formulas for joint distributions
Critical alpha value for correlation sign change
Abstract
We study two fringe subtree counting statistics, the number of cherries and that of pitchforks for Ford's model, a one-parameter family of random phylogenetic tree models that includes the uniform and the Yule models, two tree models commonly used in phylogenetics. Based on a nonuniform version of the extended P\'olya urn models in which negative entries are permitted for their replacement matrices, we obtain the strong law of large numbers and the central limit theorem for the joint distribution of these two count statistics for the Ford model. Furthermore, we derive a recursive formula for computing the exact joint distribution of these two statistics. This leads to exact formulas for their means and higher order asymptotic expansions of their second moments, which allows us to identify a critical parameter value for the correlation between these two statistics. That is, when…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplex Systems and Time Series Analysis · Evolution and Paleontology Studies · Bayesian Methods and Mixture Models
