The allelic partition for coalescent point processes
Amaury Lambert (PMA)

TL;DR
This paper analyzes the mutation patterns and frequency spectra in populations modeled by coalescent point processes, providing explicit formulas for mutation and haplotype frequencies under various growth conditions.
Contribution
It introduces explicit formulas for mutation and haplotype frequencies in coalescent point processes, including cases with infinite branch length expectations.
Findings
Mutation and haplotype frequencies grow linearly with sample size.
For infinite branch lengths, mutation counts grow as n log(n).
Harmonic series and Fisher logarithmic series describe frequencies in critical birth-death processes.
Abstract
Assume that individuals alive at time in some population can be ranked in such a way that the coalescence times between consecutive individuals are i.i.d. The ranked sequence of these branches is called a coalescent point process. We have shown in a previous work that splitting trees are important instances of such populations. Here, individuals are given DNA sequences, and for a sample of DNA sequences belonging to distinct individuals, we consider the number of polymorphic sites (sites at which at least two sequences differ), and the number of distinct haplotypes (sequences differing at one site at least). It is standard to assume that mutations arrive at constant rate (on germ lines), and never hit the same site on the DNA sequence. We study the mutation pattern associated to coalescent point processes under this assumption. Here, and grow linearly as…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic processes and statistical mechanics · Bayesian Methods and Mixture Models · Diffusion and Search Dynamics
