Models for Genetic Diversity Generated by Negative Binomial Point Processes
Yuguang F. Ipsen, Soudabeh Shemehsavar, Ross A. Maller

TL;DR
This paper introduces a flexible new model based on negative binomial point processes for analyzing genetic diversity, demonstrated on quoll microsatellite data, with implications for conservation efforts.
Contribution
It generalizes the Poisson-Dirichlet model using a negative binomial process, providing enhanced flexibility and a new sampling formula for genetic diversity analysis.
Findings
The new model fits quoll data better than standard models.
Parameter r improves interpretability of genetic diversity.
Generalized Ewens' sampling formula derived.
Abstract
We develop a model based on a generalised Poisson-Dirichlet distribution for the analysis of genetic diversity, and illustrate its use on microsatellite data for the genus Dasyurus (the quoll, a marsupial carnivore listed as near-threatened in Australia). Our class of distributions, termed , is constructed from a negative binomial point process, generalizing the usual one-parameter model, which is constructed from a Poisson point process. Both models have at their heart a Stable process, but in , an extra parameter adds flexibility, analogous to the way the negative binomial distribution allows for "overdispersion" in the analysis of count data. A key result obtained is a generalised version of Ewens' sampling formula for . We outline the theoretical basis for the model, and, for the quolls data, estimate the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models · Stochastic processes and statistical mechanics · Genetic and phenotypic traits in livestock
