Ordered and size-biased frequencies in GEM and Gibbs models for species sampling
Jim Pitman, Yuri Yakubovich

TL;DR
This paper analyzes the distribution of ordered and size-biased frequencies in GEM and Gibbs models for species sampling, extending known identities and providing new constructions for sampling and frequency ordering.
Contribution
It generalizes a known distribution identity for GEM models to include the two-parameter case and extends the description to Gibbs frequencies with new sampling constructions.
Findings
Describes the distribution of ordered frequencies in GEM and Gibbs models.
Extends known identities to the two-parameter GEM case.
Provides new sampling and frequency ordering constructions.
Abstract
We describe the distribution of frequencies ordered by sample values in a random sample of size from the two parameter GEM random discrete distribution on the positive integers. These frequencies are a size-biased random permutation of the sample frequencies in either ranked order, or in the order of appearance of values in the sampling process. This generalizes a well known identity in distribution due to Donnelly and Tavar\'e (1986) for to the case . This description extends to sampling from Gibbs frequencies obtained by suitable conditioning of the GEM model, and yields a value-ordered version of the Chinese Restaurant construction of GEM and Gibbs frequencies in the more usual size-biased order of their appearance. The proofs are based on a general construction of a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models · Statistical Methods and Bayesian Inference · Sensory Analysis and Statistical Methods
