A Bayesian View of the Poisson-Dirichlet Process
Wray Buntine, Marcus Hutter

TL;DR
This paper reviews the theory and Bayesian properties of the two-parameter Poisson-Dirichlet Process, highlighting its conjugacy, clustering, and hierarchical capabilities for modeling discrete data.
Contribution
It provides a comprehensive Bayesian interpretation of the PDP, including its properties, conjugacy, and computational techniques for discrete distributions.
Findings
PDP allows hierarchical and layered partition modeling.
The process is partially conjugate to itself, enabling hierarchical structures.
Provides methods for computing Stirling numbers in posteriors.
Abstract
The two parameter Poisson-Dirichlet Process (PDP), a generalisation of the Dirichlet Process, is increasingly being used for probabilistic modelling in discrete areas such as language technology, bioinformatics, and image analysis. There is a rich literature about the PDP and its derivative distributions such as the Chinese Restaurant Process (CRP). This article reviews some of the basic theory and then the major results needed for Bayesian modelling of discrete problems including details of priors, posteriors and computation. The PDP allows one to build distributions over countable partitions. The PDP has two other remarkable properties: first it is partially conjugate to itself, which allows one to build hierarchies of PDPs, and second using a marginalised relative the CRP, one gets fragmentation and clustering properties that lets one layer partitions to build trees. This article…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models · Advanced Clustering Algorithms Research · Genome Rearrangement Algorithms
