Distance Dependent Chinese Restaurant Processes
David M. Blei, Peter I. Frazier

TL;DR
This paper introduces the distance dependent Chinese restaurant process, a flexible non-exchangeable distribution for modeling dependencies in infinite clustering, improving fit for sequential data and enabling faster Gibbs sampling.
Contribution
It presents a novel distance dependent CRP that relaxes exchangeability, connects to Bayesian mixture models, and offers improved inference algorithms.
Findings
Better fit to sequential data with distance dependent CRP
Faster Gibbs sampling compared to traditional CRP
Effective modeling of dependencies across time or space
Abstract
We develop the distance dependent Chinese restaurant process (CRP), a flexible class of distributions over partitions that allows for non-exchangeability. This class can be used to model many kinds of dependencies between data in infinite clustering models, including dependencies across time or space. We examine the properties of the distance dependent CRP, discuss its connections to Bayesian nonparametric mixture models, and derive a Gibbs sampler for both observed and mixture settings. We study its performance with three text corpora. We show that relaxing the assumption of exchangeability with distance dependent CRPs can provide a better fit to sequential data. We also show its alternative formulation of the traditional CRP leads to a faster-mixing Gibbs sampling algorithm than the one based on the original formulation.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models · Time Series Analysis and Forecasting · Data Management and Algorithms
