Non-exchangeable random partition models for microclustering
Giuseppe Di Benedetto, Fran\c{c}ois Caron, Yee Whye Teh

TL;DR
This paper introduces a new class of non-exchangeable random partition models that allow for sublinear cluster size growth, providing more flexibility than traditional exchangeable models, with applications in clustering, graph generation, and real data analysis.
Contribution
The authors develop a flexible non-exchangeable partition model with controllable sublinear cluster growth, extending the applicability beyond exchangeable models like the Chinese restaurant process.
Findings
Model generates partitions with sublinear cluster size growth.
The model exhibits power-law behavior in cluster sizes.
Experiments show improved performance over traditional models.
Abstract
Many popular random partition models, such as the Chinese restaurant process and its two-parameter extension, fall in the class of exchangeable random partitions, and have found wide applicability in model-based clustering, population genetics, ecology or network analysis. While the exchangeability assumption is sensible in many cases, it has some strong implications. In particular, Kingman's representation theorem implies that the size of the clusters necessarily grows linearly with the sample size; this feature may be undesirable for some applications, as recently pointed out by Miller et al. (2015). We present here a flexible class of non-exchangeable random partition models which are able to generate partitions whose cluster sizes grow sublinearly with the sample size, and where the growth rate is controlled by one parameter. Along with this result, we provide the asymptotic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
