Vertical Consensus Inference for High-Dimensional Random Partition
Khai Nguyen, Yang Ni, Peter Mueller

TL;DR
This paper introduces Vertical Consensus Inference (VCI), a novel high-dimensional Bayesian clustering framework that partitions data vertically to mitigate the curse of dimensionality and improve inference accuracy.
Contribution
The paper proposes VCI, a new vertical data partitioning approach using Wasserstein barycenters, enhancing Bayesian clustering in high-dimensional settings.
Findings
VCI closely approximates full-data clustering in low dimensions.
In high dimensions, VCI provides a principled inference framework for noninformative data.
VCI connects to variational inference and generalized Bayes, broadening methodological applications.
Abstract
We review recently proposed Bayesian approaches for clustering high-dimensional data. After identifying the main limitations of available approaches, we introduce an alternative framework based on vertical consensus inference (VCI) to mitigate the curse of dimensionality in high-dimensional Bayesian clustering. VCI builds on the idea of consensus Monte Carlo by dividing the data into multiple shards (smaller subsets of variables), performing posterior inference on each shard, and then combining the shard-level posteriors to obtain a consensus posterior. The key distinction is that VCI splits the data vertically, producing vertical shards that retain the same number of observations but have lower dimensionality. We use an entropic regularized Wasserstein barycenter to define a consensus posterior. The shard-specific barycenter weights are constructed to favor shards that provide…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
