Model-based clustering in very high dimensions via adaptive projections
Bernd Taschler, Frank Dondelinger, Sach Mukherjee

TL;DR
This paper introduces MCAP, a novel model-based clustering method that uses adaptive low-dimensional projections to effectively identify mean and covariance signals in extremely high-dimensional data, overcoming existing limitations.
Contribution
MCAP automatically determines projection dimensions for high-dimensional clustering, enabling detection of mean and covariance differences with improved accuracy and computational efficiency.
Findings
Successfully detects covariance signals in problems with p ~ 10^4 or more.
Performs well even when mean signals are absent.
Remains stable and efficient up to p = 10^6.
Abstract
Mixture models are a standard approach to dealing with heterogeneous data with non-i.i.d. structure. However, when the dimension is large relative to sample size and where either or both of means and covariances/graphical models may differ between the latent groups, mixture models face statistical and computational difficulties and currently available methods cannot realistically go beyond or so. We propose an approach called Model-based Clustering via Adaptive Projections (MCAP). Instead of estimating mixtures in the original space, we work with a low-dimensional representation obtained by linear projection. The projection dimension itself plays an important role and governs a type of bias-variance tradeoff with respect to recovery of the relevant signals. MCAP sets the projection dimension automatically in a data-adaptive manner, using a proxy for the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models · Advanced Clustering Algorithms Research · Algorithms and Data Compression
