CAKE: Confidence in Assignments via K-partition Ensembles
Aggelos Semoglou, John Pavlopoulos

TL;DR
CAKE is a framework that quantifies confidence in individual clustering assignments by combining ensemble stability and geometric consistency, aiding in identifying reliable data points.
Contribution
Introduces CAKE, a novel method that provides interpretable confidence scores for clustering assignments using ensemble-based stability and geometric support.
Findings
CAKE effectively highlights ambiguous and stable points in datasets.
Theoretical analysis confirms CAKE's robustness under noise.
Experiments show CAKE improves downstream clustering reliability.
Abstract
Clustering is widely used for unsupervised structure discovery, yet it offers limited insight into how reliable each individual assignment is. Diagnostics, such as convergence behavior or objective values, may reflect global quality, but they do not indicate whether particular instances are assigned confidently, especially for initialization-sensitive algorithms like k-means. This assignment-level instability can undermine both accuracy and robustness. Ensemble approaches improve global consistency by aggregating multiple runs, but they typically lack tools for quantifying pointwise confidence in a way that combines cross-run agreement with geometric support from the learned cluster structure. This work introduces CAKE (Confidence in Assignments via K-partition Ensembles), a framework that evaluates each point using two complementary statistics computed over a clustering ensemble:…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
