Parameter-Free Clustering via Self-Supervised Consensus Maximization (Extended Version)
Lijun Zhang, Suyuan Liu, Siwei Wang, Shengju Yu, Xueling Zhu, Miaomiao Li, Xinwang Liu

TL;DR
This paper introduces SCMax, a fully parameter-free clustering method that combines hierarchical agglomerative clustering with self-supervised learning and consensus scoring to determine the optimal number of clusters without hyperparameters.
Contribution
The paper presents a novel parameter-free clustering framework that integrates self-supervised learning and consensus maximization within hierarchical clustering, eliminating the need for hyperparameter tuning.
Findings
Outperforms existing methods on multiple datasets.
Effectively determines the optimal number of clusters.
Does not require hyperparameter tuning.
Abstract
Clustering is a fundamental task in unsupervised learning, but most existing methods heavily rely on hyperparameters such as the number of clusters or other sensitive settings, limiting their applicability in real-world scenarios. To address this long-standing challenge, we propose a novel and fully parameter-free clustering framework via Self-supervised Consensus Maximization, named SCMax. Our framework performs hierarchical agglomerative clustering and cluster evaluation in a single, integrated process. At each step of agglomeration, it creates a new, structure-aware data representation through a self-supervised learning task guided by the current clustering structure. We then introduce a nearest neighbor consensus score, which measures the agreement between the nearest neighbor-based merge decisions suggested by the original representation and the self-supervised one. The moment at…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Clustering Algorithms Research · Bayesian Methods and Mixture Models · Domain Adaptation and Few-Shot Learning
