DeepDPM: Deep Clustering With an Unknown Number of Clusters
Meitar Ronen, Shahaf E. Finder, Oren Freifeld

TL;DR
DeepDPM introduces a deep clustering method that dynamically infers the number of clusters during training, eliminating the need for predefined K and outperforming existing methods, including on large-scale datasets like ImageNet.
Contribution
It proposes a novel deep clustering approach that infers the number of clusters during training using a split/merge framework and a new loss, addressing a key limitation of existing methods.
Findings
Outperforms existing nonparametric deep clustering methods.
First to evaluate deep nonparametric clustering on ImageNet.
Demonstrates the importance of inferring K for imbalanced datasets.
Abstract
Deep Learning (DL) has shown great promise in the unsupervised task of clustering. That said, while in classical (i.e., non-deep) clustering the benefits of the nonparametric approach are well known, most deep-clustering methods are parametric: namely, they require a predefined and fixed number of clusters, denoted by K. When K is unknown, however, using model-selection criteria to choose its optimal value might become computationally expensive, especially in DL as the training process would have to be repeated numerous times. In this work, we bridge this gap by introducing an effective deep-clustering method that does not require knowing the value of K as it infers it during the learning. Using a split/merge framework, a dynamic architecture that adapts to the changing K, and a novel loss, our proposed method outperforms existing nonparametric methods (both classical and deep ones).…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · COVID-19 diagnosis using AI · Anomaly Detection Techniques and Applications
