Active Clustering with Model-Based Uncertainty Reduction
Caiming Xiong, David Johnson, Jason J. Corso

TL;DR
This paper introduces an active semi-supervised spectral clustering method that intelligently selects pairwise constraints to improve clustering quality efficiently, reducing human effort and outperforming existing techniques.
Contribution
The paper presents a novel online framework for active clustering that uses uncertainty reduction to select the most impactful human queries during the clustering process.
Findings
Outperforms state-of-the-art clustering methods across multiple datasets.
Robust to noise and unknown number of clusters.
Effectively reduces human labeling effort.
Abstract
Semi-supervised clustering seeks to augment traditional clustering methods by incorporating side information provided via human expertise in order to increase the semantic meaningfulness of the resulting clusters. However, most current methods are \emph{passive} in the sense that the side information is provided beforehand and selected randomly. This may require a large number of constraints, some of which could be redundant, unnecessary, or even detrimental to the clustering results. Thus in order to scale such semi-supervised algorithms to larger problems it is desirable to pursue an \emph{active} clustering method---i.e. an algorithm that maximizes the effectiveness of the available human labor by only requesting human input where it will have the greatest impact. Here, we propose a novel online framework for active semi-supervised spectral clustering that selects pairwise…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace and Expression Recognition · Machine Learning and Algorithms · Domain Adaptation and Few-Shot Learning
MethodsSpectral Clustering
