TL;DR
This paper introduces Twin Contrastive Learning (TCL), a novel online clustering method that simultaneously learns instance and cluster representations through contrastive learning, improving clustering accuracy on various benchmarks.
Contribution
The paper presents a new twin contrastive learning framework that operates at both instance and cluster levels for online clustering, with a confidence-based pseudo-label refinement mechanism.
Findings
TCL outperforms existing methods on six image and text benchmarks.
The method effectively handles false-negative pairs and improves cluster assignment accuracy.
TCL can independently predict cluster assignments, suitable for online applications.
Abstract
This paper proposes to perform online clustering by conducting twin contrastive learning (TCL) at the instance and cluster level. Specifically, we find that when the data is projected into a feature space with a dimensionality of the target cluster number, the rows and columns of its feature matrix correspond to the instance and cluster representation, respectively. Based on the observation, for a given dataset, the proposed TCL first constructs positive and negative pairs through data augmentations. Thereafter, in the row and column space of the feature matrix, instance- and cluster-level contrastive learning are respectively conducted by pulling together positive pairs while pushing apart the negatives. To alleviate the influence of intrinsic false-negative pairs and rectify cluster assignments, we adopt a confidence-based criterion to select pseudo-labels for boosting both the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsContrastive Learning
