Sharpness-aware Dynamic Anchor Selection for Generalized Category Discovery
Zhimao Peng, Enguang Wang, Fei Yang, Xialei Liu, and Ming-Ming Cheng

TL;DR
This paper introduces a novel approach for generalized category discovery that improves pseudo-label quality and clustering accuracy by reducing overfitting and dynamically selecting representative samples, achieving state-of-the-art results.
Contribution
The proposed method combines Loss Sharpness Penalty and Dynamic Anchor Selection to enhance robustness and adaptively select samples, addressing noise and confidence issues in GCD.
Findings
Achieves state-of-the-art results on multiple GCD benchmarks.
Effectively mitigates pseudo-label noise and overfitting.
Improves clustering accuracy for unknown classes.
Abstract
Generalized category discovery (GCD) is an important and challenging task in open-world learning. Specifically, given some labeled data of known classes, GCD aims to cluster unlabeled data that contain both known and unknown classes. Current GCD methods based on parametric classification adopt the DINO-like pseudo-labeling strategy, where the sharpened probability output of one view is used as supervision information for the other view. However, large pre-trained models have a preference for some specific visual patterns, resulting in encoding spurious correlation for unlabeled data and generating noisy pseudo-labels. To address this issue, we propose a novel method, which contains two modules: Loss Sharpness Penalty (LSP) and Dynamic Anchor Selection (DAS). LSP enhances the robustness of model parameters to small perturbations by minimizing the worst-case loss sharpness of the model,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Text and Document Classification Technologies · Machine Learning and Data Classification
