ClearGCD: Mitigating Shortcut Learning For Robust Generalized Category Discovery

Kailin Lyu; Jianwei He; Long Xiao; Jianing Zeng; Liang Fan; Lin Shu; Jie Hao

arXiv:2511.22892·cs.CV·December 1, 2025

ClearGCD: Mitigating Shortcut Learning For Robust Generalized Category Discovery

Kailin Lyu, Jianwei He, Long Xiao, Jianing Zeng, Liang Fan, Lin Shu, Jie Hao

PDF

Open Access

TL;DR

ClearGCD introduces a novel framework that reduces shortcut learning in generalized category discovery, improving robustness and accuracy in identifying both known and novel categories in open-world data.

Contribution

It proposes two new mechanisms, Semantic View Alignment and Shortcut Suppression Regularization, to mitigate shortcut learning and enhance GCD performance.

Findings

01

Outperforms state-of-the-art methods on multiple benchmarks.

02

Effectively reduces prototype confusion caused by shortcut learning.

03

Enhances generalization to novel categories.

Abstract

In open-world scenarios, Generalized Category Discovery (GCD) requires identifying both known and novel categories within unlabeled data. However, existing methods often suffer from prototype confusion caused by shortcut learning, which undermines generalization and leads to forgetting of known classes. We propose ClearGCD, a framework designed to mitigate reliance on non-semantic cues through two complementary mechanisms. First, Semantic View Alignment (SVA) generates strong augmentations via cross-class patch replacement and enforces semantic consistency using weak augmentations. Second, Shortcut Suppression Regularization (SSR) maintains an adaptive prototype bank that aligns known classes while encouraging separation of potential novel ones. ClearGCD can be seamlessly integrated into parametric GCD approaches and consistently outperforms state-of-the-art methods across multiple…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Topic Modeling · Text and Document Classification Technologies