Novel Class Discovery for Ultra-Fine-Grained Visual Categorization
Yu Liu, Yaqi Cai, Qi Jia, Binglin Qiu, Weimin Wang, Nan Pu

TL;DR
This paper introduces a novel task called Ultra-Fine-Grained Novel Class Discovery (UFG-NCD) for ultra-fine-grained visual categorization, proposing a Region-Aligned Proxy Learning framework that leverages partially labeled data to identify new categories with improved accuracy.
Contribution
The paper presents a new task UFG-NCD and a Region-Aligned Proxy Learning framework with CRA and SemiPL modules, advancing ultra-fine-grained visual categorization without extensive human annotation.
Findings
RAPL outperforms baseline methods on multiple datasets.
CRA effectively captures discriminative local features.
SemiPL enhances knowledge transfer and representation learning.
Abstract
Ultra-fine-grained visual categorization (Ultra-FGVC) aims at distinguishing highly similar sub-categories within fine-grained objects, such as different soybean cultivars. Compared to traditional fine-grained visual categorization, Ultra-FGVC encounters more hurdles due to the small inter-class and large intra-class variation. Given these challenges, relying on human annotation for Ultra-FGVC is impractical. To this end, our work introduces a novel task termed Ultra-Fine-Grained Novel Class Discovery (UFG-NCD), which leverages partially annotated data to identify new categories of unlabeled images for Ultra-FGVC. To tackle this problem, we devise a Region-Aligned Proxy Learning (RAPL) framework, which comprises a Channel-wise Region Alignment (CRA) module and a Semi-Supervised Proxy Learning (SemiPL) strategy. The CRA module is designed to extract and utilize discriminative features…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Processing Techniques and Applications · Face and Expression Recognition · Image Retrieval and Classification Techniques
