DNA: Denoised Neighborhood Aggregation for Fine-grained Category Discovery
Wenbin An, Feng Tian, Wenkai Shi, Yan Chen, Qinghua Zheng, QianYing, Wang, Ping Chen

TL;DR
This paper introduces Denoised Neighborhood Aggregation (DNA), a self-supervised method that improves fine-grained category discovery by leveraging semantic neighbor information and filtering noise to produce more compact and accurate clusters.
Contribution
The paper proposes a novel self-supervised framework that encodes semantic structures into embeddings and filters noisy neighbors, with theoretical justification linking it to clustering loss.
Findings
21.31% accuracy improvement in neighbor retrieval
Outperforms state-of-the-art models by 9.96% on three metrics
Effective in discovering fine-grained categories from coarse labels
Abstract
Discovering fine-grained categories from coarsely labeled data is a practical and challenging task, which can bridge the gap between the demand for fine-grained analysis and the high annotation cost. Previous works mainly focus on instance-level discrimination to learn low-level features, but ignore semantic similarities between data, which may prevent these models learning compact cluster representations. In this paper, we propose Denoised Neighborhood Aggregation (DNA), a self-supervised framework that encodes semantic structures of data into the embedding space. Specifically, we retrieve k-nearest neighbors of a query as its positive keys to capture semantic similarities between data and then aggregate information from the neighbors to learn compact cluster representations, which can make fine-grained categories more separatable. However, the retrieved neighbors can be noisy and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · AI in cancer detection · COVID-19 diagnosis using AI
MethodsFocus
