Labeled Data Selection for Category Discovery
Bingchen Zhao, Nico Lang, Serge Belongie, Oisin Mac Aodha

TL;DR
This paper investigates how the selection of labeled data influences category discovery in unlabeled visual datasets, proposing new methods to optimize labeled data choice for improved discovery performance.
Contribution
It introduces two novel approaches for automatic labeled data selection based on data similarity, demonstrating their effectiveness in enhancing category discovery results.
Findings
Changing labeled data significantly affects discovery performance
Proposed methods outperform existing approaches on benchmark datasets
Optimal labeled data is neither too similar nor too dissimilar to unlabeled categories
Abstract
Category discovery methods aim to find novel categories in unlabeled visual data. At training time, a set of labeled and unlabeled images are provided, where the labels correspond to the categories present in the images. The labeled data provides guidance during training by indicating what types of visual properties and features are relevant for performing discovery in the unlabeled data. As a result, changing the categories present in the labeled set can have a large impact on what is ultimately discovered in the unlabeled set. Despite its importance, the impact of labeled data selection has not been explored in the category discovery literature to date. We show that changing the labeled data can significantly impact discovery performance. Motivated by this, we propose two new approaches for automatically selecting the most suitable labeled data based on the similarity between the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Mining Algorithms and Applications · Rough Sets and Fuzzy Logic
MethodsSparse Evolutionary Training
