In Defense of Core-set: A Density-aware Core-set Selection for Active Learning
Yeachan Kim, Bonggun Shin

TL;DR
This paper introduces DACS, a density-aware core-set method for active learning that selects diverse samples from sparse regions, improving performance over traditional uncertainty-based approaches.
Contribution
The work proposes a novel density-aware core-set approach for active learning, incorporating density estimation via locality-sensitive hashing to enhance sample diversity selection.
Findings
DACS outperforms existing methods in classification and regression tasks.
DACS achieves state-of-the-art performance in practical active learning scenarios.
Combining DACS with other methods yields further improvements.
Abstract
Active learning enables the efficient construction of a labeled dataset by labeling informative samples from an unlabeled dataset. In a real-world active learning scenario, considering the diversity of the selected samples is crucial because many redundant or highly similar samples exist. Core-set approach is the promising diversity-based method selecting diverse samples based on the distance between samples. However, the approach poorly performs compared to the uncertainty-based approaches that select the most difficult samples where neural models reveal low confidence. In this work, we analyze the feature space through the lens of the density and, interestingly, observe that locally sparse regions tend to have more informative samples than dense regions. Motivated by our analysis, we empower the core-set approach with the density-awareness and propose a density-aware core-set (DACS).…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Advanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning
