Importance of negative sampling in weak label learning

Ankit Shah; Fuyu Tang; Zelin Ye; Rita Singh; Bhiksha Raj

arXiv:2309.13227·cs.LG·September 26, 2023

Importance of negative sampling in weak label learning

Ankit Shah, Fuyu Tang, Zelin Ye, Rita Singh, Bhiksha Raj

PDF

Open Access

TL;DR

This paper investigates the importance of negative sampling strategies in weak-label learning, demonstrating that selecting informative negative instances improves classification accuracy and reduces computational costs.

Contribution

It introduces and evaluates negative sampling strategies tailored for weak-label learning, addressing the open problem of negative instance selection.

Findings

01

Improved classification performance on CIFAR-10 and AudioSet datasets.

02

Reduced computational cost compared to random sampling.

03

Negative instance selection benefits weak-label learning.

Abstract

Weak-label learning is a challenging task that requires learning from data "bags" containing positive and negative instances, but only the bag labels are known. The pool of negative instances is usually larger than positive instances, thus making selecting the most informative negative instance critical for performance. Such a selection strategy for negative instances from each bag is an open problem that has not been well studied for weak-label learning. In this paper, we study several sampling strategies that can measure the usefulness of negative instances for weak-label learning and select them accordingly. We test our method on CIFAR-10 and AudioSet datasets and show that it improves the weak-label classification performance and reduces the computational cost compared to random sampling methods. Our work reveals that negative instances are not all equally irrelevant, and selecting…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsText and Document Classification Technologies · Machine Learning and Data Classification