On the reusability of samples in active learning

Gijs van Tulder; Marco Loog

arXiv:2206.06276·cs.LG·June 14, 2022

On the reusability of samples in active learning

Gijs van Tulder, Marco Loog

PDF

Open Access 1 Repo

TL;DR

This paper investigates the limits of sample reusability in active learning, demonstrating that universal reusability is impossible due to inherent undersampling, but identifying conditions where reusability can occur.

Contribution

It provides a theoretical and empirical analysis of sample reusability in active learning, highlighting its limitations and potential conditions for reusability between classifiers.

Findings

01

Universal reusability in active learning does not exist.

02

Reusability depends on dataset and classifier pairs.

03

Importance-weighted active learning impacts reusability.

Abstract

An interesting but not extensively studied question in active learning is that of sample reusability: to what extent can samples selected for one learner be reused by another? This paper explains why sample reusability is of practical interest, why reusability can be a problem, how reusability could be improved by importance-weighted active learning, and which obstacles to universal reusability remain. With theoretical arguments and practical demonstrations, this paper argues that universal reusability is impossible. Because every active learning strategy must undersample some areas of the sample space, learners that depend on the samples in those areas will learn more from a random sample selection. This paper describes several experiments with importance-weighted active learning that show the impact of the reusability problem in practice. The experiments confirmed that universal…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

JohnLangford/vowpal_wabbit
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Mineral Processing and Grinding · Statistics Education and Methodologies