Targeting Optimal Active Learning via Example Quality
Lewis P. G. Evans, Niall M. Adams, Christoforos Anagnostopoulos

TL;DR
This paper introduces a theoretical framework for active learning based on expected loss reduction, providing insights and algorithms that outperform existing heuristics in selecting optimal examples for labeling.
Contribution
It presents a novel theoretical approach to active learning using example quality and develops algorithms to estimate this quality directly.
Findings
Algorithms based on example quality are competitive with standard AL methods.
Optimal selection based on expected loss reduction can outperform heuristic methods.
Theoretical insights clarify limitations of existing AL heuristics.
Abstract
In many classification problems unlabelled data is abundant and a subset can be chosen for labelling. This defines the context of active learning (AL), where methods systematically select that subset, to improve a classifier by retraining. Given a classification problem, and a classifier trained on a small number of labelled examples, consider the selection of a single further example. This example will be labelled by the oracle and then used to retrain the classifier. This example selection raises a central question: given a fully specified stochastic description of the classification problem, which example is the optimal selection? If optimality is defined in terms of loss, this definition directly produces expected loss reduction (ELR), a central quantity whose maximum yields the optimal example selection. This work presents a new theoretical approach to AL, example quality, which…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Machine Learning and Data Classification · Optimization and Search Problems
