Targeting Optimal Active Learning via Example Quality

Lewis P. G. Evans; Niall M. Adams; Christoforos Anagnostopoulos

arXiv:1407.8042·stat.ML·July 31, 2014

Targeting Optimal Active Learning via Example Quality

Lewis P. G. Evans, Niall M. Adams, Christoforos Anagnostopoulos

PDF

Open Access

TL;DR

This paper introduces a theoretical framework for active learning based on expected loss reduction, providing insights and algorithms that outperform existing heuristics in selecting optimal examples for labeling.

Contribution

It presents a novel theoretical approach to active learning using example quality and develops algorithms to estimate this quality directly.

Findings

01

Algorithms based on example quality are competitive with standard AL methods.

02

Optimal selection based on expected loss reduction can outperform heuristic methods.

03

Theoretical insights clarify limitations of existing AL heuristics.

Abstract

In many classification problems unlabelled data is abundant and a subset can be chosen for labelling. This defines the context of active learning (AL), where methods systematically select that subset, to improve a classifier by retraining. Given a classification problem, and a classifier trained on a small number of labelled examples, consider the selection of a single further example. This example will be labelled by the oracle and then used to retrain the classifier. This example selection raises a central question: given a fully specified stochastic description of the classification problem, which example is the optimal selection? If optimality is defined in terms of loss, this definition directly produces expected loss reduction (ELR), a central quantity whose maximum yields the optimal example selection. This work presents a new theoretical approach to AL, example quality, which…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Machine Learning and Data Classification · Optimization and Search Problems