Batch Active Learning via Coordinated Matching
Javad Azimi (Oregon State University), Alan Fern (Oregon State, University), Xiaoli Zhang-Fern (Oregon State University), Glencora Borradaile, (Oregon State University), Brent Heeringa (Williams College)

TL;DR
This paper introduces a novel batch active learning method that approximates sequential policies using Monte-Carlo simulation and coordinated matching, enabling efficient batch selection for labeling.
Contribution
It proposes a new algorithm for batch active learning that matches the distribution of sequential policies, with an efficient greedy solution and theoretical approximation guarantees.
Findings
Effective on eight benchmark datasets
Outperforms existing batch active learning methods
Provides a scalable approximation to sequential policies
Abstract
Most prior work on active learning of classifiers has focused on sequentially selecting one unlabeled example at a time to be labeled in order to reduce the overall labeling effort. In many scenarios, however, it is desirable to label an entire batch of examples at once, for example, when labels can be acquired in parallel. This motivates us to study batch active learning, which iteratively selects batches of examples to be labeled. We propose a novel batch active learning method that leverages the availability of high-quality and efficient sequential active-learning policies by attempting to approximate their behavior when applied for steps. Specifically, our algorithm first uses Monte-Carlo simulation to estimate the distribution of unlabeled examples selected by a sequential policy over step executions. The algorithm then attempts to select a set of examples that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Algorithms and Data Compression · Optimization and Search Problems
