Batch Active Learning Using Determinantal Point Processes

Erdem B{\i}y{\i}k; Kenneth Wang; Nima Anari; Dorsa Sadigh

arXiv:1906.07975·cs.LG·June 20, 2019·22 cites

Batch Active Learning Using Determinantal Point Processes

Erdem B{\i}y{\i}k, Kenneth Wang, Nima Anari, Dorsa Sadigh

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel batch active learning approach using Determinantal Point Processes to efficiently select diverse data samples, reducing redundancy and computational costs in data labeling.

Contribution

It proposes a principled DPP-based method for batch active learning with theoretical guarantees and demonstrates competitive performance against existing methods.

Findings

01

DPP-based method effectively selects diverse batches

02

The algorithms have provable approximation guarantees

03

Experimental results outperform state-of-the-art baselines

Abstract

Data collection and labeling is one of the main challenges in employing machine learning algorithms in a variety of real-world applications with limited data. While active learning methods attempt to tackle this issue by labeling only the data samples that give high information, they generally suffer from large computational costs and are impractical in settings where data can be collected in parallel. Batch active learning methods attempt to overcome this computational burden by querying batches of samples at a time. To avoid redundancy between samples, previous works rely on some ad hoc combination of sample quality and diversity. In this paper, we present a new principled batch active learning method using Determinantal Point Processes, a repulsive point process that enables generating diverse batches of samples. We develop tractable algorithms to approximate the mode of a DPP…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Stanford-ILIAD/DPP-Batch-Active-Learning
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Optimization and Search Problems · Complexity and Algorithms in Graphs