Deep Active Learning over the Long Tail

Yonatan Geifman; Ran El-Yaniv

arXiv:1711.00941·cs.LG·November 6, 2017·86 cites

Deep Active Learning over the Long Tail

Yonatan Geifman, Ran El-Yaniv

PDF

Open Access

TL;DR

This paper introduces a novel deep active learning algorithm that leverages farthest-first traversal in neural activation space, significantly reducing sample complexity and outperforming traditional uncertainty sampling on multiple datasets.

Contribution

The paper presents a new active learning method based on farthest-first traversal in neural activation space, improving sample efficiency over passive and uncertainty sampling methods.

Findings

01

Significant reduction in sample complexity on MNIST, CIFAR-10, and CIFAR-100.

02

Outperforms traditional uncertainty sampling techniques.

03

Identifies limitations of uncertainty sampling in certain cases.

Abstract

This paper is concerned with pool-based active learning for deep neural networks. Motivated by coreset dataset compression ideas, we present a novel active learning algorithm that queries consecutive points from the pool using farthest-first traversals in the space of neural activation over a representation layer. We show consistent and overwhelming improvement in sample complexity over passive learning (random sampling) for three datasets: MNIST, CIFAR-10, and CIFAR-100. In addition, our algorithm outperforms the traditional uncertainty sampling technique (obtained using softmax activations), and we identify cases where uncertainty sampling is only slightly better than random sampling.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Algorithms and Data Compression · Machine Learning and Data Classification

MethodsSoftmax