A Structured Perspective of Volumes on Active Learning

Xiaofeng Cao; Ivor W. Tsang; Guandong Xu

arXiv:1807.08904·cs.LG·September 29, 2020

A Structured Perspective of Volumes on Active Learning

Xiaofeng Cao, Ivor W. Tsang, Guandong Xu

PDF

Open Access

TL;DR

This paper introduces a structured volume perspective on active learning, proposing a new theoretical framework and algorithm that optimize hypothesis space volumes to improve label efficiency.

Contribution

It develops a volume-based theoretical framework for active learning, introducing the Volume-based Model and VAL algorithm to optimize hypothesis space coverage.

Findings

01

Provides provable guarantees for volume-based active learning strategies.

02

Introduces the Volume-based Model to improve sampling targets.

03

Proposes the VAL algorithm for practical implementation.

Abstract

Active Learning (AL) is a learning task that requires learners interactively query the labels of the sampled unlabeled instances to minimize the training outputs with human supervisions. In theoretical study, learners approximate the version space which covers all possible classification hypothesis into a bounded convex body and try to shrink the volume of it into a half-space by a given cut size. However, only the hypersphere with finite VC dimensions has obtained formal approximation guarantees that hold when the classes of Euclidean space are separable with a margin. In this paper, we approximate the version space to a structured {hypersphere} that covers most of the hypotheses, and then divide the available AL sampling approaches into two kinds of strategies: Outer Volume Sampling and Inner Volume Sampling. After providing provable guarantees for the performance of AL in version…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Machine Learning and Data Classification · Algorithms and Data Compression