A Structured Perspective of Volumes on Active Learning
Xiaofeng Cao, Ivor W. Tsang, Guandong Xu

TL;DR
This paper introduces a structured volume perspective on active learning, proposing a new theoretical framework and algorithm that optimize hypothesis space volumes to improve label efficiency.
Contribution
It develops a volume-based theoretical framework for active learning, introducing the Volume-based Model and VAL algorithm to optimize hypothesis space coverage.
Findings
Provides provable guarantees for volume-based active learning strategies.
Introduces the Volume-based Model to improve sampling targets.
Proposes the VAL algorithm for practical implementation.
Abstract
Active Learning (AL) is a learning task that requires learners interactively query the labels of the sampled unlabeled instances to minimize the training outputs with human supervisions. In theoretical study, learners approximate the version space which covers all possible classification hypothesis into a bounded convex body and try to shrink the volume of it into a half-space by a given cut size. However, only the hypersphere with finite VC dimensions has obtained formal approximation guarantees that hold when the classes of Euclidean space are separable with a margin. In this paper, we approximate the version space to a structured {hypersphere} that covers most of the hypotheses, and then divide the available AL sampling approaches into two kinds of strategies: Outer Volume Sampling and Inner Volume Sampling. After providing provable guarantees for the performance of AL in version…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Machine Learning and Data Classification · Algorithms and Data Compression
