Nuclear Discrepancy for Active Learning

Tom J. Viering; Jesse H. Krijthe; Marco Loog

arXiv:1706.02645·cs.LG·June 9, 2017·1 cites

Nuclear Discrepancy for Active Learning

Tom J. Viering, Jesse H. Krijthe, Marco Loog

PDF

Open Access

TL;DR

This paper introduces the Nuclear Discrepancy bound for active learning, demonstrating that looser, probabilistically motivated bounds can outperform tighter bounds in practical scenarios.

Contribution

It proposes a new Nuclear Discrepancy bound for active learning, showing its effectiveness over existing bounds through theoretical analysis and empirical validation.

Findings

01

Nuclear Discrepancy bound outperforms tighter bounds in practice

02

Looser bounds focusing on realistic scenarios can lead to better active learning

03

Empirical results confirm the probabilistic motivation behind the new bound

Abstract

Active learning algorithms propose which unlabeled objects should be queried for their labels to improve a predictive model the most. We study active learners that minimize generalization bounds and uncover relationships between these bounds that lead to an improved approach to active learning. In particular we show the relation between the bound of the state-of-the-art Maximum Mean Discrepancy (MMD) active learner, the bound of the Discrepancy, and a new and looser bound that we refer to as the Nuclear Discrepancy bound. We motivate this bound by a probabilistic argument: we show it considers situations which are more likely to occur. Our experiments indicate that active learning using the tightest Discrepancy bound performs the worst in terms of the squared loss. Overall, our proposed loosest Nuclear Discrepancy generalization bound performs the best. We confirm our probabilistic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms