Generalized Coverage for More Robust Low-Budget Active Learning

Wonho Bae; Junhyug Noh; Danica J. Sutherland

arXiv:2407.12212·cs.LG·July 26, 2024

Generalized Coverage for More Robust Low-Budget Active Learning

Wonho Bae, Junhyug Noh, Danica J. Sutherland

PDF

Open Access

TL;DR

This paper introduces MaxHerding, a robust and efficient generalized coverage method for low-budget active learning that outperforms existing approaches in image classification tasks.

Contribution

It proposes a generalized coverage framework and a new greedy algorithm, MaxHerding, which improves robustness and performance over ProbCover in low-budget active learning.

Findings

01

MaxHerding outperforms existing active learning methods on multiple benchmarks.

02

The method is more robust to hyper-parameter choices than ProbCover.

03

MaxHerding requires less computational cost than most competitive methods.

Abstract

The ProbCover method of Yehuda et al. is a well-motivated algorithm for active learning in low-budget regimes, which attempts to "cover" the data distribution with balls of a given radius at selected data points. We demonstrate, however, that the performance of this algorithm is extremely sensitive to the choice of this radius hyper-parameter, and that tuning it is quite difficult, with the original heuristic frequently failing. We thus introduce (and theoretically motivate) a generalized notion of "coverage," including ProbCover's objective as a special case, but also allowing smoother notions that are far more robust to hyper-parameter choice. We propose an efficient greedy method to optimize this coverage, generalizing ProbCover's algorithm; due to its close connection to kernel herding, we call it "MaxHerding." The objective can also be optimized non-greedily through a variant of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms