Improved Algorithms for Efficient Active Learning Halfspaces with   Massart and Tsybakov noise

Chicheng Zhang; Yinan Li

arXiv:2102.05312·cs.LG·August 12, 2021

Improved Algorithms for Efficient Active Learning Halfspaces with Massart and Tsybakov noise

Chicheng Zhang, Yinan Li

PDF

Open Access

TL;DR

This paper introduces a computationally efficient active learning algorithm for halfspaces that tolerates Massart and Tsybakov noise, achieving near-optimal label complexity under various data distributions.

Contribution

The paper presents the first efficient active learning algorithms for halfspaces under Massart and Tsybakov noise with provably improved label complexity bounds.

Findings

01

Achieves near-optimal label complexity in Massart noise setting.

02

Provides lower label complexity guarantees than passive learning under Tsybakov noise.

03

Works under a broad class of structured data distributions.

Abstract

We give a computationally-efficient PAC active learning algorithm for $d$ -dimensional homogeneous halfspaces that can tolerate Massart noise (Massart and N\'ed\'elec, 2006) and Tsybakov noise (Tsybakov, 2004). Specialized to the $η$ -Massart noise setting, our algorithm achieves an information-theoretically near-optimal label complexity of $\tilde{O} (\frac{d}{( 1 - 2 η ) ^{2}} polylog (\frac{1}{ϵ}))$ under a wide range of unlabeled data distributions (specifically, the family of "structured distributions" defined in Diakonikolas et al. (2020)). Under the more challenging Tsybakov noise condition, we identify two subfamilies of noise conditions, under which our efficient algorithm provides label complexity guarantees strictly lower than passive learning algorithms.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Algorithms and Data Compression · Advanced Bandit Algorithms Research