Improved Adaptive Algorithm for Scalable Active Learning with Weak Labeler
Yifang Chen, Karthik Sankararaman, Alessandro Lazaric, Matteo Pirotta,, Dmytro Karamshuk, Qifan Wang, Karishma Mandyam, Sinong Wang, Han Fang

TL;DR
This paper introduces WL-AC, a novel active learning algorithm that effectively leverages weak labelers without assuming perfect accuracy, reducing label queries while maintaining high accuracy in large-scale settings.
Contribution
The paper proposes WL-AC, a robust, realizability-free active learning framework that adapts to various weak labelers and achieves optimal query complexity without prior knowledge of labeler accuracy.
Findings
Reduces label queries significantly on corrupted-MNIST dataset.
Maintains accuracy comparable to passive learning with fewer labels.
Applicable to large-scale models like deep neural networks.
Abstract
Active learning with strong and weak labelers considers a practical setting where we have access to both costly but accurate strong labelers and inaccurate but cheap predictions provided by weak labelers. We study this problem in the streaming setting, where decisions must be taken \textit{online}. We design a novel algorithmic template, Weak Labeler Active Cover (WL-AC), that is able to robustly leverage the lower quality weak labelers to reduce the query complexity while retaining the desired level of accuracy. Prior active learning algorithms with access to weak labelers learn a difference classifier which predicts where the weak labels differ from strong labelers; this requires the strong assumption of realizability of the difference classifier (Zhang and Chaudhuri,2015). WL-AC bypasses this \textit{realizability} assumption and thus is applicable to many real-world scenarios such…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Algorithms and Data Compression · Machine Learning and Data Classification
