Statistical Active Learning Algorithms for Noise Tolerance and Differential Privacy
Maria Florina Balcan, Vitaly Feldman

TL;DR
This paper introduces a framework for designing active learning algorithms that are robust to noise and preserve privacy, achieving optimal noise tolerance and exponential label savings over passive methods.
Contribution
It presents a generic method to convert statistical active learning algorithms into noise-tolerant and differentially-private versions, with optimal complexity and broad applicability.
Findings
Efficient algorithms for learning thresholds, rectangles, and linear separators under noise.
Optimal quadratic dependence on noise rate in complexity.
Exponential label savings in private active learning.
Abstract
We describe a framework for designing efficient active learning algorithms that are tolerant to random classification noise and are differentially-private. The framework is based on active learning algorithms that are statistical in the sense that they rely on estimates of expectations of functions of filtered random examples. It builds on the powerful statistical query framework of Kearns (1993). We show that any efficient active statistical learning algorithm can be automatically converted to an efficient active learning algorithm which is tolerant to random classification noise as well as other forms of "uncorrelated" noise. The complexity of the resulting algorithms has information-theoretically optimal quadratic dependence on , where is the noise rate. We show that commonly studied concept classes including thresholds, rectangles, and linear separators can…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
