Active Tolerant Testing
Avrim Blum, Lunjia Hu

TL;DR
This paper introduces the first algorithms for tolerant testing of certain classes in the active model, enabling error estimation with minimal label queries and independent of VC-dimension, with applications to performance estimation and hyperparameter tuning.
Contribution
It provides novel algorithms for tolerant testing in the active model for classes like unions of intervals, extending prior non-tolerant work and enabling efficient error and performance estimation.
Findings
Label query complexity is independent of VC-dimension.
Algorithms accurately estimate error rates with few labels.
Applications include hyperparameter tuning and performance estimation.
Abstract
In this work, we give the first algorithms for tolerant testing of nontrivial classes in the active model: estimating the distance of a target function to a hypothesis class C with respect to some arbitrary distribution D, using only a small number of label queries to a polynomial-sized pool of unlabeled examples drawn from D. Specifically, we show that for the class D of unions of d intervals on the line, we can estimate the error rate of the best hypothesis in the class to an additive error epsilon from only label queries to an unlabeled pool of size . The key point here is the number of labels needed is independent of the VC-dimension of the class. This extends the work of Balcan et al. [2012] who solved the non-tolerant testing problem for this class (distinguishing the zero-error case…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Complexity and Algorithms in Graphs · Optimization and Search Problems
