Agnostic Learning by Refuting
Pravesh K. Kothari, Roi Livni

TL;DR
This paper introduces refutation complexity as a computational measure that exactly characterizes the sample complexity of efficient agnostic learning for Boolean functions, linking it to the difficulty of distinguishing structured data from noise.
Contribution
It defines refutation complexity and proves its equivalence to the sample complexity of efficient agnostic learning, connecting learning theory with refutation of random constraint satisfaction problems.
Findings
Refutation complexity characterizes efficient agnostic learning.
The relationship between refutation and learning is made explicit.
Connections to previous work on PAC learning and CSP refutation.
Abstract
The sample complexity of learning a Boolean-valued function class is precisely characterized by its Rademacher complexity. This has little bearing, however, on the sample complexity of \emph{efficient} agnostic learning. We introduce \emph{refutation complexity}, a natural computational analog of Rademacher complexity of a Boolean concept class and show that it exactly characterizes the sample complexity of \emph{efficient} agnostic learning. Informally, refutation complexity of a class is the minimum number of example-label pairs required to efficiently distinguish between the case that the labels correlate with the evaluation of some member of (\emph{structure}) and the case where the labels are i.i.d. Rademacher random variables (\emph{noise}). The easy direction of this relationship was implicitly used in the recent framework for improper PAC learning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Optimization and Search Problems · Complexity and Algorithms in Graphs
