Error Exponent in Agnostic PAC Learning

Adi Hendel; Meir Feder

arXiv:2405.00792·cs.LG·May 3, 2024

Error Exponent in Agnostic PAC Learning

Adi Hendel, Meir Feder

PDF

Open Access

TL;DR

This paper introduces an analysis of PAC learning using the error exponent from Information Theory, providing improved bounds on the exponential decay of error probability in agnostic binary classification.

Contribution

It establishes a new distribution-dependent error exponent for agnostic PAC learning, showing it can match realizable learning under certain stability conditions.

Findings

01

Improved error exponent bounds under stability assumptions

02

Agnostic learning can have the same error exponent as realizable learning

03

Application of error exponent analysis to knowledge distillation

Abstract

Statistical learning theory and the Probably Approximately Correct (PAC) criterion are the common approach to mathematical learning theory. PAC is widely used to analyze learning problems and algorithms, and have been studied thoroughly. Uniform worst case bounds on the convergence rate have been well established using, e.g., VC theory or Radamacher complexity. However, in a typical scenario the performance could be much better. In this paper, we consider PAC learning using a somewhat different tradeoff, the error exponent - a well established analysis method in Information Theory - which describes the exponential behavior of the probability that the risk will exceed a certain threshold as function of the sample size. We focus on binary classification and find, under some stability assumptions, an improved distribution dependent error exponent for a wide range of problems, establishing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFault Detection and Control Systems

MethodsFocus