Sharper lower bounds on the performance of the empirical risk minimization algorithm
Guillaume Lecu\'e, Shahar Mendelson

TL;DR
This paper establishes sharper lower bounds on the excess risk of the empirical risk minimization algorithm by leveraging advanced probabilistic theorems and geometric assumptions, highlighting fundamental limitations in learning performance.
Contribution
It introduces new lower bounds on ERM performance based on Gaussian process theory and geometric conditions, improving understanding of fundamental learning limits.
Findings
Lower bounds depend on Gaussian process supremum and oscillation parameters.
Bounds are sharper under specific geometric assumptions.
Provides theoretical limits on ERM excess risk performance.
Abstract
We present an argument based on the multidimensional and the uniform central limit theorems, proving that, under some geometrical assumptions between the target function and the learning class , the excess risk of the empirical risk minimization algorithm is lower bounded by \[\frac{\mathbb{E}\sup_{q\in Q}G_q}{\sqrt{n}}\delta,\] where is a canonical Gaussian process associated with (a well chosen subset of ) and is a parameter governing the oscillations of the empirical excess risk function over a small ball in .
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
