Sharper bounds for uniformly stable algorithms
Olivier Bousquet, Yegor Klochkov, Nikita Zhivotovskiy

TL;DR
This paper improves the theoretical understanding of the generalization bounds for uniformly stable algorithms by providing sharper upper bounds, proving their optimality, and introducing a new concentration inequality for weakly correlated variables.
Contribution
It offers a new, stronger generalization bound for stable algorithms, proves its near-optimality, and introduces a novel concentration inequality for weakly correlated random variables.
Findings
New generalization bound surpasses previous results
Proved the bound is sharp up to a logarithmic factor
Introduced a new concentration inequality for weakly correlated variables
Abstract
Deriving generalization bounds for stable algorithms is a classical question in learning theory taking its roots in the early works by Vapnik and Chervonenkis (1974) and Rogers and Wagner (1978). In a series of recent breakthrough papers by Feldman and Vondrak (2018, 2019), it was shown that the best known high probability upper bounds for uniformly stable learning algorithms due to Bousquet and Elisseef (2002) are sub-optimal in some natural regimes. To do so, they proved two generalization bounds that significantly outperform the simple generalization bound of Bousquet and Elisseef (2002). Feldman and Vondrak also asked if it is possible to provide sharper bounds and prove corresponding high probability lower bounds. This paper is devoted to these questions: firstly, inspired by the original arguments of Feldman and Vondrak (2019), we provide a short proof of the moment bound that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Complexity and Algorithms in Graphs · Domain Adaptation and Few-Shot Learning
