Boosting the Confidence of Generalization for $L_2$-Stable Randomized Learning Algorithms
Xiao-Tong Yuan, Ping Li

TL;DR
This paper improves confidence in the generalization ability of $L_2$-stable randomized learning algorithms by establishing near-tight exponential bounds, especially for stochastic gradient descent in convex and non-convex settings.
Contribution
It introduces a novel confidence-boosting framework that relaxes stability assumptions, leading to near-tight exponential generalization bounds for $L_2$-stable algorithms including SGD.
Findings
Established in-expectation first moment generalization error bounds for $L_2$-stability.
Designed subbagging process yields near-tight exponential bounds.
Derived improved high-probability bounds for SGD in convex/non-convex optimization.
Abstract
Exponential generalization bounds with near-tight rates have recently been established for uniformly stable learning algorithms. The notion of uniform stability, however, is stringent in the sense that it is invariant to the data-generating distribution. Under the weaker and distribution dependent notions of stability such as hypothesis stability and -stability, the literature suggests that only polynomial generalization bounds are possible in general cases. The present paper addresses this long standing tension between these two regimes of results and makes progress towards relaxing it inside a classic framework of confidence-boosting. To this end, we first establish an in-expectation first moment generalization error bound for potentially randomized learning algorithms with -stability, based on which we then show that a properly designed subbagging process leads to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Machine Learning and ELM
