On Stability and Generalization of Bilevel Optimization Problem
Meng Ding, Mingxi Lei, Yunwen Lei, Di Wang, Jinhui Xu

TL;DR
This paper analyzes the generalization behavior of gradient-based bilevel optimization methods, providing new stability bounds and improved theoretical guarantees applicable across various convex and nonconvex settings, with empirical validation.
Contribution
It establishes a fundamental link between stability and generalization in bilevel optimization, offers the first stability bounds for both inner and outer parameters, and improves generalization bounds from o(n) to o(n).
Findings
Improved generalization bound from o(n) to o(n).
First stability bounds for both inner and outer parameters.
Experimental validation on meta-learning and hyper-parameter optimization.
Abstract
(Stochastic) bilevel optimization is a frequently encountered problem in machine learning with a wide range of applications such as meta-learning, hyper-parameter optimization, and reinforcement learning. Most of the existing studies on this problem only focused on analyzing the convergence or improving the convergence rate, while little effort has been devoted to understanding its generalization behaviors. In this paper, we conduct a thorough analysis on the generalization of first-order (gradient-based) methods for the bilevel optimization problem. We first establish a fundamental connection between algorithmic stability and generalization error in different forms and give a high probability generalization bound which improves the previous best one from to , where is the sample size. We then provide the first stability bounds for the general case…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Bone and Joint Diseases
