On the Stability and Generalization of First-order Bilevel Minimax Optimization

Xuelin Zhang; Peipei Yuan

arXiv:2604.20115·cs.LG·April 23, 2026

On the Stability and Generalization of First-order Bilevel Minimax Optimization

Xuelin Zhang, Peipei Yuan

PDF

TL;DR

This paper provides the first theoretical analysis of how well first-order bilevel minimax optimization algorithms generalize, using stability arguments and empirical validation.

Contribution

It introduces a systematic generalization analysis for gradient-based bilevel minimax algorithms, filling a key theoretical gap in the field.

Findings

01

Derived fine-grained generalization bounds for three algorithms

02

Revealed a trade-off among stability, generalization, and practical settings

03

Empirical results support theoretical insights

Abstract

Bilevel optimization and bilevel minimax optimization have recently emerged as unifying frameworks for a range of machine-learning tasks, including hyperparameter optimization and reinforcement learning. The existing literature focuses on empirical efficiency and convergence guarantees, leaving a critical theoretical gap in understanding how well these algorithms generalize. To bridge this gap, we provide the first systematic generalization analysis for first-order gradient-based bilevel minimax solvers with lower-level minimax problems. Specifically, by leveraging algorithmic stability arguments, we derive fine-grained generalization bounds for three representative algorithms, including single-timescale stochastic gradient descent-ascent, and two variants of two-timescale stochastic gradient descent-ascent. Our results reveal a precise trade-off among algorithmic stability,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.