Generalization Bounds of Nonconvex-(Strongly)-Concave Stochastic Minimax Optimization
Siqi Zhang, Yifan Hu, Liang Zhang, Niao He

TL;DR
This paper systematically investigates the generalization bounds of algorithms for nonconvex-(strongly)-concave stochastic minimax optimization, providing both algorithm-agnostic and algorithm-dependent bounds with novel stability concepts.
Contribution
It introduces a comprehensive analysis of generalization bounds for nonconvex stochastic minimax problems, including new stability notions and bounds for SGDA and related algorithms.
Findings
Sample complexity for NC-SC is (d\u00b7\u03ba^2\u00b7\u03b5^{-2})
Sample complexity for NC-C is (d\u00b7\u03b5^{-4})
Established stability-based generalization bounds for SGDA
Abstract
This paper takes an initial step to systematically investigate the generalization bounds of algorithms for solving nonconvex-(strongly)-concave (NC-SC/NC-C) stochastic minimax optimization measured by the stationarity of primal functions. We first establish algorithm-agnostic generalization bounds via uniform convergence between the empirical minimax problem and the population minimax problem. The sample complexities for achieving -generalization are and for NC-SC and NC-C settings, respectively, where is the dimension and is the condition number. We further study the algorithm-dependent generalization bounds via stability arguments of algorithms. In particular, we introduce a novel stability notion for minimax problems and build a connection between generalization bounds and the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques · Markov Chains and Monte Carlo Methods
