Generalization Bounds of Nonconvex-(Strongly)-Concave Stochastic Minimax   Optimization

Siqi Zhang; Yifan Hu; Liang Zhang; Niao He

arXiv:2205.14278·math.OC·February 8, 2023·1 cites

Generalization Bounds of Nonconvex-(Strongly)-Concave Stochastic Minimax Optimization

Siqi Zhang, Yifan Hu, Liang Zhang, Niao He

PDF

Open Access

TL;DR

This paper systematically investigates the generalization bounds of algorithms for nonconvex-(strongly)-concave stochastic minimax optimization, providing both algorithm-agnostic and algorithm-dependent bounds with novel stability concepts.

Contribution

It introduces a comprehensive analysis of generalization bounds for nonconvex stochastic minimax problems, including new stability notions and bounds for SGDA and related algorithms.

Findings

01

Sample complexity for NC-SC is (d\u00b7\u03ba^2\u00b7\u03b5^{-2})

02

Sample complexity for NC-C is (d\u00b7\u03b5^{-4})

03

Established stability-based generalization bounds for SGDA

Abstract

This paper takes an initial step to systematically investigate the generalization bounds of algorithms for solving nonconvex-(strongly)-concave (NC-SC/NC-C) stochastic minimax optimization measured by the stationarity of primal functions. We first establish algorithm-agnostic generalization bounds via uniform convergence between the empirical minimax problem and the population minimax problem. The sample complexities for achieving $ϵ$ -generalization are $\tilde{O} (d κ^{2} ϵ^{- 2})$ and $\tilde{O} (d ϵ^{- 4})$ for NC-SC and NC-C settings, respectively, where $d$ is the dimension and $κ$ is the condition number. We further study the algorithm-dependent generalization bounds via stability arguments of algorithms. In particular, we introduce a novel stability notion for minimax problems and build a connection between generalization bounds and the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques · Markov Chains and Monte Carlo Methods