Towards Sharper Risk Bounds for Minimax Problems
Bowei Zhu, Shaojie Li, Yong Liu

TL;DR
This paper derives sharper generalization bounds for minimax problems in machine learning, improving the theoretical understanding of excess risk and analyzing popular algorithms under various conditions.
Contribution
It introduces a new high probability generalization error bound for nonconvex-strongly-concave minimax problems and provides dimension-independent results under PL condition.
Findings
Sharper high probability generalization error bounds for NC-SC problems
Dimension-independent results under Polyak-Lojasiewicz condition
Excess primal risk bounds are n times faster than existing results
Abstract
Minimax problems have achieved success in machine learning such as adversarial training, robust optimization, reinforcement learning. For theoretical analysis, current optimal excess risk bounds, which are composed by generalization error and optimization error, present 1/n-rates in strongly-convex-strongly-concave (SC-SC) settings. Existing studies mainly focus on minimax problems with specific algorithms for optimization error, with only a few studies on generalization performance, which limit better excess risk bounds. In this paper, we study the generalization bounds measured by the gradients of primal functions using uniform localized convergence. We obtain a sharper high probability generalization error bound for nonconvex-strongly-concave (NC-SC) stochastic minimax problems. Furthermore, we provide dimension-independent results under Polyak-Lojasiewicz condition for the outer…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsFocus
