Towards Sharper Risk Bounds for Minimax Problems

Bowei Zhu; Shaojie Li; Yong Liu

arXiv:2410.08497·cs.LG·October 14, 2024

Towards Sharper Risk Bounds for Minimax Problems

Bowei Zhu, Shaojie Li, Yong Liu

PDF

TL;DR

This paper derives sharper generalization bounds for minimax problems in machine learning, improving the theoretical understanding of excess risk and analyzing popular algorithms under various conditions.

Contribution

It introduces a new high probability generalization error bound for nonconvex-strongly-concave minimax problems and provides dimension-independent results under PL condition.

Findings

01

Sharper high probability generalization error bounds for NC-SC problems

02

Dimension-independent results under Polyak-Lojasiewicz condition

03

Excess primal risk bounds are n times faster than existing results

Abstract

Minimax problems have achieved success in machine learning such as adversarial training, robust optimization, reinforcement learning. For theoretical analysis, current optimal excess risk bounds, which are composed by generalization error and optimization error, present 1/n-rates in strongly-convex-strongly-concave (SC-SC) settings. Existing studies mainly focus on minimax problems with specific algorithms for optimization error, with only a few studies on generalization performance, which limit better excess risk bounds. In this paper, we study the generalization bounds measured by the gradients of primal functions using uniform localized convergence. We obtain a sharper high probability generalization error bound for nonconvex-strongly-concave (NC-SC) stochastic minimax problems. Furthermore, we provide dimension-independent results under Polyak-Lojasiewicz condition for the outer…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsFocus