Optimal Epoch Stochastic Gradient Descent Ascent Methods for Min-Max Optimization
Yan Yan, Yi Xu, Qihang Lin, Wei Liu, Tianbao Yang

TL;DR
This paper introduces an epoch-wise stochastic gradient descent ascent method (Epoch-GDA) that achieves the optimal $O(1/T)$ convergence rate for the duality gap in strongly convex strongly concave min-max problems without additional assumptions.
Contribution
It provides the first sharp analysis of Epoch-GDA for SCSC min-max problems, achieving optimal convergence without extra smoothness or structural assumptions.
Findings
Epoch-GDA attains $O(1/T)$ duality gap convergence rate.
The analysis applies to general SCSC problems without extra assumptions.
Key lemma extends to weakly-convex strongly-concave problems.
Abstract
Epoch gradient descent method (a.k.a. Epoch-GD) proposed by Hazan and Kale (2011) was deemed a breakthrough for stochastic strongly convex minimization, which achieves the optimal convergence rate of with iterative updates for the {\it objective gap}. However, its extension to solving stochastic min-max problems with strong convexity and strong concavity still remains open, and it is still unclear whether a fast rate of for the {\it duality gap} is achievable for stochastic min-max optimization under strong convexity and strong concavity. Although some recent studies have proposed stochastic algorithms with fast convergence rates for min-max problems, they require additional assumptions about the problem, e.g., smoothness, bi-linear structure, etc. In this paper, we bridge this gap by providing a sharp analysis of epoch-wise stochastic gradient descent ascent…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Complexity and Algorithms in Graphs
