Stochastic Variance Reduction Methods for Saddle-Point Problems
P Balamurugan (SIERRA, LIENS), Francis Bach (SIERRA, LIENS)

TL;DR
This paper introduces stochastic variance reduction algorithms with linear convergence for large-scale convex-concave saddle-point problems, extending existing methods and addressing challenges in convergence analysis and sampling strategies.
Contribution
It extends stochastic variance reduction methods to saddle-point problems, using monotone operator theory and non-uniform sampling for improved convergence and efficiency.
Findings
Algorithms achieve linear convergence in large-scale settings
Non-uniform sampling enhances efficiency both theoretically and practically
Accelerated algorithms outperform traditional batch methods
Abstract
We consider convex-concave saddle-point problems where the objective functions may be split in many components, and extend recent stochastic variance reduction methods (such as SVRG or SAGA) to provide the first large-scale linearly convergent algorithms for this class of problems which is common in machine learning. While the algorithmic extension is straightforward, it comes with challenges and opportunities: (a) the convex minimization analysis does not apply and we use the notion of monotone operators to prove convergence, showing in particular that the same algorithm applies to a larger class of problems, such as variational inequalities, (b) there are two notions of splits, in terms of functions, or in terms of partial derivatives, (c) the split does need to be done with convex-concave terms, (d) non-uniform sampling is key to an efficient algorithm, both in theory and practice,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Markov Chains and Monte Carlo Methods · Gaussian Processes and Bayesian Inference
