Stochastic Variance Reduction Methods for Saddle-Point Problems

P Balamurugan (SIERRA; LIENS); Francis Bach (SIERRA; LIENS)

arXiv:1605.06398·cs.LG·November 4, 2016·22 cites

Stochastic Variance Reduction Methods for Saddle-Point Problems

P Balamurugan (SIERRA, LIENS), Francis Bach (SIERRA, LIENS)

PDF

Open Access

TL;DR

This paper introduces stochastic variance reduction algorithms with linear convergence for large-scale convex-concave saddle-point problems, extending existing methods and addressing challenges in convergence analysis and sampling strategies.

Contribution

It extends stochastic variance reduction methods to saddle-point problems, using monotone operator theory and non-uniform sampling for improved convergence and efficiency.

Findings

01

Algorithms achieve linear convergence in large-scale settings

02

Non-uniform sampling enhances efficiency both theoretically and practically

03

Accelerated algorithms outperform traditional batch methods

Abstract

We consider convex-concave saddle-point problems where the objective functions may be split in many components, and extend recent stochastic variance reduction methods (such as SVRG or SAGA) to provide the first large-scale linearly convergent algorithms for this class of problems which is common in machine learning. While the algorithmic extension is straightforward, it comes with challenges and opportunities: (a) the convex minimization analysis does not apply and we use the notion of monotone operators to prove convergence, showing in particular that the same algorithm applies to a larger class of problems, such as variational inequalities, (b) there are two notions of splits, in terms of functions, or in terms of partial derivatives, (c) the split does need to be done with convex-concave terms, (d) non-uniform sampling is key to an efficient algorithm, both in theory and practice,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Markov Chains and Monte Carlo Methods · Gaussian Processes and Bayesian Inference