Switch and Conquer: Efficient Algorithms By Switching Stochastic Gradient Oracles For Decentralized Saddle Point Problems
Chhavi Sharma, Vishnu Narayanan, P. Balamurugan

TL;DR
This paper introduces a novel decentralized saddle point optimization algorithm that switches between stochastic gradient oracles to improve convergence speed and accuracy, especially in the initial phases.
Contribution
The paper proposes a switching stochastic gradient method for decentralized saddle point problems, combining GSG and SVRG oracles for faster convergence.
Findings
C-DPSSG converges linearly to an $$-accurate saddle point.
Switching between GSG and SVRG oracles accelerates initial progress and improves low/medium accuracy solutions.
Numerical experiments demonstrate competitive performance on machine learning benchmarks.
Abstract
We consider a class of non-smooth strongly convex-strongly concave saddle point problems in a decentralized setting without a central server. To solve a consensus formulation of problems in this class, we develop an inexact primal dual hybrid gradient (inexact PDHG) procedure that allows generic gradient computation oracles to update the primal and dual variables. We first investigate the performance of inexact PDHG with stochastic variance reduction gradient (SVRG) oracle. Our numerical study uncovers a significant phenomenon of initial conservative progress of iterates of IPDHG with SVRG oracle. To tackle this, we develop a simple and effective switching idea, where a generalized stochastic gradient (GSG) computation oracle is employed to hasten the iterates' progress to a saddle point solution during the initial phase of updates, followed by a switch to the SVRG oracle at an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Machine Learning and ELM · Markov Chains and Monte Carlo Methods
