DSA: Decentralized Double Stochastic Averaging Gradient Algorithm
Aryan Mokhtari, Alejandro Ribeiro

TL;DR
The paper introduces DSA, a decentralized stochastic optimization algorithm that achieves linear convergence for large-scale machine learning problems, outperforming existing methods in convergence speed and efficiency.
Contribution
It proposes the DSA algorithm utilizing local stochastic averaging gradients with proven linear convergence under strong convexity and Lipschitz conditions.
Findings
DSA achieves linear convergence in expectation.
Numerical experiments show faster convergence than existing methods.
Reduces the number of feature vectors processed until convergence.
Abstract
This paper considers convex optimization problems where nodes of a network have access to summands of a global objective. Each of these local objectives is further assumed to be an average of a finite set of functions. The motivation for this setup is to solve large scale machine learning problems where elements of the training set are distributed to multiple computational elements. The decentralized double stochastic averaging gradient (DSA) algorithm is proposed as a solution alternative that relies on: (i) The use of local stochastic averaging gradients. (ii) Determination of descent steps as differences of consecutive stochastic averaging gradients. Strong convexity of local functions and Lipschitz continuity of local gradients is shown to guarantee linear convergence of the sequence generated by DSA in expectation. Local iterates are further shown to approach the optimal argument…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Distributed Control Multi-Agent Systems · Sparse and Compressive Sensing Techniques
