Accelerating Distributed SGD for Linear Regression using Iterative   Pre-Conditioning

Kushal Chakrabarti; Nirupam Gupta; Nikhil Chopra

arXiv:2011.07595·math.OC·December 1, 2020

Accelerating Distributed SGD for Linear Regression using Iterative Pre-Conditioning

Kushal Chakrabarti, Nirupam Gupta, Nikhil Chopra

PDF

Open Access

TL;DR

This paper introduces a stochastic version of the IPG algorithm, called IPSG, which accelerates distributed linear regression by using iterative pre-conditioning with single data points, achieving faster convergence.

Contribution

It extends the IPG method to stochastic settings, enabling efficient distributed linear regression with improved convergence rates using minimal data per iteration.

Findings

01

IPSG converges linearly in expectation to a neighborhood of the solution.

02

Empirical results show IPSG outperforms other stochastic algorithms in convergence speed.

03

The method is effective in server-based distributed linear regression scenarios.

Abstract

This paper considers the multi-agent distributed linear least-squares problem. The system comprises multiple agents, each agent with a locally observed set of data points, and a common server with whom the agents can interact. The agents' goal is to compute a linear model that best fits the collective data points observed by all the agents. In the server-based distributed settings, the server cannot access the data points held by the agents. The recently proposed Iteratively Pre-conditioned Gradient-descent (IPG) method has been shown to converge faster than other existing distributed algorithms that solve this problem. In the IPG algorithm, the server and the agents perform numerous iterative computations. Each of these iterations relies on the entire batch of data points observed by the agents for updating the current estimate of the solution. Here, we extend the idea of iterative…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques · Random Matrices and Applications