Asynchronous Distributed Semi-Stochastic Gradient Optimization

Ruiliang Zhang; Shuai Zheng; James T. Kwok

arXiv:1508.01633·cs.LG·December 7, 2015·2 cites

Asynchronous Distributed Semi-Stochastic Gradient Optimization

Ruiliang Zhang, Shuai Zheng, James T. Kwok

PDF

Open Access

TL;DR

This paper introduces a distributed asynchronous SGD algorithm with variance reduction that converges linearly and outperforms existing methods in speed and solution quality on large-scale problems.

Contribution

It proposes a novel asynchronous distributed SGD algorithm that combines variance reduction with linear convergence guarantees, improving efficiency and solution quality.

Findings

01

Outperforms state-of-the-art algorithms in wall clock time

02

Achieves linear convergence with a constant learning rate

03

Demonstrates effectiveness on Google Cloud Platform

Abstract

With the recent proliferation of large-scale learning problems,there have been a lot of interest on distributed machine learning algorithms, particularly those that are based on stochastic gradient descent (SGD) and its variants. However, existing algorithms either suffer from slow convergence due to the inherent variance of stochastic gradients, or have a fast linear convergence rate but at the expense of poorer solution quality. In this paper, we combine their merits by proposing a fast distributed asynchronous SGD-based algorithm with variance reduction. A constant learning rate can be used, and it is also guaranteed to converge linearly to the optimal solution. Experiments on the Google Cloud Computing Platform demonstrate that the proposed algorithm outperforms state-of-the-art distributed asynchronous algorithms in terms of both wall clock time and solution quality.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Machine Learning and ELM · Privacy-Preserving Technologies in Data