Taming the Wild: A Unified Analysis of Hogwild!-Style Algorithms

Christopher De Sa; Ce Zhang; Kunle Olukotun; Christopher R\'e

arXiv:1506.06438·cs.LG·October 6, 2015·76 cites

Taming the Wild: A Unified Analysis of Hogwild!-Style Algorithms

Christopher De Sa, Ce Zhang, Kunle Olukotun, Christopher R\'e

PDF

Open Access

TL;DR

This paper presents a unified martingale-based analysis of Hogwild!-style asynchronous SGD algorithms, including non-convex and low-precision variants, demonstrating their convergence and efficiency on modern hardware.

Contribution

It introduces a new analysis framework for asynchronous SGD, deriving convergence rates under relaxed assumptions and designing a low-precision asynchronous algorithm called Buckwild!

Findings

01

Convergence rates for convex Hogwild! with relaxed sparsity assumptions

02

Analysis of asynchronous SGD for non-convex matrix problems

03

Efficient implementation of Buckwild! with low-precision arithmetic

Abstract

Stochastic gradient descent (SGD) is a ubiquitous algorithm for a variety of machine learning problems. Researchers and industry have developed several techniques to optimize SGD's runtime performance, including asynchronous execution and reduced precision. Our main result is a martingale-based analysis that enables us to capture the rich noise models that may arise from such techniques. Specifically, we use our new analysis in three ways: (1) we derive convergence rates for the convex case (Hogwild!) with relaxed assumptions on the sparsity of the problem; (2) we analyze asynchronous SGD algorithms for non-convex matrix problems including matrix completion; and (3) we design and analyze an asynchronous SGD algorithm, called Buckwild!, that uses lower-precision arithmetic. We show experimentally that our algorithms run efficiently for a variety of problems on modern hardware.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsData Management and Algorithms · Artificial Intelligence in Games · Constraint Satisfaction and Optimization

MethodsStochastic Gradient Descent