Lock-Free Optimization for Non-Convex Problems

Shen-Yi Zhao; Gong-Duo Zhang; Wu-Jun Li

arXiv:1612.03441·stat.ML·December 19, 2016

Lock-Free Optimization for Non-Convex Problems

Shen-Yi Zhao, Gong-Duo Zhang, Wu-Jun Li

PDF

Open Access

TL;DR

This paper proves the convergence of lock-free parallel stochastic gradient descent methods, Hogwild! and AsySVRG, for non-convex optimization problems, supported by empirical evidence.

Contribution

It provides the first theoretical convergence proofs for LF-PSGD methods on non-convex problems, extending their applicability.

Findings

01

Hogwild! converges on non-convex problems

02

AsySVRG converges on non-convex problems

03

Empirical results confirm theoretical convergence

Abstract

Stochastic gradient descent~(SGD) and its variants have attracted much attention in machine learning due to their efficiency and effectiveness for optimization. To handle large-scale problems, researchers have recently proposed several lock-free strategy based parallel SGD~(LF-PSGD) methods for multi-core systems. However, existing works have only proved the convergence of these LF-PSGD methods for convex problems. To the best of our knowledge, no work has proved the convergence of the LF-PSGD methods for non-convex problems. In this paper, we provide the theoretical proof about the convergence of two representative LF-PSGD methods, Hogwild! and AsySVRG, for non-convex problems. Empirical results also show that both Hogwild! and AsySVRG are convergent on non-convex problems, which successfully verifies our theoretical results.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Machine Learning and ELM