Lock-Free Optimization for Non-Convex Problems
Shen-Yi Zhao, Gong-Duo Zhang, Wu-Jun Li

TL;DR
This paper proves the convergence of lock-free parallel stochastic gradient descent methods, Hogwild! and AsySVRG, for non-convex optimization problems, supported by empirical evidence.
Contribution
It provides the first theoretical convergence proofs for LF-PSGD methods on non-convex problems, extending their applicability.
Findings
Hogwild! converges on non-convex problems
AsySVRG converges on non-convex problems
Empirical results confirm theoretical convergence
Abstract
Stochastic gradient descent~(SGD) and its variants have attracted much attention in machine learning due to their efficiency and effectiveness for optimization. To handle large-scale problems, researchers have recently proposed several lock-free strategy based parallel SGD~(LF-PSGD) methods for multi-core systems. However, existing works have only proved the convergence of these LF-PSGD methods for convex problems. To the best of our knowledge, no work has proved the convergence of the LF-PSGD methods for non-convex problems. In this paper, we provide the theoretical proof about the convergence of two representative LF-PSGD methods, Hogwild! and AsySVRG, for non-convex problems. Empirical results also show that both Hogwild! and AsySVRG are convergent on non-convex problems, which successfully verifies our theoretical results.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Machine Learning and ELM
