The Entrapment Problem in Random Walk Decentralized Learning
Zonghong Liu, Salim El Rouayheb, Matthew Dwyer

TL;DR
This paper addresses the entrapment problem in random walk-based decentralized learning by proposing the MHLJ algorithm, which introduces jumps to improve convergence speed and overcome node entrapment.
Contribution
We introduce the MHLJ algorithm that enhances decentralized SGD by incorporating Lévy jumps, overcoming entrapment issues caused by the Metropolis-Hastings method.
Findings
MHLJ accelerates convergence compared to traditional MH-based methods.
Theoretical analysis confirms the convergence rate and error bounds of MHLJ.
Numerical experiments demonstrate improved performance and robustness of MHLJ.
Abstract
This paper explores decentralized learning in a graph-based setting, where data is distributed across nodes. We investigate a decentralized SGD algorithm that utilizes a random walk to update a global model based on local data. Our focus is on designing the transition probability matrix to speed up convergence. While importance sampling can enhance centralized learning, its decentralized counterpart, using the Metropolis-Hastings (MH) algorithm, can lead to the entrapment problem, where the random walk becomes stuck at certain nodes, slowing convergence. To address this, we propose the Metropolis-Hastings with L\'evy Jumps (MHLJ) algorithm, which incorporates random perturbations (jumps) to overcome entrapment. We theoretically establish the convergence rate and error gap of MHLJ and validate our findings through numerical experiments.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOptimization and Search Problems · Machine Learning and Algorithms · Complexity and Algorithms in Graphs
MethodsStochastic Gradient Descent · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Focus
