Accelerated Distributed Stochastic Non-Convex Optimization over Time-Varying Directed Networks
Yiyue Chen, Abolfazl Hashemi, Haris Vikalo

TL;DR
This paper introduces a novel distributed stochastic optimization algorithm for non-convex problems over dynamic directed networks, achieving efficient convergence and superior performance in various learning tasks.
Contribution
The paper proposes a new algorithm combining stochastic gradient descent with momentum and gradient tracking for non-convex optimization over time-varying directed networks.
Findings
Oracle complexity of $\\mathcal{O}(1/\epsilon^{1.5})$
Linear convergence under Polyak-Łojasiewicz condition
Superior performance in experiments on MNIST, CIFAR-10, and IMDB datasets
Abstract
Distributed stochastic non-convex optimization problems have recently received attention due to the growing interest of signal processing, computer vision, and natural language processing communities in applications deployed over distributed learning systems (e.g., federated learning). We study the setting where the data is distributed across the nodes of a time-varying directed network, a topology suitable for modeling dynamic networks experiencing communication delays and straggler effects. The network nodes, which can access only their local objectives and query a stochastic first-order oracle to obtain gradient estimates, collaborate to minimize a global objective function by exchanging messages with their neighbors. We propose an algorithm, novel to this setting, that leverages stochastic gradient descent with momentum and gradient tracking to solve distributed non-convex…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEnergy Efficient Wireless Sensor Networks · Distributed Control Multi-Agent Systems · Advanced Wireless Network Optimization
MethodsSoftmax · Attention Is All You Need · Logistic Regression
