Near-Optimal Decentralized Stochastic Nonconvex Optimization with Heavy-Tailed Noise

Menglian Wang; Zhuanghua Liu; Luo Luo

arXiv:2601.11435·math.OC·January 19, 2026

Near-Optimal Decentralized Stochastic Nonconvex Optimization with Heavy-Tailed Noise

Menglian Wang, Zhuanghua Liu, Luo Luo

PDF

Open Access

TL;DR

This paper introduces a decentralized normalized stochastic gradient descent method that effectively handles heavy-tailed noise in nonconvex optimization, achieving near-optimal sample and communication complexities.

Contribution

It proposes a novel decentralized optimization algorithm robust to heavy-tailed noise, with theoretical guarantees and empirical validation.

Findings

01

Achieves approximate stationary points with optimal sample complexity.

02

Attains near-optimal communication complexity.

03

Demonstrates practical superiority through empirical studies.

Abstract

This paper studies decentralized stochastic nonconvex optimization problem over row-stochastic networks. We consider the heavy-tailed gradient noise which is empirically observed in many popular real-world applications. Specifically, we propose a decentralized normalized stochastic gradient descent with Pull-Diag gradient tracking, which achieves approximate stationary points with the optimal sample complexity and the near-optimal communication complexity. We further follow our framework to study the setting of undirected networks, also achieving the nearly tight upper complexity bounds. Moreover, we conduct empirical studies to show the practical superiority of the proposed methods.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Distributed Control Multi-Agent Systems · Privacy-Preserving Technologies in Data