Provably Convergent Decentralized Optimization over Directed Graphs under Generalized Smoothness
Yanan Bo, Yongqiang Wang

TL;DR
This paper introduces a decentralized optimization method that works under generalized smoothness conditions, allowing for rapidly changing gradients, and proves convergence over directed graphs without requiring bounded gradient dissimilarity.
Contribution
It develops a novel decentralized optimization algorithm under generalized smoothness that converges over directed graphs without bounded gradient dissimilarity assumptions.
Findings
Demonstrates superior stability and faster convergence in experiments.
Validates effectiveness on benchmark datasets like LIBSVM and CIFAR-10.
Handles unbounded gradient dissimilarity, broadening applicability.
Abstract
Decentralized optimization has become a fundamental tool for large-scale learning systems; however, most existing methods rely on the classical Lipschitz smoothness assumption, which is often violated in problems with rapidly varying gradients. Motivated by this limitation, we study decentralized optimization under the generalized -smoothness framework, in which the Hessian norm is allowed to grow linearly with the gradient norm, thereby accommodating rapidly varying gradients beyond classical Lipschitz smoothness. We integrate gradient-tracking techniques with gradient clipping and carefully design the clipping threshold to ensure accurate convergence over directed communication graphs under generalized smoothness. In contrast to existing distributed optimization results under generalized smoothness that require a bounded gradient dissimilarity assumption, our results…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Privacy-Preserving Technologies in Data · Sparse and Compressive Sensing Techniques
