On Convergence Analysis of Network-GIANT: An approximate Hessian-based fully distributed optimization algorithm
Souvik Das, Luca Schenato, and Subhrakanti Dey

TL;DR
This paper provides a detailed convergence analysis of Network-GIANT, a distributed optimization algorithm that uses approximate Hessian information, demonstrating its faster linear convergence compared to first-order methods under certain conditions.
Contribution
It explicitly characterizes the global linear convergence rate of Network-GIANT and derives bounds on step size and optimality gap, explaining its faster convergence.
Findings
Network-GIANT achieves faster linear convergence than first-order methods.
Explicit spectral radius-based convergence rate depends on problem and graph parameters.
Numerical experiments confirm theoretical convergence properties.
Abstract
In this paper, we present a detailed convergence analysis of a recently developed approximate Newton-type fully distributed optimization method for smooth, strongly convex local loss functions, called Network-GIANT, which has been empirically illustrated to show faster linear convergence properties while having the same communication complexity (per iteration) as its first order distributed counterparts. By using consensus based parameter updates, and a local Hessian based descent direction at the individual nodes with gradient tracking, we first explicitly characterize a global linear convergence rate for Network-GIANT, which can be computed as the spectral radius of a matrix dependent on the Lipschitz continuity () and strong convexity () parameters of the objective functions, and the spectral norm () of the underlying undirected graph represented by a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Distributed Control Multi-Agent Systems · Sparse and Compressive Sensing Techniques
