How is Distributed ADMM Affected by Network Topology?
Guilherme Fran\c{c}a, Jos\'e Bento

TL;DR
This paper analyzes how the convergence rate of distributed ADMM for consensus problems depends on network topology, providing explicit formulas and proving a conjecture that ADMM can outperform gradient descent by a square root factor.
Contribution
It offers a full characterization of distributed ADMM convergence in relation to graph topology and confirms the conjecture that ADMM is faster than GD by a square root factor.
Findings
Explicit formulas for optimal parameters based on graph eigenvalues.
Proof that ADMM outperforms GD by a square root factor for any graph.
Analysis applicable even when Markov chain lifting does not accelerate mixing.
Abstract
When solving consensus optimization problems over a graph, there is often an explicit characterization of the convergence rate of Gradient Descent (GD) using the spectrum of the graph Laplacian. The same type of problems under the Alternating Direction Method of Multipliers (ADMM) are, however, poorly understood. For instance, simple but important non-strongly-convex consensus problems have not yet being analyzed, especially concerning the dependency of the convergence rate on the graph topology. Recently, for a non-strongly-convex consensus problem, a connection between distributed ADMM and lifted Markov chains was proposed, followed by a conjecture that ADMM is faster than GD by a square root factor in its convergence time, in close analogy to the mixing speedup achieved by lifting several Markov chains. Nevertheless, a proof of such a claim is is still lacking. Here we provide a full…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDistributed Control Multi-Agent Systems · Neural Networks Stability and Synchronization · Stochastic Gradient Optimization Techniques
MethodsAlternating Direction Method of Multipliers
