Distributed Differential Dynamic Programming Architectures for Large-Scale Multi-Agent Control
Augustinos D. Saravanos, Yuichiro Aoyama, Hongchang Zhu, Evangelos, A. Theodorou

TL;DR
This paper introduces two scalable decentralized control architectures, ND-DDP and MD-DDP, for large-scale multi-agent systems, combining DDP and ADMM to improve efficiency and robustness in robotics applications.
Contribution
The paper presents two novel decentralized optimization frameworks, ND-DDP and MD-DDP, integrating DDP and ADMM for scalable multi-agent control.
Findings
Both frameworks are fully decentralized and parallelizable.
Simulation results demonstrate scalability to thousands of agents.
Hardware experiments confirm real-world applicability.
Abstract
In this paper, we propose two novel decentralized optimization frameworks for multi-agent nonlinear optimal control problems in robotics. The aim of this work is to suggest architectures that inherit the computational efficiency and scalability of Differential Dynamic Programming (DDP) and the distributed nature of the Alternating Direction Method of Multipliers (ADMM). In this direction, two frameworks are introduced. The first one called Nested Distributed DDP (ND-DDP), is a three-level architecture which employs ADMM for enforcing a consensus between all agents, an augmented Lagrangian layer for satisfying local constraints and DDP as each agent's optimizer. In the second approach, both consensus and local constraints are handled with ADMM, yielding a two-level architecture called Merged Distributed DDP (MD-DDP), which further reduces computational complexity. Both frameworks are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdaptive Dynamic Programming Control · Smart Parking Systems Research · Reinforcement Learning in Robotics
MethodsAlternating Direction Method of Multipliers
