Decentralized Safe Multi-agent Stochastic Optimal Control using Deep   FBSDEs and ADMM

Marcus A. Pereira; Augustinos D. Saravanos; Oswin So; Evangelos A.; Theodorou

arXiv:2202.10658·cs.MA·June 9, 2022

Decentralized Safe Multi-agent Stochastic Optimal Control using Deep FBSDEs and ADMM

Marcus A. Pereira, Augustinos D. Saravanos, Oswin So, Evangelos A., Theodorou

PDF

Open Access

TL;DR

This paper introduces a decentralized, safe, and scalable multi-agent control method using deep FBSDEs and ADMM, ensuring safety through stochastic control barrier functions and consensus-based optimization.

Contribution

It develops a novel end-to-end differentiable framework combining deep FBSDEs with ADMM for safe, decentralized stochastic control in multi-agent systems.

Findings

01

Ensures safe multi-agent operation during training with collision avoidance.

02

Demonstrates superior scalability and computational efficiency over centralized methods.

03

Validates effectiveness on complex multi-robot simulation tasks.

Abstract

In this work, we propose a novel safe and scalable decentralized solution for multi-agent control in the presence of stochastic disturbances. Safety is mathematically encoded using stochastic control barrier functions and safe controls are computed by solving quadratic programs. Decentralization is achieved by augmenting to each agent's optimization variables, copy variables, for its neighbors. This allows us to decouple the centralized multi-agent optimization problem. However, to ensure safety, neighboring agents must agree on "what is safe for both of us" and this creates a need for consensus. To enable safe consensus solutions, we incorporate an ADMM-based approach. Specifically, we propose a Merged CADMM-OSQP implicit neural network layer, that solves a mini-batch of both, local quadratic programs as well as the overall consensus problem, as a single optimization problem. This…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBlockchain Technology Applications and Security · Risk and Portfolio Optimization · Reinforcement Learning in Robotics