Decomposability and Parallel Computation of Multi-Agent LQR

Gangshan Jing; He Bai; Jemin George; Aranya Chakrabortty

arXiv:2010.08615·eess.SY·March 9, 2021·1 cites

Decomposability and Parallel Computation of Multi-Agent LQR

Gangshan Jing, He Bai, Jemin George, Aranya Chakrabortty

PDF

Open Access

TL;DR

This paper introduces a parallel reinforcement learning scheme for multi-agent linear quadratic regulator design, leveraging structural properties to decompose the problem into smaller, decoupled subproblems, significantly speeding up learning.

Contribution

It proposes a novel decomposition method exploiting graph structures in LQR for multi-agent systems, enabling parallel RL and computational efficiency.

Findings

01

Significant speed-up in learning process.

02

Maintains optimality in homogeneous systems.

03

Robustness when applied to non-homogeneous systems.

Abstract

Individual agents in a multi-agent system (MAS) may have decoupled open-loop dynamics, but a cooperative control objective usually results in coupled closed-loop dynamics thereby making the control design computationally expensive. The computation time becomes even higher when a learning strategy such as reinforcement learning (RL) needs to be applied to deal with the situation when the agents dynamics are not known. To resolve this problem, we propose a parallel RL scheme for a linear quadratic regulator (LQR) design in a continuous-time linear MAS. The idea is to exploit the structural properties of two graphs embedded in the $Q$ and $R$ weighting matrices in the LQR objective to define an orthogonal transformation that can convert the original LQR design to multiple decoupled smaller-sized LQR designs. We show that if the MAS is homogeneous then this decomposition retains closed-loop…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdaptive Dynamic Programming Control · Reinforcement Learning in Robotics · Distributed Control Multi-Agent Systems