Data-Based Optimal Control of Multi-Agent Systems: A Reinforcement   Learning Design Approach

Jilie Zhang; Zhanshan Wang; Hongwei Zhang

arXiv:1711.11422·math.OC·May 21, 2019·IEEE Trans. Cybern.

Data-Based Optimal Control of Multi-Agent Systems: A Reinforcement Learning Design Approach

Jilie Zhang, Zhanshan Wang, Hongwei Zhang

PDF

TL;DR

This paper presents a data-driven reinforcement learning approach for optimal consensus control in heterogeneous multi-agent systems, eliminating the need for system models by using I/O data and value iteration.

Contribution

It introduces a novel data-based error estimator and an I/O Q-learning algorithm to solve the optimal consensus problem without relying on system models.

Findings

01

The proposed method effectively achieves optimal consensus control.

02

The approach does not require explicit system models.

03

Numerical results demonstrate the algorithm's effectiveness.

Abstract

This paper studies optimal consensus tracking problem of heterogeneous linear multi-agent systems. By introducing tracking error dynamics, the optimal tracking problem is reformulated as finding a Nash-equilibrium solution of a multi-player games, which can be done by solving associated coupled Hamilton-Jacobi (HJ) equations. A data-based error estimator is designed to obtain the data-based control for the multi-agent systems. Using the quadratic functional to approximate the every agent's value function, we can obtain the optimal cooperative control by input-output (I/O) $Q$ -learning algorithm with value iteration technique in the least-square sense. The control law solves the optimal consensus problem for multi-agent systems with measured input-output information, and does not rely on the model of multi-agent systems. A numerical example is provided to illustrate the effectiveness of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.