Distributed Q-Learning for Dynamically Decoupled Systems

Siavash Alemzadeh; Mehran Mesbahi

arXiv:1809.08745·math.OC·March 21, 2019·ACC

Distributed Q-Learning for Dynamically Decoupled Systems

Siavash Alemzadeh, Mehran Mesbahi

PDF

TL;DR

This paper introduces a distributed Q-learning algorithm for large-scale networked systems with decoupled dynamics, enabling data-driven control design that converges to optimal LQR controllers without requiring detailed models.

Contribution

It presents a novel distributed Q-learning method tailored for dynamically decoupled systems, ensuring convergence to optimal controllers based solely on observed data.

Findings

01

The algorithm converges to the optimal LQR controller for each subsystem.

02

The method effectively handles systems with complex interaction structures.

03

Verification through an example demonstrates practical applicability.

Abstract

Control of large-scale networked systems often necessitates the availability of complex models for the interactions amongst the agents. However in many applications, building accurate models of agents or interactions amongst them might be infeasible or computationally prohibitive due to the curse of dimensionality or the complexity of these interactions. In the meantime, data-guided control methods can circumvent model complexity by directly synthesizing the controller from the observed data. In this paper, we propose a distributed Q-learning algorithm to design a feedback mechanism based on a given underlying graph structure parameterizing the agents' interaction network. We assume that the distributed nature of the system arises from the cost function of the corresponding control problem and show that for the specific case of identical dynamically decoupled systems, the learned…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.