Distributed Q-Learning with State Tracking for Multi-agent Networked   Control

Hang Wang; Sen Lin; Hamid Jafarkhani; Junshan Zhang

arXiv:2012.12383·cs.MA·December 24, 2020·1 cites

Distributed Q-Learning with State Tracking for Multi-agent Networked Control

Hang Wang, Sen Lin, Hamid Jafarkhani, Junshan Zhang

PDF

Open Access

TL;DR

This paper introduces a distributed Q-learning algorithm with state tracking for multi-agent LQR control, enabling agents to learn optimal policies without global state observation or central coordination.

Contribution

It proposes a novel state tracking based Q-learning method that ensures convergence in multi-agent systems with unknown models and limited communication.

Findings

01

Convergence of local state estimates to true global state.

02

Distributed algorithm achieves performance comparable to centralized methods.

03

Theoretical proof of convergence under decaying excitation noise.

Abstract

This paper studies distributed Q-learning for Linear Quadratic Regulator (LQR) in a multi-agent network. The existing results often assume that agents can observe the global system state, which may be infeasible in large-scale systems due to privacy concerns or communication constraints. In this work, we consider a setting with unknown system models and no centralized coordinator. We devise a state tracking (ST) based Q-learning algorithm to design optimal controllers for agents. Specifically, we assume that agents maintain local estimates of the global state based on their local information and communications with neighbors. At each step, every agent updates its local global state estimation, based on which it solves an approximate Q-factor locally through policy iteration. Assuming decaying injected excitation noise during the policy evaluation, we prove that the local estimation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdaptive Dynamic Programming Control · Distributed Control Multi-Agent Systems · Frequency Control in Power Systems

MethodsQ-Learning