Distributed Q-Learning with State Tracking for Multi-agent Networked Control
Hang Wang, Sen Lin, Hamid Jafarkhani, Junshan Zhang

TL;DR
This paper introduces a distributed Q-learning algorithm with state tracking for multi-agent LQR control, enabling agents to learn optimal policies without global state observation or central coordination.
Contribution
It proposes a novel state tracking based Q-learning method that ensures convergence in multi-agent systems with unknown models and limited communication.
Findings
Convergence of local state estimates to true global state.
Distributed algorithm achieves performance comparable to centralized methods.
Theoretical proof of convergence under decaying excitation noise.
Abstract
This paper studies distributed Q-learning for Linear Quadratic Regulator (LQR) in a multi-agent network. The existing results often assume that agents can observe the global system state, which may be infeasible in large-scale systems due to privacy concerns or communication constraints. In this work, we consider a setting with unknown system models and no centralized coordinator. We devise a state tracking (ST) based Q-learning algorithm to design optimal controllers for agents. Specifically, we assume that agents maintain local estimates of the global state based on their local information and communications with neighbors. At each step, every agent updates its local global state estimation, based on which it solves an approximate Q-factor locally through policy iteration. Assuming decaying injected excitation noise during the policy evaluation, we prove that the local estimation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdaptive Dynamic Programming Control · Distributed Control Multi-Agent Systems · Frequency Control in Power Systems
MethodsQ-Learning
