An Encoded Corrective Double Deep Q-Networks for Multi-Agent Control Systems

Mohammadreza Barzegaran; Kemeng Han; and Hamid Jafarkhani

arXiv:2605.14121·eess.SP·May 15, 2026

An Encoded Corrective Double Deep Q-Networks for Multi-Agent Control Systems

Mohammadreza Barzegaran, Kemeng Han, and Hamid Jafarkhani

PDF

TL;DR

This paper introduces a distributed encoded corrective double actor-critic framework for multi-agent control, explicitly modeling communication delays and noise to improve policy synthesis.

Contribution

It presents a novel message-passing mechanism that refines global state information over time, enhancing multi-agent control under communication uncertainties.

Findings

01

Effective in multiple test cases

02

Outperforms various baseline methods

03

Numerical regret analysis supports effectiveness

Abstract

This paper studies the synthesis of control policies for heterogeneous and interconnected multi-agent systems that collaborate through data exchange over a communication network to minimize a collective cost. We propose a distributed encoded corrective double actor-critic framework that integrates a novel message-passing mechanism. Existing methods assume noise-free and delay-free access to the global or partial states and overlook the fact that the global states, though noisy and delayed, can be progressively reconstructed and refined over time. In contrast, this work explicitly models communication sampling asynchrony, delay, and link noise based on the network configuration. The proposed message-passing mechanism characterizes timing and information flow to refine and time shift global state information, which is then used to incrementally correct the Q-networks. The double Q-network…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.