Delay-Aware Multi-Agent Reinforcement Learning for Cooperative Adaptive Cruise Control with Model-based Stability Enhancement
Jiaqi Liu, Ziran Wang, Peng Hang, and Jian Sun

TL;DR
This paper introduces a delay-aware multi-agent reinforcement learning framework for cooperative adaptive cruise control, incorporating stability enhancement techniques to improve safety and performance in vehicular platoons under real-world delays.
Contribution
It proposes a novel delay-aware MARL framework with a specialized MADA-MDP model, attention-based policy network, and velocity optimization filter for stable CACC control.
Findings
Outperforms baseline methods in safety and stability
Effective under various delay conditions
Scalable to different platoon sizes
Abstract
Cooperative Adaptive Cruise Control (CACC) represents a quintessential control strategy for orchestrating vehicular platoon movement within Connected and Automated Vehicle (CAV) systems, significantly enhancing traffic efficiency and reducing energy consumption. In recent years, the data-driven methods, such as reinforcement learning (RL), have been employed to address this task due to their significant advantages in terms of efficiency and flexibility. However, the delay issue, which often arises in real-world CACC systems, is rarely taken into account by current RL-based approaches. To tackle this problem, we propose a Delay-Aware Multi-Agent Reinforcement Learning (DAMARL) framework aimed at achieving safe and stable control for CACC. We model the entire decision-making process using a Multi-Agent Delay-Aware Markov Decision Process (MADA-MDP) and develop a centralized training with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTraffic control and management · Autonomous Vehicle Technology and Safety · Traffic Prediction and Management Techniques
