Multi-Agent Deep Reinforcement Learning for Distributed and Autonomous Platoon Coordination via Speed-regulation over Large-scale Transportation Networks
Dixiao Wei (1), Peng Yi (1, 2), Jinlong Lei (1, 2), Xingyi, Zhu (3) ((1) Shanghai Research Institute for Intelligent Autonomous Systems,, Tongji University, China, (2) Department of Control Science, Engineering,, Tongji University, China, (3) Key Laboratory of Road

TL;DR
This paper introduces a multi-agent deep reinforcement learning framework for autonomous truck platoon coordination in large-scale networks, optimizing fuel efficiency and traffic flow through decentralized decision-making.
Contribution
The paper proposes TA-QMIX, a novel multi-agent DRL method with attention mechanisms for large-scale, distributed platoon coordination, enabling autonomous and cooperative decision-making.
Findings
Achieves 19.17% fuel savings in large-scale simulations
Decentralized policy operates with decision time of 0.001 seconds
Effective cooperation among trucks improves traffic efficiency
Abstract
Truck platooning technology enables a group of trucks to travel closely together, with which the platoon can save fuel, improve traffic flow efficiency, and improve safety. In this paper, we consider the platoon coordination problem in a large-scale transportation network, to promote cooperation among trucks and optimize the overall efficiency. Involving the regulation of both speed and departure times at hubs, we formulate the coordination problem as a complicated dynamic stochastic integer programming under network and information constraints. To get an autonomous, distributed, and robust platoon coordination policy, we formulate the problem into a model of the Decentralized-Partial Observable Markov Decision Process. Then, we propose a Multi-Agent Deep Reinforcement Learning framework named Trcuk Attention-QMIX (TA-QMIX) to train an efficient online decision policy. TA-QMIX utilizes…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTraffic control and management · Transportation Planning and Optimization
MethodsSoftmax · Attention Is All You Need · Emirates Airlines Office in Dubai · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
