Multi-Agent Deep Reinforcement Learning for Distributed and Autonomous   Platoon Coordination via Speed-regulation over Large-scale Transportation   Networks

Dixiao Wei (1); Peng Yi (1; 2); Jinlong Lei (1; 2); Xingyi; Zhu (3) ((1) Shanghai Research Institute for Intelligent Autonomous Systems,; Tongji University; China; (2) Department of Control Science; Engineering,; Tongji University; China; (3) Key Laboratory of Road; Traffic Engineering; of the Ministry of Education; Tongji University; China)

arXiv:2412.01075·cs.LG·December 3, 2024

Multi-Agent Deep Reinforcement Learning for Distributed and Autonomous Platoon Coordination via Speed-regulation over Large-scale Transportation Networks

Dixiao Wei (1), Peng Yi (1, 2), Jinlong Lei (1, 2), Xingyi, Zhu (3) ((1) Shanghai Research Institute for Intelligent Autonomous Systems,, Tongji University, China, (2) Department of Control Science, Engineering,, Tongji University, China, (3) Key Laboratory of Road

PDF

Open Access

TL;DR

This paper introduces a multi-agent deep reinforcement learning framework for autonomous truck platoon coordination in large-scale networks, optimizing fuel efficiency and traffic flow through decentralized decision-making.

Contribution

The paper proposes TA-QMIX, a novel multi-agent DRL method with attention mechanisms for large-scale, distributed platoon coordination, enabling autonomous and cooperative decision-making.

Findings

01

Achieves 19.17% fuel savings in large-scale simulations

02

Decentralized policy operates with decision time of 0.001 seconds

03

Effective cooperation among trucks improves traffic efficiency

Abstract

Truck platooning technology enables a group of trucks to travel closely together, with which the platoon can save fuel, improve traffic flow efficiency, and improve safety. In this paper, we consider the platoon coordination problem in a large-scale transportation network, to promote cooperation among trucks and optimize the overall efficiency. Involving the regulation of both speed and departure times at hubs, we formulate the coordination problem as a complicated dynamic stochastic integer programming under network and information constraints. To get an autonomous, distributed, and robust platoon coordination policy, we formulate the problem into a model of the Decentralized-Partial Observable Markov Decision Process. Then, we propose a Multi-Agent Deep Reinforcement Learning framework named Trcuk Attention-QMIX (TA-QMIX) to train an efficient online decision policy. TA-QMIX utilizes…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTraffic control and management · Transportation Planning and Optimization

MethodsSoftmax · Attention Is All You Need · Emirates Airlines Office in Dubai · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings