Reinforcement Learning with Distributed MPC for Fuel-Efficient Platoon Control with Discrete Gear Transitions
Samuel Mallick, Gianpietro Battocletti, Dimitris Boskos, Azita Dabiri, Bart De Schutter

TL;DR
This paper introduces a scalable reinforcement learning-based distributed MPC method for fuel-efficient autonomous vehicle platoons, reducing computational complexity while maintaining performance.
Contribution
It proposes a novel RL-based distributed MPC approach with decoupled single-agent learning and RNN gear policies for scalable, real-time platoon control.
Findings
Lower computational burden compared to traditional MPC
Achieves comparable fuel efficiency in simulations
Scalable to large platoons with complex gear schedules
Abstract
Cooperative control of groups of autonomous vehicles (AVs), i.e., platoons, is a promising direction to improving the efficiency of autonomous transportation systems. In this context, distributed co-optimization of both vehicle speed and gear position can offer benefits for fuel-efficient driving. To this end, model predictive control (MPC) is a popular approach, optimizing the speed and gear-shift schedule while explicitly considering the vehicles' dynamics over a prediction window. However, optimization over both the vehicles' continuous dynamics and discrete gear positions is computationally intensive, and may require overly long sample times or high-end hardware for real-time implementation. This work proposes a reinforcement learning (RL)-based distributed MPC approach to address this issue. For each vehicle in the platoon, a policy is trained to select and fix the gear positions…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTraffic control and management · Electric and Hybrid Vehicle Technologies · Vehicle Dynamics and Control Systems
