A Hysteretic Q-learning Coordination Framework for Emerging Mobility   Systems in Smart Cities

Behdad Chalaki; Andreas A. Malikopoulos

arXiv:2011.03137·math.OC·March 11, 2022

A Hysteretic Q-learning Coordination Framework for Emerging Mobility Systems in Smart Cities

Behdad Chalaki, Andreas A. Malikopoulos

PDF

TL;DR

This paper proposes a decentralized Q-learning based coordination framework for connected autonomous vehicles at intersections, aiming to reduce travel time and fuel consumption in smart city traffic management.

Contribution

It introduces a novel hysteretic Q-learning coordination mechanism combined with FIFO policy for signal-free intersection management in smart cities.

Findings

01

The approach reduces travel time compared to classical methods.

02

It improves fuel efficiency in simulated scenarios.

03

Demonstrates effectiveness through simulation comparisons.

Abstract

Connected and automated vehicles (CAVs) can alleviate traffic congestion, air pollution, and improve safety. In this paper, we provide a decentralized coordination framework for CAVs at a signal-free intersection to minimize travel time and improve fuel efficiency. We employ a simple yet powerful reinforcement learning approach, an off-policy temporal difference learning called Q-learning, enhanced with a coordination mechanism to address this problem. Then, we integrate a first-in-first-out queuing policy to improve the performance of our system. We demonstrate the efficacy of our proposed approach through simulation and comparison with the classical optimal control method based on Pontryagin's minimum principle.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsEmirates Airlines Office in Dubai