MARL-FWC: Optimal Coordination of Freeway Traffic Control Measures

Ahmed Fares; Walid Gomaa; Mohamed A. Khamis

arXiv:1808.09806·cs.SY·August 30, 2018

MARL-FWC: Optimal Coordination of Freeway Traffic Control Measures

Ahmed Fares, Walid Gomaa, Mohamed A. Khamis

PDF

Open Access

TL;DR

This paper introduces MARL-FWC, a multi-agent reinforcement learning system for optimizing freeway traffic flow through coordinated ramp metering and dynamic speed limits, demonstrating significant improvements in travel time and speed.

Contribution

It proposes a novel collaborative multi-agent reinforcement learning framework with a microscopic network-level model and various control architectures for freeway traffic management.

Findings

01

Significant reduction in total travel time.

02

Increase in average traffic speed.

03

Effective coordination of control measures in simulations.

Abstract

The objective of this article is to optimize the overall traffic flow on freeways using multiple ramp metering controls plus its complementary Dynamic Speed Limits (DSLs). An optimal freeway operation can be reached when minimizing the difference between the freeway density and the critical ratio for maximum traffic flow. In this article, a Multi-Agent Reinforcement Learning for Freeways Control (MARL-FWC) system for ramps metering and DSLs is proposed. MARL-FWC introduces a new microscopic framework at the network level based on collaborative Markov Decision Process modeling (Markov game) and an associated cooperative Q-learning algorithm. The technique incorporates payoff propagation (Max-Plus algorithm) under the coordination graphs framework, particularly suited for optimal control purposes. MARL-FWC provides three control designs: fully independent, fully distributed, and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTraffic control and management · Transportation Planning and Optimization · Traffic Prediction and Management Techniques

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Q-Learning