MARL-FWC: Optimal Coordination of Freeway Traffic Control Measures
Ahmed Fares, Walid Gomaa, Mohamed A. Khamis

TL;DR
This paper introduces MARL-FWC, a multi-agent reinforcement learning system for optimizing freeway traffic flow through coordinated ramp metering and dynamic speed limits, demonstrating significant improvements in travel time and speed.
Contribution
It proposes a novel collaborative multi-agent reinforcement learning framework with a microscopic network-level model and various control architectures for freeway traffic management.
Findings
Significant reduction in total travel time.
Increase in average traffic speed.
Effective coordination of control measures in simulations.
Abstract
The objective of this article is to optimize the overall traffic flow on freeways using multiple ramp metering controls plus its complementary Dynamic Speed Limits (DSLs). An optimal freeway operation can be reached when minimizing the difference between the freeway density and the critical ratio for maximum traffic flow. In this article, a Multi-Agent Reinforcement Learning for Freeways Control (MARL-FWC) system for ramps metering and DSLs is proposed. MARL-FWC introduces a new microscopic framework at the network level based on collaborative Markov Decision Process modeling (Markov game) and an associated cooperative Q-learning algorithm. The technique incorporates payoff propagation (Max-Plus algorithm) under the coordination graphs framework, particularly suited for optimal control purposes. MARL-FWC provides three control designs: fully independent, fully distributed, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTraffic control and management · Transportation Planning and Optimization · Traffic Prediction and Management Techniques
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Q-Learning
