Proximal Policy Optimization Learning based Control of Congested Freeway Traffic
Shurong Mo, Nailong Wu, Jie Qi, Anqi Pan, Zhiguang Feng, Huaicheng, Yan, Yueying Wang

TL;DR
This paper introduces a reinforcement learning-based control method using proximal policy optimization to stabilize congested freeway traffic, demonstrating improved performance and robustness over traditional control methods through numerical simulations.
Contribution
It develops a PPO-based delay-compensated feedback controller for traffic flow stabilization that learns control gains without explicit system dynamics knowledge.
Findings
PPO control converges faster with less effort in delay-free systems.
PPO performs comparably to backstepping control under input delay conditions.
PPO exhibits robustness to parameter perturbations, unlike backstepping control.
Abstract
This study proposes a delay-compensated feedback controller based on proximal policy optimization (PPO) reinforcement learning to stabilize traffic flow in the congested regime by manipulating the time-gap of adaptive cruise control-equipped (ACC-equipped) vehicles.The traffic dynamics on a freeway segment are governed by an Aw-Rascle-Zhang (ARZ) model, consisting of nonlinear first-order partial differential equations (PDEs).Inspired by the backstepping delay compensator [18] but different from whose complex segmented control scheme, the PPO control is composed of three feedbacks, namely the current traffic flow velocity, the current traffic flow density and previous one step control input. The control gains for the three feedbacks are learned from the interaction between the PPO and the numerical simulator of the traffic system without knowing the system dynamics.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTraffic control and management · Traffic Prediction and Management Techniques
MethodsEntropy Regularization · Proximal Policy Optimization
