Proximal Policy Optimization Learning based Control of Congested Freeway   Traffic

Shurong Mo; Nailong Wu; Jie Qi; Anqi Pan; Zhiguang Feng; Huaicheng; Yan; Yueying Wang

arXiv:2204.05627·cs.AI·January 18, 2023

Proximal Policy Optimization Learning based Control of Congested Freeway Traffic

Shurong Mo, Nailong Wu, Jie Qi, Anqi Pan, Zhiguang Feng, Huaicheng, Yan, Yueying Wang

PDF

Open Access

TL;DR

This paper introduces a reinforcement learning-based control method using proximal policy optimization to stabilize congested freeway traffic, demonstrating improved performance and robustness over traditional control methods through numerical simulations.

Contribution

It develops a PPO-based delay-compensated feedback controller for traffic flow stabilization that learns control gains without explicit system dynamics knowledge.

Findings

01

PPO control converges faster with less effort in delay-free systems.

02

PPO performs comparably to backstepping control under input delay conditions.

03

PPO exhibits robustness to parameter perturbations, unlike backstepping control.

Abstract

This study proposes a delay-compensated feedback controller based on proximal policy optimization (PPO) reinforcement learning to stabilize traffic flow in the congested regime by manipulating the time-gap of adaptive cruise control-equipped (ACC-equipped) vehicles.The traffic dynamics on a freeway segment are governed by an Aw-Rascle-Zhang (ARZ) model, consisting of $2 \times 2$ nonlinear first-order partial differential equations (PDEs).Inspired by the backstepping delay compensator [18] but different from whose complex segmented control scheme, the PPO control is composed of three feedbacks, namely the current traffic flow velocity, the current traffic flow density and previous one step control input. The control gains for the three feedbacks are learned from the interaction between the PPO and the numerical simulator of the traffic system without knowing the system dynamics.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTraffic control and management · Traffic Prediction and Management Techniques

MethodsEntropy Regularization · Proximal Policy Optimization