Data efficient reinforcement learning and adaptive optimal perimeter   control of network traffic dynamics

C. Chen; Y. P. Huang; W. H. K. Lam; T. L. Pan; S. C. Hsu; A. Sumalee,; R. X. Zhong

arXiv:2209.05726·eess.SY·September 14, 2022

Data efficient reinforcement learning and adaptive optimal perimeter control of network traffic dynamics

C. Chen, Y. P. Huang, W. H. K. Lam, T. L. Pan, S. C. Hsu, A. Sumalee,, R. X. Zhong

PDF

Open Access

TL;DR

This paper introduces an integral reinforcement learning approach for adaptive, data-efficient, and robust perimeter traffic control that learns system dynamics without requiring precise models, improving real-time traffic management.

Contribution

The work develops a model-free IRL-based control method with experience replay, continuous-time control, and neural network approximation, enhancing robustness and efficiency in traffic regulation.

Findings

01

The proposed IRL method converges and stabilizes traffic dynamics.

02

Experience replay improves data efficiency and learning speed.

03

Numerical simulations demonstrate effectiveness in real-world scenarios.

Abstract

Existing data-driven and feedback traffic control strategies do not consider the heterogeneity of real-time data measurements. Besides, traditional reinforcement learning (RL) methods for traffic control usually converge slowly for lacking data efficiency. Moreover, conventional optimal perimeter control schemes require exact knowledge of the system dynamics and thus would be fragile to endogenous uncertainties. To handle these challenges, this work proposes an integral reinforcement learning (IRL) based approach to learning the macroscopic traffic dynamics for adaptive optimal perimeter control. This work makes the following primary contributions to the transportation literature: (a) A continuous-time control is developed with discrete gain updates to adapt to the discrete-time sensor data. (b) To reduce the sampling complexity and use the available data more efficiently, the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTraffic control and management · Traffic Prediction and Management Techniques

MethodsExperience Replay