Traffic-R1: Reinforced LLMs Bring Human-Like Reasoning to Traffic Signal Control Systems
Xingchen Zou, Yuhao Yang, Zheng Chen, Xixuan Hao, Yiqi Chen, Chao Huang, Yuxuan Liang

TL;DR
Traffic-R1 is a 3B-parameter foundation model that enables human-like reasoning in traffic signal control, offering zero-shot generalization, real-time edge deployment, and explainability, significantly improving traffic management efficiency.
Contribution
This paper introduces Traffic-R1, a novel foundation model for traffic signal control that combines reinforcement learning and large language model techniques for the first time.
Findings
Outperforms traditional RL and recent LLM-based methods in benchmarks.
Reduces average queue lengths by over 5% in real-world deployment.
Halves operator workload in traffic management.
Abstract
We introduce Traffic-R1, a 3B-parameter foundation model with human-like reasoning for Traffic signal control (TSC), developed via self-exploration and iterative reinforcement of LLM with expert guidance in a simulated traffic environment. Compared with traditional reinforcement learning and recent LLM-based methods, Traffic-R1 offers three main advantages: zero-shot generalization, transferring unchanged to new road networks and out-of-distribution incidents by leveraging internal traffic-control policies and reasoning; a compact 3B-parameter design that supports real-time inference on mobile-class chips for edge deployment; and an explainable TSC process that enables multi-intersection coordination through communication and an asynchronous communication network. Extensive benchmarks show Traffic-R1 outperforms strong baselines and training-intensive RL controllers. In production, the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTraffic control and management · Vehicular Ad Hoc Networks (VANETs) · Traffic Prediction and Management Techniques
