Traffic-R1: Reinforced LLMs Bring Human-Like Reasoning to Traffic Signal Control Systems

Xingchen Zou; Yuhao Yang; Zheng Chen; Xixuan Hao; Yiqi Chen; Chao Huang; Yuxuan Liang

arXiv:2508.02344·cs.AI·October 23, 2025

Traffic-R1: Reinforced LLMs Bring Human-Like Reasoning to Traffic Signal Control Systems

Xingchen Zou, Yuhao Yang, Zheng Chen, Xixuan Hao, Yiqi Chen, Chao Huang, Yuxuan Liang

PDF

Open Access 1 Models

TL;DR

Traffic-R1 is a 3B-parameter foundation model that enables human-like reasoning in traffic signal control, offering zero-shot generalization, real-time edge deployment, and explainability, significantly improving traffic management efficiency.

Contribution

This paper introduces Traffic-R1, a novel foundation model for traffic signal control that combines reinforcement learning and large language model techniques for the first time.

Findings

01

Outperforms traditional RL and recent LLM-based methods in benchmarks.

02

Reduces average queue lengths by over 5% in real-world deployment.

03

Halves operator workload in traffic management.

Abstract

We introduce Traffic-R1, a 3B-parameter foundation model with human-like reasoning for Traffic signal control (TSC), developed via self-exploration and iterative reinforcement of LLM with expert guidance in a simulated traffic environment. Compared with traditional reinforcement learning and recent LLM-based methods, Traffic-R1 offers three main advantages: zero-shot generalization, transferring unchanged to new road networks and out-of-distribution incidents by leveraging internal traffic-control policies and reasoning; a compact 3B-parameter design that supports real-time inference on mobile-class chips for edge deployment; and an explainable TSC process that enables multi-intersection coordination through communication and an asynchronous communication network. Extensive benchmarks show Traffic-R1 outperforms strong baselines and training-intensive RL controllers. In production, the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
Season998/Traffic-R1
model· ♡ 6
♡ 6

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTraffic control and management · Vehicular Ad Hoc Networks (VANETs) · Traffic Prediction and Management Techniques