VLMLight: Safety-Critical Traffic Signal Control via Vision-Language Meta-Control and Dual-Branch Reasoning Architecture

Maonan Wang; Yirong Chen; Aoyu Pang; Yuxin Cai; Chung Shue Chen; Yuheng Kan; Man-On Pun

arXiv:2505.19486·eess.SY·December 12, 2025

VLMLight: Safety-Critical Traffic Signal Control via Vision-Language Meta-Control and Dual-Branch Reasoning Architecture

Maonan Wang, Yirong Chen, Aoyu Pang, Yuxin Cai, Chung Shue Chen, Yuheng Kan, Man-On Pun

PDF

Open Access 1 Video

TL;DR

VLMLight is a novel traffic signal control framework that combines vision-language meta-control with dual-branch reasoning, improving safety and efficiency in complex urban scenarios by leveraging multi-view perception and large language models.

Contribution

It introduces the first image-based traffic simulator and a hybrid control system using LLMs for safety-critical decision-making in traffic signal control.

Findings

01

Reduces emergency vehicle waiting times by up to 65%.

02

Maintains real-time performance with less than 1% degradation.

03

Provides a scalable, interpretable, safety-aware traffic control solution.

Abstract

Traffic signal control (TSC) is a core challenge in urban mobility, where real-time decisions must balance efficiency and safety. Existing methods - ranging from rule-based heuristics to reinforcement learning (RL) - often struggle to generalize to complex, dynamic, and safety-critical scenarios. We introduce VLMLight, a novel TSC framework that integrates vision-language meta-control with dual-branch reasoning. At the core of VLMLight is the first image-based traffic simulator that enables multi-view visual perception at intersections, allowing policies to reason over rich cues such as vehicle type, motion, and spatial density. A large language model (LLM) serves as a safety-prioritized meta-controller, selecting between a fast RL policy for routine traffic and a structured reasoning branch for critical cases. In the latter, multiple LLM agents collaborate to assess traffic phases,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

VLMLight: Safety-Critical Traffic Signal Control via Vision-Language Meta-Control and Dual-Branch Reasoning Architecture· slideslive

Taxonomy

TopicsSemantic Web and Ontologies · Business Process Modeling and Analysis · Logic, Reasoning, and Knowledge