Traffic Signal Control Using Lightweight Transformers: An Offline-to-Online RL Approach
Xingshuai Huang, Di Wu, and Benoit Boulet

TL;DR
This paper introduces DTLight, a lightweight, offline-trained transformer-based traffic signal control method that outperforms existing online RL approaches and can be efficiently fine-tuned for real-world deployment.
Contribution
The paper presents DTLight, a novel lightweight transformer-based TSC method utilizing knowledge distillation and adapters for efficient offline training and online adaptation.
Findings
DTLight outperforms state-of-the-art online RL methods in synthetic and real-world scenarios.
Online fine-tuning of DTLight improves performance by up to 42.6%.
The approach reduces computational costs and enhances practicality for real-world traffic control.
Abstract
Efficient traffic signal control is critical for reducing traffic congestion and improving overall transportation efficiency. The dynamic nature of traffic flow has prompted researchers to explore Reinforcement Learning (RL) for traffic signal control (TSC). Compared with traditional methods, RL-based solutions have shown preferable performance. However, the application of RL-based traffic signal controllers in the real world is limited by the low sample efficiency and high computational requirements of these solutions. In this work, we propose DTLight, a simple yet powerful lightweight Decision Transformer-based TSC method that can learn policy from easily accessible offline datasets. DTLight novelly leverages knowledge distillation to learn a lightweight controller from a well-trained larger teacher model to reduce implementation computation. Additionally, it integrates adapter…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTraffic control and management · Traffic Prediction and Management Techniques
MethodsKnowledge Distillation · Adapter
