Closed-Loop Supervised Fine-Tuning of Tokenized Traffic Models
Zhejun Zhang, Peter Karkus, Maximilian Igl, Wenhao Ding, Yuxiao Chen,, Boris Ivanovic, Marco Pavone

TL;DR
This paper introduces CAT-K, a closed-loop fine-tuning method for tokenized traffic models that improves simulation fidelity without reinforcement learning, outperforming larger models in traffic simulation benchmarks.
Contribution
The paper proposes a novel closed-loop fine-tuning strategy called CAT-K that enhances tokenized traffic models using only existing trajectory data, avoiding complex reinforcement learning.
Findings
CAT-K outperforms larger models in traffic simulation tasks.
A small 7M-parameter model surpasses a 102M-parameter model.
Achieved top results on the Waymo Sim Agent Challenge leaderboard.
Abstract
Traffic simulation aims to learn a policy for traffic agents that, when unrolled in closed-loop, faithfully recovers the joint distribution of trajectories observed in the real world. Inspired by large language models, tokenized multi-agent policies have recently become the state-of-the-art in traffic simulation. However, they are typically trained through open-loop behavior cloning, and thus suffer from covariate shift when executed in closed-loop during simulation. In this work, we present Closest Among Top-K (CAT-K) rollouts, a simple yet effective closed-loop fine-tuning strategy to mitigate covariate shift. CAT-K fine-tuning only requires existing trajectory data, without reinforcement learning or generative adversarial imitation. Concretely, CAT-K fine-tuning enables a small 7M-parameter tokenized traffic simulation policy to outperform a 102M-parameter model from the same model…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSimulation Techniques and Applications · Real-time simulation and control systems · Traffic control and management
