ForSim: Stepwise Forward Simulation for Traffic Policy Fine-Tuning
Keyu Chen, Wenchao Sun, Hao Cheng, Zheng Fu, Sifa Zheng

TL;DR
ForSim introduces a stepwise closed-loop traffic simulation method that enhances realism and safety in autonomous driving policy fine-tuning by modeling multimodal interactions more accurately.
Contribution
It proposes a novel stepwise forward simulation paradigm that improves traffic simulation fidelity by better modeling multimodal behaviors and interactions within a closed-loop framework.
Findings
Improves safety in traffic policy fine-tuning.
Maintains realism, efficiency, and comfort in simulations.
Enhances the fidelity and reliability of traffic simulation.
Abstract
As the foundation of closed-loop training and evaluation in autonomous driving, traffic simulation still faces two fundamental challenges: covariate shift introduced by open-loop imitation learning and limited capacity to reflect the multimodal behaviors observed in real-world traffic. Although recent frameworks such as RIFT have partially addressed these issues through group-relative optimization, their forward simulation procedures remain largely non-reactive, leading to unrealistic agent interactions within the virtual domain and ultimately limiting simulation fidelity. To address these issues, we propose ForSim, a stepwise closed-loop forward simulation paradigm. At each virtual timestep, the traffic agent propagates the virtual candidate trajectory that best spatiotemporally matches the reference trajectory through physically grounded motion dynamics, thereby preserving multimodal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAutonomous Vehicle Technology and Safety · Traffic control and management · Reinforcement Learning in Robotics
