Enactor: From Traffic Simulators to Surrogate World Models
Yash Ranjan, Rahul Sengupta, Anand Rangarajan, Sanjay Ranka

TL;DR
This paper introduces Enactor, a transformer-based generative model that captures realistic traffic actor interactions and generates physically consistent long-term trajectories, improving traffic simulation accuracy and efficiency.
Contribution
Enactor is the first actor-centric model that explicitly captures complex interactions and intersection dynamics using transformers, outperforming traditional methods with fewer training samples.
Findings
Effectively captures complex actor-actor interactions.
Generates physically consistent long-horizon trajectories.
Outperforms baseline by over 10x in KL-Divergence.
Abstract
Traffic microsimulators are widely used to evaluate road network performance under various ``what-if" conditions. However, the behavior models controlling the actions of the actors are overly simplistic and fails to capture realistic actor-actor interactions. Deep learning-based methods have been applied to model vehicles and pedestrians as ``agents" responding to their surrounding ``environment" (including lanes, signals, and neighboring agents). Although effective in learning actor-actor interaction, these approaches fail to generate physically consistent trajectories over long time periods, and they do not explicitly address the complex dynamics that arise at traffic intersections which is a critical location in urban networks. Inspired by the World Model paradigm, we have developed an actor centric generative model using transformer-based architecture that is able to capture the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTraffic control and management · Autonomous Vehicle Technology and Safety · Evacuation and Crowd Dynamics
