TL;DR
AutoWorld introduces a scalable traffic simulation framework that leverages unlabeled LiDAR data and self-supervised learning to improve realism and diversity in multi-agent autonomous driving scenarios.
Contribution
It presents a novel world model trained on unlabeled data, combined with a cascaded DPP sampling and motion-aware supervision, advancing traffic simulation without extra labeling.
Findings
AutoWorld ranks first on the WOSAC benchmark leaderboard.
Simulation quality improves with more unlabeled LiDAR data.
Component ablations confirm the effectiveness of each method part.
Abstract
Multi-agent traffic simulation is central to developing and testing autonomous driving systems. Recent data-driven simulators have achieved promising results, but rely heavily on supervised learning from labeled trajectories or semantic annotations, making it costly to scale their performance. Meanwhile, large amounts of unlabeled sensor data can be collected at scale but remain largely unused by existing traffic simulation frameworks. This raises a key question: How can a method harness unlabeled data to improve traffic simulation performance? In this work, we propose AutoWorld, a traffic simulation framework that employs a world model learned from unlabeled occupancy representations of LiDAR data. Given world model samples, AutoWorld constructs a coarse-to-fine predictive scene context as input to a multi-agent motion generation model. To promote sample diversity, AutoWorld uses a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
