RealDriveSim: A Realistic Multi-Modal Multi-Task Synthetic Dataset for Autonomous Driving

Arpit Jadon; Haoran Wang; Phillip Thomas; Michael Stanley; S. Nathaniel Cibik; Rachel Laurat; Omar Maher; Lukas Hoyer; Ozan Unal; Dengxin Dai

arXiv:2506.16319·cs.CV·June 23, 2025

RealDriveSim: A Realistic Multi-Modal Multi-Task Synthetic Dataset for Autonomous Driving

Arpit Jadon, Haoran Wang, Phillip Thomas, Michael Stanley, S. Nathaniel Cibik, Rachel Laurat, Omar Maher, Lukas Hoyer, Ozan Unal, Dengxin Dai

PDF

Open Access

TL;DR

RealDriveSim is a comprehensive synthetic dataset for autonomous driving that offers multi-modal data and detailed annotations, enabling improved model training across various perception tasks with reduced annotation costs.

Contribution

It introduces a realistic, multi-modal synthetic dataset supporting multiple tasks and classes, filling gaps in existing datasets for autonomous driving research.

Findings

01

Achieves state-of-the-art results on multiple perception tasks

02

Supports 2D and LiDAR data with detailed annotations

03

Demonstrates broad applicability across domains

Abstract

As perception models continue to develop, the need for large-scale datasets increases. However, data annotation remains far too expensive to effectively scale and meet the demand. Synthetic datasets provide a solution to boost model performance with substantially reduced costs. However, current synthetic datasets remain limited in their scope, realism, and are designed for specific tasks and applications. In this work, we present RealDriveSim, a realistic multi-modal synthetic dataset for autonomous driving that not only supports popular 2D computer vision applications but also their LiDAR counterparts, providing fine-grained annotations for up to 64 classes. We extensively evaluate our dataset for a wide range of applications and domains, demonstrating state-of-the-art results compared to existing synthetic benchmarks. The dataset is publicly available at…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Generative Adversarial Networks and Image Synthesis · Multimodal Machine Learning Applications