Open-sourced Data Ecosystem in Autonomous Driving: the Present and Future
Hongyang Li, Yang Li, Huijie Wang, Jia Zeng, Huilin Xu and, Pinlong Cai, Li Chen, Junchi Yan, Feng Xu, Lu Xiong, Jingdong, Wang, Futang Zhu, Chunjing Xu, Tiancai Wang, Fei Xia, Beipeng, Mu, Zhihui Peng, Dahua Lin, Yu Qiao

TL;DR
This paper systematically reviews open-source autonomous driving datasets, highlighting their evolution from perception-focused to multi-task datasets, and discusses future directions, challenges, and the role of data generation technologies in advancing autonomous driving.
Contribution
It provides a comprehensive assessment of over seventy datasets, analyzes their characteristics, and explores future dataset development and technical challenges in autonomous driving.
Findings
First-generation datasets are perception-focused with limited sensor modalities.
Second-generation datasets feature increased complexity and task diversity.
Future datasets should incorporate higher data scales and multi-modal sensors.
Abstract
With the continuous maturation and application of autonomous driving technology, a systematic examination of open-source autonomous driving datasets becomes instrumental in fostering the robust evolution of the industry ecosystem. Current autonomous driving datasets can broadly be categorized into two generations. The first-generation autonomous driving datasets are characterized by relatively simpler sensor modalities, smaller data scale, and is limited to perception-level tasks. KITTI, introduced in 2012, serves as a prominent representative of this initial wave. In contrast, the second-generation datasets exhibit heightened complexity in sensor modalities, greater data scale and diversity, and an expansion of tasks from perception to encompass prediction and control. Leading examples of the second generation include nuScenes and Waymo, introduced around 2019. This comprehensive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Autonomous Vehicle Technology and Safety · IoT and Edge/Fog Computing
