SODA10M: A Large-Scale 2D Self/Semi-Supervised Object Detection Dataset   for Autonomous Driving

Jianhua Han; Xiwen Liang; Hang Xu; Kai Chen; Lanqing Hong; Jiageng; Mao; Chaoqiang Ye; Wei Zhang; Zhenguo Li; Xiaodan Liang; Chunjing Xu

arXiv:2106.11118·cs.CV·November 9, 2021·33 cites

SODA10M: A Large-Scale 2D Self/Semi-Supervised Object Detection Dataset for Autonomous Driving

Jianhua Han, Xiwen Liang, Hang Xu, Kai Chen, Lanqing Hong, Jiageng, Mao, Chaoqiang Ye, Wei Zhang, Zhenguo Li, Xiaodan Liang, Chunjing Xu

PDF

Open Access

TL;DR

SODA10M is the largest-scale dataset for autonomous driving, enabling self/semi-supervised learning with 10 million unlabeled images and 20K labeled images across diverse conditions, facilitating robust detection models.

Contribution

This paper introduces SODA10M, the first large-scale autonomous driving dataset for self/semi-supervised learning, with extensive diversity and detailed analysis of existing methods.

Findings

01

SODA10M improves pre-training for autonomous driving detection models.

02

Self/semi-supervised methods benefit significantly from SODA10M pre-training.

03

The dataset enables better generalization across diverse driving scenarios.

Abstract

Aiming at facilitating a real-world, ever-evolving and scalable autonomous driving system, we present a large-scale dataset for standardizing the evaluation of different self-supervised and semi-supervised approaches by learning from raw data, which is the first and largest dataset to date. Existing autonomous driving systems heavily rely on `perfect' visual perception models (i.e., detection) trained using extensive annotated data to ensure safety. However, it is unrealistic to elaborately label instances of all scenarios and circumstances (i.e., night, extreme weather, cities) when deploying a robust autonomous driving system. Motivated by recent advances of self-supervised and semi-supervised learning, a promising direction is to learn a robust detection model by collaboratively exploiting large-scale unlabeled data and few labeled data. Existing datasets either provide only a small…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Video Surveillance and Tracking Methods