SelfD: Self-Learning Large-Scale Driving Policies From the Web
Jimuyang Zhang, Ruizhao Zhu, Eshed Ohn-Bar

TL;DR
SelfD is a novel self-learning framework that leverages large-scale unlabeled web videos to train robust, scalable driving policies capable of generalizing across diverse scenarios without additional data collection.
Contribution
The paper introduces SelfD, a semi-supervised, image-based planning model trained on unlabeled web videos, enabling scalable and generalized autonomous driving without extra annotation.
Findings
SelfD improves driving performance by up to 24% on multiple benchmarks.
It effectively utilizes unlabeled web videos for training.
The approach enhances robustness across diverse navigation scenarios.
Abstract
Effectively utilizing the vast amounts of ego-centric navigation data that is freely available on the internet can advance generalized intelligent systems, i.e., to robustly scale across perspectives, platforms, environmental conditions, scenarios, and geographical locations. However, it is difficult to directly leverage such large amounts of unlabeled and highly diverse data for complex 3D reasoning and planning tasks. Consequently, researchers have primarily focused on its use for various auxiliary pixel- and image-level computer vision tasks that do not consider an ultimate navigational objective. In this work, we introduce SelfD, a framework for learning scalable driving by utilizing large amounts of online monocular images. Our key idea is to leverage iterative semi-supervised training when learning imitative agents from unlabeled data. To handle unconstrained viewpoints, scenes,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Advanced Neural Network Applications · Domain Adaptation and Few-Shot Learning
MethodsEntropy Regularization · Proximal Policy Optimization · CARLA: An Open Urban Driving Simulator
