SelfD: Self-Learning Large-Scale Driving Policies From the Web

Jimuyang Zhang; Ruizhao Zhu; Eshed Ohn-Bar

arXiv:2204.10320·cs.CV·April 22, 2022·1 cites

SelfD: Self-Learning Large-Scale Driving Policies From the Web

Jimuyang Zhang, Ruizhao Zhu, Eshed Ohn-Bar

PDF

Open Access

TL;DR

SelfD is a novel self-learning framework that leverages large-scale unlabeled web videos to train robust, scalable driving policies capable of generalizing across diverse scenarios without additional data collection.

Contribution

The paper introduces SelfD, a semi-supervised, image-based planning model trained on unlabeled web videos, enabling scalable and generalized autonomous driving without extra annotation.

Findings

01

SelfD improves driving performance by up to 24% on multiple benchmarks.

02

It effectively utilizes unlabeled web videos for training.

03

The approach enhances robustness across diverse navigation scenarios.

Abstract

Effectively utilizing the vast amounts of ego-centric navigation data that is freely available on the internet can advance generalized intelligent systems, i.e., to robustly scale across perspectives, platforms, environmental conditions, scenarios, and geographical locations. However, it is difficult to directly leverage such large amounts of unlabeled and highly diverse data for complex 3D reasoning and planning tasks. Consequently, researchers have primarily focused on its use for various auxiliary pixel- and image-level computer vision tasks that do not consider an ultimate navigational objective. In this work, we introduce SelfD, a framework for learning scalable driving by utilizing large amounts of online monocular images. Our key idea is to leverage iterative semi-supervised training when learning imitative agents from unlabeled data. To handle unconstrained viewpoints, scenes,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Advanced Neural Network Applications · Domain Adaptation and Few-Shot Learning

MethodsEntropy Regularization · Proximal Policy Optimization · CARLA: An Open Urban Driving Simulator