Learning to drive from a world on rails

Dian Chen; Vladlen Koltun; Philipp Kr\"ahenb\"uhl

arXiv:2105.00636·cs.RO·October 5, 2021

Learning to drive from a world on rails

Dian Chen, Vladlen Koltun, Philipp Kr\"ahenb\"uhl

PDF

1 Repo

TL;DR

This paper introduces a model-based, vision-driven driving policy learned from pre-recorded logs under a world-on-rails assumption, achieving high performance and sample efficiency in simulation benchmarks.

Contribution

It proposes a novel approach that simplifies learning by assuming a nonreactive environment, enabling efficient training of a vision-based driving policy from pre-recorded data.

Findings

01

Ranks first on the CARLA leaderboard with 25% higher score

02

Uses 40 times less data than previous methods

03

Outperforms state-of-the-art model-free RL in ProcGen benchmarks

Abstract

We learn an interactive vision-based driving policy from pre-recorded driving logs via a model-based approach. A forward model of the world supervises a driving policy that predicts the outcome of any potential driving trajectory. To support learning from pre-recorded logs, we assume that the world is on rails, meaning neither the agent nor its actions influence the environment. This assumption greatly simplifies the learning problem, factorizing the dynamics into a nonreactive world model and a low-dimensional and compact forward model of the ego-vehicle. Our approach computes action-values for each training trajectory using a tabular dynamic-programming evaluation of the Bellman equations; these action-values in turn supervise the final vision-based driving policy. Despite the world-on-rails assumption, the final driving policy acts well in a dynamic and reactive world. At the time of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

dotchen/WorldOnRails
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsEntropy Regularization · Proximal Policy Optimization · CARLA: An Open Urban Driving Simulator