Model-Based Imitation Learning for Urban Driving
Anthony Hu, Gianluca Corrado, Nicolas Griffiths, Zak Murez, and Corina Gurau, Hudson Yeo, Alex Kendall, Roberto Cipolla and, Jamie Shotton

TL;DR
MILE is a novel model-based imitation learning approach that uses 3D geometry and high-resolution videos to learn a compact environment model and policy for urban autonomous driving, improving generalization and interpretability.
Contribution
It introduces the first camera-only method that jointly models static and dynamic scenes along with ego-behavior for urban driving, trained entirely offline.
Findings
Achieves 31% higher driving score on CARLA in new environments.
Predicts diverse, plausible states and actions with interpretability.
Executes complex maneuvers from imagined plans.
Abstract
An accurate model of the environment and the dynamic agents acting in it offers great potential for improving motion planning. We present MILE: a Model-based Imitation LEarning approach to jointly learn a model of the world and a policy for autonomous driving. Our method leverages 3D geometry as an inductive bias and learns a highly compact latent space directly from high-resolution videos of expert demonstrations. Our model is trained on an offline corpus of urban driving data, without any online interaction with the environment. MILE improves upon prior state-of-the-art by 31% in driving score on the CARLA simulator when deployed in a completely new town and new weather conditions. Our model can predict diverse and plausible states and actions, that can be interpretably decoded to bird's-eye view semantic segmentation. Further, we demonstrate that it can execute complex driving…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsHuman Pose and Action Recognition · Autonomous Vehicle Technology and Safety · Human Motion and Animation
MethodsEntropy Regularization · Proximal Policy Optimization · CARLA: An Open Urban Driving Simulator
