Lift, Splat, Shoot: Encoding Images From Arbitrary Camera Rigs by   Implicitly Unprojecting to 3D

Jonah Philion; Sanja Fidler

arXiv:2008.05711·cs.CV·August 14, 2020

Lift, Splat, Shoot: Encoding Images From Arbitrary Camera Rigs by Implicitly Unprojecting to 3D

Jonah Philion, Sanja Fidler

PDF

1 Repo

TL;DR

This paper introduces a novel end-to-end method for generating bird's-eye-view scene representations from images captured by arbitrary camera rigs, improving perception accuracy and robustness for autonomous vehicle navigation.

Contribution

The authors propose a new architecture that lifts images into 3D frustums and splats them into a unified bird's-eye-view, enabling robust multi-camera fusion and end-to-end motion planning.

Findings

01

Outperforms all baselines on object and map segmentation tasks

02

Robust to calibration errors in camera rigs

03

Enables interpretable motion planning using bird's-eye-view representations

Abstract

The goal of perception for autonomous vehicles is to extract semantic representations from multiple sensors and fuse these representations into a single "bird's-eye-view" coordinate frame for consumption by motion planning. We propose a new end-to-end architecture that directly extracts a bird's-eye-view representation of a scene given image data from an arbitrary number of cameras. The core idea behind our approach is to "lift" each image individually into a frustum of features for each camera, then "splat" all frustums into a rasterized bird's-eye-view grid. By training on the entire camera rig, we provide evidence that our model is able to learn not only how to represent images but how to fuse predictions from all cameras into a single cohesive representation of the scene while being robust to calibration error. On standard bird's-eye-view tasks such as object segmentation and map…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

nv-tlabs/lift-splat-shoot
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.