Sparse Auxiliary Networks for Unified Monocular Depth Prediction and   Completion

Vitor Guizilini; Rares Ambrus; Wolfram Burgard; Adrien Gaidon

arXiv:2103.16690·cs.CV·April 1, 2021

Sparse Auxiliary Networks for Unified Monocular Depth Prediction and Completion

Vitor Guizilini, Rares Ambrus, Wolfram Burgard, Adrien Gaidon

PDF

1 Repo

TL;DR

This paper introduces Sparse Auxiliary Networks (SANs), a novel architecture that enables monocular depth prediction and completion from RGB images with optional sparse depth measurements, achieving state-of-the-art results.

Contribution

We propose SANs, a new module that allows a single network to perform both depth prediction and completion depending on available data, using sparse convolutions and feature injection.

Findings

01

SANs achieve state-of-the-art depth prediction accuracy.

02

The architecture effectively handles both tasks simultaneously.

03

Experimental results on NYUv2, KITTI, and DDAD benchmarks validate its superiority.

Abstract

Estimating scene geometry from data obtained with cost-effective sensors is key for robots and self-driving cars. In this paper, we study the problem of predicting dense depth from a single RGB image (monodepth) with optional sparse measurements from low-cost active depth sensors. We introduce Sparse Auxiliary Networks (SANs), a new module enabling monodepth networks to perform both the tasks of depth prediction and completion, depending on whether only RGB images or also sparse point clouds are available at inference time. First, we decouple the image and depth map encoding stages using sparse convolutions to process only the valid depth map pixels. Second, we inject this information, when available, into the skip connections of the depth prediction network, augmenting its features. Through extensive experimental analysis on one indoor (NYUv2) and two outdoor (KITTI and DDAD)…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

TRI-ML/packnet-sfm
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSparse Convolutions