MSDPN: Monocular Depth Prediction with Partial Laser Observation using   Multi-stage Neural Networks

Hyungtae Lim; Hyeonjae Gil; Hyun Myung

arXiv:2008.01405·cs.CV·August 5, 2020

MSDPN: Monocular Depth Prediction with Partial Laser Observation using Multi-stage Neural Networks

Hyungtae Lim, Hyeonjae Gil, Hyun Myung

PDF

TL;DR

This paper introduces MSDPN, a multi-stage neural network that predicts dense depth maps from monocular images and 2D LiDAR data, effectively addressing partial observation issues with novel architecture and feature aggregation.

Contribution

The paper presents a new multi-stage encoder-decoder network with CSFA for improved depth prediction using real 2D LiDAR data, validated on a novel dataset.

Findings

01

MSDPN outperforms state-of-the-art methods in depth prediction accuracy.

02

The proposed architecture effectively mitigates partial observation problems.

03

Reference depth maps are robust even in untrained scenarios.

Abstract

In this study, a deep-learning-based multi-stage network architecture called Multi-Stage Depth Prediction Network (MSDPN) is proposed to predict a dense depth map using a 2D LiDAR and a monocular camera. Our proposed network consists of a multi-stage encoder-decoder architecture and Cross Stage Feature Aggregation (CSFA). The proposed multi-stage encoder-decoder architecture alleviates the partial observation problem caused by the characteristics of a 2D LiDAR, and CSFA prevents the multi-stage network from diluting the features and allows the network to learn the inter-spatial relationship between features better. Previous works use sub-sampled data from the ground truth as an input rather than actual 2D LiDAR data. In contrast, our approach trains the model and conducts experiments with a physically-collected 2D LiDAR dataset. To this end, we acquired our own dataset called KAIST…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.