Deep Depth Completion of a Single RGB-D Image
Yinda Zhang, Thomas Funkhouser

TL;DR
This paper introduces a deep learning approach to complete missing depth information in RGB-D images by predicting surface normals and boundaries, improving depth accuracy especially on challenging surfaces.
Contribution
The authors propose a novel deep network that combines predicted surface normals and occlusion boundaries with raw depth data for improved depth completion, validated on a new benchmark dataset.
Findings
Outperforms existing depth completion methods
Effective on shiny, transparent, and distant surfaces
Validated on a new multiview RGB-D benchmark dataset
Abstract
The goal of our work is to complete the depth channel of an RGB-D image. Commodity-grade depth cameras often fail to sense depth for shiny, bright, transparent, and distant surfaces. To address this problem, we train a deep network that takes an RGB image as input and predicts dense surface normals and occlusion boundaries. Those predictions are then combined with raw depth observations provided by the RGB-D camera to solve for depths for all pixels, including those missing in the original observation. This method was chosen over others (e.g., inpainting depths directly) as the result of extensive experiments with a new depth completion benchmark dataset, where holes are filled in training data through the rendering of surface reconstructions created from multiview RGB-D scans. Experiments with different network inputs, depth representations, loss functions, optimization methods,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Image Processing Techniques and Applications · Optical measurement and interference techniques
