Improving Self-Supervised Single View Depth Estimation by Masking Occlusion
Maarten Schellevis

TL;DR
This paper proposes a novel occlusion mask based on predicted depth to improve self-supervised single view depth estimation by ignoring occluded regions during training, leading to better performance on benchmarks.
Contribution
Introduction of an occlusion mask derived from depth predictions and two new loss formulations that enhance depth estimation accuracy in self-supervised learning.
Findings
Improved depth estimation performance on KITTI benchmark.
Loss functions that ignore occluded regions reduce errors caused by object motion.
Occlusion mask effectively isolates non-reconstructible regions during training.
Abstract
Single view depth estimation models can be trained from video footage using a self-supervised end-to-end approach with view synthesis as the supervisory signal. This is achieved with a framework that predicts depth and camera motion, with a loss based on reconstructing a target video frame from temporally adjacent frames. In this context, occlusion relates to parts of a scene that can be observed in the target frame but not in a frame used for image reconstruction. Since the image reconstruction is based on sampling from the adjacent frame, and occluded areas by definition cannot be sampled, reconstructed occluded areas corrupt to the supervisory signal. In previous work arXiv:1806.01260 occlusion is handled based on reconstruction error; at each pixel location, only the reconstruction with the lowest error is included in the loss. The current study aims to determine whether performance…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Image Processing Techniques and Applications · Optical measurement and interference techniques
