SAFENet: Self-Supervised Monocular Depth Estimation with Semantic-Aware Feature Extraction
Jaehoon Choi, Dongki Jung, Donghwan Lee, Changick Kim

TL;DR
SAFENet improves self-supervised monocular depth estimation by integrating semantic-aware features, leading to more accurate, robust, and generalizable depth predictions without groundtruth depth maps.
Contribution
The paper introduces a novel multi-task learning approach that incorporates semantic information into depth features for self-supervised monocular depth estimation.
Findings
Outperforms state-of-the-art methods on KITTI dataset
Demonstrates better generalization across different datasets
Shows robustness under low-light and adverse weather conditions
Abstract
Self-supervised monocular depth estimation has emerged as a promising method because it does not require groundtruth depth maps during training. As an alternative for the groundtruth depth map, the photometric loss enables to provide self-supervision on depth prediction by matching the input image frames. However, the photometric loss causes various problems, resulting in less accurate depth values compared with supervised approaches. In this paper, we propose SAFENet that is designed to leverage semantic information to overcome the limitations of the photometric loss. Our key idea is to exploit semantic-aware depth features that integrate the semantic and geometric knowledge. Therefore, we introduce multi-task learning schemes to incorporate semantic-awareness into the representation of depth features. Experiments on KITTI dataset demonstrate that our methods compete or even outperform…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Optical measurement and interference techniques · Robotics and Sensor-Based Localization
