Domain Adaptive Monocular Depth Estimation With Semantic Information
Fei Lu, Hyeonwoo Yu, Jean Oh

TL;DR
This paper introduces a semantic-aware adversarial training approach for monocular depth estimation that improves domain adaptation, achieving state-of-the-art results on datasets like KITTI and Cityscapes.
Contribution
It proposes a novel adversarial model that incorporates semantic information to better align source and target domains in monocular depth estimation.
Findings
Achieves state-of-the-art performance on KITTI and Cityscapes datasets.
Improves boundary and far-distance object depth estimation.
Uses a compact model with competitive accuracy.
Abstract
The advent of deep learning has brought an impressive advance to monocular depth estimation, e.g., supervised monocular depth estimation has been thoroughly investigated. However, the large amount of the RGB-to-depth dataset may not be always available since collecting accurate depth ground truth according to the RGB image is a time-consuming and expensive task. Although the network can be trained on an alternative dataset to overcome the dataset scale problem, the trained model is hard to generalize to the target domain due to the domain discrepancy. Adversarial domain alignment has demonstrated its efficacy to mitigate the domain shift on simple image classification tasks in previous works. However, traditional approaches hardly handle the conditional alignment as they solely consider the feature map of the network. In this paper, we propose an adversarial training model that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Image Processing Techniques and Applications · Advanced Image Processing Techniques
