Self-Supervised Learning of Domain Invariant Features for Depth Estimation
Hiroyasu Akada, Shariq Farooq Bhat, Ibraheem Alhashim, Peter Wonka

TL;DR
This paper introduces a novel self-supervised training strategy for depth estimation that learns domain-invariant features by combining style transfer and Siamese networks, improving real-world generalization.
Contribution
It extends self-supervised learning to domain-invariant representation learning using image translation and Siamese networks for depth estimation.
Findings
Outperforms state-of-the-art on KITTI and Make3D datasets.
Achieves 14.7% improvement on Sq Rel metric on KITTI.
Enhances generalization to real-world data.
Abstract
We tackle the problem of unsupervised synthetic-to-real domain adaptation for single image depth estimation. An essential building block of single image depth estimation is an encoder-decoder task network that takes RGB images as input and produces depth maps as output. In this paper, we propose a novel training strategy to force the task network to learn domain invariant representations in a selfsupervised manner. Specifically, we extend self-supervised learning from traditional representation learning, which works on images from a single domain, to domain invariant representation learning, which works on images from two different domains by utilizing an image-to-image translation network. Firstly, we use an image-to-image translation network to transfer domain-specific styles between synthetic and real domains. This style transfer operation allows us to obtain similar images from the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Self-Supervised Learning of Domain Invariant Features for Depth Estimation· youtube
Taxonomy
TopicsAdvanced Vision and Imaging · Image Processing Techniques and Applications · Advanced Image Processing Techniques
MethodsSiamese Network
