Learning Depth from Monocular Videos Using Synthetic Data: A Temporally-Consistent Domain Adaptation Approach
Yipeng Mou, Mingming Gong, Huan Fu, Kayhan Batmanghelich, Kun Zhang,, Dacheng Tao

TL;DR
This paper introduces a novel domain adaptation approach that leverages synthetic videos with ground-truth depth labels to improve monocular depth estimation from real videos, addressing the style gap and temporal consistency issues.
Contribution
It proposes a temporally-consistent domain adaptation method that uses synthetic data with ground-truth labels and temporal constraints to enhance depth prediction in real videos.
Findings
Achieves comparable performance to state-of-the-art methods.
Effectively filters moving regions using learned moving masks.
Utilizes synthetic data to reduce reliance on expensive real-world labels.
Abstract
Majority of state-of-the-art monocular depth estimation methods are supervised learning approaches. The success of such approaches heavily depends on the high-quality depth labels which are expensive to obtain. Some recent methods try to learn depth networks by leveraging unsupervised cues from monocular videos which are easier to acquire but less reliable. In this paper, we propose to resolve this dilemma by transferring knowledge from synthetic videos with easily obtainable ground-truth depth labels. Due to the stylish difference between synthetic and real images, we propose a temporally-consistent domain adaptation (TCDA) approach that simultaneously explores labels in the synthetic domain and temporal constraints in the videos to improve style transfer and depth prediction. Furthermore, we make use of the ground-truth optical flow and pose information in the synthetic data to learn…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Advanced Image Processing Techniques · Image Enhancement Techniques
