Two-in-One Depth: Bridging the Gap Between Monocular and Binocular Self-supervised Depth Estimation
Zhengming Zhou, Qiulei Dong

TL;DR
This paper introduces TiO-Depth, a unified self-supervised network capable of performing both monocular and binocular depth estimation, improving accuracy and versatility over separate models.
Contribution
The authors propose a novel Siamese architecture with a Monocular Feature Matching module and a multi-stage training strategy for joint monocular and binocular depth estimation.
Findings
TiO-Depth outperforms state-of-the-art monocular and binocular methods on KITTI, Cityscapes, and DDAD datasets.
The model effectively handles both tasks with a single network architecture.
Joint training enhances depth prediction accuracy in both monocular and binocular scenarios.
Abstract
Monocular and binocular self-supervised depth estimations are two important and related tasks in computer vision, which aim to predict scene depths from single images and stereo image pairs respectively. In literature, the two tasks are usually tackled separately by two different kinds of models, and binocular models generally fail to predict depth from single images, while the prediction accuracy of monocular models is generally inferior to binocular models. In this paper, we propose a Two-in-One self-supervised depth estimation network, called TiO-Depth, which could not only compatibly handle the two tasks, but also improve the prediction accuracy. TiO-Depth employs a Siamese architecture and each sub-network of it could be used as a monocular depth estimation model. For binocular depth estimation, a Monocular Feature Matching module is proposed for incorporating the stereo knowledge…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Image Processing Techniques and Applications · Optical Coherence Tomography Applications
Methodsfail
