Two-in-One Depth: Bridging the Gap Between Monocular and Binocular   Self-supervised Depth Estimation

Zhengming Zhou; Qiulei Dong

arXiv:2309.00933·cs.CV·September 6, 2023·1 cites

Two-in-One Depth: Bridging the Gap Between Monocular and Binocular Self-supervised Depth Estimation

Zhengming Zhou, Qiulei Dong

PDF

Open Access 1 Repo

TL;DR

This paper introduces TiO-Depth, a unified self-supervised network capable of performing both monocular and binocular depth estimation, improving accuracy and versatility over separate models.

Contribution

The authors propose a novel Siamese architecture with a Monocular Feature Matching module and a multi-stage training strategy for joint monocular and binocular depth estimation.

Findings

01

TiO-Depth outperforms state-of-the-art monocular and binocular methods on KITTI, Cityscapes, and DDAD datasets.

02

The model effectively handles both tasks with a single network architecture.

03

Joint training enhances depth prediction accuracy in both monocular and binocular scenarios.

Abstract

Monocular and binocular self-supervised depth estimations are two important and related tasks in computer vision, which aim to predict scene depths from single images and stereo image pairs respectively. In literature, the two tasks are usually tackled separately by two different kinds of models, and binocular models generally fail to predict depth from single images, while the prediction accuracy of monocular models is generally inferior to binocular models. In this paper, we propose a Two-in-One self-supervised depth estimation network, called TiO-Depth, which could not only compatibly handle the two tasks, but also improve the prediction accuracy. TiO-Depth employs a Siamese architecture and each sub-network of it could be used as a monocular depth estimation model. For binocular depth estimation, a Monocular Feature Matching module is proposed for incorporating the stereo knowledge…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zm-zhou/tio-depth_pytorch
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Image Processing Techniques and Applications · Optical Coherence Tomography Applications

Methodsfail