Depth Anywhere: Enhancing 360 Monocular Depth Estimation via Perspective   Distillation and Unlabeled Data Augmentation

Ning-Hsu Wang; Yu-Lun Liu

arXiv:2406.12849·cs.CV·October 31, 2024·1 cites

Depth Anywhere: Enhancing 360 Monocular Depth Estimation via Perspective Distillation and Unlabeled Data Augmentation

Ning-Hsu Wang, Yu-Lun Liu

PDF

Open Access 1 Video

TL;DR

This paper introduces a novel semi-supervised framework for 360-degree monocular depth estimation that leverages unlabeled data and perspective models to improve accuracy across diverse datasets and camera projections.

Contribution

It proposes a perspective distillation method using pseudo labels generated via cube projection, enabling effective training on unlabeled 360 images.

Findings

01

Significant accuracy improvements on Matterport3D and Stanford2D3D datasets.

02

Effective zero-shot depth estimation performance.

03

Versatile training pipeline adaptable to various 360 depth estimators.

Abstract

Accurately estimating depth in 360-degree imagery is crucial for virtual reality, autonomous navigation, and immersive media applications. Existing depth estimation methods designed for perspective-view imagery fail when applied to 360-degree images due to different camera projections and distortions, whereas 360-degree methods perform inferior due to the lack of labeled data pairs. We propose a new depth estimation framework that utilizes unlabeled 360-degree data effectively. Our approach uses state-of-the-art perspective depth estimation models as teacher models to generate pseudo labels through a six-face cube projection technique, enabling efficient labeling of depth in 360-degree images. This method leverages the increasing availability of large datasets. Our approach includes two main stages: offline mask generation for invalid regions and an online semi-supervised joint training…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Depth Anywhere: Enhancing 360 Monocular Depth Estimation via Perspective Distillation and Unlabeled Data Augmentation· slideslive

Taxonomy

TopicsAdvanced Vision and Imaging · Industrial Vision Systems and Defect Detection · Image Processing Techniques and Applications