Depth Anywhere: Enhancing 360 Monocular Depth Estimation via Perspective Distillation and Unlabeled Data Augmentation
Ning-Hsu Wang, Yu-Lun Liu

TL;DR
This paper introduces a novel semi-supervised framework for 360-degree monocular depth estimation that leverages unlabeled data and perspective models to improve accuracy across diverse datasets and camera projections.
Contribution
It proposes a perspective distillation method using pseudo labels generated via cube projection, enabling effective training on unlabeled 360 images.
Findings
Significant accuracy improvements on Matterport3D and Stanford2D3D datasets.
Effective zero-shot depth estimation performance.
Versatile training pipeline adaptable to various 360 depth estimators.
Abstract
Accurately estimating depth in 360-degree imagery is crucial for virtual reality, autonomous navigation, and immersive media applications. Existing depth estimation methods designed for perspective-view imagery fail when applied to 360-degree images due to different camera projections and distortions, whereas 360-degree methods perform inferior due to the lack of labeled data pairs. We propose a new depth estimation framework that utilizes unlabeled 360-degree data effectively. Our approach uses state-of-the-art perspective depth estimation models as teacher models to generate pseudo labels through a six-face cube projection technique, enabling efficient labeling of depth in 360-degree images. This method leverages the increasing availability of large datasets. Our approach includes two main stages: offline mask generation for invalid regions and an online semi-supervised joint training…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdvanced Vision and Imaging · Industrial Vision Systems and Defect Detection · Image Processing Techniques and Applications
