Estimating Depth of Monocular Panoramic Image with Teacher-Student Model Fusing Equirectangular and Spherical Representations
Jingguo Liu, Yijun Xu, Shigang Li, Jianfeng Li

TL;DR
This paper introduces a novel teacher-student model that fuses equirectangular and spherical representations for monocular panoramic depth estimation, effectively addressing distortion and disconnectivity issues.
Contribution
It proposes a spherical convolution kernel and a segmentation feature fusion method, improving depth estimation accuracy over existing approaches.
Findings
Outperforms existing methods on benchmark datasets.
Effectively handles distortion in panoramic images.
Learns latent depth features via teacher-student model.
Abstract
Disconnectivity and distortion are the two problems which must be coped with when processing 360 degrees equirectangular images. In this paper, we propose a method of estimating the depth of monocular panoramic image with a teacher-student model fusing equirectangular and spherical representations. In contrast with the existing methods fusing an equirectangular representation with a cube map representation or tangent representation, a spherical representation is a better choice because a sampling on a sphere is more uniform and can also cope with distortion more effectively. In this processing, a novel spherical convolution kernel computing with sampling points on a sphere is developed to extract features from the spherical representation, and then, a Segmentation Feature Fusion(SFF) methodology is utilized to combine the features with ones extracted from the equirectangular…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSatellite Image Processing and Photogrammetry · 3D Surveying and Cultural Heritage
MethodsConvolution
