Distortion-aware Monocular Depth Estimation for Omnidirectional Images
Hong-Xiang Chen, Kunhong Li, Zhiheng Fu, Mengyi Liu and, Zonghao Chen, Yulan Guo

TL;DR
This paper introduces a distortion-aware monocular depth estimation network for indoor panoramic images, effectively addressing geometric distortions and achieving state-of-the-art results on the 360D dataset.
Contribution
The work proposes a novel distortion-aware module with deformable convolution and strip pooling, along with a spherical-aware weight matrix, to improve depth estimation on distorted omnidirectional images.
Findings
Achieves state-of-the-art performance on 360D dataset.
Effectively extracts semantic features from distorted panoramas.
Reduces supervision bias caused by distortion.
Abstract
A main challenge for tasks on panorama lies in the distortion of objects among images. In this work, we propose a Distortion-Aware Monocular Omnidirectional (DAMO) dense depth estimation network to address this challenge on indoor panoramas with two steps. First, we introduce a distortion-aware module to extract calibrated semantic features from omnidirectional images. Specifically, we exploit deformable convolution to adjust its sampling grids to geometric variations of distorted objects on panoramas and then utilize a strip pooling module to sample against horizontal distortion introduced by inverse gnomonic projection. Second, we further introduce a plug-and-play spherical-aware weight matrix for our objective function to handle the uneven distribution of areas projected from a sphere. Experiments on the 360D dataset show that the proposed method can effectively extract semantic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsStrip Pooling · Convolution · Deformable Convolution
