ACDNet: Adaptively Combined Dilated Convolution for Monocular Panorama Depth Estimation
Chuanqing Zhuang, Zhengda Lu, Yiqun Wang, Jun Xiao, Ying Wang

TL;DR
ACDNet introduces an adaptively combined dilated convolution approach with channel-wise attention for improved monocular panoramic depth estimation, effectively handling distortion and expanding receptive fields.
Contribution
The paper proposes a novel ACDNet that adaptively combines dilated convolutions with channel-wise attention to enhance depth estimation accuracy for panoramic images.
Findings
Outperforms current SOTA methods on three datasets
Effectively captures cross-channel contextual information
Handles distortion in equirectangular projection
Abstract
Depth estimation is a crucial step for 3D reconstruction with panorama images in recent years. Panorama images maintain the complete spatial information but introduce distortion with equirectangular projection. In this paper, we propose an ACDNet based on the adaptively combined dilated convolution to predict the dense depth map for a monocular panoramic image. Specifically, we combine the convolution kernels with different dilations to extend the receptive field in the equirectangular projection. Meanwhile, we introduce an adaptive channel-wise fusion module to summarize the feature maps and get diverse attention areas in the receptive field along the channels. Due to the utilization of channel-wise attention in constructing the adaptive channel-wise fusion module, the network can capture and leverage the cross-channel contextual information efficiently. Finally, we conduct depth…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdvanced Vision and Imaging · Image Processing Techniques and Applications · Advanced Image Processing Techniques
MethodsDilated Convolution · Convolution
