TL;DR
This paper introduces a neural contourlet network that combines spectral and spatial cues for improved monocular 360-degree depth estimation, effectively capturing geometric structures and outperforming existing methods.
Contribution
It proposes a novel neural network architecture integrating contourlet transform for explicit geometric cue extraction in 360-degree depth estimation.
Findings
Outperforms state-of-the-art methods on three datasets.
Achieves faster convergence in training.
Effectively fuses spectral and spatial information.
Abstract
For a monocular 360 image, depth estimation is a challenging because the distortion increases along the latitude. To perceive the distortion, existing methods devote to designing a deep and complex network architecture. In this paper, we provide a new perspective that constructs an interpretable and sparse representation for a 360 image. Considering the importance of the geometric structure in depth estimation, we utilize the contourlet transform to capture an explicit geometric cue in the spectral domain and integrate it with an implicit cue in the spatial domain. Specifically, we propose a neural contourlet network consisting of a convolutional neural network and a contourlet transform branch. In the encoder stage, we design a spatial-spectral fusion module to effectively fuse two types of cues. Contrary to the encoder, we employ the inverse contourlet transform with learned low-pass…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
