TL;DR
UniFuse introduces a unidirectional feature fusion method combining equirectangular and cubemap projections for panorama depth estimation, achieving state-of-the-art results with improved efficiency and generalization.
Contribution
The paper proposes a novel unidirectional fusion framework and an improved fusion module for panorama depth estimation, outperforming bidirectional methods in efficiency and accuracy.
Findings
Achieves state-of-the-art performance on four datasets.
More efficient than bidirectional fusion approaches.
Demonstrates better generalization and lower model complexity.
Abstract
Learning depth from spherical panoramas is becoming a popular research topic because a panorama has a full field-of-view of the environment and provides a relatively complete description of a scene. However, applying well-studied CNNs for perspective images to the standard representation of spherical panoramas, i.e., the equirectangular projection, is suboptimal, as it becomes distorted towards the poles. Another representation is the cubemap projection, which is distortion-free but discontinued on edges and limited in the field-of-view. This paper introduces a new framework to fuse features from the two projections, unidirectionally feeding the cubemap features to the equirectangular features only at the decoding stage. Unlike the recent bidirectional fusion approach operating at both the encoding and decoding stages, our fusion scheme is much more efficient. Besides, we also designed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
