TL;DR
Sphere-Depth introduces a benchmark for evaluating the robustness of monocular depth estimation models on spherical images under camera pose variations, highlighting performance degradation and providing a standardized evaluation protocol.
Contribution
The paper presents a new benchmark, evaluation protocol, and dataset splits for assessing depth estimation models' robustness to pose variations in spherical images.
Findings
Models degrade significantly under pose perturbations.
Spherical-aware models still face performance drops with pose variations.
A depth calibration protocol improves metric depth evaluation.
Abstract
Reliable depth estimation from spherical images is crucial for 360{\deg} vision in robotic navigation and immersive scene understanding. However, the onboard spherical camera can experience unintentional pose variations in real-world robotic platforms that, along with the geometric distortions inherent in equirectangular projections, significantly impact the effectiveness of depth estimation. To study this issue, a novel public benchmark, called Sphere-Depth, is introduced to systematically evaluate the robustness of monocular depth estimation models from equirectangular images in a reproducible way. Camera pose perturbations are simulated and used to assess the performance of a popular perspective-based model, Depth Anything, and of spherical-aware models such as Depth Anywhere, ACDNet, Bifuse++, and SliceNet. Furthermore, to ensure meaningful evaluation across models, a depth…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
