Depth Anything in $360^\circ$: Towards Scale Invariance in the Wild

Hualie Jiang; Ziyang Song; Zhiqiang Lou; Rui Xu; Minglang Tan

arXiv:2512.22819·cs.CV·December 30, 2025

Depth Anything in $360^\circ$: Towards Scale Invariance in the Wild

Hualie Jiang, Ziyang Song, Zhiqiang Lou, Rui Xu, Minglang Tan

PDF

Open Access

TL;DR

This paper introduces DA360, a panoramic depth estimation model that achieves scale invariance and seamless spherical depth maps, significantly improving zero-shot outdoor and indoor depth estimation performance.

Contribution

We propose a novel scale-invariance learning approach for panoramic depth estimation and integrate circular padding to improve spatial coherence, advancing zero-shot generalization in open-world environments.

Findings

01

Over 50% error reduction on indoor benchmarks

02

Over 10% error reduction on outdoor datasets

03

30% improvement over PanDA in zero-shot panoramic depth estimation

Abstract

Panoramic depth estimation provides a comprehensive solution for capturing complete $36 0^{\circ}$ environmental structural information, offering significant benefits for robotics and AR/VR applications. However, while extensively studied in indoor settings, its zero-shot generalization to open-world domains lags far behind perspective images, which benefit from abundant training data. This disparity makes transferring capabilities from the perspective domain an attractive solution. To bridge this gap, we present Depth Anything in $36 0^{\circ}$ (DA360), a panoramic-adapted version of Depth Anything V2. Our key innovation involves learning a shift parameter from the ViT backbone, transforming the model's scale- and shift-invariant output into a scale-invariant estimate that directly yields well-formed 3D point clouds. This is complemented by integrating circular padding into the DPT decoder…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Robotics and Sensor-Based Localization · Advanced Neural Network Applications