UniDAC: Universal Metric Depth Estimation for Any Camera
Girish Chandar Ganesan, Yuliang Guo, Liu Ren, Xiaoming Liu

TL;DR
UniDAC is a universal monocular depth estimation framework that generalizes across diverse camera types using a single model, decoupling relative depth and scale estimation for robust performance.
Contribution
It introduces a novel decoupling approach and a lightweight scale estimation module, enabling universal robustness and cross-camera generalization in depth estimation.
Findings
Achieves state-of-the-art cross-camera generalization performance.
Outperforms prior methods across all evaluated datasets.
Uses a distortion-aware positional embedding for ERP cameras.
Abstract
Monocular metric depth estimation (MMDE) is a core challenge in computer vision, playing a pivotal role in real-world applications that demand accurate spatial understanding. Although prior works have shown promising zero-shot performance in MMDE, they often struggle with generalization across diverse camera types, such as fisheye and cameras. Recent advances have addressed this through unified camera representations or canonical representation spaces, but they require either including large-FoV camera data during training or separately trained models for different domains. We propose UniDAC, an MMDE framework that presents universal robustness in all domains and generalizes across diverse cameras using a single model. We achieve this by decoupling metric depth estimation into relative depth prediction and spatially varying scale estimation, enabling robust performance…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
