UniDepth: Universal Monocular Metric Depth Estimation
Luigi Piccinelli, Yung-Hsu Yang, Christos Sakaridis, Mattia Segu,, Siyuan Li, Luc Van Gool, Fisher Yu

TL;DR
UniDepth is a universal monocular depth estimation model that predicts metric 3D scenes from single images across various domains without additional info, outperforming domain-specific methods in zero-shot tests.
Contribution
UniDepth introduces a novel approach with a self-promptable camera module and pseudo-spherical output, enabling robust zero-shot monocular depth estimation across diverse datasets.
Findings
Outperforms existing methods in zero-shot evaluations
Generalizes well across ten diverse datasets
Does not require domain-specific training data
Abstract
Accurate monocular metric depth estimation (MMDE) is crucial to solving downstream tasks in 3D perception and modeling. However, the remarkable accuracy of recent MMDE methods is confined to their training domains. These methods fail to generalize to unseen domains even in the presence of moderate domain gaps, which hinders their practical applicability. We propose a new model, UniDepth, capable of reconstructing metric 3D scenes from solely single images across domains. Departing from the existing MMDE methods, UniDepth directly predicts metric 3D points from the input image at inference time without any additional information, striving for a universal and flexible MMDE solution. In particular, UniDepth implements a self-promptable camera module predicting dense camera representation to condition depth features. Our model exploits a pseudo-spherical output representation, which…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Industrial Vision Systems and Defect Detection · Optical measurement and interference techniques
