DiverseDepth: Affine-invariant Depth Prediction Using Diverse Data
Wei Yin, Xinlong Wang, Chunhua Shen, Yifan Liu, Zhi Tian, Songcen Xu,, Changming Sun, Dou Renyin

TL;DR
DiverseDepth introduces a monocular depth estimation method that predicts affine-invariant depth, enabling accurate 3D shape reconstruction across diverse scenes with strong generalization, leveraging a new large-scale dataset and multi-curriculum training.
Contribution
The paper presents a novel dataset and a depth prediction method that generalizes well across diverse scenes by predicting affine-invariant depth, unlike previous metric or relative depth approaches.
Findings
Outperforms previous methods on 8 datasets in zero-shot tests
Reconstructs high-quality 3D shapes from predicted depth
Demonstrates strong generalization to diverse scenes
Abstract
We present a method for depth estimation with monocular images, which can predict high-quality depth on diverse scenes up to an affine transformation, thus preserving accurate shapes of a scene. Previous methods that predict metric depth often work well only for a specific scene. In contrast, learning relative depth (information of being closer or further) can enjoy better generalization, with the price of failing to recover the accurate geometric shape of the scene. In this work, we propose a dataset and methods to tackle this dilemma, aiming to predict accurate depth up to an affine transformation with good generalization to diverse scenes. First we construct a large-scale and diverse dataset, termed Diverse Scene Depth dataset (DiverseDepth), which has a broad range of scenes and foreground contents. Compared with previous learning objectives, i.e., learning metric depth or relative…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Optical measurement and interference techniques · Image Processing Techniques and Applications
MethodsTest
