DepthFM: Fast Monocular Depth Estimation with Flow Matching
Ming Gui, Johannes Schusterbauer, Ulrich Prestel, Pingchuan, Ma, Dmytro Kotovenko, Olga Grebenkova, Stefan Andreas Baumann and, Vincent Tao Hu, Bj\"orn Ommer

TL;DR
DepthFM introduces a flow matching approach for monocular depth estimation that improves efficiency and performance by leveraging distribution transport, external knowledge, and synthetic data, achieving competitive zero-shot results.
Contribution
This paper pioneers the use of flow matching in monocular depth estimation, enhancing training and sampling efficiency while integrating external knowledge and synthetic data.
Findings
Achieves competitive zero-shot performance on natural scene benchmarks.
Improves sampling efficiency compared to existing generative methods.
Provides reliable depth confidence estimates.
Abstract
Current discriminative depth estimation methods often produce blurry artifacts, while generative approaches suffer from slow sampling due to curvatures in the noise-to-depth transport. Our method addresses these challenges by framing depth estimation as a direct transport between image and depth distributions. We are the first to explore flow matching in this field, and we demonstrate that its interpolation trajectories enhance both training and sampling efficiency while preserving high performance. While generative models typically require extensive training data, we mitigate this dependency by integrating external knowledge from a pre-trained image diffusion model, enabling effective transfer even across differing objectives. To further boost our model performance, we employ synthetic data and utilize image-depth pairs generated by a discriminative model on an in-the-wild image…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Advanced Image Processing Techniques · Medical Image Segmentation Techniques
MethodsDiffusion
