The Midas Touch for Metric Depth

Yu Ma; Zizhan Guo; Zuyi Xiong; Haoran Zhang; Yi Feng; Hongbo Zhao; Hanli Wang; Rui Fan

arXiv:2605.11578·cs.CV·May 13, 2026

The Midas Touch for Metric Depth

Yu Ma, Zizhan Guo, Zuyi Xiong, Haoran Zhang, Yi Feng, Hongbo Zhao, Hanli Wang, Rui Fan

PDF

1 Repo

TL;DR

The paper introduces MTD, a method that converts relative depth to metric depth using sparse 3D data, improving accuracy, consistency, and efficiency for 3D applications.

Contribution

It presents a mathematically interpretable approach combining sparse graph optimization and discontinuity-aware refinement to enhance depth estimation.

Findings

01

MTD achieves substantial accuracy improvements over previous methods.

02

It exhibits strong generalization across different scenes.

03

The lightweight design facilitates deployment in diverse 3D tasks.

Abstract

Recent advances have markedly improved the cross-scene generalization of relative depth estimation, yet its practical applicability remains limited by the absence of metric scale, local inconsistencies, and low computational efficiency. To address these issues, we present \emph{\textbf{M}idas \textbf{T}ouch for \textbf{D}epth} (MTD), a mathematically interpretable approach that converts relative depth into metric depth using only extremely sparse 3D data. To eliminate local scale inconsistencies, it applies a segment-wise recovery strategy via sparse graph optimization, followed by a pixel-wise refinement strategy using a discontinuity-aware geodesic cost. MTD exhibits strong generalization and achieves substantial accuracy improvements over previous depth completion and depth estimation methods. Moreover, its lightweight, plug-and-play design facilitates deployment and integration on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

https://mias.group/MTD
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.