Distill Any Depth: Distillation Creates a Stronger Monocular Depth   Estimator

Xiankang He; Dongyan Guo; Hongji Li; Ruibo Li; Ying Cui; Chi Zhang

arXiv:2502.19204·cs.CV·April 22, 2025

Distill Any Depth: Distillation Creates a Stronger Monocular Depth Estimator

Xiankang He, Dongyan Guo, Hongji Li, Ruibo Li, Ying Cui, Chi Zhang

PDF

Open Access 1 Repo 4 Models

TL;DR

This paper improves zero-shot monocular depth estimation by analyzing depth normalization strategies, proposing a cross-context distillation method that combines global and local cues, and integrating diffusion-based priors for more robust supervision.

Contribution

It introduces a systematic analysis of depth normalization in pseudo-label distillation and proposes Cross-Context Distillation with diffusion-based priors to enhance depth estimation.

Findings

01

Outperforms state-of-the-art methods on benchmark datasets.

02

Omitting normalization can reduce noise impact in recent distillation paradigms.

03

Combining global and local cues improves pseudo-label quality.

Abstract

Recent advances in zero-shot monocular depth estimation(MDE) have significantly improved generalization by unifying depth distributions through normalized depth representations and by leveraging large-scale unlabeled data via pseudo-label distillation. However, existing methods that rely on global depth normalization treat all depth values equally, which can amplify noise in pseudo-labels and reduce distillation effectiveness. In this paper, we present a systematic analysis of depth normalization strategies in the context of pseudo-label distillation. Our study shows that, under recent distillation paradigms (e.g., shared-context distillation), normalization is not always necessary, as omitting it can help mitigate the impact of noisy supervision. Furthermore, rather than focusing solely on how depth information is represented, we propose Cross-Context Distillation, which integrates…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Westlake-AGI-Lab/Distill-Any-Depth
pytorchOfficial

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Domain Adaptation and Few-Shot Learning · Human Pose and Action Recognition