Unsupervised Monocular Depth Estimation Based on Hierarchical Feature-Guided Diffusion

Runze Liu; Dongchen Zhu; Guanghui Zhang; Yue Xu; Wenjun Shi; Xiaolin Zhang; Lei Wang; and Jiamao Li

arXiv:2406.09782·cs.CV·October 29, 2025·1 cites

Unsupervised Monocular Depth Estimation Based on Hierarchical Feature-Guided Diffusion

Runze Liu, Dongchen Zhu, Guanghui Zhang, Yue Xu, Wenjun Shi, Xiaolin Zhang, Lei Wang, and Jiamao Li

PDF

Open Access

TL;DR

This paper introduces a robust unsupervised monocular depth estimation method using a hierarchical feature-guided diffusion model, improving depth learning and consistency in noisy real-world images.

Contribution

It proposes a novel hierarchical feature-guided denoising module and an implicit depth consistency loss within a diffusion-based framework for depth estimation.

Findings

01

Outperforms existing generative-based depth estimation models.

02

Demonstrates high robustness on diverse datasets.

03

Ensures scale consistency in video sequences.

Abstract

Unsupervised monocular depth estimation has received widespread attention because of its capability to train without ground truth. In real-world scenarios, the images may be blurry or noisy due to the influence of weather conditions and inherent limitations of the camera. Therefore, it is particularly important to develop a robust depth estimation model. Benefiting from the training strategies of generative networks, generative-based methods often exhibit enhanced robustness. In light of this, we employ a well-converging diffusion model among generative networks for unsupervised monocular depth estimation. Additionally, we propose a hierarchical feature-guided denoising module. This model significantly enriches the model's capacity for learning and interpreting depth distribution by fully leveraging image features to guide the denoising process. Furthermore, we explore the implicit…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage Processing Techniques and Applications · Optical measurement and interference techniques · Advanced Vision and Imaging

MethodsDiffusion