Stealing Stable Diffusion Prior for Robust Monocular Depth Estimation
Yifan Mao, Jian Liu, Xianming Liu

TL;DR
This paper presents Stealing Stable Diffusion (SSD), a novel method that uses synthetic data and self-training to improve monocular depth estimation in challenging conditions like low-light or rain, leveraging semantic priors for robustness.
Contribution
The paper introduces SSD, combining stable diffusion-generated synthetic images, self-training, and DINOv2 encoder integration to enhance depth estimation in adverse environments.
Findings
Improved depth estimation accuracy on nuScenes and Oxford RobotCar datasets.
Effective use of synthetic challenging-condition images for training.
Enhanced scene understanding through semantic priors.
Abstract
Monocular depth estimation is a crucial task in computer vision. While existing methods have shown impressive results under standard conditions, they often face challenges in reliably performing in scenarios such as low-light or rainy conditions due to the absence of diverse training data. This paper introduces a novel approach named Stealing Stable Diffusion (SSD) prior for robust monocular depth estimation. The approach addresses this limitation by utilizing stable diffusion to generate synthetic images that mimic challenging conditions. Additionally, a self-training mechanism is introduced to enhance the model's depth estimation capability in such challenging environments. To enhance the utilization of the stable diffusion prior further, the DINOv2 encoder is integrated into the depth model architecture, enabling the model to leverage rich semantic priors and improve its scene…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Optical measurement and interference techniques · Industrial Vision Systems and Defect Detection
MethodsDiffusion
