Towards Robust Monocular Depth Estimation in Non-Lambertian Surfaces
Junrui Zhang, Jiaqi Li, Yachuan Huang, Yiran Wang, Jinghong Zheng,, Liao Shen, and Zhiguo Cao

TL;DR
This paper introduces a novel training framework for monocular depth estimation that improves robustness on non-Lambertian surfaces by guiding predictions in the gradient domain, employing lighting augmentation, and using a variational autoencoder-based fusion module.
Contribution
It proposes non-Lambertian surface regional guidance, lighting augmentation, and a variational autoencoder fusion module to enhance depth estimation on reflective surfaces.
Findings
33.39% accuracy improvement on Booster dataset
5.21% accuracy improvement on Mirror3D dataset
State-of-the-art 90.75 delta1.05 on TRICKY2024
Abstract
In the field of monocular depth estimation (MDE), many models with excellent zero-shot performance in general scenes emerge recently. However, these methods often fail in predicting non-Lambertian surfaces, such as transparent or mirror (ToM) surfaces, due to the unique reflective properties of these regions. Previous methods utilize externally provided ToM masks and aim to obtain correct depth maps through direct in-painting of RGB images. These methods highly depend on the accuracy of additional input masks, and the use of random colors during in-painting makes them insufficiently robust. We are committed to incrementally enabling the baseline model to directly learn the uniqueness of non-Lambertian surface regions for depth estimation through a well-designed training framework. Therefore, we propose non-Lambertian surface regional guidance, which constrains the predictions of MDE…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topics3D Shape Modeling and Analysis · 3D Surveying and Cultural Heritage · Remote Sensing and LiDAR Applications
MethodsSparse Evolutionary Training
