MonoRelief V2: Leveraging Real Data for High-Fidelity Monocular Relief Recovery

Yu-Wei Zhang; Tongju Han; Lipeng Gao; Mingqiang Wei; Hui Liu; Changbao Li; and Caiming Zhang

arXiv:2508.19555·cs.CV·August 28, 2025

MonoRelief V2: Leveraging Real Data for High-Fidelity Monocular Relief Recovery

Yu-Wei Zhang, Tongju Han, Lipeng Gao, Mingqiang Wei, Hui Liu, Changbao Li, and Caiming Zhang

PDF

TL;DR

MonoRelief V2 is an advanced model that leverages both synthetic and real data, including pseudo-real images generated by AI, to accurately recover 2.5D reliefs from single images with complex variations.

Contribution

It introduces a novel training approach combining pseudo-real and real data, significantly improving relief recovery accuracy over previous synthetic-only models.

Findings

01

Achieves state-of-the-art depth and normal prediction performance

02

Effectively utilizes pseudo-real images for training

03

Demonstrates robustness across complex material and illumination variations

Abstract

This paper presents MonoRelief V2, an end-to-end model designed for directly recovering 2.5D reliefs from single images under complex material and illumination variations. In contrast to its predecessor, MonoRelief V1 [1], which was solely trained on synthetic data, MonoRelief V2 incorporates real data to achieve improved robustness, accuracy and efficiency. To overcome the challenge of acquiring large-scale real-world dataset, we generate approximately 15,000 pseudo real images using a text-to-image generative model, and derive corresponding depth pseudo-labels through fusion of depth and normal predictions. Furthermore, we construct a small-scale real-world dataset (800 samples) via multi-view reconstruction and detail refinement. MonoRelief V2 is then progressively trained on the pseudo-real and real-world datasets. Comprehensive experiments demonstrate its state-of-the-art…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.