DreamPBR: Text-driven Generation of High-resolution SVBRDF with Multi-modal Guidance
Linxuan Xin, Zheng Zhang, Jinfu Wei, Wei Gao, Duan Gao

TL;DR
DreamPBR is a diffusion-based framework that generates high-resolution, diverse, and controllable spatially-varying PBR materials guided by text and multi-modal inputs, overcoming limitations of prior methods.
Contribution
It introduces a novel diffusion model with multi-modal guidance for high-quality, controllable SVBRDF generation, integrating vision-language models and material priors.
Findings
Supports tileable material generation.
Enables diverse and high-quality material creation.
Provides versatile control through multi-modal guidance.
Abstract
Prior material creation methods had limitations in producing diverse results mainly because reconstruction-based methods relied on real-world measurements and generation-based methods were trained on relatively small material datasets. To address these challenges, we propose DreamPBR, a novel diffusion-based generative framework designed to create spatially-varying appearance properties guided by text and multi-modal controls, providing high controllability and diversity in material generation. Key to achieving diverse and high-quality PBR material generation lies in integrating the capabilities of recent large-scale vision-language models trained on billions of text-image pairs, along with material priors derived from hundreds of PBR material samples. We utilize a novel material Latent Diffusion Model (LDM) to establish the mapping between albedo maps and the corresponding latent…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Semantic Web and Ontologies
MethodsLatent Diffusion Model · Convolution · Diffusion
