TL;DR
DealMaTe introduces a simplified diffusion framework for multi-dimensional material transfer that eliminates text guidance, reduces computational costs, and achieves high-fidelity results across diverse objects and lighting conditions.
Contribution
Proposes DealMaTe, a novel diffusion-based method using depth, normal, and lighting images for material transfer without text or reference networks.
Findings
Achieves high-fidelity material transfer across various objects and lighting conditions.
Reduces inference latency and computational costs with optimized attention mechanisms.
Demonstrates stable and harmonious results without modifying base model weights.
Abstract
Recently, diffusion-based material transfer methods rely on image fine-tuning or complex architectures with auxiliary networks but face challenges such as text dependency, additional computational costs, and feature misalignment. To address these limitations, we propose \textbf{DealMaTe}, using \underline{\textbf{de}}pth, norm\underline{\textbf{a}}l, and \underline{\textbf{l}}ighting images for \underline{\textbf{ma}}terial \underline{\textbf{t}}ransf\underline{\textbf{e}}r. DealMaTe is a simplified diffusion framework that eliminates text guidance and reference networks. We design a lightweight 3D information injection method, Multi-Dim 3D Shader LoRA, which, without modifying the base model weights, enables compatible control conditions and achieves harmonious and stable results. Additionally, we optimize the attention mechanism with Shader Causal Mutual Attention and key-value (KV)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
