VCD-Texture: Variance Alignment based 3D-2D Co-Denoising for Text-Guided Texturing
Shang Liu, Chaohui Yu, Chenjie Cao, Wen Qian, Fan Wang

TL;DR
VCD-Texture introduces a variance alignment framework for 3D-2D co-denoising in texture synthesis, effectively addressing modal gaps and improving high-fidelity texturing of 3D objects guided by text prompts.
Contribution
The paper proposes a novel variance alignment based 3D-2D collaborative denoising framework that unifies latent feature learning and addresses variance bias, advancing texture synthesis quality.
Findings
Outperforms existing methods in texture quality metrics.
Constructs a new benchmark dataset for texture synthesis evaluation.
Demonstrates superior results through comprehensive experiments.
Abstract
Recent research on texture synthesis for 3D shapes benefits a lot from dramatically developed 2D text-to-image diffusion models, including inpainting-based and optimization-based approaches. However, these methods ignore the modal gap between the 2D diffusion model and 3D objects, which primarily render 3D objects into 2D images and texture each image separately. In this paper, we revisit the texture synthesis and propose a Variance alignment based 3D-2D Collaborative Denoising framework, dubbed VCD-Texture, to address these issues. Formally, we first unify both 2D and 3D latent feature learning in diffusion self-attention modules with re-projected 3D attention receptive fields. Subsequently, the denoised multi-view 2D latent features are aggregated into 3D space and then rasterized back to formulate more consistent 2D predictions. However, the rasterization process suffers from an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topics3D Shape Modeling and Analysis · Computer Graphics and Visualization Techniques · Handwritten Text Recognition Techniques
MethodsSoftmax · Attention Is All You Need · Sparse Evolutionary Training · Diffusion · Inpainting
