CaPa: Carve-n-Paint Synthesis for Efficient 4K Textured Mesh Generation
Hwan Heo, Jangyeong Kim, Seongyeong Lee, Jeong A Wi, Junyoung Choi,, Sangjun Ahn

TL;DR
CaPa is a novel two-stage framework that efficiently generates high-fidelity 3D textured meshes from multi-view inputs, addressing issues of inconsistency, slow speed, and low quality in existing methods.
Contribution
It introduces a carve-and-paint approach with a geometry generator and a novel texture synthesis method, achieving fast, high-quality 3D asset creation.
Findings
Generates 3D assets in under 30 seconds.
Produces 4K high-resolution textures.
Outperforms existing methods in fidelity and stability.
Abstract
The synthesis of high-quality 3D assets from textual or visual inputs has become a central objective in modern generative modeling. Despite the proliferation of 3D generation algorithms, they frequently grapple with challenges such as multi-view inconsistency, slow generation times, low fidelity, and surface reconstruction problems. While some studies have addressed some of these issues, a comprehensive solution remains elusive. In this paper, we introduce \textbf{CaPa}, a carve-and-paint framework that generates high-fidelity 3D assets efficiently. CaPa employs a two-stage process, decoupling geometry generation from texture synthesis. Initially, a 3D latent diffusion model generates geometry guided by multi-view inputs, ensuring structural consistency across perspectives. Subsequently, leveraging a novel, model-agnostic Spatially Decoupled Attention, the framework synthesizes…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Materials and Mechanics · Additive Manufacturing and 3D Printing Technologies · Interactive and Immersive Displays
MethodsSoftmax · Attention Is All You Need · Diffusion · Latent Diffusion Model · Inpainting
