MeshFormer: High-Quality Mesh Generation with 3D-Guided Reconstruction Model
Minghua Liu, Chong Zeng, Xinyue Wei, Ruoxi Shi, Linghao Chen, Chao Xu,, Mengqi Zhang, Zhaoning Wang, Xiaoshuai Zhang, Isabella Liu, Hongzhi Wu, Hao, Su

TL;DR
MeshFormer is a novel 3D reconstruction model that leverages explicit 3D structure, normal map guidance, and SDF supervision to efficiently generate high-quality textured meshes from sparse views, enabling fast single-image-to-3D and text-to-3D applications.
Contribution
The paper introduces MeshFormer, which combines 3D sparse voxels, transformers, and 3D convolutions with normal map guidance and SDF supervision for improved 3D mesh reconstruction.
Findings
Achieves high-quality textured meshes with fine details.
Enables fast single-image-to-3D and text-to-3D reconstruction.
Reduces training costs compared to previous methods.
Abstract
Open-world 3D reconstruction models have recently garnered significant attention. However, without sufficient 3D inductive bias, existing methods typically entail expensive training costs and struggle to extract high-quality 3D meshes. In this work, we introduce MeshFormer, a sparse-view reconstruction model that explicitly leverages 3D native structure, input guidance, and training supervision. Specifically, instead of using a triplane representation, we store features in 3D sparse voxels and combine transformers with 3D convolutions to leverage an explicit 3D structure and projective bias. In addition to sparse-view RGB input, we require the network to take input and generate corresponding normal maps. The input normal maps can be predicted by 2D diffusion models, significantly aiding in the guidance and refinement of the geometry's learning. Moreover, by combining Signed Distance…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
Topics3D Shape Modeling and Analysis · Computer Graphics and Visualization Techniques · Computational Geometry and Mesh Generation
MethodsDiffusion
