Fancy123: One Image to High-Quality 3D Mesh Generation via Plug-and-Play Deformation
Qiao Yu, Xianzhi Li, Yuan Tang, Xu Han, Long Hu, Yixue Hao, Min Chen

TL;DR
Fancy123 introduces a plug-and-play framework with enhancement modules and unprojection to improve the quality, fidelity, and consistency of 3D meshes generated from a single image, outperforming existing methods.
Contribution
It proposes novel enhancement modules and an unprojection operation that significantly improve 3D mesh quality from a single image, with seamless integration into existing methods.
Findings
Achieves state-of-the-art performance in 3D mesh generation from a single image.
Enhancement modules improve multiview consistency and fidelity to input images.
Unprojection discards blurry mesh colors, enhancing clarity.
Abstract
Generating 3D meshes from a single image is an important but ill-posed task. Existing methods mainly adopt 2D multiview diffusion models to generate intermediate multiview images, and use the Large Reconstruction Model (LRM) to create the final meshes. However, the multiview images exhibit local inconsistencies, and the meshes often lack fidelity to the input image or look blurry. We propose Fancy123, featuring two enhancement modules and an unprojection operation to address the above three issues, respectively. The appearance enhancement module deforms the 2D multiview images to realign misaligned pixels for better multiview consistency. The fidelity enhancement module deforms the 3D mesh to match the input image. The unprojection of the input image and deformed multiview images onto LRM's generated mesh ensures high clarity, discarding LRM's predicted blurry-looking mesh colors.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topics3D Shape Modeling and Analysis · Computer Graphics and Visualization Techniques · 3D Surveying and Cultural Heritage
MethodsDiffusion · ADaptive gradient method with the OPTimal convergence rate
