Geometry and Perception Guided Gaussians for Multiview-consistent 3D Generation from a Single Image
Pufan Li, Bi'an Du, Wei Hu

TL;DR
This paper introduces a novel method for 3D object generation from a single image that combines geometry and perception priors, achieving better multiview consistency and detail without additional training.
Contribution
The proposed approach integrates geometry and perception priors into Gaussian-based 3D generation, enabling detailed and consistent 3D reconstructions from a single image without extra training.
Findings
Outperforms existing methods in novel view synthesis
Achieves robust multiview consistency
Produces detailed 3D reconstructions
Abstract
Generating realistic 3D objects from single-view images requires natural appearance, 3D consistency, and the ability to capture multiple plausible interpretations of unseen regions. Existing approaches often rely on fine-tuning pretrained 2D diffusion models or directly generating 3D information through fast network inference or 3D Gaussian Splatting, but their results generally suffer from poor multiview consistency and lack geometric detail. To tackle these issues, we present a novel method that seamlessly integrates geometry and perception information without requiring additional model training to reconstruct detailed 3D objects from a single image. Specifically, we incorporate geometry and perception priors to initialize the Gaussian branches and guide their parameter optimization. The geometry prior captures the rough 3D shapes, while the perception prior utilizes the 2D pretrained…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · 3D Surveying and Cultural Heritage · Computer Graphics and Visualization Techniques
MethodsDiffusion
