Geometry and Perception Guided Gaussians for Multiview-consistent 3D Generation from a Single Image

Pufan Li; Bi'an Du; Wei Hu

arXiv:2506.21152·cs.CV·October 21, 2025

Geometry and Perception Guided Gaussians for Multiview-consistent 3D Generation from a Single Image

Pufan Li, Bi'an Du, Wei Hu

PDF

Open Access

TL;DR

This paper introduces a novel method for 3D object generation from a single image that combines geometry and perception priors, achieving better multiview consistency and detail without additional training.

Contribution

The proposed approach integrates geometry and perception priors into Gaussian-based 3D generation, enabling detailed and consistent 3D reconstructions from a single image without extra training.

Findings

01

Outperforms existing methods in novel view synthesis

02

Achieves robust multiview consistency

03

Produces detailed 3D reconstructions

Abstract

Generating realistic 3D objects from single-view images requires natural appearance, 3D consistency, and the ability to capture multiple plausible interpretations of unseen regions. Existing approaches often rely on fine-tuning pretrained 2D diffusion models or directly generating 3D information through fast network inference or 3D Gaussian Splatting, but their results generally suffer from poor multiview consistency and lack geometric detail. To tackle these issues, we present a novel method that seamlessly integrates geometry and perception information without requiring additional model training to reconstruct detailed 3D objects from a single image. Specifically, we incorporate geometry and perception priors to initialize the Gaussian branches and guide their parameter optimization. The geometry prior captures the rough 3D shapes, while the perception prior utilizes the 2D pretrained…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · 3D Surveying and Cultural Heritage · Computer Graphics and Visualization Techniques

MethodsDiffusion