F3D-Gaus: Feed-forward 3D-aware Generation on ImageNet with Cycle-Aggregative Gaussian Splatting
Yuxin Wang, Qianyi Wu, Dan Xu

TL;DR
F3D-Gaus introduces a novel pipeline for 3D-aware image generation from monocular datasets, leveraging Gaussian Splatting and cycle-aggregative constraints to improve realism, consistency, and efficiency.
Contribution
The paper presents a new feed-forward method with cycle-aggregative Gaussian Splatting for monocular 3D-aware generation, enhancing quality and consistency over existing approaches.
Findings
Achieves high-quality, multi-view consistent 3D generation from monocular data.
Significantly improves training and inference efficiency.
Enhances fine detail rendering using geometry-aware refinement.
Abstract
This paper tackles the problem of generalizable 3D-aware generation from monocular datasets, e.g., ImageNet. The key challenge of this task is learning a robust 3D-aware representation without multi-view or dynamic data, while ensuring consistent texture and geometry across different viewpoints. Although some baseline methods are capable of 3D-aware generation, the quality of the generated images still lags behind state-of-the-art 2D generation approaches, which excel in producing high-quality, detailed images. To address this severe limitation, we propose a novel feed-forward pipeline based on pixel-aligned Gaussian Splatting, coined as F3D-Gaus, which can produce more realistic and reliable 3D renderings from monocular inputs. In addition, we introduce a self-supervised cycle-aggregative constraint to enforce cross-view consistency in the learned 3D representation. This training…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Advanced Neural Network Applications · Advanced Image and Video Retrieval Techniques
