RealisticDreamer: Guidance Score Distillation for Few-shot Gaussian Splatting
Ruocheng Wu, Haolan He, Yufei Wang, Zhihao Li, Bihan Wen

TL;DR
RealisticDreamer introduces Guidance Score Distillation, leveraging pretrained Video Diffusion Models to improve 3D Gaussian Splatting from sparse views by incorporating depth and semantic guidance, leading to better multi-view consistency.
Contribution
The paper proposes Guidance Score Distillation, a novel framework that enhances 3D Gaussian Splatting with multi-view priors from VDMs and unified guidance to prevent overfitting in sparse view scenarios.
Findings
Outperforms existing methods on multiple datasets.
Improves multi-view consistency in sparse view settings.
Effectively incorporates depth and semantic guidance.
Abstract
3D Gaussian Splatting (3DGS) has recently gained great attention in the 3D scene representation for its high-quality real-time rendering capabilities. However, when the input comprises sparse training views, 3DGS is prone to overfitting, primarily due to the lack of intermediate-view supervision. Inspired by the recent success of Video Diffusion Models (VDM), we propose a framework called Guidance Score Distillation (GSD) to extract the rich multi-view consistency priors from pretrained VDMs. Building on the insights from Score Distillation Sampling (SDS), GSD supervises rendered images from multiple neighboring views, guiding the Gaussian splatting representation towards the generative direction of VDM. However, the generative direction often involves object motion and random camera trajectories, making it challenging for direct supervision in the optimization process. To address this…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Video Coding and Compression Technologies · Generative Adversarial Networks and Image Synthesis
