LM-Gaussian: Boost Sparse-view 3D Gaussian Splatting with Large Model Priors
Hanyang Yu, Xiaoxiao Long, Ping Tan

TL;DR
LM-Gaussian leverages large-scale vision model priors to enable high-quality 3D scene reconstruction from sparse images, reducing data needs and improving detail preservation.
Contribution
The paper introduces LM-Gaussian, a novel method that uses stereo, diffusion, and video priors to enhance sparse-view 3D Gaussian Splatting reconstructions.
Findings
Achieves high-quality 3D reconstructions with fewer images.
Outperforms existing methods on public datasets.
Preserves scene details effectively during sparse-view reconstruction.
Abstract
We aim to address sparse-view reconstruction of a 3D scene by leveraging priors from large-scale vision models. While recent advancements such as 3D Gaussian Splatting (3DGS) have demonstrated remarkable successes in 3D reconstruction, these methods typically necessitate hundreds of input images that densely capture the underlying scene, making them time-consuming and impractical for real-world applications. However, sparse-view reconstruction is inherently ill-posed and under-constrained, often resulting in inferior and incomplete outcomes. This is due to issues such as failed initialization, overfitting on input images, and a lack of details. To mitigate these challenges, we introduce LM-Gaussian, a method capable of generating high-quality reconstructions from a limited number of images. Specifically, we propose a robust initialization module that leverages stereo priors to aid in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpectroscopy Techniques in Biomedical and Chemical Research · Video Surveillance and Tracking Methods · Image Enhancement Techniques
MethodsDiffusion
