MuGS: Multi-Baseline Generalizable Gaussian Splatting Reconstruction
Yaopeng Lou, Liao Shen, Tianqi Liu, Jiaqi Li, Zihao Huang, Huiqiang Sun, Zhiguo Cao

TL;DR
MuGS is a novel method for view synthesis that combines multi-view stereo and monocular depth cues with Gaussian representations, achieving state-of-the-art results across various scenarios and input configurations.
Contribution
It introduces a generalized Gaussian splatting approach integrating multi-view and monocular features, with a new depth fusion mechanism and reference-view loss for improved reconstruction.
Findings
State-of-the-art performance on DTU, RealEstate10K, LLFF, and Mip-NeRF 360 datasets.
Effective handling of diverse baseline settings, including sparse views and large baselines.
Fast training and inference with 3D Gaussian representations.
Abstract
We present Multi-Baseline Gaussian Splatting (MuGS), a generalized feed-forward approach for novel view synthesis that effectively handles diverse baseline settings, including sparse input views with both small and large baselines. Specifically, we integrate features from Multi-View Stereo (MVS) and Monocular Depth Estimation (MDE) to enhance feature representations for generalizable reconstruction. Next, We propose a projection-and-sampling mechanism for deep depth fusion, which constructs a fine probability volume to guide the regression of the feature map. Furthermore, We introduce a reference-view loss to improve geometry and optimization efficiency. We leverage 3D Gaussian representations to accelerate training and inference time while enhancing rendering quality. MuGS achieves state-of-the-art performance across multiple baseline settings and diverse scenarios ranging from simple…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
