MonoSplat: Generalizable 3D Gaussian Splatting from Monocular Depth Foundation Models
Yifan Liu, Keyu Fan, Weihao Yu, Chenxin Li, Hao Lu, Yixuan Yuan

TL;DR
MonoSplat introduces a novel framework that leverages monocular depth priors and multi-view feature fusion to improve the generalization and quality of 3D Gaussian splatting for real-time rendering.
Contribution
It presents MonoSplat, a new method combining monocular depth models with multi-view feature adaptation for robust 3D Gaussian reconstruction without scene-specific training.
Findings
Outperforms existing methods in reconstruction quality.
Demonstrates superior generalization on diverse datasets.
Maintains computational efficiency with minimal parameters.
Abstract
Recent advances in generalizable 3D Gaussian Splatting have demonstrated promising results in real-time high-fidelity rendering without per-scene optimization, yet existing approaches still struggle to handle unfamiliar visual content during inference on novel scenes due to limited generalizability. To address this challenge, we introduce MonoSplat, a novel framework that leverages rich visual priors from pre-trained monocular depth foundation models for robust Gaussian reconstruction. Our approach consists of two key components: a Mono-Multi Feature Adapter that transforms monocular features into multi-view representations, coupled with an Integrated Gaussian Prediction module that effectively fuses both feature types for precise Gaussian generation. Through the Adapter's lightweight attention mechanism, features are seamlessly aligned and aggregated across views while preserving…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topics3D Surveying and Cultural Heritage · Remote Sensing and LiDAR Applications · Computational Geometry and Mesh Generation
MethodsSoftmax · Attention Is All You Need · Adapter
