GeoLRM: Geometry-Aware Large Reconstruction Model for High-Quality 3D Gaussian Generation
Chubin Zhang, Hongliang Song, Yi Wei, Yu Chen, Jiwen Lu, Yansong Tang

TL;DR
GeoLRM is a geometry-aware 3D reconstruction model that efficiently generates high-quality 3D assets with dense views using a novel transformer architecture and explicit geometric relationships, all within limited GPU memory.
Contribution
The paper introduces GeoLRM, a novel 3D-aware transformer that incorporates geometric relationships and deformable cross-attention for scalable, high-quality 3D Gaussian generation from limited images.
Findings
Outperforms existing models on dense view inputs
Efficiently generates 512k Gaussians with only 11 GB GPU memory
Demonstrates practical 3D generation applications
Abstract
In this work, we introduce the Geometry-Aware Large Reconstruction Model (GeoLRM), an approach which can predict high-quality assets with 512k Gaussians and 21 input images in only 11 GB GPU memory. Previous works neglect the inherent sparsity of 3D structure and do not utilize explicit geometric relationships between 3D and 2D images. This limits these methods to a low-resolution representation and makes it difficult to scale up to the dense views for better quality. GeoLRM tackles these issues by incorporating a novel 3D-aware transformer structure that directly processes 3D points and uses deformable cross-attention mechanisms to effectively integrate image features into 3D representations. We implement this solution through a two-stage pipeline: initially, a lightweight proposal network generates a sparse set of 3D anchor points from the posed image inputs; subsequently, a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Computer Graphics and Visualization Techniques · Cell Image Analysis Techniques
MethodsSparse Evolutionary Training
