GeoQuery: Geometry-Query Diffusion for Sparse-View Reconstruction
Xiao Cao, Yuze Li, Youmin Zhang, Jiayu Song, Cheng Yan, Wen Li, Lixin Duan

TL;DR
GeoQuery introduces a geometry-guided diffusion framework with a novel attention mechanism to improve sparse-view 3D reconstruction and view synthesis, addressing artifacts caused by corrupted features.
Contribution
It proposes a geometry-guided cross-view attention mechanism that enhances feature retrieval and robustness in sparse-view 3D reconstruction.
Findings
Effective in reducing artifacts in sparse-view synthesis
Improves robustness of 3D reconstruction under view sparsity
Seamlessly integrates into existing diffusion pipelines
Abstract
3D Gaussian Splatting (3DGS) has emerged as a prominent paradigm for 3D reconstruction and novel view synthesis. However, it remains vulnerable to severe artifacts when trained under sparse-view constraints. While recent methods attempt to rectify artifacts in rendered views using image diffusion models, they typically rely on multi-view self-attention to retrieve information from reference images. We observe that this mechanism often fails when the rendered novel views output by 3DGS are heavily corrupted: damaged query features lead to erroneous cross-view retrieval, resulting in inconsistent rendering refinement. To address this, we propose GeoQuery, a geometry-guided diffusion framework that integrates generative priors with explicit geometric cues via a novel Geometry-guided Cross-view Attention (GCA) mechanism. First, by leveraging predicted depth maps and camera poses, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
