H3R: Hybrid Multi-view Correspondence for Generalizable 3D Reconstruction

Heng Jia; Linchao Zhu; Na Zhao

arXiv:2508.03118·cs.CV·August 6, 2025

H3R: Hybrid Multi-view Correspondence for Generalizable 3D Reconstruction

Heng Jia, Linchao Zhu, Na Zhao

PDF

Open Access

TL;DR

H3R introduces a hybrid multi-view correspondence framework combining volumetric latent fusion and attention mechanisms, significantly improving generalizable 3D reconstruction speed and accuracy across diverse datasets.

Contribution

The paper proposes a novel hybrid approach that integrates explicit geometric constraints with implicit attention-based refinement, enhancing generalization and convergence speed in 3D reconstruction.

Findings

01

Achieves state-of-the-art performance on multiple benchmarks.

02

Converges twice as fast as existing methods.

03

Demonstrates robustness with variable high-resolution views.

Abstract

Despite recent advances in feed-forward 3D Gaussian Splatting, generalizable 3D reconstruction remains challenging, particularly in multi-view correspondence modeling. Existing approaches face a fundamental trade-off: explicit methods achieve geometric precision but struggle with ambiguous regions, while implicit methods provide robustness but suffer from slow convergence. We present H3R, a hybrid framework that addresses this limitation by integrating volumetric latent fusion with attention-based feature aggregation. Our framework consists of two complementary components: an efficient latent volume that enforces geometric consistency through epipolar constraints, and a camera-aware Transformer that leverages Pl\"ucker coordinates for adaptive correspondence refinement. By integrating both paradigms, our approach enhances generalization while converging 2 $\times$ faster than existing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · 3D Shape Modeling and Analysis · Advanced Optical Sensing Technologies