PreF3R: Pose-Free Feed-Forward 3D Gaussian Splatting from Variable-length Image Sequence
Zequn Chen, Jiezhi Yang, Heng Yang

TL;DR
PreF3R is a real-time, pose-free 3D reconstruction method from image sequences that enables efficient novel-view rendering without camera calibration, using a dense Gaussian parameter prediction and a spatial memory network.
Contribution
It introduces a novel pose-free, feed-forward 3D reconstruction approach that extends pair-wise 3D structure reconstruction to sequential inputs with a spatial memory network.
Findings
Reconstructs 3D Gaussian field at 20 FPS in real-time.
Effective for pose-free, novel-view synthesis.
Shows robust generalization to unseen scenes.
Abstract
We present PreF3R, Pose-Free Feed-forward 3D Reconstruction from an image sequence of variable length. Unlike previous approaches, PreF3R removes the need for camera calibration and reconstructs the 3D Gaussian field within a canonical coordinate frame directly from a sequence of unposed images, enabling efficient novel-view rendering. We leverage DUSt3R's ability for pair-wise 3D structure reconstruction, and extend it to sequential multi-view input via a spatial memory network, eliminating the need for optimization-based global alignment. Additionally, PreF3R incorporates a dense Gaussian parameter prediction head, which enables subsequent novel-view synthesis with differentiable rasterization. This allows supervising our model with the combination of photometric loss and pointmap regression loss, enhancing both photorealism and structural accuracy. Given a sequence of ordered images,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Medical Image Segmentation Techniques · Advanced Image Processing Techniques
