PIS3R: Very Large Parallax Image Stitching via Deep 3D Reconstruction

Muhua Zhu; Xinhao Jin; Chengbo Wang; Yongcong Zhang; Yifei Xue; Tie Ji; Yizhen Lao

arXiv:2508.04236·cs.CV·December 25, 2025

PIS3R: Very Large Parallax Image Stitching via Deep 3D Reconstruction

Muhua Zhu, Xinhao Jin, Chengbo Wang, Yongcong Zhang, Yifei Xue, Tie Ji, Yizhen Lao

PDF

TL;DR

PIS3R introduces a deep 3D reconstruction-based image stitching method that effectively handles very large parallax, producing geometrically accurate and seamless wide images suitable for 3D vision tasks.

Contribution

The paper presents a novel deep 3D reconstruction approach with a visual geometry transformer and point-conditioned diffusion for large parallax image stitching.

Findings

01

Outperforms existing methods in large parallax scenarios

02

Produces geometrically consistent stitched images

03

Enables direct application to 3D vision tasks

Abstract

Image stitching aim to align two images taken from different viewpoints into one seamless, wider image. However, when the 3D scene contains depth variations and the camera baseline is significant, noticeable parallax occurs-meaning the relative positions of scene elements differ substantially between views. Most existing stitching methods struggle to handle such images with large parallax effectively. To address this challenge, in this paper, we propose an image stitching solution called PIS3R that is robust to very large parallax based on the novel concept of deep 3D reconstruction. First, we apply visual geometry grounded transformer to two input images with very large parallax to obtain both intrinsic and extrinsic parameters, as well as the dense 3D scene reconstruction. Subsequently, we reproject reconstructed dense point cloud onto a designated reference view using the recovered…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.