PanoSplatt3R: Leveraging Perspective Pretraining for Generalized Unposed Wide-Baseline Panorama Reconstruction
Jiahui Ren, Mochu Xiang, Jiajun Zhu, Yuchao Dai

TL;DR
PanoSplatt3R is a novel panorama reconstruction method that operates without pose information by leveraging perspective pretraining and a new rotary positional embedding technique, achieving superior results in view synthesis and depth estimation.
Contribution
It introduces a pose-free panorama reconstruction approach that adapts perspective pretraining to the panoramic domain with a novel RoPE rolling technique for improved generalization.
Findings
Outperforms state-of-the-art methods in novel view synthesis
Achieves high-quality depth estimation without pose data
Demonstrates strong generalization across benchmarks
Abstract
Wide-baseline panorama reconstruction has emerged as a highly effective and pivotal approach for not only achieving geometric reconstruction of the surrounding 3D environment, but also generating highly realistic and immersive novel views. Although existing methods have shown remarkable performance across various benchmarks, they are predominantly reliant on accurate pose information. In real-world scenarios, the acquisition of precise pose often requires additional computational resources and is highly susceptible to noise. These limitations hinder the broad applicability and practicality of such methods. In this paper, we present PanoSplatt3R, an unposed wide-baseline panorama reconstruction method. We extend and adapt the foundational reconstruction pretrainings from the perspective domain to the panoramic domain, thus enabling powerful generalization capabilities. To ensure a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
