PAS3R: Pose-Adaptive Streaming 3D Reconstruction for Long Video Sequences
Lanbo Xu, Liang Guo, Caigui Jiang, Cheng Wang

TL;DR
PAS3R is a novel pose-adaptive streaming framework for 3D scene reconstruction from long videos, dynamically weighting frames based on motion and scene change to improve accuracy and stability.
Contribution
It introduces a motion-aware update mechanism and trajectory-consistent training objectives for improved long-term 3D reconstruction stability.
Findings
Significantly improves trajectory accuracy in long sequences
Enhances depth estimation and point cloud quality
Maintains competitive performance on shorter sequences
Abstract
Online monocular 3D reconstruction enables dense scene recovery from streaming video but remains fundamentally limited by the stability-adaptation dilemma: the reconstruction model must rapidly incorporate novel viewpoints while preserving previously accumulated scene structure. Existing streaming approaches rely on uniform or attention-based update mechanisms that often fail to account for abrupt viewpoint transitions, leading to trajectory drift and geometric inconsistencies over long sequences. We introduce PAS3R, a pose-adaptive streaming reconstruction framework that dynamically modulates state updates according to camera motion and scene structure. Our key insight is that frames contributing significant geometric novelty should exert stronger influence on the reconstruction state, while frames with minor viewpoint variation should prioritize preserving historical context. PAS3R…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage and Video Stabilization · Advanced Vision and Imaging · Advanced Image Processing Techniques
