GaVS: 3D-Grounded Video Stabilization via Temporally-Consistent Local Reconstruction and Rendering

Zinuo You; Stamatios Georgoulis; Anpei Chen; Siyu Tang; Dengxin Dai

arXiv:2506.23957·cs.GR·July 21, 2025

GaVS: 3D-Grounded Video Stabilization via Temporally-Consistent Local Reconstruction and Rendering

Zinuo You, Stamatios Georgoulis, Anpei Chen, Siyu Tang, Dengxin Dai

PDF

TL;DR

GaVS introduces a 3D-grounded video stabilization method that achieves temporally-consistent local reconstructions and rendering, reducing distortions and cropping issues, and improving overall stabilization quality.

Contribution

The paper presents a novel 3D-grounded approach reformulating video stabilization as local reconstruction and rendering, with test-time finetuning for temporal consistency and scene extrapolation to avoid cropping.

Findings

01

Outperforms state-of-the-art methods in geometry consistency.

02

Produces higher quality stabilized videos with less distortion.

03

Validated by user study showing improved user experience.

Abstract

Video stabilization is pivotal for video processing, as it removes unwanted shakiness while preserving the original user motion intent. Existing approaches, depending on the domain they operate, suffer from several issues (e.g. geometric distortions, excessive cropping, poor generalization) that degrade the user experience. To address these issues, we introduce \textbf{GaVS}, a novel 3D-grounded approach that reformulates video stabilization as a temporally-consistent `local reconstruction and rendering' paradigm. Given 3D camera poses, we augment a reconstruction model to predict Gaussian Splatting primitives, and finetune it at test-time, with multi-view dynamics-aware photometric supervision and cross-frame regularization, to produce temporally-consistent local reconstructions. The model are then used to render each stabilized frame. We utilize a scene extrapolation module to avoid…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.