StabStitch++: Unsupervised Online Video Stitching with Spatiotemporal Bidirectional Warps
Lang Nie, Chunyu Lin, Kang Liao, Yun Zhang, Shuaicheng Liu, Yao Zhao

TL;DR
StabStitch++ introduces an unsupervised online video stitching framework that simultaneously achieves spatial alignment and temporal stabilization, effectively reducing warping shake and enhancing visual quality in real-time applications.
Contribution
The paper proposes a novel bidirectional warping approach with a differentiable decomposition module and a warp smoothing model for unsupervised, real-time video stitching with stabilization.
Findings
Outperforms existing methods in stitching quality and robustness.
Effectively reduces warping shake in stitched videos.
Enables real-time online video stitching with improved stability.
Abstract
We retarget video stitching to an emerging issue, named warping shake, which unveils the temporal content shakes induced by sequentially unsmooth warps when extending image stitching to video stitching. Even if the input videos are stable, the stitched video can inevitably cause undesired warping shakes and affect the visual experience. To address this issue, we propose StabStitch++, a novel video stitching framework to realize spatial stitching and temporal stabilization with unsupervised learning simultaneously. First, different from existing learning-based image stitching solutions that typically warp one image to align with another, we suppose a virtual midplane between original image planes and project them onto it. Concretely, we design a differentiable bidirectional decomposition module to disentangle the homography transformation and incorporate it into our spatial warp, evenly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Generative Adversarial Networks and Image Synthesis · Human Pose and Action Recognition
MethodsALIGN
