LiftVSR: Lifting Image Diffusion to Video Super-Resolution via Hybrid Temporal Modeling with Only 4$\times$RTX 4090s
Xijun Wang, Xin Li, Bingchen Li, Zhibo Chen

TL;DR
LiftVSR introduces an efficient hybrid temporal modeling framework for video super-resolution that achieves state-of-the-art results with significantly reduced computational resources, leveraging only four RTX 4090 GPUs.
Contribution
The paper proposes LiftVSR, a novel VSR framework combining dynamic temporal attention and attention memory cache for improved long-term coherence and efficiency.
Findings
State-of-the-art performance on VSR benchmarks
Significantly lower computational costs
Effective long-term temporal consistency
Abstract
Diffusion models have significantly advanced video super-resolution (VSR) by enhancing perceptual quality, largely through elaborately designed temporal modeling to ensure inter-frame consistency. However, existing methods usually suffer from limited temporal coherence and prohibitively high computational costs (e.g., typically requiring over 8 NVIDIA A100-80G GPUs), especially for long videos. In this work, we propose LiftVSR, an efficient VSR framework that leverages and elevates the image-wise diffusion prior from PixArt-, achieving state-of-the-art results using only 4RTX 4090 GPUs. To balance long-term consistency and efficiency, we introduce a hybrid temporal modeling mechanism that decomposes temporal learning into two complementary components: (i) Dynamic Temporal Attention (DTA) for fine-grained temporal modeling within short frame segment (, low…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image Processing Techniques · Image and Video Quality Assessment · Generative Adversarial Networks and Image Synthesis
MethodsSoftmax · Attention Is All You Need · Diffusion
