Fast Online Video Super-Resolution with Deformable Attention Pyramid
Dario Fuoli, Martin Danelljan, Radu Timofte, Luc Van Gool

TL;DR
This paper introduces a fast, efficient online video super-resolution method using a deformable attention pyramid, achieving real-time performance with high accuracy by dynamically focusing on relevant spatial locations.
Contribution
The work presents a novel recurrent VSR architecture with a deformable attention pyramid that reduces computational cost while maintaining high super-resolution quality.
Findings
Outperforms EDVR-M on standard benchmarks
Achieves over 3x speed-up compared to state-of-the-art methods
Reduces processing time and computational complexity
Abstract
Video super-resolution (VSR) has many applications that pose strict causal, real-time, and latency constraints, including video streaming and TV. We address the VSR problem under these settings, which poses additional important challenges since information from future frames is unavailable. Importantly, designing efficient, yet effective frame alignment and fusion modules remain central problems. In this work, we propose a recurrent VSR architecture based on a deformable attention pyramid (DAP). Our DAP aligns and integrates information from the recurrent state into the current frame prediction. To circumvent the computational cost of traditional attention-based methods, we only attend to a limited number of spatial locations, which are dynamically predicted by the DAP. Comprehensive experiments and analysis of the proposed key innovations show the effectiveness of our approach. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Fast Online Video Super-Resolution with Deformable Attention Pyramid· youtube
Taxonomy
TopicsAdvanced Image Processing Techniques · Image Processing Techniques and Applications · Advanced Vision and Imaging
