RealViformer: Investigating Attention for Real-World Video Super-Resolution
Yuehan Zhang, Angela Yao

TL;DR
RealViformer introduces a novel channel-attention-based framework for real-world video super-resolution that effectively handles artifacts, outperforming existing methods in accuracy and efficiency.
Contribution
The paper proposes RealViformer, a new VSR model leveraging channel attention and covariance-based techniques to improve artifact robustness and computational efficiency.
Findings
Outperforms state-of-the-art on real-world VSR datasets
Uses fewer parameters and has faster runtimes
Channel attention is less sensitive to artifacts than spatial attention
Abstract
In real-world video super-resolution (VSR), videos suffer from in-the-wild degradations and artifacts. VSR methods, especially recurrent ones, tend to propagate artifacts over time in the real-world setting and are more vulnerable than image super-resolution. This paper investigates the influence of artifacts on commonly used covariance-based attention mechanisms in VSR. Comparing the widely-used spatial attention, which computes covariance over space, versus the channel attention, we observe that the latter is less sensitive to artifacts. However, channel attention leads to feature redundancy, as evidenced by the higher covariance among output channels. As such, we explore simple techniques such as the squeeze-excite mechanism and covariance-based rescaling to counter the effects of high channel covariance. Based on our findings, we propose RealViformer. This channel-attention-based…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image Processing Techniques · Image and Video Quality Assessment · Image and Signal Denoising Methods
MethodsSoftmax · Attention Is All You Need
