Learning Spatiotemporal Frequency-Transformer for Compressed Video Super-Resolution
Zhongwei Qiu, Huan Yang, Jianlong Fu, Dongmei Fu

TL;DR
This paper introduces a Frequency-Transformer model for compressed video super-resolution that leverages joint space-time-frequency self-attention on spectral maps to effectively restore high-quality frames from degraded compressed videos.
Contribution
The novel Frequency-Transformer conducts fine-grained self-attention in the spectral domain, improving texture extraction and transfer in compressed video super-resolution tasks.
Findings
Outperforms state-of-the-art methods on benchmark datasets.
Effective separation of textures from artifacts in spectral domain.
Divided attention scheme yields best enhancement results.
Abstract
Compressed video super-resolution (VSR) aims to restore high-resolution frames from compressed low-resolution counterparts. Most recent VSR approaches often enhance an input frame by borrowing relevant textures from neighboring video frames. Although some progress has been made, there are grand challenges to effectively extract and transfer high-quality textures from compressed videos where most frames are usually highly degraded. In this paper, we propose a novel Frequency-Transformer for compressed video super-resolution (FTVSR) that conducts self-attention over a joint space-time-frequency domain. First, we divide a video frame into patches, and transform each patch into DCT spectral maps in which each channel represents a frequency band. Such a design enables a fine-grained level self-attention on each frequency band, so that real visual texture can be distinguished from artifacts,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image Processing Techniques · Image Processing Techniques and Applications · Image and Signal Denoising Methods
