Flow-Guided Sparse Transformer for Video Deblurring

Jing Lin; Yuanhao Cai; Xiaowan Hu; Haoqian Wang; Youliang Yan; Xueyi; Zou; Henghui Ding; Yulun Zhang; Radu Timofte; Luc Van Gool

arXiv:2201.01893·eess.IV·May 31, 2022·22 cites

Flow-Guided Sparse Transformer for Video Deblurring

Jing Lin, Yuanhao Cai, Xiaowan Hu, Haoqian Wang, Youliang Yan, Xueyi, Zou, Henghui Ding, Yulun Zhang, Radu Timofte, Luc Van Gool

PDF

Open Access 1 Repo

TL;DR

This paper introduces a Flow-Guided Sparse Transformer framework for video deblurring that effectively captures long-range dependencies and non-local self-similarity, outperforming existing methods on standard datasets.

Contribution

The novel FGST framework with a flow-guided sparse self-attention module and recurrent embedding mechanism enhances long-range temporal modeling in video deblurring.

Findings

01

Outperforms state-of-the-art methods on DVD and GOPRO datasets.

02

Produces more visually pleasing results in real video deblurring.

03

Effectively captures long-range dependencies and non-local self-similarity.

Abstract

Exploiting similar and sharper scene patches in spatio-temporal neighborhoods is critical for video deblurring. However, CNN-based methods show limitations in capturing long-range dependencies and modeling non-local self-similarity. In this paper, we propose a novel framework, Flow-Guided Sparse Transformer (FGST), for video deblurring. In FGST, we customize a self-attention module, Flow-Guided Sparse Window-based Multi-head Self-Attention (FGSW-MSA). For each $q u er y$ element on the blurry reference frame, FGSW-MSA enjoys the guidance of the estimated optical flow to globally sample spatially sparse yet highly related $k ey$ elements corresponding to the same scene patch in neighboring frames. Besides, we present a Recurrent Embedding (RE) mechanism to transfer information from past frames and strengthen long-range temporal dependencies. Comprehensive experiments demonstrate that our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

linjing7/VR-Baseline
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image Processing Techniques · Digital Media Forensic Detection · Generative Adversarial Networks and Image Synthesis

MethodsAttention Is All You Need · Linear Layer · Cosine Annealing · Weight Decay · Absolute Position Encodings · Linear Warmup With Cosine Annealing · Residual Connection · Softmax · Refunds@Expedia|||How do I get a full refund from Expedia? · Adam