Flow-Guided Sparse Transformer for Video Deblurring
Jing Lin, Yuanhao Cai, Xiaowan Hu, Haoqian Wang, Youliang Yan, Xueyi, Zou, Henghui Ding, Yulun Zhang, Radu Timofte, Luc Van Gool

TL;DR
This paper introduces a Flow-Guided Sparse Transformer framework for video deblurring that effectively captures long-range dependencies and non-local self-similarity, outperforming existing methods on standard datasets.
Contribution
The novel FGST framework with a flow-guided sparse self-attention module and recurrent embedding mechanism enhances long-range temporal modeling in video deblurring.
Findings
Outperforms state-of-the-art methods on DVD and GOPRO datasets.
Produces more visually pleasing results in real video deblurring.
Effectively captures long-range dependencies and non-local self-similarity.
Abstract
Exploiting similar and sharper scene patches in spatio-temporal neighborhoods is critical for video deblurring. However, CNN-based methods show limitations in capturing long-range dependencies and modeling non-local self-similarity. In this paper, we propose a novel framework, Flow-Guided Sparse Transformer (FGST), for video deblurring. In FGST, we customize a self-attention module, Flow-Guided Sparse Window-based Multi-head Self-Attention (FGSW-MSA). For each element on the blurry reference frame, FGSW-MSA enjoys the guidance of the estimated optical flow to globally sample spatially sparse yet highly related elements corresponding to the same scene patch in neighboring frames. Besides, we present a Recurrent Embedding (RE) mechanism to transfer information from past frames and strengthen long-range temporal dependencies. Comprehensive experiments demonstrate that our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image Processing Techniques · Digital Media Forensic Detection · Generative Adversarial Networks and Image Synthesis
MethodsAttention Is All You Need · Linear Layer · Cosine Annealing · Weight Decay · Absolute Position Encodings · Linear Warmup With Cosine Annealing · Residual Connection · Softmax · Refunds@Expedia|||How do I get a full refund from Expedia? · Adam
