Aggregating Nearest Sharp Features via Hybrid Transformers for Video Deblurring
Wei Shang, Dongwei Ren, Yi Yang, Wangmeng Zuo

TL;DR
This paper introduces a hybrid Transformer-based approach for video deblurring that leverages both neighboring frames and interspersed sharp frames, improving restoration quality in real-world scenarios.
Contribution
It proposes a novel method combining local and global Transformers for feature aggregation, utilizing sharp frame detection and extending to event-driven deblurring.
Findings
Outperforms state-of-the-art methods on benchmark datasets
Effective in real-world scenarios with interspersed sharp frames
Extensible to event-driven video deblurring
Abstract
Video deblurring methods, aiming at recovering consecutive sharp frames from a given blurry video, usually assume that the input video suffers from consecutively blurry frames. However, in real-world scenarios captured by modern imaging devices, sharp frames often interspersed within the video, providing temporally nearest sharp features that can aid in the restoration of blurry frames. In this work, we propose a video deblurring method that leverages both neighboring frames and existing sharp frames using hybrid Transformers for feature aggregation. Specifically, we first train a blur-aware detector to distinguish between sharp and blurry frames. Then, a window-based local Transformer is employed for exploiting features from neighboring frames, where cross attention is beneficial for aggregating features from neighboring frames without explicit spatial alignment. To aggregate nearest…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image Processing Techniques · Image Processing Techniques and Applications · Image and Signal Denoising Methods
MethodsMulti-Head Attention · Attention Is All You Need · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Linear Layer · Residual Connection · Adam · Byte Pair Encoding · Softmax · Layer Normalization
