Aggregating Nearest Sharp Features via Hybrid Transformers for Video   Deblurring

Wei Shang; Dongwei Ren; Yi Yang; Wangmeng Zuo

arXiv:2309.07054·cs.CV·December 2, 2024·1 cites

Aggregating Nearest Sharp Features via Hybrid Transformers for Video Deblurring

Wei Shang, Dongwei Ren, Yi Yang, Wangmeng Zuo

PDF

Open Access 1 Repo

TL;DR

This paper introduces a hybrid Transformer-based approach for video deblurring that leverages both neighboring frames and interspersed sharp frames, improving restoration quality in real-world scenarios.

Contribution

It proposes a novel method combining local and global Transformers for feature aggregation, utilizing sharp frame detection and extending to event-driven deblurring.

Findings

01

Outperforms state-of-the-art methods on benchmark datasets

02

Effective in real-world scenarios with interspersed sharp frames

03

Extensible to event-driven video deblurring

Abstract

Video deblurring methods, aiming at recovering consecutive sharp frames from a given blurry video, usually assume that the input video suffers from consecutively blurry frames. However, in real-world scenarios captured by modern imaging devices, sharp frames often interspersed within the video, providing temporally nearest sharp features that can aid in the restoration of blurry frames. In this work, we propose a video deblurring method that leverages both neighboring frames and existing sharp frames using hybrid Transformers for feature aggregation. Specifically, we first train a blur-aware detector to distinguish between sharp and blurry frames. Then, a window-based local Transformer is employed for exploiting features from neighboring frames, where cross attention is beneficial for aggregating features from neighboring frames without explicit spatial alignment. To aggregate nearest…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

shangwei5/stgtn
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image Processing Techniques · Image Processing Techniques and Applications · Image and Signal Denoising Methods

MethodsMulti-Head Attention · Attention Is All You Need · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Linear Layer · Residual Connection · Adam · Byte Pair Encoding · Softmax · Layer Normalization