Pay Attention to Hidden States for Video Deblurring: Ping-Pong Recurrent Neural Networks and Selective Non-Local Attention
JoonKyu Park, Seungjun Nah, Kyoung Mu Lee

TL;DR
This paper introduces a novel recurrent neural network architecture with specialized modules to enhance video deblurring by effectively utilizing hidden states and aligning features, achieving state-of-the-art results.
Contribution
The paper proposes Ping-Pong RNN and Selective Non-Local Attention modules to improve hidden state utilization and feature alignment in video deblurring models.
Findings
Achieves state-of-the-art performance on standard benchmarks.
Effectively handles strong motion blur in real-world videos.
Enhances RNN representation power with new modules.
Abstract
Video deblurring models exploit information in the neighboring frames to remove blur caused by the motion of the camera and the objects. Recurrent Neural Networks~(RNNs) are often adopted to model the temporal dependency between frames via hidden states. When motion blur is strong, however, hidden states are hard to deliver proper information due to the displacement between different frames. While there have been attempts to update the hidden states, it is difficult to handle misaligned features beyond the receptive field of simple modules. Thus, we propose 2 modules to supplement the RNN architecture for video deblurring. First, we design Ping-Pong RNN~(PPRNN) that acts on updating the hidden states by referring to the features from the current and the previous time steps alternately. PPRNN gathers relevant information from the both features in an iterative and balanced manner by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image Processing Techniques · Image and Signal Denoising Methods · Digital Media Forensic Detection
