Pyramid Feature Alignment Network for Video Deblurring

Leitian Tao; Zhenzhong Chen

arXiv:2203.14556·cs.CV·March 29, 2022

Pyramid Feature Alignment Network for Video Deblurring

Leitian Tao, Zhenzhong Chen

PDF

Open Access

TL;DR

The paper introduces PFAN, a novel pyramid feature alignment network that improves video deblurring by multi-scale feature extraction and cascade guided deformable alignment, effectively handling complex motions and misalignments.

Contribution

The paper presents a multi-scale feature extraction strategy with Structure-to-Detail Downsampling and a cascade guided deformable alignment for more accurate video deblurring.

Findings

01

PFAN outperforms state-of-the-art methods in accuracy.

02

The multi-scale approach effectively handles complex motions.

03

The method achieves competitive speed with superior performance.

Abstract

Video deblurring remains a challenging task due to various causes of blurring. Traditional methods have considered how to utilize neighboring frames by the single-scale alignment for restoration. However, they typically suffer from misalignment caused by severe blur. In this work, we aim to better utilize neighboring frames with high efficient feature alignment. We propose a Pyramid Feature Alignment Network (PFAN) for video deblurring. First, the multi-scale feature of blurry frames is extracted with the strategy of Structure-to-Detail Downsampling (SDD) before alignment. This downsampling strategy makes the edges sharper, which is helpful for alignment. Then we align the feature at each scale and reconstruct the image at the corresponding scale. This strategy effectively supervises the alignment at each scale, overcoming the problem of propagated errors from the above scales at the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image Processing Techniques · Image Processing Techniques and Applications · Advanced Vision and Imaging

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Deformable Convolution · Convolution · ALIGN