Pyramid Feature Alignment Network for Video Deblurring
Leitian Tao, Zhenzhong Chen

TL;DR
The paper introduces PFAN, a novel pyramid feature alignment network that improves video deblurring by multi-scale feature extraction and cascade guided deformable alignment, effectively handling complex motions and misalignments.
Contribution
The paper presents a multi-scale feature extraction strategy with Structure-to-Detail Downsampling and a cascade guided deformable alignment for more accurate video deblurring.
Findings
PFAN outperforms state-of-the-art methods in accuracy.
The multi-scale approach effectively handles complex motions.
The method achieves competitive speed with superior performance.
Abstract
Video deblurring remains a challenging task due to various causes of blurring. Traditional methods have considered how to utilize neighboring frames by the single-scale alignment for restoration. However, they typically suffer from misalignment caused by severe blur. In this work, we aim to better utilize neighboring frames with high efficient feature alignment. We propose a Pyramid Feature Alignment Network (PFAN) for video deblurring. First, the multi-scale feature of blurry frames is extracted with the strategy of Structure-to-Detail Downsampling (SDD) before alignment. This downsampling strategy makes the edges sharper, which is helpful for alignment. Then we align the feature at each scale and reconstruct the image at the corresponding scale. This strategy effectively supervises the alignment at each scale, overcoming the problem of propagated errors from the above scales at the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image Processing Techniques · Image Processing Techniques and Applications · Advanced Vision and Imaging
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Deformable Convolution · Convolution · ALIGN
