TL;DR
This paper introduces a novel spatio-temporal learning framework using a modified 3D convolutional network within a GAN architecture to effectively deblur videos affected by camera or object motion, achieving state-of-the-art results.
Contribution
The paper proposes DBLRNet with modified 3D convolution for joint spatial-temporal learning and integrates it into a GAN framework with content loss for improved video deblurring.
Findings
Achieves state-of-the-art performance on standard benchmarks.
Effectively models spatio-temporal features for deblurring.
Outperforms existing methods in video deblurring accuracy.
Abstract
Camera shake or target movement often leads to undesired blur effects in videos captured by a hand-held camera. Despite significant efforts having been devoted to video-deblur research, two major challenges remain: 1) how to model the spatio-temporal characteristics across both the spatial domain (i.e., image plane) and temporal domain (i.e., neighboring frames), and 2) how to restore sharp image details w.r.t. the conventionally adopted metric of pixel-wise errors. In this paper, to address the first challenge, we propose a DeBLuRring Network (DBLRNet) for spatial-temporal learning by applying a modified 3D convolution to both spatial and temporal domains. Our DBLRNet is able to capture jointly spatial and temporal information encoded in neighboring frames, which directly contributes to improved video deblur performance. To tackle the second challenge, we leverage the developed DBLRNet…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
