TL;DR
This paper introduces a multi-stage learning-based method for raw video denoising under low light, utilizing explicit frame alignment, fusion, and adversarial training with gradient masks to produce coherent, high-quality videos.
Contribution
It proposes a novel multi-stage framework combining alignment, fusion, and adversarial loss with gradient masks for improved raw video denoising.
Findings
Outperforms state-of-the-art denoising methods in quality.
Produces temporally coherent and realistic videos.
Effectively utilizes neighboring frames without direct alignment of distant frames.
Abstract
In this paper, we propose a learning-based approach for denoising raw videos captured under low lighting conditions. We propose to do this by first explicitly aligning the neighboring frames to the current frame using a convolutional neural network (CNN). We then fuse the registered frames using another CNN to obtain the final denoised frame. To avoid directly aligning the temporally distant frames, we perform the two processes of alignment and fusion in multiple stages. Specifically, at each stage, we perform the denoising process on three consecutive input frames to generate the intermediate denoised frames which are then passed as the input to the next stage. By performing the process in multiple stages, we can effectively utilize the information of neighboring frames without directly aligning the temporally distant frames. We train our multi-stage system using an adversarial loss…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsGAN Hinge Loss
