Coarse-to-Fine Video Denoising with Dual-Stage Spatial-Channel Transformer
Wulian Yun, Mengshi Qi, Chuanming Wang, Huiyuan Fu, Huadong Ma

TL;DR
This paper introduces a dual-stage transformer-based approach for video denoising that captures long-range dependencies and refines results through coarse-to-fine processing, outperforming existing CNN-based methods.
Contribution
The paper proposes a novel dual-stage transformer architecture with spatial-channel encoding and multi-scale residuals for improved video denoising.
Findings
Significant performance improvements over state-of-the-art methods.
Effective modeling of long-range spatial and channel dependencies.
Robust results across multiple datasets.
Abstract
Video denoising aims to recover high-quality frames from the noisy video. While most existing approaches adopt convolutional neural networks~(CNNs) to separate the noise from the original visual content, however, CNNs focus on local information and ignore the interactions between long-range regions in the frame. Furthermore, most related works directly take the output after basic spatio-temporal denoising as the final result, leading to neglect the fine-grained denoising process. In this paper, we propose a Dual-stage Spatial-Channel Transformer for coarse-to-fine video denoising, which inherits the advantages of both Transformer and CNNs. Specifically, DSCT is proposed based on a progressive dual-stage architecture, namely a coarse-level and a fine-level stage to extract dynamic features and static features, respectively. At both stages, a Spatial-Channel Encoding Module is designed to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage and Signal Denoising Methods · Image Processing Techniques and Applications · Sparse and Compressive Sensing Techniques
MethodsAttention Is All You Need · Linear Layer · Residual Connection · Absolute Position Encodings · Label Smoothing · Multi-Head Attention · Layer Normalization · Softmax · Dense Connections · Dropout
