Towards an Effective and Efficient Transformer for Rain-by-snow Weather Removal
Tao Gao, Yuanbo Wen, Kaihao Zhang, Peng Cheng, and Ting Chen

TL;DR
RSFormer is a novel Transformer-based model designed for rain-by-snow weather removal, combining convolutional and attention mechanisms to achieve efficient and effective image restoration with new datasets and superior performance.
Contribution
The paper introduces RSFormer, a Transformer-like convolutional network with a global-local self-attention sampling mechanism, and provides two new datasets for rain-by-snow removal.
Findings
RSFormer outperforms existing methods in accuracy and efficiency.
It reduces parameters and inference time compared to Restormer.
Extensive experiments validate its effectiveness on new datasets.
Abstract
Rain-by-snow weather removal is a specialized task in weather-degraded image restoration aiming to eliminate coexisting rain streaks and snow particles. In this paper, we propose RSFormer, an efficient and effective Transformer that addresses this challenge. Initially, we explore the proximity of convolution networks (ConvNets) and vision Transformers (ViTs) in hierarchical architectures and experimentally find they perform approximately at intra-stage feature learning. On this basis, we utilize a Transformer-like convolution block (TCB) that replaces the computationally expensive self-attention while preserving attention characteristics for adapting to input content. We also demonstrate that cross-stage progression is critical for performance improvement, and propose a global-local self-attention sampling mechanism (GLASM) that down-/up-samples features while capturing both global and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Enhancement Techniques · Advanced Image Fusion Techniques · Advanced Vision and Imaging
MethodsMulti-Head Attention · Dense Connections · Label Smoothing · Convolution · Adam · Softmax · Linear Layer · Absolute Position Encodings · Byte Pair Encoding · Residual Connection
