Towards an Effective and Efficient Transformer for Rain-by-snow Weather   Removal

Tao Gao; Yuanbo Wen; Kaihao Zhang; Peng Cheng; and Ting Chen

arXiv:2304.02860·cs.CV·October 30, 2023·1 cites

Towards an Effective and Efficient Transformer for Rain-by-snow Weather Removal

Tao Gao, Yuanbo Wen, Kaihao Zhang, Peng Cheng, and Ting Chen

PDF

Open Access 1 Repo

TL;DR

RSFormer is a novel Transformer-based model designed for rain-by-snow weather removal, combining convolutional and attention mechanisms to achieve efficient and effective image restoration with new datasets and superior performance.

Contribution

The paper introduces RSFormer, a Transformer-like convolutional network with a global-local self-attention sampling mechanism, and provides two new datasets for rain-by-snow removal.

Findings

01

RSFormer outperforms existing methods in accuracy and efficiency.

02

It reduces parameters and inference time compared to Restormer.

03

Extensive experiments validate its effectiveness on new datasets.

Abstract

Rain-by-snow weather removal is a specialized task in weather-degraded image restoration aiming to eliminate coexisting rain streaks and snow particles. In this paper, we propose RSFormer, an efficient and effective Transformer that addresses this challenge. Initially, we explore the proximity of convolution networks (ConvNets) and vision Transformers (ViTs) in hierarchical architectures and experimentally find they perform approximately at intra-stage feature learning. On this basis, we utilize a Transformer-like convolution block (TCB) that replaces the computationally expensive self-attention while preserving attention characteristics for adapting to input content. We also demonstrate that cross-stage progression is critical for performance improvement, and propose a global-local self-attention sampling mechanism (GLASM) that down-/up-samples features while capturing both global and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

chdwyb/rsformer
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage Enhancement Techniques · Advanced Image Fusion Techniques · Advanced Vision and Imaging

MethodsMulti-Head Attention · Dense Connections · Label Smoothing · Convolution · Adam · Softmax · Linear Layer · Absolute Position Encodings · Byte Pair Encoding · Residual Connection