TL;DR
RainMamba introduces a novel SSMs-based video deraining network utilizing a Hilbert scanning mechanism and contrastive locality learning, achieving efficient and effective rain removal in videos with improved local and temporal feature modeling.
Contribution
The paper proposes an improved SSMs-based network with a Hilbert scanning mechanism and contrastive locality learning for better local and temporal feature extraction in video deraining.
Findings
Effective rain streak and raindrop removal demonstrated on multiple datasets.
Outperforms existing methods in efficiency and accuracy.
Code and results publicly available for reproducibility.
Abstract
The outdoor vision systems are frequently contaminated by rain streaks and raindrops, which significantly degenerate the performance of visual tasks and multimedia applications. The nature of videos exhibits redundant temporal cues for rain removal with higher stability. Traditional video deraining methods heavily rely on optical flow estimation and kernel-based manners, which have a limited receptive field. Yet, transformer architectures, while enabling long-term dependencies, bring about a significant increase in computational complexity. Recently, the linear-complexity operator of the state space models (SSMs) has contrarily facilitated efficient long-term temporal modeling, which is crucial for rain streaks and raindrops removal in videos. Unexpectedly, its uni-dimensional sequential process on videos destroys the local correlations across the spatio-temporal dimension by distancing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
