Semi-Supervised State-Space Model with Dynamic Stacking Filter for Real-World Video Deraining

Shangquan Sun; Wenqi Ren; Juxiang Zhou; Shu Wang; Jianhou Gan; Xiaochun Cao

arXiv:2505.16811·cs.CV·May 23, 2025

Semi-Supervised State-Space Model with Dynamic Stacking Filter for Real-World Video Deraining

Shangquan Sun, Wenqi Ren, Juxiang Zhou, Shu Wang, Jianhou Gan, Xiaochun Cao

PDF

Open Access

TL;DR

This paper introduces a semi-supervised, dual-branch state-space model with a dynamic stacking filter for improved real-world video deraining, enhancing generalization and supporting downstream vision tasks.

Contribution

It proposes a novel dual-branch spatio-temporal state-space model with a dynamic stacking filter and median stacking loss for semi-supervised video deraining in real-world scenarios.

Findings

01

Outperforms existing methods on synthetic and real-world benchmarks

02

Improves object detection and tracking in rainy conditions

03

Demonstrates efficiency and superior visual quality

Abstract

Significant progress has been made in video restoration under rainy conditions over the past decade, largely propelled by advancements in deep learning. Nevertheless, existing methods that depend on paired data struggle to generalize effectively to real-world scenarios, primarily due to the disparity between synthetic and authentic rain effects. To address these limitations, we propose a dual-branch spatio-temporal state-space model to enhance rain streak removal in video sequences. Specifically, we design spatial and temporal state-space model layers to extract spatial features and incorporate temporal dependencies across frames, respectively. To improve multi-frame feature fusion, we derive a dynamic stacking filter, which adaptively approximates statistical filters for superior pixel-wise feature refinement. Moreover, we develop a median stacking loss to enable semi-supervised…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage Enhancement Techniques · Image and Video Quality Assessment · Visual Attention and Saliency Detection