Frames2Residual: Spatiotemporal Decoupling for Self-Supervised Video Denoising

Mingjie Ji; Zhan Shi; Kailai Zhou; Zixuan Fu; Xun Cao

arXiv:2603.10417·cs.CV·March 12, 2026

Frames2Residual: Spatiotemporal Decoupling for Self-Supervised Video Denoising

Mingjie Ji, Zhan Shi, Kailai Zhou, Zixuan Fu, Xun Cao

PDF

Open Access

TL;DR

Frames2Residual introduces a novel spatiotemporal decoupling framework for self-supervised video denoising, effectively separating temporal consistency modeling from spatial texture recovery to improve performance.

Contribution

The paper proposes a two-stage decoupling approach that explicitly separates temporal and spatial denoising processes, enhancing self-supervised video denoising performance.

Findings

01

Outperforms existing self-supervised methods on sRGB and raw video benchmarks.

02

Effectively models inter-frame temporal consistency while preserving intra-frame spatial details.

03

Demonstrates significant improvements through extensive experiments.

Abstract

Self-supervised video denoising methods typically extend image-based frameworks into the temporal dimension, yet they often struggle to integrate inter-frame temporal consistency with intra-frame spatial specificity. Existing Video Blind-Spot Networks (BSNs) require noise independence by masking the center pixel, this constraint prevents the use of spatial evidence for texture recovery, thereby severing spatiotemporal correlations and causing texture loss. To address this, we propose Frames2Residual (F2R), a spatiotemporal decoupling framework that explicitly divides self-supervised training into two distinct stages: blind temporal consistency modeling and non-blind spatial texture recovery. In Stage 1, a blind temporal estimator learns inter-frame consistency using a frame-wise blind strategy, producing a temporally consistent anchor. In Stage 2, a non-blind spatial refiner leverages…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image Processing Techniques · Image and Signal Denoising Methods · Generative Adversarial Networks and Image Synthesis