XYScanNet: A State Space Model for Single Image Deblurring

Hanzhou Liu; Chengkai Liu; Jiacong Xu; Peng Jiang; Mi Lu

arXiv:2412.10338·cs.CV·June 24, 2025

XYScanNet: A State Space Model for Single Image Deblurring

Hanzhou Liu, Chengkai Liu, Jiacong Xu, Peng Jiang, Mi Lu

PDF

Open Access

TL;DR

XYScanNet introduces a novel state space model with a slice-and-scan strategy and a vision state space module, significantly improving perceptual quality in single image deblurring while maintaining competitive distortion metrics.

Contribution

The paper proposes a new slice-and-scan strategy and a vision state space module, advancing state space models for improved image deblurring performance.

Findings

01

Enhances KID by 17% over nearest competitor

02

Maintains competitive distortion metrics

03

Significantly improves perceptual quality

Abstract

Deep state-space models (SSMs), like recent Mamba architectures, are emerging as a promising alternative to CNN and Transformer networks. Existing Mamba-based restoration methods process visual data by leveraging a flatten-and-scan strategy that converts image patches into a 1D sequence before scanning. However, this scanning paradigm ignores local pixel dependencies and introduces spatial misalignment by positioning distant pixels incorrectly adjacent, which reduces local noise-awareness and degrades image sharpness in low-level vision tasks. To overcome these issues, we propose a novel slice-and-scan strategy that alternates scanning along intra- and inter-slices. We further design a new Vision State Space Module (VSSM) for image deblurring, and tackle the inefficiency challenges of the current Mamba-based vision module. Building upon this, we develop XYScanNet, an SSM architecture…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image Processing Techniques · Digital Media Forensic Detection · Image and Signal Denoising Methods

MethodsAttention Is All You Need · Linear Layer · Mamba: Linear-Time Sequence Modeling with Selective State Spaces · Adam · Layer Normalization · Dropout · Position-Wise Feed-Forward Layer · Label Smoothing · Dense Connections · Byte Pair Encoding