GPSMamba: A Global Phase and Spectral Prompt-guided Mamba for Infrared Image Super-Resolution
Yongsong Huang, Tomo Miyazaki, Xiaofeng Liu, Shinichiro Omachi

TL;DR
GPSMamba introduces a novel framework for infrared image super-resolution that leverages global phase and spectral prompts, non-causal supervision, and spectral attention to enhance detail and global coherence.
Contribution
It proposes a new architecture combining semantic-frequency prompts with non-causal supervision, overcoming limitations of causal models in infrared image restoration.
Findings
Achieves state-of-the-art performance on infrared super-resolution tasks.
Effectively preserves global structure and spectral fidelity.
Demonstrates robustness across diverse infrared datasets.
Abstract
Infrared Image Super-Resolution (IRSR) is challenged by the low contrast and sparse textures of infrared data, requiring robust long-range modeling to maintain global coherence. While State-Space Models like Mamba offer proficiency in modeling long-range dependencies for this task, their inherent 1D causal scanning mechanism fragments the global context of 2D images, hindering fine-detail restoration. To address this, we propose Global Phase and Spectral Prompt-guided Mamba (GPSMamba), a framework that synergizes architectural guidance with non-causal supervision. First, our Adaptive Semantic-Frequency State Space Module (ASF-SSM) injects a fused semantic-frequency prompt directly into the Mamba block, integrating non-local context to guide reconstruction. Then, a novel Thermal-Spectral Attention and Phase Consistency Loss provides explicit, non-causal supervision to enforce global…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image Processing Techniques · Advanced Image Fusion Techniques · Generative Adversarial Networks and Image Synthesis
