SwinFSR: Stereo Image Super-Resolution using SwinIR and Frequency Domain Knowledge
Ke Chen, Liangyan Li, Huan Liu, Yunzhe Li, Congling Tang, Jun Chen

TL;DR
SwinFSR is a novel stereo image super-resolution method that combines SwinIR architecture with frequency domain knowledge and introduces a new cross-attention module for efficient stereo view fusion.
Contribution
The paper extends SwinIR with frequency domain features and proposes RCAM for improved stereo image fusion, enhancing super-resolution performance.
Findings
Outperforms existing methods in accuracy and efficiency
Effective integration of frequency domain knowledge
Reduced computational cost with RCAM
Abstract
Stereo Image Super-Resolution (stereoSR) has attracted significant attention in recent years due to the extensive deployment of dual cameras in mobile phones, autonomous vehicles and robots. In this work, we propose a new StereoSR method, named SwinFSR, based on an extension of SwinIR, originally designed for single image restoration, and the frequency domain knowledge obtained by the Fast Fourier Convolution (FFC). Specifically, to effectively gather global information, we modify the Residual Swin Transformer blocks (RSTBs) in SwinIR by explicitly incorporating the frequency domain knowledge using the FFC and employing the resulting residual Swin Fourier Transformer blocks (RSFTBs) for feature extraction. Besides, for the efficient and accurate fusion of stereo views, we propose a new cross-attention module referred to as RCAM, which achieves highly competitive performance while…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image Processing Techniques · Image Processing Techniques and Applications · Photoacoustic and Ultrasonic Imaging
MethodsAttention Is All You Need · Adam · Label Smoothing · Dropout · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Concatenated Skip Connection · Convolution · Softmax · Stochastic Depth
