PFT-SSR: Parallax Fusion Transformer for Stereo Image Super-Resolution
Hansheng Guo, Juncheng Li, Guangwei Gao, Zhi Li, Tieyong Zeng

TL;DR
This paper introduces PFT-SSR, a Transformer-based model that leverages cross-view and intra-view information for stereo image super-resolution, achieving state-of-the-art performance.
Contribution
The paper proposes a novel Parallax Fusion Transformer with cross-view and intra-view modules, enhancing stereo image super-resolution beyond existing methods.
Findings
Outperforms most state-of-the-art methods
Achieves competitive super-resolution results
Demonstrates effectiveness of the Transformer-based architecture
Abstract
Stereo image super-resolution aims to boost the performance of image super-resolution by exploiting the supplementary information provided by binocular systems. Although previous methods have achieved promising results, they did not fully utilize the information of cross-view and intra-view. To further unleash the potential of binocular images, in this letter, we propose a novel Transformerbased parallax fusion module called Parallax Fusion Transformer (PFT). PFT employs a Cross-view Fusion Transformer (CVFT) to utilize cross-view information and an Intra-view Refinement Transformer (IVRT) for intra-view feature refinement. Meanwhile, we adopted the Swin Transformer as the backbone for feature extraction and SR reconstruction to form a pure Transformer architecture called PFT-SSR. Extensive experiments and ablation studies show that PFT-SSR achieves competitive results and outperforms…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image Processing Techniques · Image Processing Techniques and Applications · Advanced Vision and Imaging
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Adam · Absolute Position Encodings · Softmax · Residual Connection · Byte Pair Encoding · Label Smoothing · Position-Wise Feed-Forward Layer
