PFT-SSR: Parallax Fusion Transformer for Stereo Image Super-Resolution

Hansheng Guo; Juncheng Li; Guangwei Gao; Zhi Li; Tieyong Zeng

arXiv:2303.13807·cs.CV·March 27, 2023·1 cites

PFT-SSR: Parallax Fusion Transformer for Stereo Image Super-Resolution

Hansheng Guo, Juncheng Li, Guangwei Gao, Zhi Li, Tieyong Zeng

PDF

Open Access

TL;DR

This paper introduces PFT-SSR, a Transformer-based model that leverages cross-view and intra-view information for stereo image super-resolution, achieving state-of-the-art performance.

Contribution

The paper proposes a novel Parallax Fusion Transformer with cross-view and intra-view modules, enhancing stereo image super-resolution beyond existing methods.

Findings

01

Outperforms most state-of-the-art methods

02

Achieves competitive super-resolution results

03

Demonstrates effectiveness of the Transformer-based architecture

Abstract

Stereo image super-resolution aims to boost the performance of image super-resolution by exploiting the supplementary information provided by binocular systems. Although previous methods have achieved promising results, they did not fully utilize the information of cross-view and intra-view. To further unleash the potential of binocular images, in this letter, we propose a novel Transformerbased parallax fusion module called Parallax Fusion Transformer (PFT). PFT employs a Cross-view Fusion Transformer (CVFT) to utilize cross-view information and an Intra-view Refinement Transformer (IVRT) for intra-view feature refinement. Meanwhile, we adopted the Swin Transformer as the backbone for feature extraction and SR reconstruction to form a pure Transformer architecture called PFT-SSR. Extensive experiments and ablation studies show that PFT-SSR achieves competitive results and outperforms…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image Processing Techniques · Image Processing Techniques and Applications · Advanced Vision and Imaging

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Adam · Absolute Position Encodings · Softmax · Residual Connection · Byte Pair Encoding · Label Smoothing · Position-Wise Feed-Forward Layer