A New Dataset and Transformer for Stereoscopic Video Super-Resolution
Hassan Imani, Md Baharul Islam, Lai-Kuan Wong

TL;DR
This paper introduces Trans-SVSR, a Transformer-based model for stereoscopic video super-resolution that preserves stereo and temporal consistency, and provides a new high-quality dataset for benchmarking.
Contribution
The paper proposes a novel Transformer architecture with spatio-temporal self-attention and optical flow layers for SVSR, and introduces a new stereoscopic video dataset, SVSR-Set.
Findings
Trans-SVSR achieves competitive performance on multiple datasets.
The proposed model effectively preserves stereo and temporal consistency.
The new dataset enables better benchmarking of SVSR methods.
Abstract
Stereo video super-resolution (SVSR) aims to enhance the spatial resolution of the low-resolution video by reconstructing the high-resolution video. The key challenges in SVSR are preserving the stereo-consistency and temporal-consistency, without which viewers may experience 3D fatigue. There are several notable works on stereoscopic image super-resolution, but there is little research on stereo video super-resolution. In this paper, we propose a novel Transformer-based model for SVSR, namely Trans-SVSR. Trans-SVSR comprises two key novel components: a spatio-temporal convolutional self-attention layer and an optical flow-based feed-forward layer that discovers the correlation across different video frames and aligns the features. The parallax attention mechanism (PAM) that uses the cross-view information to consider the significant disparities is used to fuse the stereo views. Due to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image Processing Techniques · Advanced Vision and Imaging · Image and Video Quality Assessment
