A New Dataset and Transformer for Stereoscopic Video Super-Resolution

Hassan Imani; Md Baharul Islam; Lai-Kuan Wong

arXiv:2204.10039·cs.CV·April 22, 2022

A New Dataset and Transformer for Stereoscopic Video Super-Resolution

Hassan Imani, Md Baharul Islam, Lai-Kuan Wong

PDF

Open Access 1 Repo

TL;DR

This paper introduces Trans-SVSR, a Transformer-based model for stereoscopic video super-resolution that preserves stereo and temporal consistency, and provides a new high-quality dataset for benchmarking.

Contribution

The paper proposes a novel Transformer architecture with spatio-temporal self-attention and optical flow layers for SVSR, and introduces a new stereoscopic video dataset, SVSR-Set.

Findings

01

Trans-SVSR achieves competitive performance on multiple datasets.

02

The proposed model effectively preserves stereo and temporal consistency.

03

The new dataset enables better benchmarking of SVSR methods.

Abstract

Stereo video super-resolution (SVSR) aims to enhance the spatial resolution of the low-resolution video by reconstructing the high-resolution video. The key challenges in SVSR are preserving the stereo-consistency and temporal-consistency, without which viewers may experience 3D fatigue. There are several notable works on stereoscopic image super-resolution, but there is little research on stereo video super-resolution. In this paper, we propose a novel Transformer-based model for SVSR, namely Trans-SVSR. Trans-SVSR comprises two key novel components: a spatio-temporal convolutional self-attention layer and an optical flow-based feed-forward layer that discovers the correlation across different video frames and aligns the features. The parallax attention mechanism (PAM) that uses the cross-view information to consider the significant disparities is used to fuse the stereo views. Due to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

h-deep/trans-svsr
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image Processing Techniques · Advanced Vision and Imaging · Image and Video Quality Assessment