OSRT: Omnidirectional Image Super-Resolution with Distortion-aware   Transformer

Fanghua Yu; Xintao Wang; Mingdeng Cao; Gen Li; Ying Shan; Chao Dong

arXiv:2302.03453·eess.IV·February 10, 2023·1 cites

OSRT: Omnidirectional Image Super-Resolution with Distortion-aware Transformer

Fanghua Yu, Xintao Wang, Mingdeng Cao, Gen Li, Ying Shan, Chao Dong

PDF

Open Access 1 Repo

TL;DR

This paper introduces OSRT, a distortion-aware Transformer for omnidirectional image super-resolution that models realistic degradation and uses data augmentation to achieve state-of-the-art results.

Contribution

The paper proposes a realistic fisheye downsampling method and a distortion-aware Transformer for improved omnidirectional image super-resolution.

Findings

01

OSRT outperforms previous methods by about 0.2dB on PSNR.

02

Fisheye downsampling better mimics real-world imaging.

03

Data augmentation significantly boosts performance.

Abstract

Omnidirectional images (ODIs) have obtained lots of research interest for immersive experiences. Although ODIs require extremely high resolution to capture details of the entire scene, the resolutions of most ODIs are insufficient. Previous methods attempt to solve this issue by image super-resolution (SR) on equirectangular projection (ERP) images. However, they omit geometric properties of ERP in the degradation process, and their models can hardly generalize to real ERP images. In this paper, we propose Fisheye downsampling, which mimics the real-world imaging process and synthesizes more realistic low-resolution samples. Then we design a distortion-aware Transformer (OSRT) to modulate ERP distortions continuously and self-adaptively. Without a cumbersome process, OSRT outperforms previous methods by about 0.2dB on PSNR. Moreover, we propose a convenient data augmentation strategy,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

fanghua-yu/osrt
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image Processing Techniques · Advanced Vision and Imaging · Image and Video Quality Assessment

MethodsMulti-Head Attention · Attention Is All You Need · Dense Connections · Linear Layer · Layer Normalization · Position-Wise Feed-Forward Layer · Absolute Position Encodings · Label Smoothing · Adam · Softmax