OSRT: Omnidirectional Image Super-Resolution with Distortion-aware Transformer
Fanghua Yu, Xintao Wang, Mingdeng Cao, Gen Li, Ying Shan, Chao Dong

TL;DR
This paper introduces OSRT, a distortion-aware Transformer for omnidirectional image super-resolution that models realistic degradation and uses data augmentation to achieve state-of-the-art results.
Contribution
The paper proposes a realistic fisheye downsampling method and a distortion-aware Transformer for improved omnidirectional image super-resolution.
Findings
OSRT outperforms previous methods by about 0.2dB on PSNR.
Fisheye downsampling better mimics real-world imaging.
Data augmentation significantly boosts performance.
Abstract
Omnidirectional images (ODIs) have obtained lots of research interest for immersive experiences. Although ODIs require extremely high resolution to capture details of the entire scene, the resolutions of most ODIs are insufficient. Previous methods attempt to solve this issue by image super-resolution (SR) on equirectangular projection (ERP) images. However, they omit geometric properties of ERP in the degradation process, and their models can hardly generalize to real ERP images. In this paper, we propose Fisheye downsampling, which mimics the real-world imaging process and synthesizes more realistic low-resolution samples. Then we design a distortion-aware Transformer (OSRT) to modulate ERP distortions continuously and self-adaptively. Without a cumbersome process, OSRT outperforms previous methods by about 0.2dB on PSNR. Moreover, we propose a convenient data augmentation strategy,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image Processing Techniques · Advanced Vision and Imaging · Image and Video Quality Assessment
MethodsMulti-Head Attention · Attention Is All You Need · Dense Connections · Linear Layer · Layer Normalization · Position-Wise Feed-Forward Layer · Absolute Position Encodings · Label Smoothing · Adam · Softmax
