ST360IQ: No-Reference Omnidirectional Image Quality Assessment with Spherical Vision Transformers
Nafiseh Jabbari Tofighi, Mohamed Hedi Elfkir, Nevrez Imamoglu, Cagri, Ozcinar, Erkut Erdem, Aykut Erdem

TL;DR
The paper introduces ST360IQ, a no-reference omnidirectional image quality assessment method using spherical vision transformers that effectively predicts perceived image quality by analyzing salient viewports.
Contribution
It proposes a novel spherical vision transformer-based model for no-reference 360 image quality assessment, addressing high-resolution and spherical distortion challenges.
Findings
Outperforms state-of-the-art methods on benchmark datasets
Correlates well with human perceived image quality
Effective in analyzing salient viewports for quality prediction
Abstract
Omnidirectional images, aka 360 images, can deliver immersive and interactive visual experiences. As their popularity has increased dramatically in recent years, evaluating the quality of 360 images has become a problem of interest since it provides insights for capturing, transmitting, and consuming this new media. However, directly adapting quality assessment methods proposed for standard natural images for omnidirectional data poses certain challenges. These models need to deal with very high-resolution data and implicit distortions due to the spherical form of the images. In this study, we present a method for no-reference 360 image quality assessment. Our proposed ST360IQ model extracts tangent viewports from the salient parts of the input omnidirectional image and employs a vision-transformers based module processing saliency selective patches/tokens that estimates a quality score…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage and Video Quality Assessment · Visual Attention and Saliency Detection · Advanced Image Fusion Techniques
