ExIQA: Explainable Image Quality Assessment Using Distortion Attributes

Sepehr Kazemi Ranjbar; Emad Fatemizadeh

arXiv:2409.06853·cs.CV·September 12, 2024

ExIQA: Explainable Image Quality Assessment Using Distortion Attributes

Sepehr Kazemi Ranjbar, Emad Fatemizadeh

PDF

Open Access

TL;DR

This paper introduces ExIQA, an explainable BIQA method that predicts distortion attributes using Vision-Language Models, enabling accurate and transparent image quality assessment without reference images.

Contribution

The paper proposes a novel attribute-based distortion identification approach using VLMs, generating a large dataset, and achieving state-of-the-art performance in blind image quality assessment.

Findings

01

Achieves state-of-the-art results on multiple datasets.

02

Demonstrates strong zero-shot generalization.

03

Provides explainability through attribute-based distortion inference.

Abstract

Blind Image Quality Assessment (BIQA) aims to develop methods that estimate the quality scores of images in the absence of a reference image. In this paper, we approach BIQA from a distortion identification perspective, where our primary goal is to predict distortion types and strengths using Vision-Language Models (VLMs), such as CLIP, due to their extensive knowledge and generalizability. Based on these predicted distortions, we then estimate the quality score of the image. To achieve this, we propose an explainable approach for distortion identification based on attribute learning. Instead of prompting VLMs with the names of distortions, we prompt them with the attributes or effects of distortions and aggregate this information to infer the distortion strength. Additionally, we consider multiple distortions per image, making our method more scalable. To support this, we generate a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage and Signal Denoising Methods · Industrial Vision Systems and Defect Detection · Advanced Image Processing Techniques

MethodsContrastive Language-Image Pre-training