TIER: Text-Image Encoder-based Regression for AIGC Image Quality   Assessment

Jiquan Yuan; Xinyan Cao; Jinming Che; Qinyuan Wang; Sen Liang; Wei; Ren; Jinlong Lin; Xixin Cao

arXiv:2401.03854·cs.CV·January 12, 2024·1 cites

TIER: Text-Image Encoder-based Regression for AIGC Image Quality Assessment

Jiquan Yuan, Xinyan Cao, Jinming Che, Qinyuan Wang, Sen Liang, Wei, Ren, Jinlong Lin, Xixin Cao

PDF

Open Access 1 Repo

TL;DR

This paper introduces TIER, a novel framework that leverages text prompts and generated images using encoders to improve AI-generated image quality assessment, outperforming existing methods.

Contribution

The paper proposes a text-image encoder-based regression framework that incorporates text prompts into quality assessment, addressing limitations of previous image-only approaches.

Findings

01

TIER outperforms baseline methods on multiple AIGCIQA datasets.

02

Incorporating text prompts improves the accuracy of image quality prediction.

03

Extensive experiments validate the effectiveness of the proposed approach.

Abstract

Recently, AIGC image quality assessment (AIGCIQA), which aims to assess the quality of AI-generated images (AIGIs) from a human perception perspective, has emerged as a new topic in computer vision. Unlike common image quality assessment tasks where images are derived from original ones distorted by noise, blur, and compression, \textit{etc.}, in AIGCIQA tasks, images are typically generated by generative models using text prompts. Considerable efforts have been made in the past years to advance AIGCIQA. However, most existing AIGCIQA methods regress predicted scores directly from individual generated images, overlooking the information contained in the text prompts of these images. This oversight partially limits the performance of these AIGCIQA methods. To address this issue, we propose a text-image encoder-based regression (TIER) framework. Specifically, we process the generated…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jiquan123/TIER
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage and Video Quality Assessment · Advanced Image Fusion Techniques · Visual Attention and Saliency Detection