TIER: Text-Image Encoder-based Regression for AIGC Image Quality Assessment
Jiquan Yuan, Xinyan Cao, Jinming Che, Qinyuan Wang, Sen Liang, Wei, Ren, Jinlong Lin, Xixin Cao

TL;DR
This paper introduces TIER, a novel framework that leverages text prompts and generated images using encoders to improve AI-generated image quality assessment, outperforming existing methods.
Contribution
The paper proposes a text-image encoder-based regression framework that incorporates text prompts into quality assessment, addressing limitations of previous image-only approaches.
Findings
TIER outperforms baseline methods on multiple AIGCIQA datasets.
Incorporating text prompts improves the accuracy of image quality prediction.
Extensive experiments validate the effectiveness of the proposed approach.
Abstract
Recently, AIGC image quality assessment (AIGCIQA), which aims to assess the quality of AI-generated images (AIGIs) from a human perception perspective, has emerged as a new topic in computer vision. Unlike common image quality assessment tasks where images are derived from original ones distorted by noise, blur, and compression, \textit{etc.}, in AIGCIQA tasks, images are typically generated by generative models using text prompts. Considerable efforts have been made in the past years to advance AIGCIQA. However, most existing AIGCIQA methods regress predicted scores directly from individual generated images, overlooking the information contained in the text prompts of these images. This oversight partially limits the performance of these AIGCIQA methods. To address this issue, we propose a text-image encoder-based regression (TIER) framework. Specifically, we process the generated…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage and Video Quality Assessment · Advanced Image Fusion Techniques · Visual Attention and Saliency Detection
