Q-REAL: Towards Realism and Plausibility Evaluation for AI-Generated Content

Shushi Wang; Zicheng Zhang; Chunyi Li; Wei Wang; Liya Ma; Fengjiao Chen; Xiaoyu Li; Xuezhi Cao; Guangtao Zhai; Xiaohong Liu

arXiv:2511.16908·cs.CV·April 2, 2026

Q-REAL: Towards Realism and Plausibility Evaluation for AI-Generated Content

Shushi Wang, Zicheng Zhang, Chunyi Li, Wei Wang, Liya Ma, Fengjiao Chen, Xiaoyu Li, Xuezhi Cao, Guangtao Zhai, Xiaohong Liu

PDF

TL;DR

Q-Real is a new dataset and benchmark for fine-grained evaluation of realism and plausibility in AI-generated images, aiming to improve model assessment and optimization.

Contribution

The paper introduces Q-Real, a detailed dataset with annotations and a benchmark for evaluating AI-generated images along realism and plausibility dimensions.

Findings

01

Q-Real dataset contains 3,088 annotated images.

02

Benchmark evaluates multi-modal large language models on judgment and grounding tasks.

03

Fine-tuning MLLMs on Q-Real improves their evaluation capabilities.

Abstract

Quality assessment of AI-generated content is crucial for evaluating model capability and guiding model optimization. However, most existing quality assessment datasets and models provide only a single quality score, which is too coarse to offer targeted guidance for improving generative models. In current applications of AI-generated images, realism and plausibility are two critical dimensions, and with the emergence of unified generation-understanding models, fine-grained evaluation along these dimensions becomes especially effective for improving generative performance. Therefore, we introduce Q-Real, a novel dataset for fine-grained evaluation of realism and plausibility in AI-generated images. Q-Real consists of 3,088 images generated by popular text-to-image models. For each image, we annotate the locations of major entities and provide a set of judgment questions and attribution…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.