Evaluating Text-to-Image Generative Models: An Empirical Study on Human   Image Synthesis

Muxi Chen; Yi Liu; Jian Yi; Changran Xu; Qiuxia Lai; Hongliang Wang,; Tsung-Yi Ho; Qiang Xu

arXiv:2403.05125·cs.CV·October 29, 2024·3 cites

Evaluating Text-to-Image Generative Models: An Empirical Study on Human Image Synthesis

Muxi Chen, Yi Liu, Jian Yi, Changran Xu, Qiuxia Lai, Hongliang Wang,, Tsung-Yi Ho, Qiang Xu

PDF

Open Access 1 Repo

TL;DR

This paper introduces an empirical evaluation framework for text-to-image models focusing on human image synthesis, assessing image quality, concept accuracy, and fairness, with new datasets and predictive models to improve understanding and development.

Contribution

It presents a novel evaluation framework, an aesthetic score prediction model, and a dataset with annotated low-quality regions for automatic defect detection in human image synthesis.

Findings

01

Developed an aesthetic score prediction model.

02

Created a dataset with low-quality region annotations.

03

Analyzed biases related to gender, race, and age.

Abstract

In this paper, we present an empirical study introducing a nuanced evaluation framework for text-to-image (T2I) generative models, applied to human image synthesis. Our framework categorizes evaluations into two distinct groups: first, focusing on image qualities such as aesthetics and realism, and second, examining text conditions through concept coverage and fairness. We introduce an innovative aesthetic score prediction model that assesses the visual appeal of generated images and unveils the first dataset marked with low-quality regions in generated human images to facilitate automatic defect detection. Our exploration into concept coverage probes the model's effectiveness in interpreting and rendering text-based concepts accurately, while our analysis of fairness reveals biases in model outputs, with an emphasis on gender, race, and age. While our study is grounded in human…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

cure-lab/evaluateaigc
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Motion and Animation · Advanced Technology in Applications · Image Processing and 3D Reconstruction