Evaluating Text-to-Image Generative Models: An Empirical Study on Human Image Synthesis
Muxi Chen, Yi Liu, Jian Yi, Changran Xu, Qiuxia Lai, Hongliang Wang,, Tsung-Yi Ho, Qiang Xu

TL;DR
This paper introduces an empirical evaluation framework for text-to-image models focusing on human image synthesis, assessing image quality, concept accuracy, and fairness, with new datasets and predictive models to improve understanding and development.
Contribution
It presents a novel evaluation framework, an aesthetic score prediction model, and a dataset with annotated low-quality regions for automatic defect detection in human image synthesis.
Findings
Developed an aesthetic score prediction model.
Created a dataset with low-quality region annotations.
Analyzed biases related to gender, race, and age.
Abstract
In this paper, we present an empirical study introducing a nuanced evaluation framework for text-to-image (T2I) generative models, applied to human image synthesis. Our framework categorizes evaluations into two distinct groups: first, focusing on image qualities such as aesthetics and realism, and second, examining text conditions through concept coverage and fairness. We introduce an innovative aesthetic score prediction model that assesses the visual appeal of generated images and unveils the first dataset marked with low-quality regions in generated human images to facilitate automatic defect detection. Our exploration into concept coverage probes the model's effectiveness in interpreting and rendering text-based concepts accurately, while our analysis of fairness reveals biases in model outputs, with an emphasis on gender, race, and age. While our study is grounded in human…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Motion and Animation · Advanced Technology in Applications · Image Processing and 3D Reconstruction
