TISE: Bag of Metrics for Text-to-Image Synthesis Evaluation
Tan M. Dinh, Rang Nguyen, Binh-Son Hua

TL;DR
This paper introduces TISE, a comprehensive evaluation framework with new metrics for assessing text-to-image synthesis, addressing current evaluation issues and providing more consistent, human-aligned rankings.
Contribution
It proposes a set of improved and new metrics for better evaluation of text-to-image models, including multi-object scenarios, and releases a benchmarking toolbox.
Findings
Enhanced metrics like IS* improve calibration accuracy.
New metrics for multi-object evaluation show better alignment with human judgment.
Benchmarking results demonstrate more consistent method rankings.
Abstract
In this paper, we conduct a study on the state-of-the-art methods for text-to-image synthesis and propose a framework to evaluate these methods. We consider syntheses where an image contains a single or multiple objects. Our study outlines several issues in the current evaluation pipeline: (i) for image quality assessment, a commonly used metric, e.g., Inception Score (IS), is often either miscalibrated for the single-object case or misused for the multi-object case; (ii) for text relevance and object accuracy assessment, there is an overfitting phenomenon in the existing R-precision (RP) and Semantic Object Accuracy (SOA) metrics, respectively; (iii) for multi-object case, many vital factors for evaluation, e.g., object fidelity, positional alignment, counting alignment, are largely dismissed; (iv) the ranking of the methods based on current metrics is highly inconsistent with real…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCell Image Analysis Techniques · Advanced Neural Network Applications · Machine Learning in Materials Science
