Trade-offs in Image Generation: How Do Different Dimensions Interact?

Sicheng Zhang; Binzhu Xie; Zhonghao Yan; Yuli Zhang; Donghao Zhou; Xiaofei Chen; Shi Qiu; Jiaqi Liu; Guoyang Xie; Zhichao Lu

arXiv:2507.22100·cs.CV·July 31, 2025

Trade-offs in Image Generation: How Do Different Dimensions Interact?

Sicheng Zhang, Binzhu Xie, Zhonghao Yan, Yuli Zhang, Donghao Zhou, Xiaofei Chen, Shi Qiu, Jiaqi Liu, Guoyang Xie, Zhichao Lu

PDF

1 Datasets

TL;DR

This paper introduces TRIG-Bench and TRIGScore to quantify and analyze complex trade-offs among multiple dimensions in image generation models, providing a comprehensive evaluation framework.

Contribution

It presents a new benchmark dataset and metric for fine-grained analysis of trade-offs in image generation, along with a visualization system for model capabilities.

Findings

01

DTM reveals trade-offs among model dimensions

02

Fine-tuning on DTM improves model performance

03

TRIGScore adapts to various evaluation dimensions

Abstract

Model performance in text-to-image (T2I) and image-to-image (I2I) generation often depends on multiple aspects, including quality, alignment, diversity, and robustness. However, models' complex trade-offs among these dimensions have rarely been explored due to (1) the lack of datasets that allow fine-grained quantification of these trade-offs, and (2) the use of a single metric for multiple dimensions. To bridge this gap, we introduce TRIG-Bench (Trade-offs in Image Generation), which spans 10 dimensions (Realism, Originality, Aesthetics, Content, Relation, Style, Knowledge, Ambiguity, Toxicity, and Bias), contains 40,200 samples, and covers 132 pairwise dimensional subsets. Furthermore, we develop TRIGScore, a VLM-as-judge metric that automatically adapts to various dimensions. Based on TRIG-Bench and TRIGScore, we evaluate 14 models across T2I and I2I tasks. In addition, we propose…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

TRIG-bench/TRIG
dataset· 6 dl
6 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.