ColorConceptBench: A Benchmark for Probabilistic Color-Concept Understanding in Text-to-Image Models

Chenxi Ruan; Yihan Hou; Yu Xiao; Guosheng Hu; Wei Zeng

arXiv:2601.16836·cs.CV·May 12, 2026

ColorConceptBench: A Benchmark for Probabilistic Color-Concept Understanding in Text-to-Image Models

Chenxi Ruan, Yihan Hou, Yu Xiao, Guosheng Hu, Wei Zeng

PDF

TL;DR

ColorConceptBench is a new benchmark that evaluates how well text-to-image models understand implicit color concepts, revealing significant gaps in their semantic comprehension.

Contribution

We introduce ColorConceptBench, a systematic benchmark for assessing probabilistic color-concept associations in T2I models, focusing on implicit and abstract semantics.

Findings

01

Models show varied performance across semantic categories.

02

Significant lack of sensitivity to abstract semantics.

03

Performance gaps persist even with guidance scaling.

Abstract

Text-to-image (T2I) models have advanced considerably in generating high-quality images from textual descriptions. However, their ability to associate colors with concepts remains largely constrained to explicit color names or codes, while their capacity to handle \emph{implicit concepts}, such as emotions and visual states, remains underexplored. To address this gap, we introduce ColorConceptBench, an expert-annotated benchmark that systematically evaluates color-concept associations through probabilistic color distributions. ColorConceptBench moves beyond explicit color specifications by examining how models interpret 1,281 implicit color concepts, grounded in 6,584 human annotations. Our evaluation of nine leading T2I models reveals that performance varies substantially across semantic categories, and models exhibit a significant lack of sensitivity to abstract semantics. These…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.