SoS: Analysis of Surface over Semantics in Multilingual Text-To-Image Generation
Carolin Holtermann, Florian Schneider, Anne Lauscher

TL;DR
This paper investigates how multilingual text-to-image models often prioritize surface language features over prompt semantics, leading to stereotypical visual outputs, and introduces a new measure to quantify this Surface-over-Semantics behavior.
Contribution
It provides the first comprehensive analysis of Surface-over-Semantics tendencies in T2I models across multiple languages and cultures, introducing a novel quantification method.
Findings
Most models exhibit strong surface-level biases in at least two languages.
Surface tendencies increase across encoder layers.
Surface biases often correlate with stereotypical visual depictions.
Abstract
Text-to-image (T2I) models are increasingly employed by users worldwide. However, prior research has pointed to the high sensitivity of T2I towards particular input languages - when faced with languages other than English (i.e., different surface forms of the same prompt), T2I models often produce culturally stereotypical depictions, prioritizing the surface over the prompt's semantics. Yet a comprehensive analysis of this behavior, which we dub Surface-over-Semantics (SoS), is missing. We present the first analysis of T2I models' SoS tendencies. To this end, we create a set of prompts covering 171 cultural identities, translated into 14 languages, and use it to prompt seven T2I models. To quantify SoS tendencies across models, languages, and cultures, we introduce a novel measure and analyze how the tendencies we identify manifest visually. We show that all but one model exhibit strong…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsMultimodal Machine Learning Applications · Generative Adversarial Networks and Image Synthesis · Data Visualization and Analytics
