Language Does More Than Describe: On The Lack Of Figurative Speech in Text-To-Image Models
Ricardo Kleinlein, Cristina Luna-Jim\'enez, Fernando, Fern\'andez-Mart\'inez

TL;DR
This paper examines how current text-to-image models lack the use of figurative and subjective language, which limits their artistic and creative potential, and suggests incorporating more subjective data for improvement.
Contribution
It characterizes the language used in training data for text-to-image models and advocates for including subjective and artistic language to enhance generative capabilities.
Findings
Training data is predominantly objective and descriptive.
Current models lack figurative and subjective language.
Incorporating artistic language could improve creativity.
Abstract
The impressive capacity shown by recent text-to-image diffusion models to generate high-quality pictures from textual input prompts has leveraged the debate about the very definition of art. Nonetheless, these models have been trained using text data collected from content-based labelling protocols that focus on describing the items and actions in an image but neglect any subjective appraisal. Consequently, these automatic systems need rigorous descriptions of the elements and the pictorial style of the image to be generated, otherwise failing to deliver. As potential indicators of the actual artistic capabilities of current generative models, we characterise the sentimentality, objectiveness and degree of abstraction of publicly available text data used to train current text-to-image diffusion models. Considering the sharp difference observed between their language style and that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAesthetic Perception and Analysis · Image Retrieval and Classification Techniques · Generative Adversarial Networks and Image Synthesis
MethodsDiffusion
