What Do AI-Generated Images Want?
Amanda Wasielewski

TL;DR
This paper explores the agency of AI-generated images, arguing they desire specificity and concreteness due to their abstract nature, and critiques the misconception that text and images are directly interchangeable in multimodal models.
Contribution
It reframes Mitchell's question in the context of AI image generation, highlighting the abstract nature of AI images and critiquing the assumptions about text-image interchangeability.
Findings
AI images are fundamentally abstract and desire concreteness.
Multimodal models obscure the representational differences between text and images.
The pipeline from text to image creates an illusion of direct transformation.
Abstract
W.J.T. Mitchell's influential essay 'What do pictures want?' shifts the theoretical focus away from the interpretative act of understanding pictures and from the motivations of the humans who create them to the possibility that the picture itself is an entity with agency and wants. In this article, I reframe Mitchell's question in light of contemporary AI image generation tools to ask: what do AI-generated images want? Drawing from art historical discourse on the nature of abstraction, I argue that AI-generated images want specificity and concreteness because they are fundamentally abstract. Multimodal text-to-image models, which are the primary subject of this article, are based on the premise that text and image are interchangeable or exchangeable tokens and that there is a commensurability between them, at least as represented mathematically in data. The user pipeline that sees…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAesthetic Perception and Analysis · Digital Media and Philosophy · Cybernetics and Technology in Society
