ArtWhisperer: A Dataset for Characterizing Human-AI Interactions in Artistic Creations
Kailas Vodrahalli, James Zou

TL;DR
This paper introduces ArtWhisperer, a large dataset of human interactions with text-to-image AI models, revealing diverse user strategies and proposing a new metric to measure AI steerability across different image types.
Contribution
The paper presents a novel dataset of over 50,000 human-AI interactions and introduces a metric for quantifying AI steerability based on user interaction sequences.
Findings
Users generate diverse prompts for similar images.
Prompt diversity persists even with improved prompts.
City and natural images are more steerable than artistic ones.
Abstract
As generative AI becomes more prevalent, it is important to study how human users interact with such models. In this work, we investigate how people use text-to-image models to generate desired target images. To study this interaction, we created ArtWhisperer, an online game where users are given a target image and are tasked with iteratively finding a prompt that creates a similar-looking image as the target. Through this game, we recorded over 50,000 human-AI interactions; each interaction corresponds to one text prompt created by a user and the corresponding generated image. The majority of these are repeated interactions where a user iterates to find the best prompt for their target image, making this a unique sequential dataset for studying human-AI collaborations. In an initial analysis of this dataset, we identify several characteristics of prompt interactions and user…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAesthetic Perception and Analysis · Data Visualization and Analytics
