I Prompt, it Generates, we Negotiate. Exploring Text-Image Intertextuality in Human-AI Co-Creation of Visual Narratives with VLMs

Mengyao Guo; Kexin Nie; Ze Gao; Black Sun; Xueyang Wang; Jinda Han; Xingting Wu

arXiv:2511.03375·cs.HC·November 6, 2025

I Prompt, it Generates, we Negotiate. Exploring Text-Image Intertextuality in Human-AI Co-Creation of Visual Narratives with VLMs

Mengyao Guo, Kexin Nie, Ze Gao, Black Sun, Xueyang Wang, Jinda Han, Xingting Wu

PDF

Open Access

TL;DR

This study explores how humans and AI collaboratively create visual narratives using VLMs, revealing strategies, collaboration patterns, and challenges in understanding and leveraging text-image intertextuality.

Contribution

It provides an empirical analysis of text-image intertextuality in human-AI co-creation and proposes design implications for role-based AI storytelling assistants.

Findings

01

Users develop strategies to harness AI's semantic surplus.

02

Four collaboration patterns identified: Educational, Technical, Visual.

03

Challenges include cultural gaps and visual consistency issues.

Abstract

Creating meaningful visual narratives through human-AI collaboration requires understanding how text-image intertextuality emerges when textual intentions meet AI-generated visuals. We conducted a three-phase qualitative study with 15 participants using GPT-4o to investigate how novices navigate sequential visual narratives. Our findings show that users develop strategies to harness AI's semantic surplus by recognizing meaningful visual content beyond literal descriptions, iteratively refining prompts, and constructing narrative significance through complementary text-image relationships. We identified four distinct collaboration patterns and, through fsQCA's analysis, discovered three pathways to successful intertextual collaboration: Educational Collaborator, Technical Expert, and Visual Thinker. However, participants faced challenges, including cultural representation gaps, visual…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEthics and Social Impacts of AI · Innovative Human-Technology Interaction · AI in Service Interactions