Score: 1

I Prompt, it Generates, we Negotiate. Exploring Text-Image Intertextuality in Human-AI Co-Creation of Visual Narratives with VLMs

Published: November 5, 2025 | arXiv ID: 2511.03375v1

By: Mengyao Guo , Kexin Nie , Ze Gao and more

Potential Business Impact:

Helps people tell stories with AI pictures.

Business Areas:

Natural Language Processing Artificial Intelligence, Data and Analytics, Software

Creating meaningful visual narratives through human-AI collaboration requires understanding how text-image intertextuality emerges when textual intentions meet AI-generated visuals. We conducted a three-phase qualitative study with 15 participants using GPT-4o to investigate how novices navigate sequential visual narratives. Our findings show that users develop strategies to harness AI's semantic surplus by recognizing meaningful visual content beyond literal descriptions, iteratively refining prompts, and constructing narrative significance through complementary text-image relationships. We identified four distinct collaboration patterns and, through fsQCA's analysis, discovered three pathways to successful intertextual collaboration: Educational Collaborator, Technical Expert, and Visual Thinker. However, participants faced challenges, including cultural representation gaps, visual consistency issues, and difficulties translating narrative concepts into visual prompts. These findings contribute to HCI research by providing an empirical account of \textit{text-image intertextuality} in human-AI co-creation and proposing design implications for role-based AI assistants that better support iterative, human-led creative processes in visual storytelling.

Interaction-Augmented Instruction: Modeling the Synergy of Prompts and Interactions in Human-GenAI Collaboration

Human-Computer Interaction

Helps AI understand instructions better with clicks.

30 Oct 2025 2

89%

Prompting Generative AI with Interaction-Augmented Instructions

Human-Computer Interaction

Makes AI understand your instructions better.

4 Mar 2025 2

89%

From Prompting to Partnering: Personalization Features for Human-LLM Interactions

Human-Computer Interaction

Makes AI easier to use and understand.

2 Mar 2025 0

View PDF Login to Bookmark

Country of Origin

🇨🇳 🇦🇺 🇺🇸 China, Australia, United States

Page Count

38 pages

I Prompt, it Generates, we Negotiate. Exploring Text-Image Intertextuality in Human-AI Co-Creation of Visual Narratives with VLMs

Helps people tell stories with AI pictures.

Technical Abstract

Interaction-Augmented Instruction: Modeling the Synergy of Prompts and Interactions in Human-GenAI Collaboration

Prompting Generative AI with Interaction-Augmented Instructions

From Prompting to Partnering: Personalization Features for Human-LLM Interactions