Prompting Generative AI with Interaction-Augmented Instructions
By: Leixian Shen , Haotian Li , Yifang Wang and more
Potential Business Impact:
Makes AI understand your instructions better.
The emergence of generative AI (GenAI) models, including large language models and text-to-image models, has significantly advanced the synergy between humans and AI with not only their outstanding capability but more importantly, the intuitive communication method with text prompts. Though intuitive, text-based instructions suffer from natural languages' ambiguous and redundant nature. To address the issue, researchers have explored augmenting text-based instructions with interactions that facilitate precise and effective human intent expression, such as direct manipulation. However, the design strategy of interaction-augmented instructions lacks systematic investigation, hindering our understanding and application. To provide a panorama of interaction-augmented instructions, we propose a framework to analyze related tools from why, when, who, what, and how interactions are applied to augment text-based instructions. Notably, we identify four purposes for applying interactions, including restricting, expanding, organizing, and refining text instructions. The design paradigms for each purpose are also summarized to benefit future researchers and practitioners.
Similar Papers
Interaction-Augmented Instruction: Modeling the Synergy of Prompts and Interactions in Human-GenAI Collaboration
Human-Computer Interaction
Helps AI understand instructions better with clicks.
Expanding the Generative AI Design Space through Structured Prompting and Multimodal Interfaces
Human-Computer Interaction
Helps small businesses make ads easily.
When Teams Embrace AI: Human Collaboration Strategies in Generative Prompting in a Creative Design Task
Human-Computer Interaction
Helps teams create art by talking to AI.