Score: 1

TalkSketch: Multimodal Generative AI for Real-time Sketch Ideation with Speech

Published: November 8, 2025 | arXiv ID: 2511.05817v2

By: Weiyan Shi , Sunaya Upadhyay , Geraldine Quek and more

Potential Business Impact:

Draw and talk to create design ideas faster.

Business Areas:
Intelligent Systems Artificial Intelligence, Data and Analytics, Science and Engineering

Sketching is a widely used medium for generating and exploring early-stage design concepts. While generative AI (GenAI) chatbots are increasingly used for idea generation, designers often struggle to craft effective prompts and find it difficult to express evolving visual concepts through text alone. In the formative study (N=6), we examined how designers use GenAI during ideation, revealing that text-based prompting disrupts creative flow. To address these issues, we developed TalkSketch, an embedded multimodal AI sketching system that integrates freehand drawing with real-time speech input. TalkSketch aims to support a more fluid ideation process through capturing verbal descriptions during sketching and generating context-aware AI responses. Our work highlights the potential of GenAI tools to engage the design process itself rather than focusing on output.

Country of Origin
πŸ‡ΈπŸ‡¬ πŸ‡ΊπŸ‡Έ United States, Singapore

Page Count
16 pages

Category
Computer Science:
Human-Computer Interaction