Exploring Student Choice and the Use of Multimodal Generative AI in Programming Learning
By: Xinying Hou , Ruiwei Xiao , Runlong Ye and more
Potential Business Impact:
Helps students learn coding with AI that sees and hears.
The broad adoption of Generative AI (GenAI) is impacting Computer Science education, and recent studies found its benefits and potential concerns when students use it for programming learning. However, most existing explorations focus on GenAI tools that primarily support text-to-text interaction. With recent developments, GenAI applications have begun supporting multiple modes of communication, known as multimodality. In this work, we explored how undergraduate programming novices choose and work with multimodal GenAI tools, and their criteria for choices. We selected a commercially available multimodal GenAI platform for interaction, as it supports multiple input and output modalities, including text, audio, image upload, and real-time screen-sharing. Through 16 think-aloud sessions that combined participant observation with follow-up semi-structured interviews, we investigated student modality choices for GenAI tools when completing programming problems and the underlying criteria for modality selections. With multimodal communication emerging as the future of AI in education, this work aims to spark continued exploration on understanding student interaction with multimodal GenAI in the context of CS education.
Similar Papers
Enhancing Higher Education with Generative AI: A Multimodal Approach for Personalised Learning
Human-Computer Interaction
Helps students learn and teachers grade faster.
GenAI Voice Mode in Programming Education
Computers and Society
Helps students with disabilities learn to code.
Examining the Usage of Generative AI Models in Student Learning Activities for Software Programming
Software Engineering
Helps students learn better with AI, not just copy.