Score: 1

ClassMind: Scaling Classroom Observation and Instructional Feedback with Multimodal AI

Published: September 22, 2025 | arXiv ID: 2509.18020v1

By: Ao Qu , Yuxi Wen , Jiayi Zhang and more

BigTech Affiliations: Johns Hopkins University Massachusetts Institute of Technology

Potential Business Impact:

Helps teachers improve by analyzing class videos.

Business Areas:
Machine Learning Artificial Intelligence, Data and Analytics, Software

Classroom observation -- one of the most effective methods for teacher development -- remains limited due to high costs and a shortage of expert coaches. We present ClassMind, an AI-driven classroom observation system that integrates generative AI and multimodal learning to analyze classroom artifacts (e.g., class recordings) and deliver timely, personalized feedback aligned with pedagogical practices. At its core is AVA-Align, an agent framework that analyzes long classroom video recordings to generate temporally precise, best-practice-aligned feedback to support teacher reflection and improvement. Our three-phase study involved participatory co-design with educators, development of a full-stack system, and field testing with teachers at different stages of practice. Teachers highlighted the system's usefulness, ease of use, and novelty, while also raising concerns about privacy and the role of human judgment, motivating deeper exploration of future human--AI coaching partnerships. This work illustrates how multimodal AI can scale expert coaching and advance teacher development.

Country of Origin
🇺🇸 United States

Page Count
22 pages

Category
Computer Science:
Human-Computer Interaction