Score: 0

A Multimodal Dataset of Student Oral Presentations with Sensors and Evaluation Data

Published: January 12, 2026 | arXiv ID: 2601.07576v1

By: Alvaro Becerra, Ruth Cobos, Roberto Daza

Oral presentation skills are a critical component of higher education, yet comprehensive datasets capturing real-world student performance across multiple modalities remain scarce. To address this gap, we present SOPHIAS (Student Oral Presentation monitoring for Holistic Insights & Analytics using Sensors), a 12-hour multimodal dataset containing recordings of 50 oral presentations (10-15-minute presentation followed by 5-15-minute Q&A) delivered by 65 undergraduate and master's students at the Universidad Autonoma de Madrid. SOPHIAS integrates eight synchronized sensor streams from high-definition webcams, ambient and webcam audio, eye-tracking glasses, smartwatch physiological sensors, and clicker, keyboard, and mouse interactions. In addition, the dataset includes slides and rubric-based evaluations from teachers, peers, and self-assessments, along with timestamped contextual annotations. The dataset captures presentations conducted in real classroom settings, preserving authentic student behaviors, interactions, and physiological responses. SOPHIAS enables the exploration of relationships between multimodal behavioral and physiological signals and presentation performance, supports the study of peer assessment, and provides a benchmark for developing automated feedback and Multimodal Learning Analytics tools. The dataset is publicly available for research through GitHub and Science Data Bank.

Real-Time Multimodal Data Collection Using Smartwatches and Its Visualization in Education

Human-Computer Interaction

Tracks student focus during lessons using smartwatches.

2 Dec 2025 0

85%

Chinese-LiPS: A Chinese audio-visual speech recognition dataset with Lip-reading and Presentation Slides

Multimedia

Helps computers understand speech better with lip-reading and slides.

21 Apr 2025 0

85%

DIPSER: A Dataset for In-Person Student Engagement Recognition in the Wild

CV and Pattern Recognition

Helps computers tell if students are paying attention.

27 Feb 2025 0

View PDF Login to Bookmark

A Multimodal Dataset of Student Oral Presentations with Sensors and Evaluation Data

Technical Abstract

Real-Time Multimodal Data Collection Using Smartwatches and Its Visualization in Education

Chinese-LiPS: A Chinese audio-visual speech recognition dataset with Lip-reading and Presentation Slides

DIPSER: A Dataset for In-Person Student Engagement Recognition in the Wild