Autonomous AI Surveillance: Multimodal Deep Learning for Cognitive and Behavioral Monitoring
By: Ameer Hamza , Zuhaib Hussain But , Umar Arif and more
Potential Business Impact:
Spots students sleeping or on phones.
This study presents a novel classroom surveillance system that integrates multiple modalities, including drowsiness, tracking of mobile phone usage, and face recognition,to assess student attentiveness with enhanced precision.The system leverages the YOLOv8 model to detect both mobile phone and sleep usage,(Ghatge et al., 2024) while facial recognition is achieved through LResNet Occ FC body tracking using YOLO and MTCNN.(Durai et al., 2024) These models work in synergy to provide comprehensive, real-time monitoring, offering insights into student engagement and behavior.(S et al., 2023) The framework is trained on specialized datasets, such as the RMFD dataset for face recognition and a Roboflow dataset for mobile phone detection. The extensive evaluation of the system shows promising results. Sleep detection achieves 97. 42% mAP@50, face recognition achieves 86. 45% validation accuracy and mobile phone detection reach 85. 89% mAP@50. The system is implemented within a core PHP web application and utilizes ESP32-CAM hardware for seamless data capture.(Neto et al., 2024) This integrated approach not only enhances classroom monitoring, but also ensures automatic attendance recording via face recognition as students remain seated in the classroom, offering scalability for diverse educational environments.(Banada,2025)
Similar Papers
Modular Deep Learning Framework for Assistive Perception: Gaze, Affect, and Speaker Identification
CV and Pattern Recognition
Helps computers see and hear to understand you.
Vision-Based Driver Drowsiness Monitoring: Comparative Analysis of YOLOv5-v11 Models
CV and Pattern Recognition
Helps cars tell if drivers are sleepy.
Advancing Autonomous Vehicle Intelligence: Deep Learning and Multimodal LLM for Traffic Sign Recognition and Robust Lane Detection
CV and Pattern Recognition
Helps self-driving cars see roads and signs better.