InsideOut: An EfficientNetV2-S Based Deep Learning Framework for Robust Multi-Class Facial Emotion Recognition
By: Ahsan Farabi , Israt Khandaker , Ibrahim Khalil Shanto and more
Potential Business Impact:
Helps computers understand emotions better, even when faces are hidden.
Facial Emotion Recognition (FER) is a key task in affective computing, enabling applications in human-computer interaction, e-learning, healthcare, and safety systems. Despite advances in deep learning, FER remains challenging due to occlusions, illumination and pose variations, subtle intra-class differences, and dataset imbalance that hinders recognition of minority emotions. We present InsideOut, a reproducible FER framework built on EfficientNetV2-S with transfer learning, strong data augmentation, and imbalance-aware optimization. The approach standardizes FER2013 images, applies stratified splitting and augmentation, and fine-tunes a lightweight classification head with class-weighted loss to address skewed distributions. InsideOut achieves 62.8% accuracy with a macro averaged F1 of 0.590 on FER2013, showing competitive results compared to conventional CNN baselines. The novelty lies in demonstrating that efficient architectures, combined with tailored imbalance handling, can provide practical, transparent, and reproducible FER solutions.
Similar Papers
Multi-modal Transfer Learning for Dynamic Facial Emotion Recognition in the Wild
CV and Pattern Recognition
Helps computers understand emotions from faces better.
Evaluating Open-Source Vision Language Models for Facial Emotion Recognition against Traditional Deep Learning Models
CV and Pattern Recognition
Makes computers understand emotions from blurry pictures.
ExpressNet-MoE: A Hybrid Deep Neural Network for Emotion Recognition
CV and Pattern Recognition
Helps computers understand your feelings better.