Enhancing Facial Expression Recognition through Dual-Direction Attention Mixed Feature Networks and CLIP: Application to 8th ABAW Challenge
By: Josep Cabacas-Maso , Elena Ortega-Beltrán , Ismael Benito-Altamirano and more
Potential Business Impact:
Helps computers understand emotions and faces better.
We present our contribution to the 8th ABAW challenge at CVPR 2025, where we tackle valence-arousal estimation, emotion recognition, and facial action unit detection as three independent challenges. Our approach leverages the well-known Dual-Direction Attention Mixed Feature Network (DDAMFN) for all three tasks, achieving results that surpass the proposed baselines. Additionally, we explore the use of CLIP for the emotion recognition challenge as an additional experiment. We provide insights into the architectural choices that contribute to the strong performance of our methods.
Similar Papers
Emotion Recognition with CLIP and Sequential Learning
CV and Pattern Recognition
Helps computers understand your feelings better.
Feature-Based Dual Visual Feature Extraction Model for Compound Multimodal Emotion Recognition
CV and Pattern Recognition
Helps computers understand emotions from faces and voices.
Bimodal Connection Attention Fusion for Speech Emotion Recognition
Sound
Helps computers understand feelings from voices and words.