Score: 0

Deep Learning for Speech Emotion Recognition: A CNN Approach Utilizing Mel Spectrograms

Published: March 25, 2025 | arXiv ID: 2503.19677v1

By: Niketa Penumajji

Potential Business Impact:

Helps computers understand how you feel from your voice.

Business Areas:

Speech Recognition Data and Analytics, Software

This paper explores the application of Convolutional Neural Networks CNNs for classifying emotions in speech through Mel Spectrogram representations of audio files. Traditional methods such as Gaussian Mixture Models and Hidden Markov Models have proven insufficient for practical deployment, prompting a shift towards deep learning techniques. By transforming audio data into a visual format, the CNN model autonomously learns to identify intricate patterns, enhancing classification accuracy. The developed model is integrated into a user-friendly graphical interface, facilitating realtime predictions and potential applications in educational environments. The study aims to advance the understanding of deep learning in speech emotion recognition, assess the models feasibility, and contribute to the integration of technology in learning contexts

Amplifying Emotional Signals: Data-Efficient Deep Learning for Robust Speech Emotion Recognition

Audio and Speech Processing

Helps computers understand your feelings from your voice.

26 Aug 2025 1

89%

Spectral and Rhythm Feature Performance Evaluation for Category and Class Level Audio Classification with Deep Convolutional Neural Networks

Sound

Helps computers better understand sounds like music.

9 Sep 2025 0

89%

Spectral and Rhythm Feature Performance Evaluation for Category and Class Level Audio Classification with Deep Convolutional Neural Networks

Sound

Helps computers better hear and sort sounds.

9 Sep 2025 0

View PDF Login to Bookmark

Country of Origin

🇺🇸 United States

Page Count

5 pages

Deep Learning for Speech Emotion Recognition: A CNN Approach Utilizing Mel Spectrograms

Helps computers understand how you feel from your voice.

Technical Abstract

Amplifying Emotional Signals: Data-Efficient Deep Learning for Robust Speech Emotion Recognition

Spectral and Rhythm Feature Performance Evaluation for Category and Class Level Audio Classification with Deep Convolutional Neural Networks

Spectral and Rhythm Feature Performance Evaluation for Category and Class Level Audio Classification with Deep Convolutional Neural Networks