Dynamic Stress Detection: A Study of Temporal Progression Modelling of Stress in Speech
By: Vishakha Lall, Yisi Liu
Potential Business Impact:
Helps computers hear when people are stressed.
Detecting psychological stress from speech is critical in high-pressure settings. While prior work has leveraged acoustic features for stress detection, most treat stress as a static label. In this work, we model stress as a temporally evolving phenomenon influenced by historical emotional state. We propose a dynamic labelling strategy that derives fine-grained stress annotations from emotional labels and introduce cross-attention-based sequential models, a Unidirectional LSTM and a Transformer Encoder, to capture temporal stress progression. Our approach achieves notable accuracy gains on MuSE (+5%) and StressID (+18%) over existing baselines, and generalises well to a custom real-world dataset. These results highlight the value of modelling stress as a dynamic construct in speech.
Similar Papers
Human Feedback Driven Dynamic Speech Emotion Recognition
Sound
Makes cartoon characters show real feelings.
StressTest: Can YOUR Speech LM Handle the Stress?
Computation and Language
Helps computers understand meaning from spoken emphasis.
Cross-Modality Investigation on WESAD Stress Classification
Machine Learning (CS)
Detects stress from body signals with high accuracy.