Infant Cry Detection In Noisy Environment Using Blueprint Separable Convolutions and Time-Frequency Recurrent Neural Network
By: Haolin Yu, Yanxiong Li
Potential Business Impact:
Helps machines tell if a baby is crying.
Infant cry detection is a crucial component of baby care system. In this paper, we propose a lightweight and robust method for infant cry detection. The method leverages blueprint separable convolutions to reduce computational complexity, and a time-frequency recurrent neural network for adaptive denoising. The overall framework of the method is structured as a multi-scale convolutional recurrent neural network, which is enhanced by efficient spatial attention mechanism and contrast-aware channel attention module, and acquire local and global information from the input feature of log Mel-spectrogram. Multiple public datasets are adopted to create a diverse and representative dataset, and environmental corruption techniques are used to generate the noisy samples encountered in real-world scenarios. Results show that our method exceeds many state-of-the-art methods in accuracy, F1-score, and complexity under various signal-to-noise ratio conditions. The code is at https://github.com/fhfjsd1/ICD_MMSP.
Similar Papers
Infant Cry Detection Using Causal Temporal Representation
Sound
Helps machines hear baby cries in noisy places.
Making deep neural networks work for medical audio: representation, compression and domain adaptation
Sound
Helps doctors hear sickness in baby cries.
Real-Time Pitch/F0 Detection Using Spectrogram Images and Convolutional Neural Networks
Sound
Helps computers hear singing pitch better.