Score: 2

Infant Cry Detection In Noisy Environment Using Blueprint Separable Convolutions and Time-Frequency Recurrent Neural Network

Published: August 26, 2025 | arXiv ID: 2508.19308v1

By: Haolin Yu, Yanxiong Li

Potential Business Impact:

Helps machines tell if a baby is crying.

Business Areas:

Baby Community and Lifestyle

Infant cry detection is a crucial component of baby care system. In this paper, we propose a lightweight and robust method for infant cry detection. The method leverages blueprint separable convolutions to reduce computational complexity, and a time-frequency recurrent neural network for adaptive denoising. The overall framework of the method is structured as a multi-scale convolutional recurrent neural network, which is enhanced by efficient spatial attention mechanism and contrast-aware channel attention module, and acquire local and global information from the input feature of log Mel-spectrogram. Multiple public datasets are adopted to create a diverse and representative dataset, and environmental corruption techniques are used to generate the noisy samples encountered in real-world scenarios. Results show that our method exceeds many state-of-the-art methods in accuracy, F1-score, and complexity under various signal-to-noise ratio conditions. The code is at https://github.com/fhfjsd1/ICD_MMSP.

Infant Cry Detection Using Causal Temporal Representation

Sound

Helps machines hear baby cries in noisy places.

8 Mar 2025 2

85%

Making deep neural networks work for medical audio: representation, compression and domain adaptation

Sound

Helps doctors hear sickness in baby cries.

24 May 2025 1

85%

Real-Time Pitch/F0 Detection Using Spectrogram Images and Convolutional Neural Networks

Sound

Helps computers hear singing pitch better.

8 Apr 2025 0

View PDF Login to Bookmark

Country of Origin

🇨🇳 China

Repos / Data Links

github.com

Page Count

6 pages

Infant Cry Detection In Noisy Environment Using Blueprint Separable Convolutions and Time-Frequency Recurrent Neural Network

Helps machines tell if a baby is crying.

Technical Abstract

Infant Cry Detection Using Causal Temporal Representation

Making deep neural networks work for medical audio: representation, compression and domain adaptation

Real-Time Pitch/F0 Detection Using Spectrogram Images and Convolutional Neural Networks