Score: 0

An Enhanced Audio Feature Tailored for Anomalous Sound Detection Based on Pre-trained Models

Published: August 21, 2025 | arXiv ID: 2508.15334v1

By: Guirui Zhong , Qing Wang , Jun Du and more

Potential Business Impact:

Finds broken machine sounds better.

Business Areas:
Speech Recognition Data and Analytics, Software

Anomalous Sound Detection (ASD) aims at identifying anomalous sounds from machines and has gained extensive research interests from both academia and industry. However, the uncertainty of anomaly location and much redundant information such as noise in machine sounds hinder the improvement of ASD system performance. This paper proposes a novel audio feature of filter banks with evenly distributed intervals, ensuring equal attention to all frequency ranges in the audio, which enhances the detection of anomalies in machine sounds. Moreover, based on pre-trained models, this paper presents a parameter-free feature enhancement approach to remove redundant information in machine audio. It is believed that this parameter-free strategy facilitates the effective transfer of universal knowledge from pre-trained tasks to the ASD task during model fine-tuning. Evaluation results on the Detection and Classification of Acoustic Scenes and Events (DCASE) 2024 Challenge dataset demonstrate significant improvements in ASD performance with our proposed methods.

Country of Origin
🇨🇳 China

Page Count
13 pages

Category
Computer Science:
Sound