Score: 1

Noise Supervised Contrastive Learning and Feature-Perturbed for Anomalous Sound Detection

Published: September 17, 2025 | arXiv ID: 2509.13853v2

By: Shun Huang, Zhihua Fang, Liang He

Potential Business Impact:

Finds weird noises without being fooled.

Business Areas:
Speech Recognition Data and Analytics, Software

Unsupervised anomalous sound detection aims to detect unknown anomalous sounds by training a model using only normal audio data. Despite advancements in self-supervised methods, the issue of frequent false alarms when handling samples of the same type from different machines remains unresolved. This paper introduces a novel training technique called one-stage supervised contrastive learning (OS-SCL), which significantly addresses this problem by perturbing features in the embedding space and employing a one-stage noisy supervised contrastive learning approach. On the DCASE 2020 Challenge Task 2, it achieved 94.64\% AUC, 88.42\% pAUC, and 89.24\% mAUC using only Log-Mel features. Additionally, a time-frequency feature named TFgram is proposed, which is extracted from raw audio. This feature effectively captures critical information for anomalous sound detection, ultimately achieving 95.71\% AUC, 90.23\% pAUC, and 91.23\% mAUC. The source code is available at: \underline{www.github.com/huangswt/OS-SCL}.

Country of Origin
🇨🇳 China

Repos / Data Links

Page Count
6 pages

Category
Computer Science:
Sound