Score: 1

SaD: A Scenario-Aware Discriminator for Speech Enhancement

Published: August 30, 2025 | arXiv ID: 2509.00405v1

By: Xihao Yuan , Siqi Liu , Yan Chen and more

BigTech Affiliations: Huawei

Potential Business Impact:

Makes noisy voices sound clear in any place.

Business Areas:
Speech Recognition Data and Analytics, Software

Generative adversarial network-based models have shown remarkable performance in the field of speech enhancement. However, the current optimization strategies for these models predominantly focus on refining the architecture of the generator or enhancing the quality evaluation metrics of the discriminator. This approach often overlooks the rich contextual information inherent in diverse scenarios. In this paper, we propose a scenario-aware discriminator that captures scene-specific features and performs frequency-domain division, thereby enabling a more accurate quality assessment of the enhanced speech generated by the generator. We conducted comprehensive experiments on three representative models using two publicly available datasets. The results demonstrate that our method can effectively adapt to various generator architectures without altering their structure, thereby unlocking further performance gains in speech enhancement across different scenarios.

Country of Origin
🇨🇳 China

Page Count
5 pages

Category
Computer Science:
Sound