Score: 0

HiFi-Stream: Streaming Speech Enhancement with Generative Adversarial Networks

Published: March 21, 2025 | arXiv ID: 2503.17141v2

By: Ekaterina Dmitrieva, Maksim Kaledin

Potential Business Impact:

Makes phone calls clearer on slow phones.

Business Areas:

Internet Radio Media and Entertainment, Music and Audio

Speech Enhancement techniques have become core technologies in mobile devices and voice software. Still, modern deep learning solutions often require high amount of computational resources what makes their usage on low-resource devices challenging. We present HiFi-Stream, an optimized version of recently published HiFi++ model. Our experiments demonstrate that HiFi-Stream saves most of the qualities of the original model despite its size and computational complexity improved in comparison to the original HiFi++ making it one of the smallest and fastest models available. The model is evaluated in streaming setting where it demonstrates its superior performance in comparison to modern baselines.