Mixture-of-Experts Framework for Field-of-View Enhanced Signal-Dependent Binauralization of Moving Talkers
By: Manan Mittal , Thomas Deppisch , Joseph Forrer and more
Potential Business Impact:
Focus on one voice in noisy places.
We propose a novel mixture of experts framework for field-of-view enhancement in binaural signal matching. Our approach enables dynamic spatial audio rendering that adapts to continuous talker motion, allowing users to emphasize or suppress sounds from selected directions while preserving natural binaural cues. Unlike traditional methods that rely on explicit direction-of-arrival estimation or operate in the Ambisonics domain, our signal-dependent framework combines multiple binaural filters in an online manner using implicit localization. This allows for real-time tracking and enhancement of moving sound sources, supporting applications such as speech focus, noise reduction, and world-locked audio in augmented and virtual reality. The method is agnostic to array geometry offering a flexible solution for spatial audio capture and personalized playback in next-generation consumer audio devices.
Similar Papers
Mixture-of-Experts Framework for Field-of-View Enhanced Signal-Dependent Binauralization of Moving Talkers
Sound
Focus on sounds, block out noise.
Field of View Enhanced Signal Dependent Binauralization with Mixture of Experts Framework for Continuous Source Motion
Sound
Focus on sounds you want, block others.
In-the-wild Audio Spatialization with Flexible Text-guided Localization
Sound
Makes game sounds move with your head.