Field of View Enhanced Signal Dependent Binauralization with Mixture of Experts Framework for Continuous Source Motion
By: Manan Mittal , Thomas Deppisch , Joseph Forrer and more
Potential Business Impact:
Focus on sounds you want, block others.
We propose a novel mixture of experts framework for field-of-view enhancement in binaural signal matching. Our approach enables dynamic spatial audio rendering that adapts to continuous talker motion, allowing users to emphasize or suppress sounds from selected directions while preserving natural binaural cues. Unlike traditional methods that rely on explicit direction-of-arrival estimation or operate in the Ambisonics domain, our signal-dependent framework combines multiple binaural filters in an online manner using implicit localization. This allows for real-time tracking and enhancement of moving sound sources, supporting applications such as speech focus, noise reduction, and world-locked audio in augmented and virtual reality. The method is agnostic to array geometry offering a flexible solution for spatial audio capture and personalized playback in next-generation consumer audio devices.
Similar Papers
Mixture-of-Experts Framework for Field-of-View Enhanced Signal-Dependent Binauralization of Moving Talkers
Sound
Focus on sounds, block out noise.
Mixture-of-Experts Framework for Field-of-View Enhanced Signal-Dependent Binauralization of Moving Talkers
Sound
Focus on one voice in noisy places.
FoleySpace: Vision-Aligned Binaural Spatial Audio Generation
Sound
Makes videos sound like you're really there.