Score: 1

RFOP: Rethinking Fusion and Orthogonal Projection for Face-Voice Association

Published: December 2, 2025 | arXiv ID: 2512.02860v1

By: Abdul Hannan , Furqan Malik , Hina Jabbar and more

Potential Business Impact:

Helps computers match faces to voices in different languages.

Business Areas:

Speech Recognition Data and Analytics, Software

Face-voice association in multilingual environment challenge 2026 aims to investigate the face-voice association task in multilingual scenario. The challenge introduces English-German face-voice pairs to be utilized in the evaluation phase. To this end, we revisit the fusion and orthogonal projection for face-voice association by effectively focusing on the relevant semantic information within the two modalities. Our method performs favorably on the English-German data split and ranked 3rd in the FAME 2026 challenge by achieving the EER of 33.1.