Score: 0

Different Speech Translation Models Encode and Translate Speaker Gender Differently

Published: June 2, 2025 | arXiv ID: 2506.02172v1

By: Dennis Fucci , Marco Gaido , Matteo Negri and more

Potential Business Impact:

Translators learn gender, but some new ones don't.

Business Areas:

Translation Service Professional Services

Recent studies on interpreting the hidden states of speech models have shown their ability to capture speaker-specific features, including gender. Does this finding also hold for speech translation (ST) models? If so, what are the implications for the speaker's gender assignment in translation? We address these questions from an interpretability perspective, using probing methods to assess gender encoding across diverse ST models. Results on three language directions (English-French/Italian/Spanish) indicate that while traditional encoder-decoder models capture gender information, newer architectures -- integrating a speech encoder with a machine translation system via adapters -- do not. We also demonstrate that low gender encoding capabilities result in systems' tendency toward a masculine default, a translation bias that is more pronounced in newer architectures.

Voice, Bias, and Coreference: An Interpretability Study of Gender in Speech Translation

Computation and Language

Translates speech, guessing gender from sound, not just pitch.

26 Nov 2025 0

90%

Addressing speaker gender bias in large scale speech translation systems

Computation and Language

Fixes translation mistakes for female speakers.

10 Jan 2025 0

88%

Streaming Speaker Change Detection and Gender Classification for Transducer-Based Multi-Talker Speech Translation

Sound

Translates talking while knowing who speaks.

4 Feb 2025 0

View PDF Login to Bookmark

Page Count

15 pages

Different Speech Translation Models Encode and Translate Speaker Gender Differently

Translators learn gender, but some new ones don't.

Technical Abstract

Voice, Bias, and Coreference: An Interpretability Study of Gender in Speech Translation

Addressing speaker gender bias in large scale speech translation systems

Streaming Speaker Change Detection and Gender Classification for Transducer-Based Multi-Talker Speech Translation