Improving the Speaker Anonymization Evaluation's Robustness to Target Speakers with Adversarial Learning
By: Carlos Franzreb , Arnab Das , Tim Polzehl and more
Potential Business Impact:
Keeps your voice private when it's changed.
The current privacy evaluation for speaker anonymization often overestimates privacy when a same-gender target selection algorithm (TSA) is used, although this TSA leaks the speaker's gender and should hence be more vulnerable. We hypothesize that this occurs because the evaluation does not account for the fact that anonymized speech contains information from both the source and target speakers. To address this, we propose to add a target classifier that measures the influence of target speaker information in the evaluation, which can also be removed with adversarial learning. Experiments demonstrate that this approach is effective for multiple anonymizers, particularly when using a same-gender TSA, leading to a more reliable assessment.
Similar Papers
Target speaker anonymization in multi-speaker recordings
Audio and Speech Processing
Hides one person's voice in group talks.
Any-to-any Speaker Attribute Perturbation for Asynchronous Voice Anonymization
Sound
Makes your voice sound like someone else.
You Are What You Say: Exploiting Linguistic Content for VoicePrivacy Attacks
Audio and Speech Processing
Makes it harder to hide who is talking.