Score: 1

Evaluating Identity Leakage in Speaker De-Identification Systems

Published: August 19, 2025 | arXiv ID: 2508.14012v1

By: Seungmin Seo , Oleg Aulov , Afzal Godil and more

Potential Business Impact:

Makes voices sound like someone else.

Business Areas:
Identity Management Information Technology, Privacy and Security

Speaker de-identification aims to conceal a speaker's identity while preserving intelligibility of the underlying speech. We introduce a benchmark that quantifies residual identity leakage with three complementary error rates: equal error rate, cumulative match characteristic hit rate, and embedding-space similarity measured via canonical correlation analysis and Procrustes analysis. Evaluation results reveal that all state-of-the-art speaker de-identification systems leak identity information. The highest performing system in our evaluation performs only slightly better than random guessing, while the lowest performing system achieves a 45% hit rate within the top 50 candidates based on CMC. These findings highlight persistent privacy risks in current speaker de-identification technologies.

Page Count
5 pages

Category
Computer Science:
Sound