SCDF: A Speaker Characteristics DeepFake Speech Dataset for Bias Analysis
By: Vojtěch Staněk , Karel Srna , Anton Firc and more
Potential Business Impact:
Finds fake voices that trick people unfairly.
Despite growing attention to deepfake speech detection, the aspects of bias and fairness remain underexplored in the speech domain. To address this gap, we introduce the Speaker Characteristics Deepfake (SCDF) dataset: a novel, richly annotated resource enabling systematic evaluation of demographic biases in deepfake speech detection. SCDF contains over 237,000 utterances in a balanced representation of both male and female speakers spanning five languages and a wide age range. We evaluate several state-of-the-art detectors and show that speaker characteristics significantly influence detection performance, revealing disparities across sex, language, age, and synthesizer type. These findings highlight the need for bias-aware development and provide a foundation for building non-discriminatory deepfake detection systems aligned with ethical and regulatory standards.
Similar Papers
SocialDF: Benchmark Dataset and Detection Model for Mitigating Harmful Deepfake Content on Social Media Platforms
Machine Learning (CS)
Finds fake videos and voices online.
Descriptor:: Extended-Length Audio Dataset for Synthetic Voice Detection and Speaker Recognition (ELAD-SVDSR)
Audio and Speech Processing
Makes fake voices sound more real.
Speech DF Arena: A Leaderboard for Speech DeepFake Detection Models
Sound
Tests tools that find fake voices.