Speech DF Arena: A Leaderboard for Speech DeepFake Detection Models
By: Sandipana Dowerah , Atharva Kulkarni , Ajinkya Kulkarni and more
Potential Business Impact:
Tests tools that find fake voices.
Parallel to the development of advanced deepfake audio generation, audio deepfake detection has also seen significant progress. However, a standardized and comprehensive benchmark is still missing. To address this, we introduce Speech DeepFake (DF) Arena, the first comprehensive benchmark for audio deepfake detection. Speech DF Arena provides a toolkit to uniformly evaluate detection systems, currently across 14 diverse datasets and attack scenarios, standardized evaluation metrics and protocols for reproducibility and transparency. It also includes a leaderboard to compare and rank the systems to help researchers and developers enhance their reliability and robustness. We include 14 evaluation sets, 12 state-of-the-art open-source and 3 proprietary detection systems. Our study presents many systems exhibiting high EER in out-of-domain scenarios, highlighting the need for extensive cross-domain evaluation. The leaderboard is hosted on Huggingface1 and a toolkit for reproducing results across the listed datasets is available on GitHub.
Similar Papers
AUDDT: Audio Unified Deepfake Detection Benchmark Toolkit
Audio and Speech Processing
Tests if fake voices fool computers.
DeepFake Doctor: Diagnosing and Treating Audio-Video Fake Detection
Multimedia
Finds fake videos and sounds better.
SocialDF: Benchmark Dataset and Detection Model for Mitigating Harmful Deepfake Content on Social Media Platforms
Machine Learning (CS)
Finds fake videos and voices online.