Score: 1

Multilingual and Multi-Accent Jailbreaking of Audio LLMs

Published: April 1, 2025 | arXiv ID: 2504.01094v1

By: Jaechul Roh, Virat Shejwalkar, Amir Houmansadr

Potential Business Impact:

Makes AI understand bad audio from many languages.

Business Areas:

Natural Language Processing Artificial Intelligence, Data and Analytics, Software

Large Audio Language Models (LALMs) have significantly advanced audio understanding but introduce critical security risks, particularly through audio jailbreaks. While prior work has focused on English-centric attacks, we expose a far more severe vulnerability: adversarial multilingual and multi-accent audio jailbreaks, where linguistic and acoustic variations dramatically amplify attack success. In this paper, we introduce Multi-AudioJail, the first systematic framework to exploit these vulnerabilities through (1) a novel dataset of adversarially perturbed multilingual/multi-accent audio jailbreaking prompts, and (2) a hierarchical evaluation pipeline revealing that how acoustic perturbations (e.g., reverberation, echo, and whisper effects) interacts with cross-lingual phonetics to cause jailbreak success rates (JSRs) to surge by up to +57.25 percentage points (e.g., reverberated Kenyan-accented attack on MERaLiON). Crucially, our work further reveals that multimodal LLMs are inherently more vulnerable than unimodal systems: attackers need only exploit the weakest link (e.g., non-English audio inputs) to compromise the entire model, which we empirically show by multilingual audio-only attacks achieving 3.1x higher success rates than text-only attacks. We plan to release our dataset to spur research into cross-modal defenses, urging the community to address this expanding attack surface in multimodality as LALMs evolve.

Beyond Text: Multimodal Jailbreaking of Vision-Language and Audio Models through Perceptually Simple Transformations

Cryptography and Security

Tricks AI into showing bad stuff using pictures.

23 Oct 2025 2

92%

Align is not Enough: Multimodal Universal Jailbreak Attack against Multimodal Large Language Models

Cryptography and Security

Makes AI models with pictures unsafe.

2 Jun 2025 0

91%

StyleBreak: Revealing Alignment Vulnerabilities in Large Audio-Language Models via Style-Aware Audio Jailbreak

Sound

Makes AI models unsafe with spoken words.

12 Nov 2025 0

View PDF Login to Bookmark

Repos / Data Links

github.com

Page Count

21 pages

Multilingual and Multi-Accent Jailbreaking of Audio LLMs

Makes AI understand bad audio from many languages.

Technical Abstract

Beyond Text: Multimodal Jailbreaking of Vision-Language and Audio Models through Perceptually Simple Transformations

Align is not Enough: Multimodal Universal Jailbreak Attack against Multimodal Large Language Models

StyleBreak: Revealing Alignment Vulnerabilities in Large Audio-Language Models via Style-Aware Audio Jailbreak