Score: 2

Summary on The Multilingual Conversational Speech Language Model Challenge: Datasets, Tasks, Baselines, and Methods

Published: September 17, 2025 | arXiv ID: 2509.13785v1

By: Bingshen Mu , Pengcheng Guo , Zhaokai Sun and more

Potential Business Impact:

Teaches computers to talk in many languages.

Business Areas:
Natural Language Processing Artificial Intelligence, Data and Analytics, Software

This paper summarizes the Interspeech2025 Multilingual Conversational Speech Language Model (MLC-SLM) challenge, which aims to advance the exploration of building effective multilingual conversational speech LLMs (SLLMs). We provide a detailed description of the task settings for the MLC-SLM challenge, the released real-world multilingual conversational speech dataset totaling approximately 1,604 hours, and the baseline systems for participants. The MLC-SLM challenge attracts 78 teams from 13 countries to participate, with 489 valid leaderboard results and 14 technical reports for the two tasks. We distill valuable insights on building multilingual conversational SLLMs based on submissions from participants, aiming to contribute to the advancement of the community.


Page Count
5 pages

Category
Electrical Engineering and Systems Science:
Audio and Speech Processing