Score: 0

The Eloquence team submission for task 1 of MLC-SLM challenge

Published: July 25, 2025 | arXiv ID: 2507.19308v1

By: Lorenzo Concina , Jordi Luque , Alessio Brutti and more

Potential Business Impact:

Helps computers understand many languages spoken.

In this paper, we present our studies and experiments carried out for the task 1 of the Challenge and Workshop on Multilingual Conversational Speech Language Model (MLC-SLM), which focuses on advancing multilingual conversational speech recognition through the development of speech language models architectures. Given the increasing relevance of real-world conversational data for building robust Spoken Dialogue Systems, we explore three approaches to multilingual ASR. First, we conduct an evaluation of the official baseline to better understand its strengths and limitations, by training two projectors (linear and qformer) with different foundation models. Second we leverage the SLAM-ASR framework to train a custom multilingual linear projector. Finally we investigate the role of contrastive learning and the extended conversational context in enhancing the robustness of recognition.

Country of Origin
🇬🇧 United Kingdom

Page Count
4 pages

Category
Computer Science:
Sound