Score: 1

Simulstream: Open-Source Toolkit for Evaluation and Demonstration of Streaming Speech-to-Text Translation Systems

Published: December 19, 2025 | arXiv ID: 2512.17648v1

By: Marco Gaido , Sara Papi , Mauro Cettolo and more

Potential Business Impact:

Lets computers translate talking as it happens.

Business Areas:

Simulation Software

Streaming Speech-to-Text Translation (StreamST) requires producing translations concurrently with incoming speech, imposing strict latency constraints and demanding models that balance partial-information decision-making with high translation quality. Research efforts on the topic have so far relied on the SimulEval repository, which is no longer maintained and does not support systems that revise their outputs. In addition, it has been designed for simulating the processing of short segments, rather than long-form audio streams, and it does not provide an easy method to showcase systems in a demo. As a solution, we introduce simulstream, the first open-source framework dedicated to unified evaluation and demonstration of StreamST systems. Designed for long-form speech processing, it supports not only incremental decoding approaches, but also re-translation methods, enabling for their comparison within the same framework both in terms of quality and latency. In addition, it also offers an interactive web interface to demo any system built within the tool.

Efficient and Adaptive Simultaneous Speech Translation with Fully Unidirectional Architecture

Computation and Language

Translates talking instantly, faster and smarter.

16 Apr 2025 2

87%

SimulS2S-LLM: Unlocking Simultaneous Inference of Speech LLMs for Speech-to-Speech Translation

Computation and Language

Translates talking instantly, like a real-time interpreter.

22 Apr 2025 1

87%

Direct Simultaneous Translation Activation for Large Audio-Language Models

Sound

Translates talking instantly, even mid-sentence.

19 Sep 2025 0

View PDF Login to Bookmark

Repos / Data Links

github.com

Page Count

17 pages

Simulstream: Open-Source Toolkit for Evaluation and Demonstration of Streaming Speech-to-Text Translation Systems

Lets computers translate talking as it happens.

Technical Abstract

Efficient and Adaptive Simultaneous Speech Translation with Fully Unidirectional Architecture

SimulS2S-LLM: Unlocking Simultaneous Inference of Speech LLMs for Speech-to-Speech Translation

Direct Simultaneous Translation Activation for Large Audio-Language Models