Score: 2

Trans-Zero: Self-Play Incentivizes Large Language Models for Multilingual Translation Without Parallel Data

Published: April 20, 2025 | arXiv ID: 2504.14669v2

By: Wei Zou , Sen Yang , Yu Bao and more

BigTech Affiliations: ByteDance

Potential Business Impact:

Translates languages without needing example sentences.

Business Areas:

Translation Service Professional Services

The rise of Large Language Models (LLMs) has reshaped machine translation (MT), but multilingual MT still relies heavily on parallel data for supervised fine-tuning (SFT), facing challenges like data scarcity for low-resource languages and catastrophic forgetting. To address these issues, we propose TRANS-ZERO, a self-play framework that leverages only monolingual data and the intrinsic multilingual knowledge of LLM. TRANS-ZERO combines Genetic Monte-Carlo Tree Search (G-MCTS) with preference optimization, achieving strong translation performance that rivals supervised methods. Experiments demonstrate that this approach not only matches the performance of models trained on large-scale parallel data but also excels in non-English translation directions. Further analysis reveals that G-MCTS itself significantly enhances translation quality by exploring semantically consistent candidates through iterative translations, providing a robust foundation for the framework's succuss.

MT-R1-Zero: Advancing LLM-based Machine Translation via R1-Zero-like Reinforcement Learning

Computation and Language

Makes computer translations better without needing examples.

14 Apr 2025 2

89%

Beyond English: Toward Inclusive and Scalable Multilingual Machine Translation with LLMs

Computation and Language

Translates 60 languages better, even Chinese.

10 Nov 2025 2

89%

Zero-Shot Cross-Lingual Transfer using Prefix-Based Adaptation

Computation and Language

Helps computers learn many languages faster.

28 Oct 2025 2

View PDF Login to Bookmark

Country of Origin

🇨🇳 China

Repos / Data Links

github.com

Page Count

11 pages

Trans-Zero: Self-Play Incentivizes Large Language Models for Multilingual Translation Without Parallel Data

Translates languages without needing example sentences.

Technical Abstract

MT-R1-Zero: Advancing LLM-based Machine Translation via R1-Zero-like Reinforcement Learning

Beyond English: Toward Inclusive and Scalable Multilingual Machine Translation with LLMs

Zero-Shot Cross-Lingual Transfer using Prefix-Based Adaptation