Score: 1

Ready to Translate, Not to Represent? Bias and Performance Gaps in Multilingual LLMs Across Language Families and Domains

Published: October 9, 2025 | arXiv ID: 2510.07877v1

By: Md. Faiyaz Abdullah Sayeedi , Md. Mahbub Alam , Subhey Sadi Rahman and more

Potential Business Impact:

Checks if AI translators are fair and good.

Business Areas:

Translation Service Professional Services

The rise of Large Language Models (LLMs) has redefined Machine Translation (MT), enabling context-aware and fluent translations across hundreds of languages and textual domains. Despite their remarkable capabilities, LLMs often exhibit uneven performance across language families and specialized domains. Moreover, recent evidence reveals that these models can encode and amplify different biases present in their training data, posing serious concerns for fairness, especially in low-resource languages. To address these gaps, we introduce Translation Tangles, a unified framework and dataset for evaluating the translation quality and fairness of open-source LLMs. Our approach benchmarks 24 bidirectional language pairs across multiple domains using different metrics. We further propose a hybrid bias detection pipeline that integrates rule-based heuristics, semantic similarity filtering, and LLM-based validation. We also introduce a high-quality, bias-annotated dataset based on human evaluations of 1,439 translation-reference pairs. The code and dataset are accessible on GitHub: https://github.com/faiyazabdullah/TranslationTangles

Bridging the Linguistic Divide: A Survey on Leveraging Large Language Models for Machine Translation

Computation and Language

Helps computers translate rare languages better.

2 Apr 2025 0

91%

Beyond English: Toward Inclusive and Scalable Multilingual Machine Translation with LLMs

Computation and Language

Translates 60 languages better, even Chinese.

10 Nov 2025 2

90%

Testing the Limits of Machine Translation from One Book

Computation and Language

Helps computers translate rare languages better.

8 Aug 2025 1

View PDF Login to Bookmark

Repos / Data Links

github.com

Page Count

24 pages

Ready to Translate, Not to Represent? Bias and Performance Gaps in Multilingual LLMs Across Language Families and Domains

Checks if AI translators are fair and good.

Technical Abstract

Bridging the Linguistic Divide: A Survey on Leveraging Large Language Models for Machine Translation

Beyond English: Toward Inclusive and Scalable Multilingual Machine Translation with LLMs

Testing the Limits of Machine Translation from One Book