Evaluating Large Language Models on Urdu Idiom Translation
By: Muhammad Farmal Khan, Mousumi Akter
Potential Business Impact:
Helps computers translate tricky sayings from Urdu.
Idiomatic translation remains a significant challenge in machine translation, especially for low resource languages such as Urdu, and has received limited prior attention. To advance research in this area, we introduce the first evaluation datasets for Urdu to English idiomatic translation, covering both Native Urdu and Roman Urdu scripts and annotated with gold-standard English equivalents. We evaluate multiple open-source Large Language Models (LLMs) and Neural Machine Translation (NMT) systems on this task, focusing on their ability to preserve idiomatic and cultural meaning. Automatic metrics including BLEU, BERTScore, COMET, and XCOMET are used to assess translation quality. Our findings indicate that prompt engineering enhances idiomatic translation compared to direct translation, though performance differences among prompt types are relatively minor. Moreover, cross script comparisons reveal that text representation substantially affects translation quality, with Native Urdu inputs producing more accurate idiomatic translations than Roman Urdu.
Similar Papers
AI-Generated Text Detection in Low-Resource Languages: A Case Study on Urdu
Computation and Language
Finds fake writing in Urdu.
Irony Detection in Urdu Text: A Comparative Study Using Machine Learning Models and Large Language Models
Computation and Language
Helps computers understand jokes in Urdu.
Evaluating Large Language Models for Code Translation: Effects of Prompt Language and Prompt Design
Software Engineering
Helps computers rewrite code between languages.