Ara-HOPE: Human-Centric Post-Editing Evaluation for Dialectal Arabic to Modern Standard Arabic Translation
By: Abdullah Alabdullah, Lifeng Han, Chenghua Lin
Potential Business Impact:
Helps translate Arabic dialects to standard Arabic.
Dialectal Arabic to Modern Standard Arabic (DA-MSA) translation is a challenging task in Machine Translation (MT) due to significant lexical, syntactic, and semantic divergences between Arabic dialects and MSA. Existing automatic evaluation metrics and general-purpose human evaluation frameworks struggle to capture dialect-specific MT errors, hindering progress in translation assessment. This paper introduces Ara-HOPE, a human-centric post-editing evaluation framework designed to systematically address these challenges. The framework includes a five-category error taxonomy and a decision-tree annotation protocol. Through comparative evaluation of three MT systems (Arabic-centric Jais, general-purpose GPT-3.5, and baseline NLLB-200), Ara-HOPE effectively highlights systematic performance differences between these systems. The results show that dialect-specific terminology and semantic preservation remain the most persistent challenges in DA-MSA translation. Ara-HOPE establishes a new framework for evaluating Dialectal Arabic MT quality and provides actionable guidance for improving dialect-aware MT systems.
Similar Papers
Advancing Dialectal Arabic to Modern Standard Arabic Machine Translation
Computation and Language
Helps computers translate everyday Arabic speech.
Advancing Dialectal Arabic to Modern Standard Arabic Machine Translation
Computation and Language
Translates spoken Arabic into written Arabic.
AHaSIS: Shared Task on Sentiment Analysis for Arabic Dialects
Computation and Language
Helps understand customer feelings in Arabic hotel reviews.