Evaluating Machine Translation Models for English-Hindi Language Pairs: A Comparative Analysis
By: Ahan Prasannakumar Shetty
Potential Business Impact:
Makes English and Hindi speak to each other better.
Machine translation has become a critical tool in bridging linguistic gaps, especially between languages as diverse as English and Hindi. This paper comprehensively evaluates various machine translation models for translating between English and Hindi. We assess the performance of these models using a diverse set of automatic evaluation metrics, both lexical and machine learning-based metrics. Our evaluation leverages an 18000+ corpus of English Hindi parallel dataset and a custom FAQ dataset comprising questions from government websites. The study aims to provide insights into the effectiveness of different machine translation approaches in handling both general and specialized language domains. Results indicate varying performance levels across different metrics, highlighting strengths and areas for improvement in current translation systems.
Similar Papers
Benchmarking Hindi LLMs: A New Suite of Datasets and a Comparative Analysis
Computation and Language
Tests Hindi AI to understand language better.
HinTel-AlignBench: A Framework and Benchmark for Hindi-Telugu with English-Aligned Samples
Computation and Language
Helps AI understand Indian languages and pictures better.
Topic Modeling in Marathi
Computation and Language
Helps computers understand Indian languages better.