Beyond the Sentence: A Survey on Context-Aware Machine Translation with Large Language Models
By: Ramakrishna Appicharla , Baban Gain , Santanu Pal and more
Potential Business Impact:
Makes computer translations understand more context.
Despite the popularity of the large language models (LLMs), their application to machine translation is relatively underexplored, especially in context-aware settings. This work presents a literature review of context-aware translation with LLMs. The existing works utilise prompting and fine-tuning approaches, with few focusing on automatic post-editing and creating translation agents for context-aware machine translation. We observed that the commercial LLMs (such as ChatGPT and Tower LLM) achieved better results than the open-source LLMs (such as Llama and Bloom LLMs), and prompt-based approaches serve as good baselines to assess the quality of translations. Finally, we present some interesting future directions to explore.
Similar Papers
Bridging the Linguistic Divide: A Survey on Leveraging Large Language Models for Machine Translation
Computation and Language
Helps computers translate rare languages better.
Multilingual Contextualization of Large Language Models for Document-Level Machine Translation
Computation and Language
Translates whole books, not just sentences.
Testing the Limits of Machine Translation from One Book
Computation and Language
Helps computers translate rare languages better.