Please Translate Again: Two Simple Experiments on Whether Human-Like Reasoning Helps Translation
By: Di Wu, Seth Aycock, Christof Monz
Potential Business Impact:
Computers translate better by fixing their own work.
Large Language Models (LLMs) demonstrate strong reasoning capabilities for many tasks, often by explicitly decomposing the task via Chain-of-Thought (CoT) reasoning. Recent work on LLM-based translation designs hand-crafted prompts to decompose translation, or trains models to incorporate intermediate steps. Translating Step-by-step (Briakou et al., 2024), for instance, introduces a multi-step prompt with decomposition and refinement of translation with LLMs, which achieved state-of-the-art results on WMT24 test data. In this work, we scrutinise this strategy's effectiveness. Empirically, we find no clear evidence that performance gains stem from explicitly decomposing the translation process via CoT, at least for the models on test; and we show prompting LLMs to 'translate again' and self-refine yields even better results than human-like step-by-step prompting. While the decomposition influences translation behaviour, faithfulness to the decomposition has both positive and negative effects on translation. Our analysis therefore suggests a divergence between the optimal translation strategies for humans and LLMs.
Similar Papers
Chain-of-Thought Reasoning Improves Context-Aware Translation with Large Language Models
Computation and Language
Helps computers translate sentences better.
New Trends for Modern Machine Translation with Large Reasoning Models
Computation and Language
Translates languages with deeper understanding and fewer mistakes.
Scaling Test-time Compute for Low-resource Languages: Multilingual Reasoning in LLMs
Computation and Language
Helps computers reason in any language.