Score: 1

Explicit Learning and the LLM in Machine Translation

Published: March 12, 2025 | arXiv ID: 2503.09454v4

By: Malik Marmonier, Rachel Bawden, Benoît Sagot

Potential Business Impact:

Computers learn new languages from books.

Business Areas:
Language Learning Education

This study explores an LLM's ability to learn new languages using explanations found in a grammar book, a process we term "explicit learning." To rigorously assess this ability, we design controlled translation experiments between English and constructed languages generated, through specific cryptographic means, from Latin or French. Contrary to previous studies, our results demonstrate that LLMs do possess a measurable capacity for explicit learning. This ability, however, diminishes as the complexity of the linguistic phenomena to be learned increases. Supervised fine-tuning on ad hoc chains of thought significantly enhances LLM performance but struggles to generalize to typologically novel or more complex linguistic features. These findings point to the need for more diverse training sets and alternative fine-tuning strategies to further improve explicit learning by LLMs, benefiting low-resource languages typically described in grammar books but lacking extensive corpora.

Repos / Data Links

Page Count
51 pages

Category
Computer Science:
Computation and Language