Score: 3

How does Alignment Enhance LLMs' Multilingual Capabilities? A Language Neurons Perspective

Published: May 27, 2025 | arXiv ID: 2505.21505v1

By: Shimao Zhang , Zhejian Lai , Xiang Liu and more

BigTech Affiliations: Microsoft

Potential Business Impact:

Helps computers learn many languages better.

Business Areas:

Natural Language Processing Artificial Intelligence, Data and Analytics, Software

Multilingual Alignment is an effective and representative paradigm to enhance LLMs' multilingual capabilities, which transfers the capabilities from the high-resource languages to the low-resource languages. Meanwhile, some researches on language-specific neurons reveal that there are language-specific neurons that are selectively activated in LLMs when processing different languages. This provides a new perspective to analyze and understand LLMs' mechanisms more specifically in multilingual scenarios. In this work, we propose a new finer-grained neuron identification algorithm, which detects language neurons~(including language-specific neurons and language-related neurons) and language-agnostic neurons. Furthermore, based on the distributional characteristics of different types of neurons, we divide the LLMs' internal process for multilingual inference into four parts: (1) multilingual understanding, (2) shared semantic space reasoning, (3) multilingual output space transformation, and (4) vocabulary space outputting. Additionally, we systematically analyze the models before and after alignment with a focus on different types of neurons. We also analyze the phenomenon of ''Spontaneous Multilingual Alignment''. Overall, our work conducts a comprehensive investigation based on different types of neurons, providing empirical results and valuable insights for better understanding multilingual alignment and multilingual capabilities of LLMs.

Language-specific Neurons Do Not Facilitate Cross-Lingual Transfer

Computation and Language

Helps computers understand less common languages better.

21 Mar 2025 2

91%

From Neurons to Semantics: Evaluating Cross-Linguistic Alignment Capabilities of Large Language Models via Neurons Alignment

Computation and Language

Tests how well computers understand different languages.

20 Jul 2025 1

90%

Cross-Lingual Generalization and Compression: From Language-Specific to Shared Neurons

Computation and Language

Computers learn to understand words in many languages.

2 Jun 2025 1

View PDF Login to Bookmark

Country of Origin

🇺🇸 🇨🇳 United States, China

Repos / Data Links

github.com github.com github.com github.com

Page Count

18 pages

How does Alignment Enhance LLMs' Multilingual Capabilities? A Language Neurons Perspective

Helps computers learn many languages better.

Technical Abstract

Language-specific Neurons Do Not Facilitate Cross-Lingual Transfer

From Neurons to Semantics: Evaluating Cross-Linguistic Alignment Capabilities of Large Language Models via Neurons Alignment

Cross-Lingual Generalization and Compression: From Language-Specific to Shared Neurons