Score: 0

Neuron-Guided Interpretation of Code LLMs: Where, Why, and How?

Published: December 23, 2025 | arXiv ID: 2512.19980v1

By: Zhe Yin, Xiaodong Gu, Beijun Shen

Code language models excel on code intelligence tasks, yet their internal interpretability is underexplored. Existing neuron interpretability techniques from NLP are suboptimal for source code due to programming languages formal, hierarchical, and executable nature. We empirically investigate code LLMs at the neuron level, localizing language-specific neurons (selectively responsive to one language) and concept layers (feed-forward layers encoding language-agnostic code representations). We analyze Llama-3.1-8B and Qwen2.5-Coder-32B on multilingual inputs in C++, Java, Python, Go, and JavaScript, measuring neuron selectivity and layerwise contributions during generation. We find (1) neurons specialized for individual languages alongside a universal subset supporting general-purpose generation; and (2) lower layers mainly encode language-specific syntax, while middle layers capture semantic abstractions shared across languages, emerging as concept layers. We demonstrate utility on three tasks: neuron-guided fine-tuning for code generation, clone detection via concept-layer embeddings, and concept-layer-guided transfer for code summarization, each yielding consistent gains in multilingual settings.

How Programming Concepts and Neurons Are Shared in Code Language Models

Computation and Language

Helps computers understand many computer languages like English.

1 Jun 2025 2

90%

From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence

Software Engineering

Helps computers write computer programs from words.

23 Nov 2025 2

90%

From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence

Software Engineering

Makes computers write computer programs from your words.

23 Nov 2025 2

View PDF Login to Bookmark

Neuron-Guided Interpretation of Code LLMs: Where, Why, and How?

Technical Abstract

How Programming Concepts and Neurons Are Shared in Code Language Models

From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence

From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence