Score: 1

Understanding Chain-of-Thought Effectiveness in Code Generation: An Empirical and Information-Theoretic Analysis

Published: December 10, 2025 | arXiv ID: 2512.09679v1

By: Naizhu Jin , Zhong Li , Guang Yang and more

Potential Business Impact:

Helps computers write better code by thinking step-by-step.

Business Areas:

Natural Language Processing Artificial Intelligence, Data and Analytics, Software

Large language models (LLMs) achieve strong performance on code generation, but the mechanisms by which Chain-of-Thought (CoT) prompting helps remain unclear. We present a systematic empirical and information-theoretic study of CoT effectiveness in neural code generation, evaluating five paradigms (Zero-Shot, Zero-Shot CoT, Self-Planning, Structured CoT, Reasoning-CoT) across six Python benchmarks, a multilingual benchmark with 12 programming languages, and six models from 7B to 480B parameters, using conditional mutual information $I(Y;C|X)$ as a conceptual lens. Our results show that externally guided CoT consistently outperforms direct generation, with structured methods improving Pass@1 by 5--12\% on average while using substantially fewer tokens than reflective reasoning, and that CoT benefits depend on language type systems and model capacity. We further find that reasoning \emph{quality} is critical: high-quality structured CoT from strong generators yields significantly higher accuracy than lightweight alternatives with the same template, whereas naive Zero-Shot CoT can even degrade performance. These findings provide practical guidance for choosing CoT strategies based on model capacity, language characteristics, and task complexity.

Generating Verifiable Chain of Thoughts from Exection-Traces

Software Engineering

Teaches computers to explain code by watching it run.

28 Nov 2025 2

92%

Effectiveness of Chain-of-Thought in Distilling Reasoning Capability from Large Language Models

Computation and Language

Teaches small computers to think like big ones.

7 Nov 2025 0

92%

Focused Chain-of-Thought: Efficient LLM Reasoning via Structured Input Information

Computation and Language

Makes AI think faster with less information.

27 Nov 2025 1

View PDF Login to Bookmark

Country of Origin

🇨🇳 China

Repos / Data Links

github.com github.com github.com github.com

Page Count

29 pages

Understanding Chain-of-Thought Effectiveness in Code Generation: An Empirical and Information-Theoretic Analysis

Helps computers write better code by thinking step-by-step.

Technical Abstract

Generating Verifiable Chain of Thoughts from Exection-Traces

Effectiveness of Chain-of-Thought in Distilling Reasoning Capability from Large Language Models

Focused Chain-of-Thought: Efficient LLM Reasoning via Structured Input Information