Score: 1

Efficient Reasoning via Thought-Training and Thought-Free Inference

Published: November 5, 2025 | arXiv ID: 2511.03408v1

By: Canhui Wu , Qiong Cao , Chao Xue and more

BigTech Affiliations: JD.com

Potential Business Impact:

Computers learn to think without showing their work.

Business Areas:

Natural Language Processing Artificial Intelligence, Data and Analytics, Software

Recent advances in large language models (LLMs) have leveraged explicit Chain-of-Thought (CoT) prompting to improve reasoning accuracy. However, most existing methods primarily compress verbose reasoning outputs. These Long-to-Short transformations aim to improve efficiency, but still rely on explicit reasoning during inference. In this work, we introduce \textbf{3TF} (\textbf{T}hought-\textbf{T}raining and \textbf{T}hought-\textbf{F}ree inference), a framework for efficient reasoning that takes a Short-to-Long perspective. We first train a hybrid model that can operate in both reasoning and non-reasoning modes, and then further train it on CoT-annotated data to internalize structured reasoning, while enforcing concise, thought-free outputs at inference time using the no-reasoning mode. Unlike compression-based approaches, 3TF improves the reasoning quality of non-reasoning outputs, enabling models to perform rich internal reasoning implicitly while keeping external outputs short. Empirically, 3TF-trained models obtain large improvements on reasoning benchmarks under thought-free inference, demonstrating that high quality reasoning can be learned and executed implicitly without explicit step-by-step generation.

Long-Short Chain-of-Thought Mixture Supervised Fine-Tuning Eliciting Efficient Reasoning in Large Language Models

Computation and Language

Makes AI think smarter, not longer.

6 May 2025 0

92%

Focused Chain-of-Thought: Efficient LLM Reasoning via Structured Input Information

Computation and Language

Makes AI think faster with less information.

27 Nov 2025 1

91%

Towards Reasoning Era: A Survey of Long Chain-of-Thought for Reasoning Large Language Models

Artificial Intelligence

Makes computers think deeper to solve hard problems.

12 Mar 2025 2

View PDF Login to Bookmark

Country of Origin

🇨🇳 China

Page Count

11 pages

Efficient Reasoning via Thought-Training and Thought-Free Inference

Computers learn to think without showing their work.

Technical Abstract

Long-Short Chain-of-Thought Mixture Supervised Fine-Tuning Eliciting Efficient Reasoning in Large Language Models

Focused Chain-of-Thought: Efficient LLM Reasoning via Structured Input Information

Towards Reasoning Era: A Survey of Long Chain-of-Thought for Reasoning Large Language Models