Score: 0

CtrlCoT: Dual-Granularity Chain-of-Thought Compression for Controllable Reasoning

Published: January 28, 2026 | arXiv ID: 2601.20467v1

By: Zhenxuan Fan , Jie Cao , Yang Dai and more

Potential Business Impact:

Makes AI think faster without losing answers.

Business Areas:

Natural Language Processing Artificial Intelligence, Data and Analytics, Software

Chain-of-thought (CoT) prompting improves LLM reasoning but incurs high latency and memory cost due to verbose traces, motivating CoT compression with preserved correctness. Existing methods either shorten CoTs at the semantic level, which is often conservative, or prune tokens aggressively, which can miss task-critical cues and degrade accuracy. Moreover, combining the two is non-trivial due to sequential dependency, task-agnostic pruning, and distribution mismatch. We propose \textbf{CtrlCoT}, a dual-granularity CoT compression framework that harmonizes semantic abstraction and token-level pruning through three components: Hierarchical Reasoning Abstraction produces CoTs at multiple semantic granularities; Logic-Preserving Distillation trains a logic-aware pruner to retain indispensable reasoning cues (e.g., numbers and operators) across pruning ratios; and Distribution-Alignment Generation aligns compressed traces with fluent inference-time reasoning styles to avoid fragmentation. On MATH-500 with Qwen2.5-7B-Instruct, CtrlCoT uses 30.7\% fewer tokens while achieving 7.6 percentage points higher than the strongest baseline, demonstrating more efficient and reliable reasoning. Our code will be publicly available at https://github.com/fanzhenxuan/Ctrl-CoT.

Chain Of Thought Compression: A Theoritical Analysis

Artificial Intelligence

Makes AI think faster without extra words.

29 Jan 2026 0

92%

SemCoT: Accelerating Chain-of-Thought Reasoning through Semantically-Aligned Implicit Tokens

Computation and Language

Makes AI think faster and smarter.

28 Oct 2025 3

92%

CAC-CoT: Connector-Aware Compact Chain-of-Thought for Efficient Reasoning Data Synthesis Across Dual-System Cognitive Tasks

Artificial Intelligence

Makes AI solve hard problems faster and better.

26 Aug 2025 1

View PDF Login to Bookmark

Page Count

16 pages

CtrlCoT: Dual-Granularity Chain-of-Thought Compression for Controllable Reasoning

Makes AI think faster without losing answers.

Technical Abstract

Chain Of Thought Compression: A Theoritical Analysis

SemCoT: Accelerating Chain-of-Thought Reasoning through Semantically-Aligned Implicit Tokens

CAC-CoT: Connector-Aware Compact Chain-of-Thought for Efficient Reasoning Data Synthesis Across Dual-System Cognitive Tasks