Score: 0

Information-Preserving Reformulation of Reasoning Traces for Antidistillation

Published: October 13, 2025 | arXiv ID: 2510.11545v1

By: Jiayu Ding , Lei Cui , Li Dong and more

Potential Business Impact:

Protects smart computer thinking from being copied.

Business Areas:

Natural Language Processing Artificial Intelligence, Data and Analytics, Software

Recent advances in Large Language Models (LLMs) show that extending the length of reasoning chains significantly improves performance on complex tasks. While revealing these reasoning traces helps users better follow, verify, and learn from the model's problem-solving process, it also makes them highly vulnerable to unauthorized distillation. To mitigate this risk, proprietary model providers often adopt aggressive protection strategies, such as replacing detailed reasoning with brief summaries, which deprive users of valuable intermediate information. To address this trade-off, we propose PART, an information-preserving antidistillation reformulation of reasoning traces. Motivated by the difference between how humans understand reasoning traces and how LLMs exploit them for supervised fine-tuning, we design a simple but effective two-step reformulation: removing self-talk behaviors and reordering sub-conclusions. A small auxiliary model is trained to perform this reformulation, incurring minimal computational overhead. Extensive experiments demonstrate that PART consistently disrupts distillation across student models of different sizes and types on various reasoning benchmarks. For instance, when training on reformulated traces, even the performance of a large 32B student model decreases from 54.17 to 46.88 on AIME 2024, corresponding to a 13.5% degradation.

Distilling the Essence: Efficient Reasoning Distillation via Sequence Truncation

Computation and Language

Makes AI learn faster by focusing on thinking steps.

24 Dec 2025 4

89%

From Reasoning LLMs to BERT: A Two-Stage Distillation Framework for Search Relevance

Information Retrieval

Makes online shopping search faster and smarter.

13 Oct 2025 1

89%

Beyond Scaling Law: A Data-Efficient Distillation Framework for Reasoning

Machine Learning (CS)

Teaches computers to think better with less data.

13 Aug 2025 1

View PDF Login to Bookmark

Page Count

16 pages

Information-Preserving Reformulation of Reasoning Traces for Antidistillation

Protects smart computer thinking from being copied.

Technical Abstract

Distilling the Essence: Efficient Reasoning Distillation via Sequence Truncation

From Reasoning LLMs to BERT: A Two-Stage Distillation Framework for Search Relevance

Beyond Scaling Law: A Data-Efficient Distillation Framework for Reasoning