Score: 1

Knowledge Distillation with Structured Chain-of-Thought for Text-to-SQL

Published: December 18, 2025 | arXiv ID: 2512.17053v1

By: Khushboo Thaker, Yony Bresler

Potential Business Impact:

Teaches small computers to write accurate database answers.

Business Areas:

Natural Language Processing Artificial Intelligence, Data and Analytics, Software

Deploying accurate Text-to-SQL systems at the enterprise level faces a difficult trilemma involving cost, security and performance. Current solutions force enterprises to choose between expensive, proprietary Large Language Models (LLMs) and low-performing Small Language Models (SLMs). Efforts to improve SLMs often rely on distilling reasoning from large LLMs using unstructured Chain-of-Thought (CoT) traces, a process that remains inherently ambiguous. Instead, we hypothesize that a formal, structured reasoning representation provides a clearer, more reliable teaching signal, as the Text-to-SQL task requires explicit and precise logical steps. To evaluate this hypothesis, we propose Struct-SQL, a novel Knowledge Distillation (KD) framework that trains an SLM to emulate a powerful large LLM. Consequently, we adopt a query execution plan as a formal blueprint to derive this structured reasoning. Our SLM, distilled with structured CoT, achieves an absolute improvement of 8.1% over an unstructured CoT distillation baseline. A detailed error analysis reveals that a key factor in this gain is a marked reduction in syntactic errors. This demonstrates that teaching a model to reason using a structured logical blueprint is beneficial for reliable SQL generation in SLMs.

Deconstructing Long Chain-of-Thought: A Structured Reasoning Optimization Framework for Long CoT Distillation

Artificial Intelligence

Teaches computers to think better, step-by-step.

20 Mar 2025 2

90%

Fine-Tuned Thoughts: Leveraging Chain-of-Thought Reasoning for Industrial Asset Health Monitoring

Computation and Language

Teaches small computers to think like big ones.

21 Oct 2025 2

90%

Effectiveness of Chain-of-Thought in Distilling Reasoning Capability from Large Language Models

Computation and Language

Teaches small computers to think like big ones.

7 Nov 2025 0

View PDF Login to Bookmark

Repos / Data Links

github.com github.com

Page Count

12 pages

Knowledge Distillation with Structured Chain-of-Thought for Text-to-SQL

Teaches small computers to write accurate database answers.

Technical Abstract

Deconstructing Long Chain-of-Thought: A Structured Reasoning Optimization Framework for Long CoT Distillation

Fine-Tuned Thoughts: Leveraging Chain-of-Thought Reasoning for Industrial Asset Health Monitoring

Effectiveness of Chain-of-Thought in Distilling Reasoning Capability from Large Language Models