Score: 1

MTIR-SQL: Multi-turn Tool-Integrated Reasoning Reinforcement Learning for Text-to-SQL

Published: October 29, 2025 | arXiv ID: 2510.25510v1

By: Zekun Xu , Siyu Xia , Chuhuai Yue and more

BigTech Affiliations: Meituan

Potential Business Impact:

Teaches computers to understand questions and find answers.

Business Areas:

Natural Language Processing Artificial Intelligence, Data and Analytics, Software

As large language models (LLMs) are increasingly used in Text-to-SQL tasks, Reinforcement Learning (RL) has become a common method for improving performance. Existing methods primarily rely on static execution feedback, which restricts real-time error correction. However, integrating multi-turn tool invocation along with dynamic feedback could significantly improve adaptability and robustness, ultimately enhancing model performance. To address these issues, we propose MTIR-SQL, an innovative Multi-turn Tool-Integrated Reasoning reinforcement learning framework for Text-to-SQL. Our approach introduces an execution-aware multi-turn reasoning paradigm that seamlessly incorporates database execution feedback at each reasoning step, enabling context-sensitive query generation and progressive refinement throughout the reasoning process. The framework extends the GRPO algorithm to accommodate complex multi-turn interaction scenarios. Considering the training instability characteristics of MTIR and the potential for significant Deviation of model distribution from the initial model, we enhance the GRPO algorithm by adding a trajectory filtering mechanism and removing KL loss constraints. Experimental results demonstrate that MTIR-SQL, with 4B parameters, achieves \textbf{64.4}\% accuracy in the BIRD Dev and 84.6% execution accuracy in the SPIDER Dev, significantly outperforming existing approaches.

Process-Supervised Reinforcement Learning for Interactive Multimodal Tool-Use Agents

Computation and Language

Teaches computers to use tools with voice commands.

17 Sep 2025 3

90%

Empowering Multi-Turn Tool-Integrated Reasoning with Group Turn Policy Optimization

Machine Learning (CS)

Teaches AI to solve math problems step-by-step.

18 Nov 2025 2

90%

SimpleTIR: End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning

Machine Learning (CS)

Makes AI better at solving hard math problems.

2 Sep 2025 1

View PDF Login to Bookmark

Country of Origin

🇨🇳 China

Page Count

22 pages

MTIR-SQL: Multi-turn Tool-Integrated Reasoning Reinforcement Learning for Text-to-SQL

Teaches computers to understand questions and find answers.

Technical Abstract

Process-Supervised Reinforcement Learning for Interactive Multimodal Tool-Use Agents

Empowering Multi-Turn Tool-Integrated Reasoning with Group Turn Policy Optimization

SimpleTIR: End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning