Score: 1

A Multi-agent Text2SQL Framework using Small Language Models and Execution Feedback

Published: December 21, 2025 | arXiv ID: 2512.18622v1

By: Thanh Dat Hoang , Thanh Trung Huynh , Matthias Weidlich and more

Potential Business Impact:

Lets small AI understand complex data questions.

Business Areas:

Natural Language Processing Artificial Intelligence, Data and Analytics, Software

Text2SQL, the task of generating SQL queries from natural language text, is a critical challenge in data engineering. Recently, Large Language Models (LLMs) have demonstrated superior performance for this task due to their advanced comprehension and generation capabilities. However, privacy and cost considerations prevent companies from using Text2SQL solutions based on external LLMs offered as a service. Rather, small LLMs (SLMs) that are openly available and can hosted in-house are adopted. These SLMs, in turn, lack the generalization capabilities of larger LLMs, which impairs their effectiveness for complex tasks such as Text2SQL. To address these limitations, we propose MATS, a novel Text2SQL framework designed specifically for SLMs. MATS uses a multi-agent mechanism that assigns specialized roles to auxiliary agents, reducing individual workloads and fostering interaction. A training scheme based on reinforcement learning aligns these agents using feedback obtained during execution, thereby maintaining competitive performance despite a limited LLM size. Evaluation results using on benchmark datasets show that MATS, deployed on a single- GPU server, yields accuracy that are on-par with large-scale LLMs when using significantly fewer parameters. Our source code and data are available at https://github.com/thanhdath/mats-sql.

End-to-End Text-to-SQL with Dataset Selection: Leveraging LLMs for Adaptive Query Generation

Machine Learning (CS)

Finds the right database for your questions.

8 Aug 2025 1

90%

End-to-End Text-to-SQL with Dataset Selection: Leveraging LLMs for Adaptive Query Generation

Machine Learning (CS)

Finds the right database for your questions.

8 Aug 2025 1

90%

MageSQL: Enhancing In-context Learning for Text-to-SQL Applications with Large Language Models

Databases

Helps computers understand questions to find data.

2 Apr 2025 2

View PDF Login to Bookmark

Repos / Data Links

github.com

Page Count

13 pages

A Multi-agent Text2SQL Framework using Small Language Models and Execution Feedback

Lets small AI understand complex data questions.

Technical Abstract

End-to-End Text-to-SQL with Dataset Selection: Leveraging LLMs for Adaptive Query Generation

End-to-End Text-to-SQL with Dataset Selection: Leveraging LLMs for Adaptive Query Generation

MageSQL: Enhancing In-context Learning for Text-to-SQL Applications with Large Language Models