Reflective Reasoning for SQL Generation
By: Isabelle Mohr , Joao Gandarela , John Dujany and more
Potential Business Impact:
Helps computers write correct database questions.
Robust text-to-SQL over complex, real-world databases remains brittle even with modern LLMs: iterative refinement often introduces syntactic and semantic drift, corrections tend to be non-transferable across queries, and naive use of large context windows scales poorly. We propose a controlled text-to-SQL framework built around reflective refinement. Instead of repeatedly rewriting the current SQL instance, the system decomposes generation into typed stages and applies feedback as persistent updates to the stage-level generation mechanism. A Reflection-Refinement Loop localizes violations to the responsible stage maximize preservation of previously validated constraints and support monotonic improvement over a query set. The method operates without gold SQL by combining interpreter-based checks with LLM-based semantic coverage verification as epistemic judges. Experiments on Spider and BIRD demonstrate consistent gains over strong prompting baselines, robust convergence within a small refinement budget, and improved execution accuracy across both frontier and open-weight model families.
Similar Papers
Rethinking Text-to-SQL: Dynamic Multi-turn SQL Interaction for Real-world Database Exploration
Databases
Helps computers understand changing questions about data.
Text-to-SQL as Dual-State Reasoning: Integrating Adaptive Context and Progressive Generation
Computation and Language
Helps computers understand complex database questions better.
DeepEye-SQL: A Software-Engineering-Inspired Text-to-SQL Framework
Databases
Makes computers turn words into database answers.