Score: 0

CSR-RAG: An Efficient Retrieval System for Text-to-SQL on the Enterprise Scale

Published: January 10, 2026 | arXiv ID: 2601.06564v1

By: Rajpreet Singh , Novak Boškov , Lawrence Drabeck and more

Potential Business Impact:

Finds the right data in huge computer lists.

Business Areas:
Semantic Search Internet Services

Natural language to SQL translation (Text-to-SQL) is one of the long-standing problems that has recently benefited from advances in Large Language Models (LLMs). While most academic Text-to-SQL benchmarks request schema description as a part of natural language input, enterprise-scale applications often require table retrieval before SQL query generation. To address this need, we propose a novel hybrid Retrieval Augmented Generation (RAG) system consisting of contextual, structural, and relational retrieval (CSR-RAG) to achieve computationally efficient yet sufficiently accurate retrieval for enterprise-scale databases. Through extensive enterprise benchmarks, we demonstrate that CSR-RAG achieves up to 40% precision and over 80% recall while incurring a negligible average query generation latency of only 30ms on commodity data center hardware, which makes it appropriate for modern LLM-based enterprise-scale systems.

Country of Origin
🇩🇪 Germany

Page Count
8 pages

Category
Computer Science:
Computation and Language