Score: 2

AutoLink: Autonomous Schema Exploration and Expansion for Scalable Schema Linking in Text-to-SQL at Scale

Published: November 21, 2025 | arXiv ID: 2511.17190v1

By: Ziyang Wang , Yuanlei Zheng , Zhenbiao Cao and more

Potential Business Impact:

Helps computers understand databases without seeing them all.

Business Areas:

Semantic Search Internet Services

For industrial-scale text-to-SQL, supplying the entire database schema to Large Language Models (LLMs) is impractical due to context window limits and irrelevant noise. Schema linking, which filters the schema to a relevant subset, is therefore critical. However, existing methods incur prohibitive costs, struggle to trade off recall and noise, and scale poorly to large databases. We present \textbf{AutoLink}, an autonomous agent framework that reformulates schema linking as an iterative, agent-driven process. Guided by an LLM, AutoLink dynamically explores and expands the linked schema subset, progressively identifying necessary schema components without inputting the full database schema. Our experiments demonstrate AutoLink's superior performance, achieving state-of-the-art strict schema linking recall of \textbf{97.4\%} on Bird-Dev and \textbf{91.2\%} on Spider-2.0-Lite, with competitive execution accuracy, i.e., \textbf{68.7\%} EX on Bird-Dev (better than CHESS) and \textbf{34.9\%} EX on Spider-2.0-Lite (ranking 2nd on the official leaderboard). Crucially, AutoLink exhibits \textbf{exceptional scalability}, \textbf{maintaining high recall}, \textbf{efficient token consumption}, and \textbf{robust execution accuracy} on large schemas (e.g., over 3,000 columns) where existing methods severely degrade-making it a highly scalable, high-recall schema-linking solution for industrial text-to-SQL systems.

X-SQL: Expert Schema Linking and Understanding of Text-to-SQL with Multi-LLMs

Machine Learning (CS)

Helps computers understand questions to get data.

7 Sep 2025 0

90%

LinkAlign: Scalable Schema Linking for Real-World Large-Scale Multi-Database Text-to-SQL

Computation and Language

Helps computers understand many databases to answer questions.

24 Mar 2025 2

89%

Rethinking Schema Linking: A Context-Aware Bidirectional Retrieval Approach for Text-to-SQL

Computation and Language

Helps computers find the right data for questions.

16 Oct 2025 3

View PDF Login to Bookmark

Country of Origin

🇨🇳 China

Repos / Data Links

github.com

Page Count

22 pages

AutoLink: Autonomous Schema Exploration and Expansion for Scalable Schema Linking in Text-to-SQL at Scale

Helps computers understand databases without seeing them all.

Technical Abstract

X-SQL: Expert Schema Linking and Understanding of Text-to-SQL with Multi-LLMs

LinkAlign: Scalable Schema Linking for Real-World Large-Scale Multi-Database Text-to-SQL

Rethinking Schema Linking: A Context-Aware Bidirectional Retrieval Approach for Text-to-SQL