Score: 0

SQUiD: Synthesizing Relational Databases from Unstructured Text

Published: May 25, 2025 | arXiv ID: 2505.19025v1

By: Mushtari Sadia , Zhenning Yang , Yunming Xiao and more

Potential Business Impact:

Turns messy text into organized lists.

Business Areas:
Database Data and Analytics, Software

Relational databases are central to modern data management, yet most data exists in unstructured forms like text documents. To bridge this gap, we leverage large language models (LLMs) to automatically synthesize a relational database by generating its schema and populating its tables from raw text. We introduce SQUiD, a novel neurosymbolic framework that decomposes this task into four stages, each with specialized techniques. Our experiments show that SQUiD consistently outperforms baselines across diverse datasets.

Country of Origin
πŸ‡ΊπŸ‡Έ United States

Page Count
24 pages

Category
Computer Science:
Databases