Score: 0

Data-Aware Socratic Query Refinement in Database Systems

Published: August 7, 2025 | arXiv ID: 2508.05061v1

By: Ruiyuan Zhang, Chrysanthi Kosyfaki, Xiaofang Zhou

Potential Business Impact:

Helps computers understand your questions better.

In this paper, we propose Data-Aware Socratic Guidance (DASG), a dialogue-based query enhancement framework that embeds \linebreak interactive clarification as a first-class operator within database systems to resolve ambiguity in natural language queries. DASG treats dialogue as an optimization decision, asking clarifying questions only when the expected execution cost reduction exceeds the interaction overhead. The system quantifies ambiguity through linguistic fuzziness, schema grounding confidence, and projected costs across relational and vector backends. Our algorithm selects the optimal clarifications by combining semantic relevance, catalog-based information gain, and potential cost reduction. We evaluate our proposed framework on three datasets. The results show that DASG demonstrates improved query precision while maintaining efficiency, establishing a cooperative analytics paradigm where systems actively participate in query formulation rather than passively translating user requests.

Country of Origin
🇭🇰 Hong Kong

Page Count
6 pages

Category
Computer Science:
Databases