AgenticData: An Agentic Data Analytics System for Heterogeneous Data
By: Ji Sun , Guoliang Li , Peiyao Zhou and more
Potential Business Impact:
Lets computers answer questions from any data.
Existing unstructured data analytics systems rely on experts to write code and manage complex analysis workflows, making them both expensive and time-consuming. To address these challenges, we introduce AgenticData, an innovative agentic data analytics system that allows users to simply pose natural language (NL) questions while autonomously analyzing data sources across multiple domains, including both unstructured and structured data. First, AgenticData employs a feedback-driven planning technique that automatically converts an NL query into a semantic plan composed of relational and semantic operators. We propose a multi-agent collaboration strategy by utilizing a data profiling agent for discovering relevant data, a semantic cross-validation agent for iterative optimization based on feedback, and a smart memory agent for maintaining short-term context and long-term knowledge. Second, we propose a semantic optimization model to refine and execute semantic plans effectively. Our system, AgenticData, has been tested using three benchmarks. Experimental results showed that AgenticData achieved superior accuracy on both easy and difficult tasks, significantly outperforming state-of-the-art methods.
Similar Papers
Supporting Dynamic Agentic Workloads: How Data and Agents Interact
Multiagent Systems
Helps AI teams share information faster and smarter.
Autonomous Data Agents: A New Opportunity for Smart Data
Artificial Intelligence
Makes computers automatically turn messy data into useful knowledge.
Data Agent: A Holistic Architecture for Orchestrating Data+AI Ecosystems
Databases
Lets computers build smart data plans alone.