Score: 1

Data Product MCP: Chat with your Enterprise Data

Published: January 13, 2026 | arXiv ID: 2601.08687v1

By: Marco Tonnarelli , Filippo Scaramuzza , Simon Harrer and more

Potential Business Impact:

Lets AI agents get data safely and easily.

Business Areas:
Machine Learning Artificial Intelligence, Data and Analytics, Software

Computational data governance aims to make the enforcement of governance policies and legal obligations more efficient and reliable. Recent advances in natural language processing and agentic AI offer ways to improve how organizations share and use data. But many barriers remain. Today's tools require technical skills and multiple roles to discover, request, and query data. Automating data access using enterprise AI agents is limited by the means to discover and autonomously access distributed data. Current solutions either compromise governance or break agentic workflows through manual approvals. To close this gap, we introduce Data Product MCP integrated in a data product marketplace. This data marketplace, already in use at large enterprises, enables AI agents to find, request, and query enterprise data products while enforcing data contracts in real time without lowering governance standards. The system is built on the Model Context Protocol (MCP) and links the AI-driven marketplace with cloud platforms such as Snowflake, Databricks, and Google Cloud Platform. It supports semantic discovery of data products based on business context, automates access control by validating generated queries against approved business purposes using AI-driven checks, and enforces contracts in real time by blocking unauthorized queries before they run. We assessed the system with feedback from $n=16$ experts in data governance. Our qualitative evaluation demonstrates effectiveness through enterprise scenarios such as customer analytics. The findings suggest that Data Product MCP reduces the technical burden for data analysis without weakening governance, filling a key gap in enterprise AI adoption.

Country of Origin
🇬🇧 🇳🇱 United Kingdom, Netherlands

Page Count
7 pages

Category
Computer Science:
Emerging Technologies