Score: 1

AI-assisted JSON Schema Creation and Mapping

Published: August 7, 2025 | arXiv ID: 2508.05192v1

By: Felix Neubauer, Jürgen Pleiss, Benjamin Uekermann

Potential Business Impact:

Helps anyone create data rules with words.

Model-Driven Engineering (MDE) places models at the core of system and data engineering processes. In the context of research data, these models are typically expressed as schemas that define the structure and semantics of datasets. However, many domains still lack standardized models, and creating them remains a significant barrier, especially for non-experts. We present a hybrid approach that combines large language models (LLMs) with deterministic techniques to enable JSON Schema creation, modification, and schema mapping based on natural language inputs by the user. These capabilities are integrated into the open-source tool MetaConfigurator, which already provides visual model editing, validation, code generation, and form generation from models. For data integration, we generate schema mappings from heterogeneous JSON, CSV, XML, and YAML data using LLMs, while ensuring scalability and reliability through deterministic execution of generated mapping rules. The applicability of our work is demonstrated in an application example in the field of chemistry. By combining natural language interaction with deterministic safeguards, this work significantly lowers the barrier to structured data modeling and data integration for non-experts.

Country of Origin
🇩🇪 Germany


Page Count
5 pages

Category
Computer Science:
Software Engineering