AI-assisted JSON Schema Creation and Mapping
By: Felix Neubauer, Jürgen Pleiss, Benjamin Uekermann
Potential Business Impact:
Helps anyone create data rules with words.
Model-Driven Engineering (MDE) places models at the core of system and data engineering processes. In the context of research data, these models are typically expressed as schemas that define the structure and semantics of datasets. However, many domains still lack standardized models, and creating them remains a significant barrier, especially for non-experts. We present a hybrid approach that combines large language models (LLMs) with deterministic techniques to enable JSON Schema creation, modification, and schema mapping based on natural language inputs by the user. These capabilities are integrated into the open-source tool MetaConfigurator, which already provides visual model editing, validation, code generation, and form generation from models. For data integration, we generate schema mappings from heterogeneous JSON, CSV, XML, and YAML data using LLMs, while ensuring scalability and reliability through deterministic execution of generated mapping rules. The applicability of our work is demonstrated in an application example in the field of chemistry. By combining natural language interaction with deterministic safeguards, this work significantly lowers the barrier to structured data modeling and data integration for non-experts.
Similar Papers
A Model-Driven Engineering Approach to AI-Powered Healthcare Platforms
Software Engineering
Helps doctors share patient data safely for better cures.
Schema Generation for Large Knowledge Graphs Using Large Language Models
Artificial Intelligence
Helps computers build knowledge maps automatically.
A Roadmap for Tamed Interactions with Large Language Models
Software Engineering
Makes AI answers more trustworthy and predictable.