Knowledge-Driven Hallucination in Large Language Models: An Empirical Study on Process Modeling
By: Humam Kourani , Anton Antonov , Alessandro Berti and more
Potential Business Impact:
AI sometimes makes up facts, even when told otherwise.
The utility of Large Language Models (LLMs) in analytical tasks is rooted in their vast pre-trained knowledge, which allows them to interpret ambiguous inputs and infer missing information. However, this same capability introduces a critical risk of what we term knowledge-driven hallucination: a phenomenon where the model's output contradicts explicit source evidence because it is overridden by the model's generalized internal knowledge. This paper investigates this phenomenon by evaluating LLMs on the task of automated process modeling, where the goal is to generate a formal business process model from a given source artifact. The domain of Business Process Management (BPM) provides an ideal context for this study, as many core business processes follow standardized patterns, making it likely that LLMs possess strong pre-trained schemas for them. We conduct a controlled experiment designed to create scenarios with deliberate conflict between provided evidence and the LLM's background knowledge. We use inputs describing both standard and deliberately atypical process structures to measure the LLM's fidelity to the provided evidence. Our work provides a methodology for assessing this critical reliability issue and raises awareness of the need for rigorous validation of AI-generated artifacts in any evidence-based domain.
Similar Papers
From Theory to Practice: Real-World Use Cases on Trustworthy LLM-Driven Process Modeling, Prediction and Automation
Software Engineering
AI helps businesses manage work better.
Evaluating the Process Modeling Abilities of Large Language Models -- Preliminary Foundations and Results
Computation and Language
Helps computers create better step-by-step plans.
Trustworthy Medical Imaging with Large Language Models: A Study of Hallucinations Across Modalities
Image and Video Processing
Fixes AI mistakes in medical pictures.