Evaluating the Process Modeling Abilities of Large Language Models -- Preliminary Foundations and Results
By: Peter Fettke, Constantin Houy
Potential Business Impact:
Helps computers create better step-by-step plans.
Large language models (LLM) have revolutionized the processing of natural language. Although first benchmarks of the process modeling abilities of LLM are promising, it is currently under debate to what extent an LLM can generate good process models. In this contribution, we argue that the evaluation of the process modeling abilities of LLM is far from being trivial. Hence, available evaluation results must be taken carefully. For example, even in a simple scenario, not only the quality of a model should be taken into account, but also the costs and time needed for generation. Thus, an LLM does not generate one optimal solution, but a set of Pareto-optimal variants. Moreover, there are several further challenges which have to be taken into account, e.g. conceptualization of quality, validation of results, generalizability, and data leakage. We discuss these challenges in detail and discuss future experiments to tackle these challenges scientifically.
Similar Papers
On the Potential of Large Language Models to Solve Semantics-Aware Process Mining Tasks
Databases
Teaches computers to understand how things work.
Evaluation of LLMs for Process Model Analysis and Optimization
Artificial Intelligence
Helps computers find mistakes in business plans.
Knowledge-Driven Hallucination in Large Language Models: An Empirical Study on Process Modeling
Artificial Intelligence
AI sometimes makes up facts, even when told otherwise.