A Hybrid Approach for EMF Code Generation:Code Templates Meet Large Language Models
By: Xiao He , Ru Chen , Zeqing Zhang and more
Template-based and LLM-based code generation are both key enablers of automated software development. The former provides correctness guarantees but are rigid for complex requirements, whereas LLMs offer high flexibility at the risk of producing faulty code.This paper proposes iEcoreGen, a hybrid approach that integrates Eclipse Modeling Framework (EMF) and LLMs. In EMF, an Ecore model defines a system structure and acts as a blueprint for code-generation.iEcoreGen decomposes requirements to derive operation specifications, uses EMF's template-based generator to produce initial Java code, and serializes specifications into docstrings. LLMs are then invoked to complete and fix unimplemented methods. We assessed iEcoreGen on twenty code-generation tasks across five LLMs. It surpasses LLM-only baselines on pass@k and performs on par with them on compilation@k. An ablation study clarified the contribution of each component of iEcoreGen. Overall, the findings indicate that LLM-enhanced model-driven development is a promising path toward more efficient software automation.
Similar Papers
A Survey on Code Generation with LLM-based Agents
Software Engineering
Computers write and fix computer programs themselves.
Designing Empirical Studies on LLM-Based Code Generation: Towards a Reference Framework
Software Engineering
Makes AI code writing easier to test fairly.
From Understanding to Excelling: Template-Free Algorithm Design through Structural-Functional Co-Evolution
Software Engineering
Creates better computer programs than people can.