Seeking Specifications: The Case for Neuro-Symbolic Specification Synthesis
By: George Granberry, Wolfgang Ahrendt, Moa Johansson
Potential Business Impact:
Helps computers understand code better.
This work is concerned with the generation of formal specifications from code, using Large Language Models (LLMs) in combination with symbolic methods. Concretely, in our study, the programming language is C, the specification language is ACSL, and the LLM is Deepseek-R1. In this context, we address two research directions, namely the specification of intent vs. implementation on the one hand, and the combination of symbolic analyses with LLMs on the other hand. For the first, we investigate how the absence or presence of bugs in the code impacts the generated specifications, as well as whether and how a user can direct the LLM to specify intent or implementation, respectively. For the second, we investigate the impact of results from symbolic analyses on the specifications generated by the LLM. The LLM prompts are augmented with outputs from two formal methods tools in the Frama-C ecosystem, Pathcrawler and EVA. We demonstrate how the addition of symbolic analysis to the workflow impacts the quality of annotations.
Similar Papers
A Neurosymbolic Approach to Natural Language Formalization and Verification
Computation and Language
Makes AI follow rules perfectly, like a robot lawyer.
A Short Survey on Formalising Software Requirements using Large Language Models
Software Engineering
Helps computers write perfect software code.
LLM-Enhanced Symbolic Control for Safety-Critical Applications
Systems and Control
Teaches robots to follow spoken instructions safely.