Score: 1

Seeking Specifications: The Case for Neuro-Symbolic Specification Synthesis

Published: April 29, 2025 | arXiv ID: 2504.21061v1

By: George Granberry, Wolfgang Ahrendt, Moa Johansson

Potential Business Impact:

Helps computers understand code better.

Business Areas:

Natural Language Processing Artificial Intelligence, Data and Analytics, Software

This work is concerned with the generation of formal specifications from code, using Large Language Models (LLMs) in combination with symbolic methods. Concretely, in our study, the programming language is C, the specification language is ACSL, and the LLM is Deepseek-R1. In this context, we address two research directions, namely the specification of intent vs. implementation on the one hand, and the combination of symbolic analyses with LLMs on the other hand. For the first, we investigate how the absence or presence of bugs in the code impacts the generated specifications, as well as whether and how a user can direct the LLM to specify intent or implementation, respectively. For the second, we investigate the impact of results from symbolic analyses on the specifications generated by the LLM. The LLM prompts are augmented with outputs from two formal methods tools in the Frama-C ecosystem, Pathcrawler and EVA. We demonstrate how the addition of symbolic analysis to the workflow impacts the quality of annotations.