TypePilot: Leveraging the Scala Type System for Secure LLM-generated Code
By: Alexander Sternfeld, Andrei Kucharavy, Ljiljana Dolamic
Potential Business Impact:
Fixes computer code to stop security problems.
Large language Models (LLMs) have shown remarkable proficiency in code generation tasks across various programming languages. However, their outputs often contain subtle but critical vulnerabilities, posing significant risks when deployed in security-sensitive or mission-critical systems. This paper introduces TypePilot, an agentic AI framework designed to enhance the security and robustness of LLM-generated code by leveraging strongly typed and verifiable languages, using Scala as a representative example. We evaluate the effectiveness of our approach in two settings: formal verification with the Stainless framework and general-purpose secure code generation. Our experiments with leading open-source LLMs reveal that while direct code generation often fails to enforce safety constraints, just as naive prompting for more secure code, our type-focused agentic pipeline substantially mitigates input validation and injection vulnerabilities. The results demonstrate the potential of structured, type-guided LLM workflows to improve the SotA of the trustworthiness of automated code generation in high-assurance domains.
Similar Papers
StackPilot: Autonomous Function Agents for Scalable and Environment-Free Code Execution
Programming Languages
Checks if computer code written by AI works.
RulePilot: An LLM-Powered Agent for Security Rule Generation
Cryptography and Security
Automates security rules, saving experts time.
Agent-based code generation for the Gammapy framework
Software Engineering
Helps scientists write computer code for research.