Score: 2

Static Analysis as a Feedback Loop: Enhancing LLM-Generated Code Beyond Correctness

Published: August 20, 2025 | arXiv ID: 2508.14419v1

By: Scott Blyth , Sherlock A. Licorish , Christoph Treude and more

Potential Business Impact:

Makes computer code safer and easier to read.

Business Areas:

Natural Language Processing Artificial Intelligence, Data and Analytics, Software

Large language models (LLMs) have demonstrated impressive capabilities in code generation, achieving high scores on benchmarks such as HumanEval and MBPP. However, these benchmarks primarily assess functional correctness and neglect broader dimensions of code quality, including security, reliability, readability, and maintainability. In this work, we systematically evaluate the ability of LLMs to generate high-quality code across multiple dimensions using the PythonSecurityEval benchmark. We introduce an iterative static analysis-driven prompting algorithm that leverages Bandit and Pylint to identify and resolve code quality issues. Our experiments with GPT-4o show substantial improvements: security issues reduced from >40% to 13%, readability violations from >80% to 11%, and reliability warnings from >50% to 11% within ten iterations. These results demonstrate that LLMs, when guided by static analysis feedback, can significantly enhance code quality beyond functional correctness.

Assessing the Quality and Security of AI-Generated Code: A Quantitative Analysis

Software Engineering

Finds bugs and security risks in AI-written code.

20 Aug 2025 0

90%

Augmenting Large Language Models with Static Code Analysis for Automated Code Quality Improvements

Software Engineering

Fixes computer code bugs automatically and faster.

12 Jun 2025 0

89%

Do Code LLMs Do Static Analysis?

Software Engineering

Computers can't yet understand code like humans.

17 May 2025 0

View PDF Login to Bookmark

Country of Origin

🇳🇿 🇦🇺 🇸🇬 New Zealand, Australia, Singapore

Repos / Data Links

github.com

Page Count

10 pages

Static Analysis as a Feedback Loop: Enhancing LLM-Generated Code Beyond Correctness

Makes computer code safer and easier to read.

Technical Abstract

Assessing the Quality and Security of AI-Generated Code: A Quantitative Analysis

Augmenting Large Language Models with Static Code Analysis for Automated Code Quality Improvements

Do Code LLMs Do Static Analysis?