Score: 2

A Causal Perspective on Measuring, Explaining and Mitigating Smells in \llm-Generated Code

Published: November 19, 2025 | arXiv ID: 2511.15817v1

By: Alejandro Velasco , Daniel Rodriguez-Cardenas , Dipin Khati and more

BigTech Affiliations: Microsoft

Potential Business Impact:

Helps computers write better, cleaner code.

Business Areas:
Simulation Software

Recent advances in large language models (LLMs) have accelerated their adoption in software engineering contexts. However, concerns persist about the structural quality of the code they produce. In particular, LLMs often replicate poor coding practices, introducing code smells (i.e., patterns that hinder readability, maintainability, or design integrity). Although prior research has examined the detection or repair of smells, we still lack a clear understanding of how and when these issues emerge in generated code. This paper addresses this gap by systematically measuring, explaining and mitigating smell propensity in LLM-generated code. We build on the Propensity Smelly Score (PSC), a probabilistic metric that estimates the likelihood of generating particular smell types, and establish its robustness as a signal of structural quality. Using PSC as an instrument for causal analysis, we identify how generation strategy, model size, model architecture and prompt formulation shape the structural properties of generated code. Our findings show that prompt design and architectural choices play a decisive role in smell propensity and motivate practical mitigation strategies that reduce its occurrence. A user study further demonstrates that PSC helps developers interpret model behavior and assess code quality, providing evidence that smell propensity signals can support human judgement. Taken together, our work lays the groundwork for integrating quality-aware assessments into the evaluation and deployment of LLMs for code.

Country of Origin
πŸ‡§πŸ‡© πŸ‡ΊπŸ‡Έ Bangladesh, United States

Page Count
12 pages

Category
Computer Science:
Software Engineering