Specification and Detection of LLM Code Smells
By: Brahim Mahmoudi , Zacharie Chenail-Larcher , Naouel Moha and more
Large Language Models (LLMs) have gained massive popularity in recent years and are increasingly integrated into software systems for diverse purposes. However, poorly integrating them in source code may undermine software system quality. Yet, to our knowledge, there is no formal catalog of code smells specific to coding practices for LLM inference. In this paper, we introduce the concept of LLM code smells and formalize five recurrent problematic coding practices related to LLM inference in software systems, based on relevant literature. We extend the detection tool SpecDetect4AI to cover the newly defined LLM code smells and use it to validate their prevalence in a dataset of 200 open-source LLM systems. Our results show that LLM code smells affect 60.50% of the analyzed systems, with a detection precision of 86.06%.
Similar Papers
Investigating The Smells of LLM Generated Code
Software Engineering
Finds bad code written by AI.
Clean Code, Better Models: Enhancing LLM Performance with Smell-Cleaned Dataset
Software Engineering
Cleans computer code to make programs better.
Teaching Code Refactoring Using LLMs
Software Engineering
Helps students learn to fix computer code better.