EvoGPT: Enhancing Test Suite Robustness via LLM-Based Generation and Genetic Optimization
By: Lior Broide, Roni Stern
Potential Business Impact:
Finds bugs in computer code better.
Large Language Models (LLMs) have recently emerged as promising tools for automated unit test generation. We introduce a hybrid framework called EvoGPT that integrates LLM-based test generation with evolutionary search techniques to create diverse, fault-revealing unit tests. Unit tests are initially generated with diverse temperature sampling to maximize behavioral and test suite diversity, followed by a generation-repair loop and coverage-guided assertion enhancement. The resulting test suites are evolved using genetic algorithms, guided by a fitness function prioritizing mutation score over traditional coverage metrics. This design emphasizes the primary objective of unit testing-fault detection. Evaluated on multiple open-source Java projects, EvoGPT achieves an average improvement of 10% in both code coverage and mutation score compared to LLMs and traditional search-based software testing baselines. These results demonstrate that combining LLM-driven diversity, targeted repair, and evolutionary optimization produces more effective and resilient test suites.
Similar Papers
Mutation-Guided Unit Test Generation with a Large Language Model
Software Engineering
Finds more software bugs with smarter tests.
Large Language Models for Unit Test Generation: Achievements, Challenges, and the Road Ahead
Software Engineering
Helps computers write better code tests automatically.
MultiGA: Leveraging Multi-Source Seeding in Genetic Algorithms
Neural and Evolutionary Computing
Combines many AI minds to solve hard problems.