Execution-Feedback Driven Test Generation from SWE Issues
By: Toufique Ahmed , Jatin Ganhotra , Avraham Shinnar and more
Potential Business Impact:
Fixes computer bugs by creating tests automatically.
A software engineering issue (SWE issue) is easier to resolve when accompanied by a reproduction test. Unfortunately, most issues do not come with functioning reproduction tests, so this paper explores how to generate them automatically. The primary challenge in this setting is that the code to be tested is either missing or wrong, as evidenced by the existence of the issue in the first place. This has held back test generation for this setting: without the correct code to execute, it is difficult to leverage execution feedback to generate good tests. This paper introduces novel techniques for leveraging execution feedback to get around this problem, implemented in a new reproduction test generator called e-Otter++. Experiments show that e-Otter++ represents a leap ahead in the state-of-the-art for this problem, generating tests with an average fail-to-pass rate of 63% on the TDD-Bench Verified benchmark.
Similar Papers
Otter: Generating Tests from Issues to Validate SWE Patches
Software Engineering
Creates tests for fixing computer bugs.
SWE-Factory: Your Automated Factory for Issue Resolution Training Data and Evaluation Benchmarks
Software Engineering
Helps computers learn to fix software bugs faster.
SWE-Exp: Experience-Driven Software Issue Resolution
Software Engineering
Computers learn from past mistakes to fix code.