SweRank+: Multilingual, Multi-Turn Code Ranking for Software Issue Localization
By: Revanth Gangi Reddy , Ye Liu , Wenting Zhao and more
Maintaining large-scale, multilingual codebases hinges on accurately localizing issues, which requires mapping natural-language error descriptions to the relevant functions that need to be modified. However, existing ranking approaches are often Python-centric and perform a single-pass search over the codebase. This work introduces SweRank+, a framework that couples SweRankMulti, a cross-lingual code ranking tool, with SweRankAgent, an agentic search setup, for iterative, multi-turn reasoning over the code repository. SweRankMulti comprises a code embedding retriever and a listwise LLM reranker, and is trained using a carefully curated large-scale issue localization dataset spanning multiple popular programming languages. SweRankAgent adopts an agentic search loop that moves beyond single-shot localization with a memory buffer to reason and accumulate relevant localization candidates over multiple turns. Our experiments on issue localization benchmarks spanning various languages demonstrate new state-of-the-art performance with SweRankMulti, while SweRankAgent further improves localization over single-pass ranking.
Similar Papers
SweRank: Software Issue Localization with Code Ranking
Software Engineering
Finds software bugs faster and cheaper.
Multi-SWE-bench: A Multilingual Benchmark for Issue Resolving
Software Engineering
Helps computers fix code in many languages.
SWE-Compass: Towards Unified Evaluation of Agentic Coding Abilities for Large Language Models
Software Engineering
Tests AI's ability to write and fix code.