Score: 0

Natural Language Summarization Enables Multi-Repository Bug Localization by LLMs in Microservice Architectures

Published: December 5, 2025 | arXiv ID: 2512.05908v1

By: Amirkia Rafiei Oskooei , S. Selcan Yukcu , Mehmet Cevheri Bozoglan and more

Potential Business Impact:

Finds software bugs faster by reading summaries.

Business Areas:

Natural Language Processing Artificial Intelligence, Data and Analytics, Software

Bug localization in multi-repository microservice architectures is challenging due to the semantic gap between natural language bug reports and code, LLM context limitations, and the need to first identify the correct repository. We propose reframing this as a natural language reasoning task by transforming codebases into hierarchical NL summaries and performing NL-to-NL search instead of cross-modal retrieval. Our approach builds context-aware summaries at file, directory, and repository levels, then uses a two-phase search: first routing bug reports to relevant repositories, then performing top-down localization within those repositories. Evaluated on DNext, an industrial system with 46 repositories and 1.1M lines of code, our method achieves Pass@10 of 0.82 and MRR of 0.50, significantly outperforming retrieval baselines and agentic RAG systems like GitHub Copilot and Cursor. This work demonstrates that engineered natural language representations can be more effective than raw source code for scalable bug localization, providing an interpretable repository -> directory -> file search path, which is vital for building trust in enterprise AI tools by providing essential transparency.

Leveraging Large Language Model for Information Retrieval-based Bug Localization

Software Engineering

Finds computer code errors faster using smart language.

1 Aug 2025 2

90%

Progressive Code Integration for Abstractive Bug Report Summarization

Software Engineering

Helps fix computer bugs faster by reading code.

29 Nov 2025 1

89%

Summary-Mediated Repair: Can LLMs use code summarisation as a tool for program repair?

Software Engineering

Fixes computer code errors using summaries.

24 Nov 2025 1

View PDF Login to Bookmark

Country of Origin

🇹🇷 Turkey

Page Count

9 pages

Natural Language Summarization Enables Multi-Repository Bug Localization by LLMs in Microservice Architectures

Finds software bugs faster by reading summaries.

Technical Abstract

Leveraging Large Language Model for Information Retrieval-based Bug Localization

Progressive Code Integration for Abstractive Bug Report Summarization

Summary-Mediated Repair: Can LLMs use code summarisation as a tool for program repair?