Score: 1

GALA: Can Graph-Augmented Large Language Model Agentic Workflows Elevate Root Cause Analysis?

Published: August 17, 2025 | arXiv ID: 2508.12472v1

By: Yifang Tian , Yaming Liu , Zichun Chong and more

Potential Business Impact:

Finds computer problems and tells you how to fix them.

Root cause analysis (RCA) in microservice systems is challenging, requiring on-call engineers to rapidly diagnose failures across heterogeneous telemetry such as metrics, logs, and traces. Traditional RCA methods often focus on single modalities or merely rank suspect services, falling short of providing actionable diagnostic insights with remediation guidance. This paper introduces GALA, a novel multi-modal framework that combines statistical causal inference with LLM-driven iterative reasoning for enhanced RCA. Evaluated on an open-source benchmark, GALA achieves substantial improvements over state-of-the-art methods of up to 42.22% accuracy. Our novel human-guided LLM evaluation score shows GALA generates significantly more causally sound and actionable diagnostic outputs than existing methods. Through comprehensive experiments and a case study, we show that GALA bridges the gap between automated failure diagnosis and practical incident resolution by providing both accurate root cause identification and human-interpretable remediation guidance.

Reasoning Language Models for Root Cause Analysis in 5G Wireless Networks

Artificial Intelligence

Fixes phone network problems faster using smart AI.

29 Jul 2025 2

90%

MicroRCA-Agent: Microservice Root Cause Analysis Method Based on Large Language Model Agents

Artificial Intelligence

Finds computer problems faster by reading logs.

19 Sep 2025 1

89%

TAMO:Fine-Grained Root Cause Analysis via Tool-Assisted LLM Agent with Multi-Modality Observation Data in Cloud-Native Systems

Artificial Intelligence

Fixes computer problems automatically by understanding clues.

29 Apr 2025 0

View PDF Login to Bookmark

Country of Origin

🇨🇦 Canada

Page Count

12 pages

GALA: Can Graph-Augmented Large Language Model Agentic Workflows Elevate Root Cause Analysis?

Finds computer problems and tells you how to fix them.

Technical Abstract

Reasoning Language Models for Root Cause Analysis in 5G Wireless Networks

MicroRCA-Agent: Microservice Root Cause Analysis Method Based on Large Language Model Agents

TAMO:Fine-Grained Root Cause Analysis via Tool-Assisted LLM Agent with Multi-Modality Observation Data in Cloud-Native Systems