Agentic Diagnostic Reasoning over Telecom and Datacenter Infrastructure
By: Nicolas Tacheny
Potential Business Impact:
Fixes computer network problems automatically.
Large-scale telecom and datacenter infrastructures rely on multi-layered service and resource models, where failures propagate across physical and logical components and affect multiple customers. Traditional approaches to root cause analysis(RCA) rely on hard-coded graph traversal algorithms or rule-based correlation engines, which are costly to maintain and tightly coupled to the infrastructure model. In this work, we introduce an agentic diagnostic framework where a Large Language Model (LLM) performs step-wise investigation using a constrained tool space exposed through the Model Context Protocol (MCP). Instead of embedding causal logic or traversal algorithms into the application, the agent autonomously navigates the infrastructure model by invoking tools for service lookup, dependency retrieval, structured and unstructured data, and event analysis, and impact discovery. We define an investigation protocol that structures the agent's reasoning and ensures grounding, reproducibility, and safe handling of missing or ambiguous information. This work lays the foundation for autonomous incident resolution and change impact mitigation. Future systems will not only diagnose and remediate infrastructure failures, but also predict the impact of planned changes on services and customers, enabling operators to mitigate risks before executing maintenance operations.
Similar Papers
Reasoning Language Models for Root Cause Analysis in 5G Wireless Networks
Artificial Intelligence
Fixes phone network problems faster using smart AI.
GALA: Can Graph-Augmented Large Language Model Agentic Workflows Elevate Root Cause Analysis?
Artificial Intelligence
Finds computer problems and tells you how to fix them.
RCA Copilot: Transforming Network Data into Actionable Insights via Large Language Models
Networking and Internet Architecture
Finds computer problems and tells you how to fix them.