Score: 0

TAAF: A Trace Abstraction and Analysis Framework Synergizing Knowledge Graphs and LLMs

Published: January 6, 2026 | arXiv ID: 2601.02632v1

By: Alireza Ezaz , Ghazal Khodabandeh , Majid Babaei and more

Potential Business Impact:

Lets computers understand messy software logs easily.

Business Areas:
Application Performance Management Data and Analytics, Software

Execution traces are a critical source of information for understanding, debugging, and optimizing complex software systems. However, traces from OS kernels or large-scale applications like Chrome or MySQL are massive and difficult to analyze. Existing tools rely on predefined analyses, and custom insights often require writing domain-specific scripts, which is an error-prone and time-consuming task. This paper introduces TAAF (Trace Abstraction and Analysis Framework), a novel approach that combines time-indexing, knowledge graphs (KGs), and large language models (LLMs) to transform raw trace data into actionable insights. TAAF constructs a time-indexed KG from trace events to capture relationships among entities such as threads, CPUs, and system resources. An LLM then interprets query-specific subgraphs to answer natural-language questions, reducing the need for manual inspection and deep system expertise. To evaluate TAAF, we introduce TraceQA-100, a benchmark of 100 questions grounded in real kernel traces. Experiments across three LLMs and multiple temporal settings show that TAAF improves answer accuracy by up to 31.2%, particularly in multi-hop and causal reasoning tasks. We further analyze where graph-grounded reasoning helps and where limitations remain, offering a foundation for next-generation trace analysis tools.

Country of Origin
🇨🇦 Canada

Page Count
12 pages

Category
Computer Science:
Software Engineering