TAAF: A Trace Abstraction and Analysis Framework Synergizing Knowledge Graphs and LLMs

Alireza Ezaz; Ghazal Khodabandeh; Majid Babaei; and Naser Ezzati-Jivan

arXiv:2601.02632·cs.SE·January 7, 2026

TAAF: A Trace Abstraction and Analysis Framework Synergizing Knowledge Graphs and LLMs

Alireza Ezaz, Ghazal Khodabandeh, Majid Babaei, and Naser Ezzati-Jivan

PDF

Open Access

TL;DR

TAAF is a new framework that combines knowledge graphs and large language models to analyze complex software execution traces, making it easier to extract insights and answer questions from massive trace data.

Contribution

It introduces a novel integration of time-indexed knowledge graphs with LLMs for trace analysis, enabling natural language querying and improved insight extraction.

Findings

01

Up to 31.2% improvement in answer accuracy

02

Effective in multi-hop and causal reasoning tasks

03

Benchmark TraceQA-100 facilitates evaluation of trace analysis methods

Abstract

Execution traces are a critical source of information for understanding, debugging, and optimizing complex software systems. However, traces from OS kernels or large-scale applications like Chrome or MySQL are massive and difficult to analyze. Existing tools rely on predefined analyses, and custom insights often require writing domain-specific scripts, which is an error-prone and time-consuming task. This paper introduces TAAF (Trace Abstraction and Analysis Framework), a novel approach that combines time-indexing, knowledge graphs (KGs), and large language models (LLMs) to transform raw trace data into actionable insights. TAAF constructs a time-indexed KG from trace events to capture relationships among entities such as threads, CPUs, and system resources. An LLM then interprets query-specific subgraphs to answer natural-language questions, reducing the need for manual inspection and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware System Performance and Reliability · Software Engineering Research · Topic Modeling