TAAF: A Trace Abstraction and Analysis Framework Synergizing Knowledge Graphs and LLMs
Alireza Ezaz, Ghazal Khodabandeh, Majid Babaei, and Naser Ezzati-Jivan

TL;DR
TAAF is a new framework that combines knowledge graphs and large language models to analyze complex software execution traces, making it easier to extract insights and answer questions from massive trace data.
Contribution
It introduces a novel integration of time-indexed knowledge graphs with LLMs for trace analysis, enabling natural language querying and improved insight extraction.
Findings
Up to 31.2% improvement in answer accuracy
Effective in multi-hop and causal reasoning tasks
Benchmark TraceQA-100 facilitates evaluation of trace analysis methods
Abstract
Execution traces are a critical source of information for understanding, debugging, and optimizing complex software systems. However, traces from OS kernels or large-scale applications like Chrome or MySQL are massive and difficult to analyze. Existing tools rely on predefined analyses, and custom insights often require writing domain-specific scripts, which is an error-prone and time-consuming task. This paper introduces TAAF (Trace Abstraction and Analysis Framework), a novel approach that combines time-indexing, knowledge graphs (KGs), and large language models (LLMs) to transform raw trace data into actionable insights. TAAF constructs a time-indexed KG from trace events to capture relationships among entities such as threads, CPUs, and system resources. An LLM then interprets query-specific subgraphs to answer natural-language questions, reducing the need for manual inspection and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware System Performance and Reliability · Software Engineering Research · Topic Modeling
