VarParser: Unleashing the Neglected Power of Variables for LLM-based Log Parsing
Jinrui Sun, Tong Jia, Minghua He, Ying Li

TL;DR
VarParser introduces a variable-centric approach to log parsing with LLMs, capturing variable contributions for improved accuracy and efficiency, addressing limitations of constant-focused methods.
Contribution
The paper presents a novel variable-centric log parsing strategy that leverages variable contributions and adaptive learning to enhance parsing accuracy and efficiency.
Findings
Achieves higher parsing accuracy than existing methods.
Reduces LLM invocation costs significantly.
Improves log parsing efficiency on large-scale datasets.
Abstract
Logs serve as a primary source of information for engineers to diagnose failures in large-scale online service systems. Log parsing, which extracts structured events from massive unstructured log data, is a critical first step for downstream tasks like anomaly detection and failure diagnosis. With advances in large language models (LLMs), leveraging their strong text understanding capabilities has proven effective for accurate log parsing. However, existing LLM-based log parsers all focus on the constant part of logs, ignoring the potential contribution of the variable part to log parsing. This constant-centric strategy brings four key problems. First, inefficient log grouping and sampling with only constant information. Second, a relatively large number of LLM invocations due to constant-based cache, leading to low log parsing accuracy and efficiency. Third, a relatively large number…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware System Performance and Reliability · Cloud Computing and Resource Management · Business Process Modeling and Analysis
