A Story About Cohesion and Separation: Label-Free Metric for Log Parser Evaluation
Qiaolin Qin, Jianchen Zhao, Heng Li, Weiyi Shang, Ettore Merlo

TL;DR
This paper introduces PMSS, a label-free metric for evaluating log parsers that correlates well with traditional label-based metrics, enabling more flexible and industry-relevant parser assessment.
Contribution
The study proposes a novel label-free evaluation metric, PMSS, for log parsers, addressing limitations of existing label-dependent metrics and demonstrating its effectiveness and correlation with traditional measures.
Findings
PMSS correlates strongly with FGA and FTA metrics.
PMSS achieves comparable performance to label-based metrics with only 2.1% difference.
PMSS enables parser evaluation without relying on ground-truth labels.
Abstract
Log parsing converts log messages into structured event templates, allowing for automated log analysis and reducing manual inspection effort. To select the most compatible parser for a specific system, multiple evaluation metrics are commonly used for performance comparisons. However, existing evaluation metrics heavily rely on labeled log data, which limits prior studies to a fixed set of datasets and hinders parser evaluations and selections in the industry. Further, we discovered that different versions of ground-truth used in existing studies can lead to inconsistent performance conclusions. Motivated by these challenges, we propose a novel label-free template-level metric, PMSS (parser medoid silhouette score), to evaluate log parser performance. PMSS evaluates both parser grouping and template quality with medoid silhouette analysis and Levenshtein distance within a near-linear…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware System Performance and Reliability · Cloud Computing and Resource Management · Business Process Modeling and Analysis
