LUNAR: Unsupervised LLM-based Log Parsing

Junjie Huang; Zhihan Jiang; Zhuangbin Chen; Michael R. Lyu

arXiv:2406.07174·cs.SE·August 9, 2024

LUNAR: Unsupervised LLM-based Log Parsing

Junjie Huang, Zhihan Jiang, Zhuangbin Chen, Michael R. Lyu

PDF

Open Access 1 Repo

TL;DR

LUNAR introduces an unsupervised, LLM-based log parsing method that leverages contrastive analysis across log groups to improve accuracy and scalability without relying on labeled data.

Contribution

The paper presents a novel unsupervised approach using contrastive analysis and a hybrid ranking scheme to enhance LLM-based log parsing performance.

Findings

01

Outperforms state-of-the-art log parsers in accuracy

02

Demonstrates high efficiency on large-scale datasets

03

Provides scalable, off-the-shelf log parsing solution

Abstract

Log parsing serves as an essential prerequisite for various log analysis tasks. Recent advancements in this field have improved parsing accuracy by leveraging the semantics in logs through fine-tuning large language models (LLMs) or learning from in-context demonstrations. However, these methods heavily depend on labeled examples to achieve optimal performance. In practice, collecting sufficient labeled data is challenging due to the large scale and continuous evolution of logs, leading to performance degradation of existing log parsers after deployment. To address this issue, we propose LUNAR, an unsupervised LLM-based method for efficient and off-the-shelf log parsing. Our key insight is that while LLMs may struggle with direct log parsing, their performance can be significantly enhanced through comparative analysis across multiple logs that differ only in their parameter parts. We…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jun-jie-huang/lunar
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Speech Recognition and Synthesis