LUNAR: Unsupervised LLM-based Log Parsing
Junjie Huang, Zhihan Jiang, Zhuangbin Chen, Michael R. Lyu

TL;DR
LUNAR introduces an unsupervised, LLM-based log parsing method that leverages contrastive analysis across log groups to improve accuracy and scalability without relying on labeled data.
Contribution
The paper presents a novel unsupervised approach using contrastive analysis and a hybrid ranking scheme to enhance LLM-based log parsing performance.
Findings
Outperforms state-of-the-art log parsers in accuracy
Demonstrates high efficiency on large-scale datasets
Provides scalable, off-the-shelf log parsing solution
Abstract
Log parsing serves as an essential prerequisite for various log analysis tasks. Recent advancements in this field have improved parsing accuracy by leveraging the semantics in logs through fine-tuning large language models (LLMs) or learning from in-context demonstrations. However, these methods heavily depend on labeled examples to achieve optimal performance. In practice, collecting sufficient labeled data is challenging due to the large scale and continuous evolution of logs, leading to performance degradation of existing log parsers after deployment. To address this issue, we propose LUNAR, an unsupervised LLM-based method for efficient and off-the-shelf log parsing. Our key insight is that while LLMs may struggle with direct log parsing, their performance can be significantly enhanced through comparative analysis across multiple logs that differ only in their parameter parts. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Speech Recognition and Synthesis
