A Comparative Study on Large Language Models for Log Parsing
Merve Astekin, Max Hort, Leon Moonen

TL;DR
This study evaluates the effectiveness of various large language models, including free and proprietary ones, in automating log parsing tasks across multiple open-source projects, revealing that smaller free models can perform competitively.
Contribution
It provides a comparative analysis of six recent LLMs for log parsing, highlighting the potential of free-to-use models in this domain.
Findings
Free models like CodeLlama outperform GPT-3.5 in log template extraction.
Free-to-use models are comparable to paid models in log parsing accuracy.
Smaller, code-specialized models show significant promise for log parsing tasks.
Abstract
Background: Log messages provide valuable information about the status of software systems. This information is provided in an unstructured fashion and automated approaches are applied to extract relevant parameters. To ease this process, log parsing can be applied, which transforms log messages into structured log templates. Recent advances in language models have led to several studies that apply ChatGPT to the task of log parsing with promising results. However, the performance of other state-of-the-art large language models (LLMs) on the log parsing task remains unclear. Aims: In this study, we investigate the current capability of state-of-the-art LLMs to perform log parsing. Method: We select six recent LLMs, including both paid proprietary (GPT-3.5, Claude 2.1) and four free-to-use open models, and compare their performance on system logs obtained from a selection of mature…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · {Dispute@FaQ-s}How to file a dispute with Expedia? · 15 Ways to Contact How can i speak to someone at Delta Airlines · Cosine Annealing · Residual Connection · Linear Warmup With Cosine Annealing · Byte Pair Encoding · Softmax · Linear Layer · Adam
