LLM4Log: A Systematic Review of Large Language Model-based Log Analysis
Zeyang Ma, Jinqiu Yang, Tse-Hsun Chen

TL;DR
This paper systematically reviews the use of large language models in log analysis, covering tasks from log parsing to anomaly detection, and discusses challenges for real-world deployment.
Contribution
It provides a comprehensive taxonomy, summarizes design patterns, and analyzes evaluation practices for LLM-based log analysis methods.
Findings
145 papers identified across seven log analysis tasks
Common design patterns include prompting, retrieval grounding, and fine-tuning
Highlights open challenges like robustness, grounding, and deployment reliability
Abstract
Software systems generate massive, evolving, semi-structured logs that are central to reliability engineering and AIOps, yet difficult to analyze at scale under drift and limited labels. Recent advances in pretrained Transformer models and instruction-tuned large language models (LLMs) have reshaped log analysis by enabling semantic generalization and cross-source evidence integration, but also introducing deployment risks such as context limits, latency and cost, privacy constraints, and hallucinations. This paper presents LLM4Log, a systematic review of LLM-based log analysis across the end-to-end pipeline, from upstream logging-statement generation and maintenance to log parsing/structuring and downstream tasks including anomaly detection, failure prediction, root cause analysis, and log summarization. Following a structured search and manual screening protocol, we completed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
