LLMLogAnalyzer: A Clustering-Based Log Analysis Chatbot using Large Language Models

Peng Cai; Reza Ryan; and Nickson M. Karie

arXiv:2510.24031·cs.AI·October 29, 2025

LLMLogAnalyzer: A Clustering-Based Log Analysis Chatbot using Large Language Models

Peng Cai, Reza Ryan, and Nickson M. Karie

PDF

TL;DR

LLMLogAnalyzer is a novel clustering-based chatbot that enhances log analysis by leveraging large language models and machine learning, significantly improving accuracy, robustness, and usability across diverse cybersecurity log tasks.

Contribution

This paper introduces a modular framework combining LLMs and ML algorithms to address log analysis challenges, outperforming existing chatbots in accuracy and robustness.

Findings

01

Achieves 39-68% performance improvements over state-of-the-art chatbots

02

Reduces result variability by 93% in ROUGE-1 scores

03

Effective across four domain-specific log datasets

Abstract

System logs are a cornerstone of cybersecurity, supporting proactive breach prevention and post-incident investigations. However, analyzing vast amounts of diverse log data remains significantly challenging, as high costs, lack of in-house expertise, and time constraints make even basic analysis difficult for many organizations. This study introduces LLMLogAnalyzer, a clustering-based log analysis chatbot that leverages Large Language Models (LLMs) and Machine Learning (ML) algorithms to simplify and streamline log analysis processes. This innovative approach addresses key LLM limitations, including context window constraints and poor structured text handling capabilities, enabling more effective summarization, pattern extraction, and anomaly detection tasks. LLMLogAnalyzer is evaluated across four distinct domain logs and various tasks. Results demonstrate significant performance…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.