Try with Simpler -- An Evaluation of Improved Principal Component Analysis in Log-based Anomaly Detection
Lin Yang, Junjie Chen, Shutao Gao, Zhihao Gong, Hongyu Zhang, Yue, Kang, Huaan Li

TL;DR
This paper evaluates an optimized traditional PCA method for log-based anomaly detection, demonstrating it can rival deep learning approaches in effectiveness while being more resource-efficient and stable with limited data.
Contribution
The study introduces a lightweight semantic-based enhancement to PCA, showing traditional methods can be improved to match deep learning performance in anomaly detection.
Findings
Optimized PCA achieves similar detection accuracy as deep learning methods.
The enhanced PCA is more stable with limited training data.
Traditional techniques can be effectively improved with small adaptations.
Abstract
The rapid growth of deep learning (DL) has spurred interest in enhancing log-based anomaly detection. This approach aims to extract meaning from log events (log message templates) and develop advanced DL models for anomaly detection. However, these DL methods face challenges like heavy reliance on training data, labels, and computational resources due to model complexity. In contrast, traditional machine learning and data mining techniques are less data-dependent and more efficient but less effective than DL. To make log-based anomaly detection more practical, the goal is to enhance traditional techniques to match DL's effectiveness. Previous research in a different domain (linking questions on Stack Overflow) suggests that optimized traditional techniques can rival state-of-the-art DL methods. Drawing inspiration from this concept, we conducted an empirical study. We optimized the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Data Quality and Management · Software System Performance and Reliability
MethodsPrincipal Components Analysis
