DistAD: Software Anomaly Detection Based on Execution Trace Distribution
Shiyi Kong, Jun Ai, Minyan Lu, Shuguang Wang, W. Eric Wong

TL;DR
DistAD is a novel runtime anomaly detection method for complex software systems that leverages execution trace distributions and deep learning, achieving high accuracy with minimal overhead.
Contribution
It introduces a trace distribution-based anomaly detection framework using Bi-LSTM and OCNN, addressing data imbalance and labeled data scarcity issues.
Findings
Achieves over 70% accuracy in anomaly detection
Reaches 90% recall in normal states
Maintains overheads within twice the normal execution cost
Abstract
Modern software systems have become increasingly complex, which makes them difficult to test and validate. Detecting software partial anomalies in complex systems at runtime can assist with handling unintended software behaviors, avoiding catastrophic software failures and improving software runtime availability. These detection techniques aim to identify the manifestation of faults (anomalies) before they ultimately lead to unavoidable failures, thus, supporting the following runtime fault-tolerant techniques. In this work, we propose a novel anomaly detection method named DistAD, which is based on the distribution of software runtime dynamic execution traces. Unlike other existing works using key performance indicators, the execution trace is collected during runtime via intrusive instrumentation. Instrumentation are controlled following a sampling mechanism to avoid excessive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware System Performance and Reliability · Software Engineering Research · Anomaly Detection Techniques and Applications
