SAQL: A Stream-based Query System for Real-Time Abnormal System Behavior Detection
Peng Gao, Xusheng Xiao, Ding Li, Zhichun Li, Kangkook Jee, Zhenyu Wu,, Chung Hwan Kim, Sanjeev R. Kulkarni, Prateek Mittal

TL;DR
SAQL is a real-time, stream-based query system designed for detecting abnormal system behaviors in large-scale enterprise environments, enabling timely cyber attack detection with a specialized query language.
Contribution
The paper introduces SAQL, a novel domain-specific query language and system for real-time anomaly detection over large-scale system event streams, addressing latency and efficiency challenges.
Findings
Detection latency under 2 seconds
Supports 110,000 events per second
Efficient memory utilization compared to existing systems
Abstract
Recently, advanced cyber attacks, which consist of a sequence of steps that involve many vulnerabilities and hosts, compromise the security of many well-protected businesses. This has led to the solutions that ubiquitously monitor system activities in each host (big data) as a series of events, and search for anomalies (abnormal behaviors) for triaging risky events. Since fighting against these attacks is a time-critical mission to prevent further damage, these solutions face challenges in incorporating expert knowledge to perform timely anomaly detection over the large-scale provenance data. To address these challenges, we propose a novel stream-based query system that takes as input, a real-time event feed aggregated from multiple hosts in an enterprise, and provides an anomaly query engine that queries the event feed to identify abnormal behaviors based on the specified anomalies.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Software System Performance and Reliability · Data Quality and Management
