Leveraging large language models for SQL behavior-based database intrusion detection
Meital Shlezinger, Shay Akirav, Lei Zhou, Liang Guo, Avi Kessel, and Guoliang Li

TL;DR
This paper presents a two-tiered SQL anomaly detection system leveraging DistilBERT, combining unsupervised and supervised learning to effectively identify both internal and external database intrusions with minimal data labeling.
Contribution
Introduces a novel hybrid approach using DistilBERT for SQL anomaly detection, improving accuracy and efficiency in identifying sophisticated database threats.
Findings
Effective detection of in-scope and out-of-scope SQL anomalies
High precision in internal attack detection with limited labeled data
Reduced false positives compared to traditional methods
Abstract
Database systems are extensively used to store critical data across various domains. However, the frequency of abnormal database access behaviors, such as database intrusion by internal and external attacks, continues to rise. Internal masqueraders often have greater organizational knowledge, making it easier to mimic employee behavior effectively. In contrast, external masqueraders may behave differently due to their lack of familiarity with the organization. Current approaches lack the granularity needed to detect anomalies at the operational level, frequently misclassifying entire sequences of operations as anomalies, even though most operations are likely to represent normal behavior. On the other hand, some anomalous behaviors often resemble normal activities, making them difficult for existing detection methods to identify. This paper introduces a two-tiered anomaly detection…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNetwork Security and Intrusion Detection · Spam and Phishing Detection
