A recommender system for efficient discovery of new anomalies in large-scale access logs
Heju Jiang, Scott Algatt, Parvez Ahammad

TL;DR
Helios is a novel recommender system that efficiently discovers unknown anomalies in large-scale access logs with minimal supervision, significantly accelerating security policy management and anomaly detection.
Contribution
The paper introduces Helios, a new recommender system that detects unseen anomalies in access logs without prior user or item information, using a bootstrapping approach and rank statistics.
Findings
Helios analyzes 4.6 billion records in under 60 minutes.
It accelerates anomaly discovery by 1 to 3 orders of magnitude.
The system is flexible with customizable metrics and measurement fields.
Abstract
We present a novel, non-standard recommender system for large-scale security policy management(SPM). Our system Helios discovers and recommends unknown and unseen anomalies in large-scale access logs with minimal supervision and no starting information on users and items. Typical recommender systems assume availability of user- and item-related information, but such information is not usually available in access logs. To resolve this problem, we first use discrete categorical labels to construct categorical combinations from access logs in a bootstrapping manner. Then, we utilize rank statistics of entity rank and order categorical combinations for recommendation. From a double-sided cold start, with minimal supervision, Helios learns to recommend most salient anomalies at large-scale, and provides visualizations to security experts to explain rationale behind the recommendations. Our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNetwork Security and Intrusion Detection · Internet Traffic Analysis and Secure E-voting · Spam and Phishing Detection
