Jasmine: A New Active Learning Approach to Combat Cybercrime
Jan Klein, Sandjai Bhulai, Mark Hoogendoorn, Rob van der Mei

TL;DR
Jasmine is a hybrid active learning method for cybersecurity that dynamically adjusts its query strategy to improve intrusion detection accuracy with minimal labeled data.
Contribution
It introduces a novel dynamic updating mechanism that learns the optimal query strategy during the labeling process, unlike static methods.
Findings
Jasmine outperforms static query strategies in robustness and accuracy.
Dynamic updating leads to more effective use of limited labeled data.
Jasmine achieves consistent improvements over existing active learning approaches.
Abstract
Over the past decade, the advent of cybercrime has accelarated the research on cybersecurity. However, the deployment of intrusion detection methods falls short. One of the reasons for this is the lack of realistic evaluation datasets, which makes it a challenge to develop techniques and compare them. This is caused by the large amounts of effort it takes for a cyber analyst to classify network connections. This has raised the need for methods (i) that can learn from small sets of labeled data, (ii) that can make predictions on large sets of unlabeled data, and (iii) that request the label of only specially selected unlabeled data instances. Hence, Active Learning (AL) methods are of interest. These approaches choose specific unlabeled instances by a query function that are expected to improve overall classification performance. The resulting query observations are labeled by a human…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNetwork Security and Intrusion Detection · Anomaly Detection Techniques and Applications · Spam and Phishing Detection
