Multi-Level Fine-Tuning, Data Augmentation, and Few-Shot Learning for Specialized Cyber Threat Intelligence
Markus Bayer, Tobias Frey, Christian Reuter

TL;DR
This paper presents a novel approach combining transfer learning, data augmentation, and few-shot learning to develop high-quality classifiers for cyber threat intelligence from minimal labeled data, demonstrated on a real-world dataset.
Contribution
It introduces a system that effectively trains classifiers for emerging cybersecurity events using very limited labeled data, outperforming standard and state-of-the-art methods.
Findings
F1 score increased by over 21 points compared to standard training.
F1 score increased by more than 18 points over a state-of-the-art few-shot method.
Achieved near-parity with classifiers trained on 1800 instances using only 32 instances.
Abstract
Gathering cyber threat intelligence from open sources is becoming increasingly important for maintaining and achieving a high level of security as systems become larger and more complex. However, these open sources are often subject to information overload. It is therefore useful to apply machine learning models that condense the amount of information to what is necessary. Yet, previous studies and applications have shown that existing classifiers are not able to extract specific information about emerging cybersecurity events due to their low generalization ability. Therefore, we propose a system to overcome this problem by training a new classifier for each new incident. Since this requires a lot of labelled data using standard training methods, we combine three different low-data regime techniques - transfer learning, data augmentation, and few-shot learning - to train a high-quality…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNetwork Security and Intrusion Detection · Advanced Malware Detection Techniques · Cybercrime and Law Enforcement Studies
