Harnessing TI Feeds for Exploitation Detection
Kajal Patel, Zubair Shafiq, Mateus Nogueira, Daniel Sadoc, Menasch\'e, Enrico Lovat, Taimur Kashif, Ashton Woiwood, Matheus, Martins

TL;DR
This paper introduces a machine learning pipeline that leverages embedding techniques and supervised classification to automatically detect vulnerability exploitation events from diverse Threat Intelligence feeds, aiding proactive cybersecurity measures.
Contribution
It presents a novel approach combining Doc2Vec and BERT embeddings with supervised learning to identify exploitation events in loosely structured TI feeds, validated on 191 feeds.
Findings
Accurately detects exploitation events using only past data for training.
Effective across multiple TI feeds, including unseen ones.
Supports downstream vulnerability risk assessment tasks.
Abstract
Many organizations rely on Threat Intelligence (TI) feeds to assess the risk associated with security threats. Due to the volume and heterogeneity of data, it is prohibitive to manually analyze the threat information available in different loosely structured TI feeds. Thus, there is a need to develop automated methods to vet and extract actionable information from TI feeds. To this end, we present a machine learning pipeline to automatically detect vulnerability exploitation from TI feeds. We first model threat vocabulary in loosely structured TI feeds using state-of-the-art embedding techniques (Doc2Vec and BERT) and then use it to train a supervised machine learning classifier to detect exploitation of security vulnerabilities. We use our approach to identify exploitation events in 191 different TI feeds. Our longitudinal evaluation shows that it is able to accurately identify…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Malware Detection Techniques · Network Security and Intrusion Detection
