Malware Knowledge Graph Generation

Sharmishtha Dutta; Nidhi Rastogi; Destin Yee; Chuqiao Gu; Qicheng Ma

arXiv:2102.05583·cs.CR·February 11, 2021

Malware Knowledge Graph Generation

Sharmishtha Dutta, Nidhi Rastogi, Destin Yee, Chuqiao Gu, Qicheng Ma

PDF

TL;DR

This paper introduces TINKER, a novel open-source knowledge graph for cybersecurity threat intelligence, generated from unstructured threat reports to facilitate better understanding and prediction of cyber threats.

Contribution

The paper presents the creation of TINKER, the first open-source security knowledge graph built from natural language threat reports using malware ontology and RDF triples.

Findings

01

TINKER contains RDF triples from 83 threat reports (2006-2021).

02

It demonstrates the feasibility of converting unstructured threat data into a structured knowledge graph.

03

Challenges in ontology alignment and data annotation are discussed.

Abstract

Cyber threat and attack intelligence information are available in non-standard format from heterogeneous sources. Comprehending them and utilizing them for threat intelligence extraction requires engaging security experts. Knowledge graphs enable converting this unstructured information from heterogeneous sources into a structured representation of data and factual knowledge for several downstream tasks such as predicting missing information and future threat trends. Existing large-scale knowledge graphs mainly focus on general classes of entities and relationships between them. Open-source knowledge graphs for the security domain do not exist. To fill this gap, we've built \textsf{TINKER} - a knowledge graph for threat intelligence (\textbf{T}hreat \textbf{IN}telligence \textbf{K}nowl\textbf{E}dge g\textbf{R}aph). \textsf{TINKER} is generated using RDF triples describing entities and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.