TL;DR
This paper presents CTiKG, a context-aware framework that improves the extraction of entity-relation triples from cybersecurity reports, enhancing the accuracy and robustness of threat intelligence knowledge graphs.
Contribution
The introduction of CTiKG, a hybrid NLP pipeline utilizing SecureBERT+ and domain ontology, to improve entity and relation extraction from unstructured CTI reports.
Findings
Achieved 3-4% improvements in NER performance.
Achieved up to 8% improvements in relation extraction.
Validated robustness on multiple benchmark datasets.
Abstract
Cybersecurity Knowledge Graphs (CKGs) unify diverse Cyber Threat Intelligence (CTI) sources into structured, queryable formats, offering scalable solutions for automating proactive and real-time security responses. Their increasing adoption has significantly enhanced the workflow and decision-making efficiency of security professionals. However, constructing CKGs requires extracting entity-relation triples from unstructured CTI reports, a task hindered by complex report structure, domain-specific language, and semantic ambiguity. As a result, existing pipeline-based approaches often suffer from error propagation, reducing extraction accuracy and limiting generalizability. This paper introduces the Context-aware Threat Intelligence Knowledge Graph (CTiKG) framework, a pipeline architecture designed to accurately extract and classify threat entities and their relationships from CTI…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
