Towards a scalable AI-driven framework for data-independent Cyber Threat Intelligence Information Extraction
Olga Sorokoletova, Emanuele Antonioni, Giordano Col\`o

TL;DR
This paper presents 0-CTI, a scalable, modular AI framework utilizing Transformer-based NLP techniques for effective cyber threat intelligence information extraction, capable of operating with or without annotated data, and aligning outputs with industry standards.
Contribution
The introduction of 0-CTI, a novel modular framework supporting both supervised and zero-shot learning for CTI information extraction, adaptable to various data availability scenarios.
Findings
Supervised Entity Extractor outperforms current state-of-the-art in cyber Entity Extraction.
The framework enables fully dataless operation via zero-shot methods.
Outputs are aligned with the STIX standard for cybersecurity information exchange.
Abstract
Cyber Threat Intelligence (CTI) is critical for mitigating threats to organizations, governments, and institutions, yet the necessary data are often dispersed across diverse formats. AI-driven solutions for CTI Information Extraction (IE) typically depend on high-quality, annotated data, which are not always available. This paper introduces 0-CTI, a scalable AI-based framework designed for efficient CTI Information Extraction. Leveraging advanced Natural Language Processing (NLP) techniques, particularly Transformer-based architectures, the proposed system processes complete text sequences of CTI reports to extract a cyber ontology of named entities and their relationships. Our contribution is the development of 0-CTI, the first modular framework for CTI Information Extraction that supports both supervised and zero-shot learning. Unlike existing state-of-the-art models that rely…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsOntology
