TSTEM: A Cognitive Platform for Collecting Cyber Threat Intelligence in the Wild
Prasasthy Balasubramanian, Sadaf Nazari, Danial Khosh Kholgh, Alireza, Mahmoodi, Justin Seby, Panos Kostakos

TL;DR
This paper introduces TSTEM, an open-source, cloud-based platform that efficiently collects, processes, and shares cyber threat intelligence from online sources in real-time, utilizing advanced NLP and microservice architecture.
Contribution
The study presents a novel containerized microservice platform, TSTEM, integrating NLP, cloud infrastructure, and automation for real-time cyber threat intelligence extraction and sharing.
Findings
High accuracy (>98%) in IOC classification and extraction.
Real-time processing within less than a minute.
Effective multi-stage IOC extraction methodology.
Abstract
The extraction of cyber threat intelligence (CTI) from open sources is a rapidly expanding defensive strategy that enhances the resilience of both Information Technology (IT) and Operational Technology (OT) environments against large-scale cyber-attacks. While previous research has focused on improving individual components of the extraction process, the community lacks open-source platforms for deploying streaming CTI data pipelines in the wild. To address this gap, the study describes the implementation of an efficient and well-performing platform capable of processing compute-intensive data pipelines based on the cloud computing paradigm for real-time detection, collecting, and sharing CTI from different online sources. We developed a prototype platform (TSTEM), a containerized microservice architecture that uses Tweepy, Scrapy, Terraform, ELK, Kafka, and MLOps to autonomously…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIntelligence, Security, War Strategy · Cognitive Computing and Networks
