The CTI Echo Chamber: Fragmentation, Overlap, and Vendor Specificity in Twenty Years of Cyber Threat Reporting
Manuel Suarez-Roman, Francesco Marchiori, Mauro Conti, Juan Tapiador

TL;DR
This paper presents a large-scale analysis of two decades of open-source Cyber Threat Intelligence reports, revealing fragmentation, biases, and overlaps in the industry using an LLM-based data extraction pipeline.
Contribution
It introduces a high-precision, automated pipeline for structuring and analyzing extensive CTI reports, providing insights into industry biases and threat actor dynamics.
Findings
Low overlap between vendor reports indicates fragmented intelligence sources.
Identification of geographic and sectoral biases in CTI reporting.
Quantification of threat actor-motivation-victim relationships over 20 years.
Abstract
Despite the high volume of open-source Cyber Threat Intelligence (CTI), our understanding of long-term threat actor-victim dynamics remains fragmented due to inconsistent reporting standards and the lack of structured datasets containing comprehensive analytic information. In this paper, we present a large-scale automated analysis of open-source CTI reports spanning two decades. We develop a high-precision, LLM-based pipeline to ingest and structure 16,096 reports, extracting key entities such as attributed threat actors, motivations, victims, reporting vendors, and technical indicators (IoCs and TTPs). Our analysis quantifies the evolution of CTI information density and specialization, characterizing patterns that relate specific threat actors to motivations and victim profiles. Furthermore, we perform a meta-analysis of the CTI industry itself. We identify a fragmented ecosystem of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCybercrime and Law Enforcement Studies · Information and Cyber Security · Cybersecurity and Cyber Warfare Studies
