The Use of Large Language Models (LLM) for Cyber Threat Intelligence (CTI) in Cybercrime Forums
Vanessa Clairoux-Trepanier, Isa-May Beauchamp, Estelle Ruellan,, Masarah Paquet-Clouston, Serge-Olivier Paquette, Eric Clay

TL;DR
This study evaluates the effectiveness of GPT-3.5-turbo in extracting cyber threat intelligence from cybercrime forums, demonstrating high accuracy and identifying areas for improvement in prompt design.
Contribution
It provides an empirical assessment of LLM performance in CTI extraction from cybercrime forums, highlighting practical accuracy metrics and enhancement strategies.
Findings
LLM achieved 96.23% accuracy in CTI extraction.
Prompt design impacts LLM performance and accuracy.
LLMs are relevant tools for cyber threat intelligence analysis.
Abstract
Large language models (LLMs) can be used to analyze cyber threat intelligence (CTI) data from cybercrime forums, which contain extensive information and key discussions about emerging cyber threats. However, to date, the level of accuracy and efficiency of LLMs for such critical tasks has yet to be thoroughly evaluated. Hence, this study assesses the performance of an LLM system built on the OpenAI GPT-3.5-turbo model [8] to extract CTI information. To do so, a random sample of more than 700 daily conversations from three cybercrime forums - XSS, Exploit_in, and RAMP - was extracted, and the LLM system was instructed to summarize the conversations and predict 10 key CTI variables, such as whether a large organization and/or a critical infrastructure is being targeted, with only simple human-language instructions. Then, two coders reviewed each conversation and evaluated whether the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCybercrime and Law Enforcement Studies
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · 15 Ways to Contact How can i speak to someone at Delta Airlines · Attention Is All You Need · Linear Layer · Attention Dropout · Residual Connection · Multi-Head Attention · {Dispute@FaQ-s}How to file a dispute with Expedia? · Cosine Annealing · Weight Decay
