EUREKHA: Enhancing User Representation for Key Hackers Identification in Underground Forums
Abdoul Nasser Hassane Amadou, Anas Motii, Saida Elouardi, EL Houcine, Bergou

TL;DR
EUREKHA is a novel approach that combines large language models, graph neural networks, and topic modeling to accurately identify key hackers in underground forums, significantly outperforming existing methods.
Contribution
The paper introduces EUREKHA, a new method integrating LLMs, GNNs, and topic modeling for improved key hacker identification in underground forums.
Findings
Fine-tuned LLMs outperform state-of-the-art methods.
Combining LLMs with GNNs improves accuracy and F1-score.
EUREKHA achieves 6-10% performance gains over existing approaches.
Abstract
Underground forums serve as hubs for cybercriminal activities, offering a space for anonymity and evasion of conventional online oversight. In these hidden communities, malicious actors collaborate to exchange illicit knowledge, tools, and tactics, driving a range of cyber threats from hacking techniques to the sale of stolen data, malware, and zero-day exploits. Identifying the key instigators (i.e., key hackers), behind these operations is essential but remains a complex challenge. This paper presents a novel method called EUREKHA (Enhancing User Representation for Key Hacker Identification in Underground Forums), designed to identify these key hackers by modeling each user as a textual sequence. This sequence is processed through a large language model (LLM) for domain-specific adaptation, with LLMs acting as feature extractors. These extracted features are then fed into a Graph…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpam and Phishing Detection · User Authentication and Security Systems · Digital and Cyber Forensics
MethodsGraph Neural Network
