Discovering Signals from Web Sources to Predict Cyber Attacks
Palash Goyal, KSM Tozammel Hossain, Ashok Deb, Nazgol Tavabi, Nathan, Bartley, Andr'es Abeliuk, Emilio Ferrara, Kristina Lerman

TL;DR
This paper presents machine learning methods that analyze signals from web sources, including dark web discussions, to predict cyber attacks, aiming to enhance early warning systems and cybersecurity defenses.
Contribution
It introduces deep neural networks and autoregressive models that utilize external web signals for cyber attack prediction, a novel approach in this domain.
Findings
Significant increase in F1 scores for top predicted attack signals
Effective forecasting of cyber attacks using web-based signals
Potential for deployment as an early warning system
Abstract
Cyber attacks are growing in frequency and severity. Over the past year alone we have witnessed massive data breaches that stole personal information of millions of people and wide-scale ransomware attacks that paralyzed critical infrastructure of several countries. Combating the rising cyber threat calls for a multi-pronged strategy, which includes predicting when these attacks will occur. The intuition driving our approach is this: during the planning and preparation stages, hackers leave digital traces of their activities on both the surface web and dark web in the form of discussions on platforms like hacker forums, social media, blogs and the like. These data provide predictive signals that allow anticipating cyber attacks. In this paper, we describe machine learning techniques based on deep neural networks and autoregressive time series models that leverage external signals from…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNetwork Security and Intrusion Detection · Complex Network Analysis Techniques · Spam and Phishing Detection
