STRisk: A Socio-Technical Approach to Assess Hacking Breaches Risk
Hicham Hammouchi, Narjisse Nejjari, Ghita Mezzour, Mounir Ghogho,, Houda Benbrahim

TL;DR
This paper introduces STRisk, a socio-technical predictive system that combines technical indicators and social media factors to accurately assess hacking breach risks for organizations, achieving high predictive performance.
Contribution
We develop a novel socio-technical model incorporating social media data into breach risk prediction, improving accuracy over purely technical approaches.
Findings
Achieved over 98% AUC in breach prediction.
Social features like spreadability and agreeability are key predictors.
Technical features such as open ports and expired certificates are highly influential.
Abstract
Data breaches have begun to take on new dimensions and their prediction is becoming of great importance to organizations. Prior work has addressed this issue mainly from a technical perspective and neglected other interfering aspects such as the social media dimension. To fill this gap, we propose STRisk which is a predictive system where we expand the scope of the prediction task by bringing into play the social media dimension. We study over 3800 US organizations including both victim and non-victim organizations. For each organization, we design a profile composed of a variety of externally measured technical indicators and social factors. In addition, to account for unreported incidents, we consider the non-victim sample to be noisy and propose a noise correction approach to correct mislabeled organizations. We then build several machine learning models to predict whether an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
