SoK: The Pitfalls of Deep Reinforcement Learning for Cybersecurity

Shae McFadden; Myles Foley; Elizabeth Bates; Ilias Tsingenopoulos; Sanyam Vyas; Vasilios Mavroudis; Chris Hicks; Fabio Pierazzi

arXiv:2602.08690·cs.LG·February 10, 2026

SoK: The Pitfalls of Deep Reinforcement Learning for Cybersecurity

Shae McFadden, Myles Foley, Elizabeth Bates, Ilias Tsingenopoulos, Sanyam Vyas, Vasilios Mavroudis, Chris Hicks, Fabio Pierazzi

PDF

Open Access

TL;DR

This paper systematically identifies common methodological pitfalls in applying deep reinforcement learning to cybersecurity, analyzing 66 papers and demonstrating their practical impact through controlled experiments, with recommendations for more robust systems.

Contribution

It is the first comprehensive systematic review highlighting 11 pitfalls in DRL for cybersecurity and providing actionable guidance to improve research rigor and deployment.

Findings

01

Over five pitfalls per paper on average

02

Demonstrated impact of pitfalls in cyber defense and malware creation

03

Provided actionable recommendations for each identified pitfall

Abstract

Deep Reinforcement Learning (DRL) has achieved remarkable success in domains requiring sequential decision-making, motivating its application to cybersecurity problems. However, transitioning DRL from laboratory simulations to bespoke cyber environments can introduce numerous issues. This is further exacerbated by the often adversarial, non-stationary, and partially-observable nature of most cybersecurity tasks. In this paper, we identify and systematize 11 methodological pitfalls that frequently occur in DRL for cybersecurity (DRL4Sec) literature across the stages of environment modeling, agent training, performance evaluation, and system deployment. By analyzing 66 significant DRL4Sec papers (2018-2025), we quantify the prevalence of each pitfall and find an average of over five pitfalls per paper. We demonstrate the practical impact of these pitfalls using controlled experiments in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Information and Cyber Security · Network Security and Intrusion Detection