The Role of Learning in Attacking ML-based Network Intrusion Detection
Kyle Domico, Jean-Charles Noirot Ferrand, Patrick McDaniel

TL;DR
This paper introduces lightweight reinforcement learning agents that efficiently evaluate the robustness of ML-based network intrusion detection systems, including non-differentiable models, with high success rates and scalability.
Contribution
It presents a novel RL-based approach for offline training of evasion strategies that can be deployed without gradient computations, enabling scalable robustness testing.
Findings
Agents achieve up to 58.1% attack success rate.
Up to 1,042X improvement in attack throughput over gradient-based methods.
RL agents maintain effectiveness on non-differentiable models with 29.8% success.
Abstract
Machine learning (ML)-based network intrusion detection is susceptible to attacks that perturb malicious network flows to evade detection. Existing approaches to evaluating the robustness of these models rely on gradient-based optimization that are computationally expensive and restricted to differentiable model architectures. This limits their practicality for continuous, large-scale evaluation. To address this, we develop lightweight adversarial agents trained via reinforcement learning (RL) that decouples the cost of learning an evasion strategy from the cost of executing it. These agents learn offline to perturb malicious NetFlow records to evade surrogate intrusion detection models, encoding the resulting strategy into a reusable policy that requires no gradient computation at deployment. We evaluate our approach on four NetFlow datasets spanning enterprise, cloud, and IoT…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
