DeepStage: Learning Autonomous Defense Policies Against Multi-Stage APT Campaigns
Trung V. Phan, Tri Gia Nguyen, Thomas Bauschert

TL;DR
DeepStage is a DRL framework that uses graph neural networks and stage estimation to provide adaptive, stage-aware autonomous defense against APTs, outperforming baseline methods.
Contribution
It introduces a hierarchical DRL approach with stage inference and graph embeddings for effective multi-stage APT defense.
Findings
DeepStage achieves an F1-score of 0.887 in identifying attacker stages.
It attains an 84.7% mitigation success rate in realistic testbeds.
Outperforms risk-aware DRL baseline by 21.8% in F1-score.
Abstract
This paper presents DeepStage, a deep reinforcement learning (DRL) framework for adaptive and stage-aware defense against Advanced Persistent Threats (APTs). The enterprise environment is formulated as a partially observable Markov decision process (POMDP), in which host provenance and network telemetry are fused into unified provenance graphs. Building on our prior work (StageFinder), DeepStage employs a graph neural network encoder and an LSTM-based stage estimator to infer probabilistic attacker stages aligned with the MITRE ATT&CK framework. The resulting stage beliefs, together with graph embeddings, are used to guide a hierarchical Proximal Policy Optimization (PPO) agent that selects defense actions across monitoring, access control, containment, and remediation. Experiments in a realistic enterprise testbed with CALDERA-driven APT playbooks show that DeepStage achieves an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
