DeepStage: Learning Autonomous Defense Policies Against Multi-Stage APT Campaigns

Trung V. Phan; Tri Gia Nguyen; Thomas Bauschert

arXiv:2603.16969·cs.CR·May 5, 2026

DeepStage: Learning Autonomous Defense Policies Against Multi-Stage APT Campaigns

Trung V. Phan, Tri Gia Nguyen, Thomas Bauschert

PDF

TL;DR

DeepStage is a DRL framework that uses graph neural networks and stage estimation to provide adaptive, stage-aware autonomous defense against APTs, outperforming baseline methods.

Contribution

It introduces a hierarchical DRL approach with stage inference and graph embeddings for effective multi-stage APT defense.

Findings

01

DeepStage achieves an F1-score of 0.887 in identifying attacker stages.

02

It attains an 84.7% mitigation success rate in realistic testbeds.

03

Outperforms risk-aware DRL baseline by 21.8% in F1-score.

Abstract

This paper presents DeepStage, a deep reinforcement learning (DRL) framework for adaptive and stage-aware defense against Advanced Persistent Threats (APTs). The enterprise environment is formulated as a partially observable Markov decision process (POMDP), in which host provenance and network telemetry are fused into unified provenance graphs. Building on our prior work (StageFinder), DeepStage employs a graph neural network encoder and an LSTM-based stage estimator to infer probabilistic attacker stages aligned with the MITRE ATT&CK framework. The resulting stage beliefs, together with graph embeddings, are used to guide a hierarchical Proximal Policy Optimization (PPO) agent that selects defense actions across monitoring, access control, containment, and remediation. Experiments in a realistic enterprise testbed with CALDERA-driven APT playbooks show that DeepStage achieves an…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.