AEGIS: White-Box Attack Path Generation using LLMs and Training Effectiveness Evaluation for Large-Scale Cyber Defence Exercises

Ivan K. Tung; Yu Xiang Shi; Alex Chien; Wenkai Liu; Lawrence Zheng

arXiv:2601.22720·cs.CR·February 2, 2026

AEGIS: White-Box Attack Path Generation using LLMs and Training Effectiveness Evaluation for Large-Scale Cyber Defence Exercises

Ivan K. Tung, Yu Xiang Shi, Alex Chien, Wenkai Liu, Lawrence Zheng

PDF

Open Access

TL;DR

AEGIS is a system that automates attack path generation for cyber defence exercises using large language models and Monte Carlo Tree Search, reducing development time and matching human-authored scenarios in training effectiveness.

Contribution

The paper introduces AEGIS, a novel system combining LLMs and white-box validation to generate attack paths without pre-existing vulnerability graphs, streamlining scenario creation.

Findings

01

AEGIS-generated attack paths are comparable to human scenarios in training effectiveness.

02

Automating exploit discovery reduces scenario development from months to days.

03

System validated in large-scale cyber defence exercise CIDeX 2025.

Abstract

Creating attack paths for cyber defence exercises requires substantial expert effort. Existing automation requires vulnerability graphs or exploit sets curated in advance, limiting where it can be applied. We present AEGIS, a system that generates attack paths using LLMs, white-box access, and Monte Carlo Tree Search over real exploit execution. LLM-based search discovers exploits dynamically without pre-existing vulnerability graphs, while white-box access enables validating exploits in isolation before committing to attack paths. Evaluation at CIDeX 2025, a large-scale exercise spanning 46 IT hosts, showed that AEGIS-generated paths are comparable to human-authored scenarios across four dimensions of training experience (perceived learning, engagement, believability, challenge). Results were measured with a validated questionnaire extensible to general simulation-based training. By…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsInformation and Cyber Security · Network Security and Intrusion Detection · Advanced Malware Detection Techniques