Human-AI Collaboration in Cloud Security: Cognitive Hierarchy-Driven Deep Reinforcement Learning
Zahra Aref, Sheng Wei, Narayan B. Mandayam

TL;DR
This paper introduces a cognitive hierarchy-driven deep reinforcement learning framework for enhancing real-time security operations in cloud environments, effectively modeling human-AI interactions to improve threat mitigation.
Contribution
It develops a novel CHT-DQN framework integrating cognitive hierarchy theory into reinforcement learning for better SOC decision support against APTs.
Findings
CHT-DQN outperforms standard DQN in simulations with complex attack graphs.
Human-in-the-loop evaluation shows improved alignment with attacker strategies.
Behavioral analysis reveals human decision biases consistent with Prospect Theory.
Abstract
Given the complexity of multi-tenant cloud environments and the growing need for real-time threat mitigation, Security Operations Centers (SOCs) must adopt AI-driven adaptive defense mechanisms to counter Advanced Persistent Threats (APTs). However, SOC analysts face challenges in handling adaptive adversarial tactics, requiring intelligent decision-support frameworks. We propose a Cognitive Hierarchy Theory-driven Deep Q-Network (CHT-DQN) framework that models interactive decision-making between SOC analysts and AI-driven APT bots. The SOC analyst (defender) operates at cognitive level-1, anticipating attacker strategies, while the APT bot (attacker) follows a level-0 policy. By incorporating CHT into DQN, our framework enhances adaptive SOC defense using Attack Graph (AG)-based reinforcement learning. Simulation experiments across varying AG complexities show that CHT-DQN consistently…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNetwork Security and Intrusion Detection
MethodsADaptive gradient method with the OPTimal convergence rate · Dense Connections · Q-Learning · Convolution · Deep Q-Network · ALIGN
