Human-AI Collaboration in Cloud Security: Cognitive Hierarchy-Driven   Deep Reinforcement Learning

Zahra Aref; Sheng Wei; Narayan B. Mandayam

arXiv:2502.16054·cs.CR·April 22, 2025

Human-AI Collaboration in Cloud Security: Cognitive Hierarchy-Driven Deep Reinforcement Learning

Zahra Aref, Sheng Wei, Narayan B. Mandayam

PDF

Open Access

TL;DR

This paper introduces a cognitive hierarchy-driven deep reinforcement learning framework for enhancing real-time security operations in cloud environments, effectively modeling human-AI interactions to improve threat mitigation.

Contribution

It develops a novel CHT-DQN framework integrating cognitive hierarchy theory into reinforcement learning for better SOC decision support against APTs.

Findings

01

CHT-DQN outperforms standard DQN in simulations with complex attack graphs.

02

Human-in-the-loop evaluation shows improved alignment with attacker strategies.

03

Behavioral analysis reveals human decision biases consistent with Prospect Theory.

Abstract

Given the complexity of multi-tenant cloud environments and the growing need for real-time threat mitigation, Security Operations Centers (SOCs) must adopt AI-driven adaptive defense mechanisms to counter Advanced Persistent Threats (APTs). However, SOC analysts face challenges in handling adaptive adversarial tactics, requiring intelligent decision-support frameworks. We propose a Cognitive Hierarchy Theory-driven Deep Q-Network (CHT-DQN) framework that models interactive decision-making between SOC analysts and AI-driven APT bots. The SOC analyst (defender) operates at cognitive level-1, anticipating attacker strategies, while the APT bot (attacker) follows a level-0 policy. By incorporating CHT into DQN, our framework enhances adaptive SOC defense using Attack Graph (AG)-based reinforcement learning. Simulation experiments across varying AG complexities show that CHT-DQN consistently…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNetwork Security and Intrusion Detection

MethodsADaptive gradient method with the OPTimal convergence rate · Dense Connections · Q-Learning · Convolution · Deep Q-Network · ALIGN