Enhancing Security in Deep Reinforcement Learning: A Comprehensive Survey on Adversarial Attacks and Defenses

Wu Yichao; Wang Yirui; Ding Panpan; Wang Hailong; Zhu Bingqian; Liu Chun

arXiv:2510.20314·cs.CR·October 24, 2025

Enhancing Security in Deep Reinforcement Learning: A Comprehensive Survey on Adversarial Attacks and Defenses

Wu Yichao, Wang Yirui, Ding Panpan, Wang Hailong, Zhu Bingqian, Liu Chun

PDF

Open Access

TL;DR

This survey comprehensively reviews adversarial attacks and defenses in deep reinforcement learning, highlighting current challenges, methodologies, and future research directions to improve security and robustness in dynamic environments.

Contribution

It provides a systematic classification of adversarial attacks and summarizes existing defense strategies, offering insights into their advantages and limitations.

Findings

01

Detailed analysis of attack methods targeting state, action, reward, and model spaces.

02

Summary of robustness training strategies including adversarial training and detection.

03

Discussion of future research needs in generalization, scalability, and explainability.

Abstract

With the wide application of deep reinforcement learning (DRL) techniques in complex fields such as autonomous driving, intelligent manufacturing, and smart healthcare, how to improve its security and robustness in dynamic and changeable environments has become a core issue in current research. Especially in the face of adversarial attacks, DRL may suffer serious performance degradation or even make potentially dangerous decisions, so it is crucial to ensure their stability in security-sensitive scenarios. In this paper, we first introduce the basic framework of DRL and analyze the main security challenges faced in complex and changing environments. In addition, this paper proposes an adversarial attack classification framework based on perturbation type and attack target and reviews the mainstream adversarial attack methods against DRL in detail, including various attack methods such…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Smart Grid Security and Resilience · Explainable Artificial Intelligence (XAI)