Robustifying Reinforcement Learning Agents via Action Space Adversarial Training
Kai Liang Tan, Yasaman Esfandiari, Xian Yeow Lee, Aakanksha, Soumik, Sarkar

TL;DR
This paper demonstrates that deep reinforcement learning agents can be made more robust against actuator attacks by employing adversarial training focused on the action space, addressing vulnerabilities in control applications.
Contribution
It introduces an adversarial training method targeting action space perturbations to enhance the robustness of DRL agents in control tasks.
Findings
Adversarial training improves agent robustness against actuator attacks.
Robustified agents maintain performance under adversarial conditions.
Method applicable to various control scenarios.
Abstract
Adoption of machine learning (ML)-enabled cyber-physical systems (CPS) are becoming prevalent in various sectors of modern society such as transportation, industrial, and power grids. Recent studies in deep reinforcement learning (DRL) have demonstrated its benefits in a large variety of data-driven decisions and control applications. As reliance on ML-enabled systems grows, it is imperative to study the performance of these systems under malicious state and actuator attacks. Traditional control systems employ resilient/fault-tolerant controllers that counter these attacks by correcting the system via error observations. However, in some applications, a resilient controller may not be sufficient to avoid a catastrophic failure. Ideally, a robust approach is more useful in these scenarios where a system is inherently robust (by design) to adversarial attacks. While robust control has a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
