Improving Robustness of Deep Reinforcement Learning Agents: Environment   Attack based on the Critic Network

Lucas Schott; Hatem Hajri; Sylvain Lamprier

arXiv:2104.03154·cs.LG·October 4, 2022

Improving Robustness of Deep Reinforcement Learning Agents: Environment Attack based on the Critic Network

Lucas Schott, Hatem Hajri, Sylvain Lamprier

PDF

2 Repos

TL;DR

This paper introduces a novel environment disturbance method for deep reinforcement learning that uses gradient-based attacks on the critic network, resulting in faster, more effective robustness improvements compared to existing adversarial approaches.

Contribution

The paper proposes a new environment attack method based on gradient-based adversarial attacks on the critic network, avoiding complex attacker policies and enhancing robustness efficiently.

Findings

01

Our method outperforms existing adversarial RL approaches in robustness enhancement.

02

It is faster and computationally lighter than previous methods.

03

Results show significant robustness improvements in tested environments.

Abstract

To improve policy robustness of deep reinforcement learning agents, a line of recent works focus on producing disturbances of the environment. Existing approaches of the literature to generate meaningful disturbances of the environment are adversarial reinforcement learning methods. These methods set the problem as a two-player game between the protagonist agent, which learns to perform a task in an environment, and the adversary agent, which learns to disturb the protagonist via modifications of the considered environment. Both protagonist and adversary are trained with deep reinforcement learning algorithms. Alternatively, we propose in this paper to build on gradient-based adversarial attacks, usually used for classification tasks for instance, that we apply on the critic network of the protagonist to identify efficient disturbances of the environment. Rather than learning an…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.