TL;DR
This paper introduces a novel environment disturbance method for deep reinforcement learning that uses gradient-based attacks on the critic network, resulting in faster, more effective robustness improvements compared to existing adversarial approaches.
Contribution
The paper proposes a new environment attack method based on gradient-based adversarial attacks on the critic network, avoiding complex attacker policies and enhancing robustness efficiently.
Findings
Our method outperforms existing adversarial RL approaches in robustness enhancement.
It is faster and computationally lighter than previous methods.
Results show significant robustness improvements in tested environments.
Abstract
To improve policy robustness of deep reinforcement learning agents, a line of recent works focus on producing disturbances of the environment. Existing approaches of the literature to generate meaningful disturbances of the environment are adversarial reinforcement learning methods. These methods set the problem as a two-player game between the protagonist agent, which learns to perform a task in an environment, and the adversary agent, which learns to disturb the protagonist via modifications of the considered environment. Both protagonist and adversary are trained with deep reinforcement learning algorithms. Alternatively, we propose in this paper to build on gradient-based adversarial attacks, usually used for classification tasks for instance, that we apply on the critic network of the protagonist to identify efficient disturbances of the environment. Rather than learning an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
