Policy Resilience to Environment Poisoning Attacks on Reinforcement   Learning

Hang Xu; Xinghua Qu; Zinovi Rabinovich

arXiv:2304.12151·cs.LG·April 25, 2023·1 cites

Policy Resilience to Environment Poisoning Attacks on Reinforcement Learning

Hang Xu, Xinghua Qu, Zinovi Rabinovich

PDF

Open Access

TL;DR

This paper presents a resource-efficient, knowledge-sharing-based policy resilience mechanism for reinforcement learning that quickly diagnoses and recovers from environment poisoning attacks, maintaining policy performance.

Contribution

It introduces a federated, meta-learning approach for policy resilience, enabling rapid detection and recovery from poisoning attacks in RL policies.

Findings

01

Effective in restoring policy performance after poisoning

02

Applicable to both model-based and model-free RL algorithms

03

Demonstrates efficiency and robustness in empirical evaluations

Abstract

This paper investigates policy resilience to training-environment poisoning attacks on reinforcement learning (RL) policies, with the goal of recovering the deployment performance of a poisoned RL policy. Due to the fact that the policy resilience is an add-on concern to RL algorithms, it should be resource-efficient, time-conserving, and widely applicable without compromising the performance of RL algorithms. This paper proposes such a policy-resilience mechanism based on an idea of knowledge sharing. We summarize the policy resilience as three stages: preparation, diagnosis, recovery. Specifically, we design the mechanism as a federated architecture coupled with a meta-learning manner, pursuing an efficient extraction and sharing of the environment knowledge. With the shared knowledge, a poisoned agent can quickly identify the deployment condition and accordingly recover its policy…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Reinforcement Learning in Robotics · Smart Grid Security and Resilience