A Temporal-Pattern Backdoor Attack to Deep Reinforcement Learning

Yinbo Yu; Jiajia Liu; Shouqing Li; Kepu Huang; Xudong Feng

arXiv:2205.02589·cs.LG·December 13, 2022

A Temporal-Pattern Backdoor Attack to Deep Reinforcement Learning

Yinbo Yu, Jiajia Liu, Shouqing Li, Kepu Huang, Xudong Feng

PDF

Open Access

TL;DR

This paper introduces a novel temporal-pattern backdoor attack on deep reinforcement learning that uses sequences of observations as triggers, demonstrating high effectiveness and stealthiness in cloud computing tasks.

Contribution

It proposes a new backdoor attack leveraging temporal constraints in DRL, which is more stealthy and controllable than traditional single-observation triggers.

Findings

01

Achieves 97.8% clean data accuracy

02

Attains 97.5% attack success rate

03

Effective in cloud job scheduling tasks

Abstract

Deep reinforcement learning (DRL) has made significant achievements in many real-world applications. But these real-world applications typically can only provide partial observations for making decisions due to occlusions and noisy sensors. However, partial state observability can be used to hide malicious behaviors for backdoors. In this paper, we explore the sequential nature of DRL and propose a novel temporal-pattern backdoor attack to DRL, whose trigger is a set of temporal constraints on a sequence of observations rather than a single observation, and effect can be kept in a controllable duration rather than in the instant. We validate our proposed backdoor attack to a typical job scheduling task in cloud computing. Numerous experimental results show that our backdoor can achieve excellent effectiveness, stealthiness, and sustainability. Our backdoor's average clean data accuracy…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Smart Grid Security and Resilience · Privacy-Preserving Technologies in Data