TooBadRL: Trigger Optimization to Boost Effectiveness of Backdoor Attacks on Deep Reinforcement Learning
Mingxuan Zhang, Oubo Ma, Kang Wei, Songze Li, Shouling Ji

TL;DR
TooBadRL systematically optimizes backdoor triggers in deep reinforcement learning by adjusting injection timing, trigger dimensions, and manipulation strength, significantly improving attack success rates with minimal impact on normal performance.
Contribution
This paper introduces TooBadRL, the first framework to systematically optimize DRL backdoor triggers across multiple aspects, enhancing attack effectiveness.
Findings
Outperforms baseline methods in attack success rate
Maintains high normal task performance
Effective across multiple DRL algorithms and benchmarks
Abstract
Deep reinforcement learning (DRL) has achieved remarkable success in a wide range of sequential decision-making applications, including robotics, healthcare, smart grids, and finance. Recent studies reveal that adversaries can implant backdoors into DRL agents during the training phase. These backdoors can later be activated by specific triggers during deployment, compelling the agent to execute targeted actions and potentially leading to severe consequences, such as drone crashes or vehicle collisions. However, existing backdoor attacks utilize simplistic and heuristic trigger configurations, overlooking the critical impact of trigger design on attack effectiveness. To address this gap, we introduce TooBadRL, the first framework to systematically optimize DRL backdoor triggers across three critical aspects: injection timing, trigger dimension, and manipulation magnitude. Specifically,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI) · Advanced Malware Detection Techniques
