TooBadRL: Trigger Optimization to Boost Effectiveness of Backdoor Attacks on Deep Reinforcement Learning

Mingxuan Zhang; Oubo Ma; Kang Wei; Songze Li; Shouling Ji

arXiv:2506.09562·cs.CR·November 19, 2025

TooBadRL: Trigger Optimization to Boost Effectiveness of Backdoor Attacks on Deep Reinforcement Learning

Mingxuan Zhang, Oubo Ma, Kang Wei, Songze Li, Shouling Ji

PDF

Open Access

TL;DR

TooBadRL systematically optimizes backdoor triggers in deep reinforcement learning by adjusting injection timing, trigger dimensions, and manipulation strength, significantly improving attack success rates with minimal impact on normal performance.

Contribution

This paper introduces TooBadRL, the first framework to systematically optimize DRL backdoor triggers across multiple aspects, enhancing attack effectiveness.

Findings

01

Outperforms baseline methods in attack success rate

02

Maintains high normal task performance

03

Effective across multiple DRL algorithms and benchmarks

Abstract

Deep reinforcement learning (DRL) has achieved remarkable success in a wide range of sequential decision-making applications, including robotics, healthcare, smart grids, and finance. Recent studies reveal that adversaries can implant backdoors into DRL agents during the training phase. These backdoors can later be activated by specific triggers during deployment, compelling the agent to execute targeted actions and potentially leading to severe consequences, such as drone crashes or vehicle collisions. However, existing backdoor attacks utilize simplistic and heuristic trigger configurations, overlooking the critical impact of trigger design on attack effectiveness. To address this gap, we introduce TooBadRL, the first framework to systematically optimize DRL backdoor triggers across three critical aspects: injection timing, trigger dimension, and manipulation magnitude. Specifically,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI) · Advanced Malware Detection Techniques