BLAST: A Stealthy Backdoor Leverage Attack against Cooperative Multi-Agent Deep Reinforcement Learning based Systems

Jing Fang; Saihao Yan; Xueyu Yin; Yinbo Yu; Chunwei Tian; and Jiajia Liu

arXiv:2501.01593·cs.AI·July 21, 2025

BLAST: A Stealthy Backdoor Leverage Attack against Cooperative Multi-Agent Deep Reinforcement Learning based Systems

Jing Fang, Saihao Yan, Xueyu Yin, Yinbo Yu, Chunwei Tian, and Jiajia Liu

PDF

Open Access

TL;DR

This paper introduces BLAST, a stealthy backdoor attack targeting multi-agent reinforcement learning systems by embedding malicious behavior in a single agent, exploiting spatiotemporal patterns to achieve system-wide compromise.

Contribution

The paper presents a novel backdoor attack method that is stealthy, leverages a single agent, and manipulates reward functions to attack cooperative multi-agent systems effectively.

Findings

01

BLAST achieves high attack success rates in multiple algorithms and environments.

02

The attack maintains low performance variance, indicating stealthiness.

03

It effectively bypasses existing defense mechanisms.

Abstract

Recent studies have shown that cooperative multi-agent deep reinforcement learning (c-MADRL) is under the threat of backdoor attacks. Once a backdoor trigger is observed, it will perform malicious actions leading to failures or malicious goals. However, existing backdoor attacks suffer from several issues, e.g., instant trigger patterns lack stealthiness, the backdoor is trained or activated by an additional network, or all agents are backdoored. To this end, in this paper, we propose a novel backdoor leverage attack against c-MADRL, BLAST, which attacks the entire multi-agent team by embedding the backdoor only in a single agent. Firstly, we introduce adversary spatiotemporal behavior patterns as the backdoor trigger rather than manual-injected fixed visual patterns or instant status and control the period to perform malicious actions. This method can guarantee the stealthiness and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning