Attention-based Fault-tolerant Approach for Multi-agent Reinforcement Learning Systems
Mingyang Geng, Kele Xu, Yiying Li, Shuqi Liu, Bo Ding, Huaimin Wang

TL;DR
This paper introduces FT-Attn, an attention-based fault-tolerant algorithm for multi-agent reinforcement learning that enhances robustness against faulty or malicious agents, improving coordination in noisy and complex environments.
Contribution
The paper proposes a novel attention-based fault-tolerant method that enables multi-agent systems to effectively communicate and adapt despite adversarial or faulty agents.
Findings
FT-Attn outperforms previous methods in complex environments.
It adapts well to noisy environments without parameter tuning.
It handles scenarios requiring multiple correct observations simultaneously.
Abstract
The aim of multi-agent reinforcement learning systems is to provide interacting agents with the ability to collaboratively learn and adapt to the behavior of other agents. In many real-world applications, the agents can only acquire a partial view of the world. However, in realistic settings, one or more agents that show arbitrarily faulty or malicious behavior may suffice to let the current coordination mechanisms fail. In this paper, we study a practical scenario considering the security issues in the presence of agents with arbitrarily faulty or malicious behavior. Under these circumstances, learning an optimal policy becomes particularly challenging, even in the unrealistic case that an agent's policy can be made conditional upon all other agents' observations. To overcome these difficulties, we present an Attention-based Fault-Tolerant (FT-Attn) algorithm which selects correct and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Evolutionary Algorithms and Applications · Adaptive Dynamic Programming Control
MethodsAttention Is All You Need · Softmax · Linear Layer · Multi-Head Attention
