Attention-based Fault-tolerant Approach for Multi-agent Reinforcement   Learning Systems

Mingyang Geng; Kele Xu; Yiying Li; Shuqi Liu; Bo Ding; Huaimin Wang

arXiv:1910.02240·cs.AI·October 10, 2019

Attention-based Fault-tolerant Approach for Multi-agent Reinforcement Learning Systems

Mingyang Geng, Kele Xu, Yiying Li, Shuqi Liu, Bo Ding, Huaimin Wang

PDF

Open Access

TL;DR

This paper introduces FT-Attn, an attention-based fault-tolerant algorithm for multi-agent reinforcement learning that enhances robustness against faulty or malicious agents, improving coordination in noisy and complex environments.

Contribution

The paper proposes a novel attention-based fault-tolerant method that enables multi-agent systems to effectively communicate and adapt despite adversarial or faulty agents.

Findings

01

FT-Attn outperforms previous methods in complex environments.

02

It adapts well to noisy environments without parameter tuning.

03

It handles scenarios requiring multiple correct observations simultaneously.

Abstract

The aim of multi-agent reinforcement learning systems is to provide interacting agents with the ability to collaboratively learn and adapt to the behavior of other agents. In many real-world applications, the agents can only acquire a partial view of the world. However, in realistic settings, one or more agents that show arbitrarily faulty or malicious behavior may suffice to let the current coordination mechanisms fail. In this paper, we study a practical scenario considering the security issues in the presence of agents with arbitrarily faulty or malicious behavior. Under these circumstances, learning an optimal policy becomes particularly challenging, even in the unrealistic case that an agent's policy can be made conditional upon all other agents' observations. To overcome these difficulties, we present an Attention-based Fault-Tolerant (FT-Attn) algorithm which selects correct and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Evolutionary Algorithms and Applications · Adaptive Dynamic Programming Control

MethodsAttention Is All You Need · Softmax · Linear Layer · Multi-Head Attention