Directionality Reinforcement Learning to Operate Multi-Agent System   without Communication

Fumito Uwano; Keiki Takadama

arXiv:2110.05773·cs.MA·October 13, 2021

Directionality Reinforcement Learning to Operate Multi-Agent System without Communication

Fumito Uwano, Keiki Takadama

PDF

Open Access

TL;DR

This paper introduces directionality reinforcement learning (DRL), a decentralized method enabling multi-agent cooperation without communication or observation, improving efficiency and learning yielding behaviors in maze tasks.

Contribution

The paper proposes a novel DRL approach that enhances multi-agent cooperation without communication, demonstrating its effectiveness through maze experiments.

Findings

01

DRL outperforms previous methods in maze solving time

02

Agents learn to yield for others through directionality

03

DRL achieves multi-agent learning with low costs regardless of agent number

Abstract

This paper establishes directionality reinforcement learning (DRL) technique to propose the complete decentralized multi-agent reinforcement learning method which can achieve cooperation based on each agent's learning: no communication and no observation. Concretely, DRL adds the direction "agents have to learn to reach the farthest goal among reachable ones" to learning agents to operate the agents cooperatively. Furthermore, to investigate the effectiveness of the DRL, this paper compare Q-learning agent with DRL with previous learning agent in maze problems. Experimental results derive that (1) DRL performs better than the previous method in terms of the spending time, (2) the direction makes agents learn yielding action for others, and (3) DRL suggests achieving multiagent learning with few costs for any number of agents.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Evolutionary Algorithms and Applications · Metaheuristic Optimization Algorithms Research