Recursive Reasoning Graph for Multi-Agent Reinforcement Learning

Xiaobai Ma; David Isele; Jayesh K. Gupta; Kikuo Fujimura; Mykel J.; Kochenderfer

arXiv:2203.02844·cs.LG·March 8, 2022

Recursive Reasoning Graph for Multi-Agent Reinforcement Learning

Xiaobai Ma, David Isele, Jayesh K. Gupta, Kikuo Fujimura, Mykel J., Kochenderfer

PDF

Open Access 1 Video

TL;DR

This paper introduces the Recursive Reasoning Graph (R2G), a novel multi-agent reinforcement learning algorithm that enhances agents' ability to anticipate others' responses, leading to improved cooperation and competition in complex multi-agent environments.

Contribution

The paper proposes R2G, a recursive reasoning model within a centralized-training-decentralized-execution framework, achieving state-of-the-art results in multi-agent tasks.

Findings

01

R2G outperforms existing algorithms in multi-agent particle games.

02

R2G demonstrates superior performance in robotics simulation environments.

03

Recursive reasoning improves strategic interactions among agents.

Abstract

Multi-agent reinforcement learning (MARL) provides an efficient way for simultaneously learning policies for multiple agents interacting with each other. However, in scenarios requiring complex interactions, existing algorithms can suffer from an inability to accurately anticipate the influence of self-actions on other agents. Incorporating an ability to reason about other agents' potential responses can allow an agent to formulate more effective strategies. This paper adopts a recursive reasoning model in a centralized-training-decentralized-execution framework to help learning agents better cooperate with or compete against others. The proposed algorithm, referred to as the Recursive Reasoning Graph (R2G), shows state-of-the-art performance on multiple multi-agent particle and robotics games.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Recursive Reasoning Graph for Multi-Agent Reinforcement Learning· underline

Taxonomy

TopicsReinforcement Learning in Robotics