Coordinated Exploration via Intrinsic Rewards for Multi-Agent   Reinforcement Learning

Shariq Iqbal; Fei Sha

arXiv:1905.12127·cs.LG·May 25, 2021·34 cites

Coordinated Exploration via Intrinsic Rewards for Multi-Agent Reinforcement Learning

Shariq Iqbal, Fei Sha

PDF

Open Access 1 Repo

TL;DR

This paper introduces a hierarchical framework for multi-agent reinforcement learning that uses coordinated intrinsic rewards and dynamic exploration strategies to improve performance in sparse reward environments.

Contribution

It proposes a novel hierarchical approach that enables multi-agent coordination through intrinsic rewards and adaptive exploration mode selection.

Findings

01

Outperforms state-of-the-art methods in cooperative sparse reward tasks.

02

Effectively adapts exploration strategies to different multi-stage tasks.

03

Enhances coordination among agents through learned intrinsic reward mechanisms.

Abstract

Solving tasks with sparse rewards is one of the most important challenges in reinforcement learning. In the single-agent setting, this challenge is addressed by introducing intrinsic rewards that motivate agents to explore unseen regions of their state spaces; however, applying these techniques naively to the multi-agent setting results in agents exploring independently, without any coordination among themselves. Exploration in cooperative multi-agent settings can be accelerated and improved if agents coordinate their exploration. In this paper we introduce a framework for designing intrinsic rewards which consider what other agents have explored such that the agents can coordinate. Then, we develop an approach for learning how to dynamically select between several exploration modalities to maximize extrinsic rewards. Concretely, we formulate the approach as a hierarchical policy where…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

shariqiqbal2810/Multi-Explore
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Data Stream Mining Techniques · Robot Manipulation and Learning