Collaborative Training of Heterogeneous Reinforcement Learning Agents in   Environments with Sparse Rewards: What and When to Share?

Alain Andres; Esther Villar-Rodriguez; Javier Del Ser

arXiv:2202.12174·cs.LG·February 25, 2022

Collaborative Training of Heterogeneous Reinforcement Learning Agents in Environments with Sparse Rewards: What and When to Share?

Alain Andres, Esther Villar-Rodriguez, Javier Del Ser

PDF

Open Access 1 Repo

TL;DR

This paper explores collaborative training of heterogeneous reinforcement learning agents using intrinsic motivation and transfer learning, demonstrating improved exploration and learning efficiency in complex, sparse reward environments.

Contribution

It introduces a novel framework for sharing parameters among heterogeneous agents with different state access, combining intrinsic motivation and transfer learning for enhanced exploration.

Findings

01

Collaborative framework outperforms independent learning in complex scenarios.

02

Proper modulation of intrinsic and extrinsic rewards is crucial.

03

Heterogeneous agents benefit from shared parameters with minimal computational overhead.

Abstract

In the early stages of human life, babies develop their skills by exploring different scenarios motivated by their inherent satisfaction rather than by extrinsic rewards from the environment. This behavior, referred to as intrinsic motivation, has emerged as one solution to address the exploration challenge derived from reinforcement learning environments with sparse rewards. Diverse exploration approaches have been proposed to accelerate the learning process over single- and multi-agent problems with homogeneous agents. However, scarce studies have elaborated on collaborative learning frameworks between heterogeneous agents deployed into the same environment, but interacting with different instances of the latter without any prior knowledge. Beyond the heterogeneity, each agent's characteristics grant access only to a subset of the full state space, which may hide different exploration…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

aklein1995/heterogeneous_agents_curiosity_vizdoom
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Advanced Bandit Algorithms Research · Mobile Crowdsensing and Crowdsourcing