Collaborative Training of Heterogeneous Reinforcement Learning Agents in Environments with Sparse Rewards: What and When to Share?
Alain Andres, Esther Villar-Rodriguez, Javier Del Ser

TL;DR
This paper explores collaborative training of heterogeneous reinforcement learning agents using intrinsic motivation and transfer learning, demonstrating improved exploration and learning efficiency in complex, sparse reward environments.
Contribution
It introduces a novel framework for sharing parameters among heterogeneous agents with different state access, combining intrinsic motivation and transfer learning for enhanced exploration.
Findings
Collaborative framework outperforms independent learning in complex scenarios.
Proper modulation of intrinsic and extrinsic rewards is crucial.
Heterogeneous agents benefit from shared parameters with minimal computational overhead.
Abstract
In the early stages of human life, babies develop their skills by exploring different scenarios motivated by their inherent satisfaction rather than by extrinsic rewards from the environment. This behavior, referred to as intrinsic motivation, has emerged as one solution to address the exploration challenge derived from reinforcement learning environments with sparse rewards. Diverse exploration approaches have been proposed to accelerate the learning process over single- and multi-agent problems with homogeneous agents. However, scarce studies have elaborated on collaborative learning frameworks between heterogeneous agents deployed into the same environment, but interacting with different instances of the latter without any prior knowledge. Beyond the heterogeneity, each agent's characteristics grant access only to a subset of the full state space, which may hide different exploration…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Advanced Bandit Algorithms Research · Mobile Crowdsensing and Crowdsourcing
