Experimental Evidence that Empowerment May Drive Exploration in   Sparse-Reward Environments

Francesco Massari; Martin Biehl; Lisa Meeden; Ryota Kanai

arXiv:2107.07031·cs.AI·July 16, 2021

Experimental Evidence that Empowerment May Drive Exploration in Sparse-Reward Environments

Francesco Massari, Martin Biehl, Lisa Meeden, Ryota Kanai

PDF

Open Access

TL;DR

This paper provides experimental evidence that empowerment-based intrinsic motivation can effectively promote exploration in sparse-reward reinforcement learning environments, comparable to curiosity-driven methods.

Contribution

It introduces an empowerment-inspired agent and compares it with a curiosity-based agent, demonstrating empowerment's potential in driving exploration.

Findings

01

Both agents benefit similarly from intrinsic rewards.

02

Empowerment can be an effective exploration strategy.

03

Experimental results support empowerment's role in sparse environments.

Abstract

Reinforcement Learning (RL) is known to be often unsuccessful in environments with sparse extrinsic rewards. A possible countermeasure is to endow RL agents with an intrinsic reward function, or 'intrinsic motivation', which rewards the agent based on certain features of the current sensor state. An intrinsic reward function based on the principle of empowerment assigns rewards proportional to the amount of control the agent has over its own sensors. We implemented a variation on a recently proposed intrinsically motivated agent, which we refer to as the 'curious' agent, and an empowerment-inspired agent. The former leverages sensor state encoding with a variational autoencoder, while the latter predicts the next sensor state via a variational information bottleneck. We compared the performance of both agents to that of an advantage actor-critic baseline in four sparse reward grid…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Neural dynamics and brain function · Advanced Bandit Algorithms Research