Information Content Exploration

Jacob Chmura; Hasham Burhani; Xiao Qi Shi

arXiv:2310.06777·cs.LG·October 11, 2023

Information Content Exploration

Jacob Chmura, Hasham Burhani, Xiao Qi Shi

PDF

Open Access

TL;DR

This paper introduces an information-theoretic intrinsic reward for reinforcement learning that enhances exploration efficiency, especially in sparse reward environments, and demonstrates superior performance over existing methods in challenging games.

Contribution

The paper proposes a novel intrinsic reward based on information content that improves exploration and sample efficiency, with an extension to continuous state spaces.

Findings

01

Outperforms Curiosity Driven Learning and RND in various games

02

Enhances exploration efficiency in sparse reward environments

03

Extends to continuous state spaces with improved sample efficiency

Abstract

Sparse reward environments are known to be challenging for reinforcement learning agents. In such environments, efficient and scalable exploration is crucial. Exploration is a means by which an agent gains information about the environment. We expand on this topic and propose a new intrinsic reward that systemically quantifies exploratory behavior and promotes state coverage by maximizing the information content of a trajectory taken by an agent. We compare our method to alternative exploration based intrinsic reward techniques, namely Curiosity Driven Learning and Random Network Distillation. We show that our information theoretic reward induces efficient exploration and outperforms in various games, including Montezuma Revenge, a known difficult task for reinforcement learning. Finally, we propose an extension that maximizes information content in a discretely compressed latent space…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Domain Adaptation and Few-Shot Learning · Evolutionary Game Theory and Cooperation