Fixed $\beta$-VAE Encoding for Curious Exploration in Complex 3D   Environments

Auguste Lehuger; Matthew Crosby

arXiv:2105.08568·cs.LG·May 19, 2021

Fixed $\beta$-VAE Encoding for Curious Exploration in Complex 3D Environments

Auguste Lehuger, Matthew Crosby

PDF

Open Access

TL;DR

This paper demonstrates that using a fixed $eta$-VAE encoding with curiosity-driven exploration significantly improves performance and sample efficiency in complex 3D environments and Atari games, outperforming traditional feature methods.

Contribution

It introduces a fixed $eta$-VAE encoding approach for curiosity in complex environments, showing improved exploration and sample efficiency over existing methods.

Findings

01

Achieved 22% gain in sample efficiency in Animal-AI environment.

02

Outperformed random features and inverse-dynamics features on Atari Breakout.

03

Enabled solving previously unsolved exploration tasks with curriculum learning.

Abstract

Curiosity is a general method for augmenting an environment reward with an intrinsic reward, which encourages exploration and is especially useful in sparse reward settings. As curiosity is calculated using next state prediction error, the type of state encoding used has a large impact on performance. Random features and inverse-dynamics features are generally preferred over VAEs based on previous results from Atari and other mostly 2D environments. However, unlike VAEs, they may not encode sufficient information for optimal behaviour, which becomes increasingly important as environments become more complex. In this paper, we use the sparse reward 3D physics environment Animal-AI, to demonstrate how a fixed $β$ -VAE encoding can be used effectively with curiosity. We combine this with curriculum learning to solve the previously unsolved exploration intensive detour tasks while…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Domain Adaptation and Few-Shot Learning · Gaussian Processes and Bayesian Inference