On the Importance of Exploration for Generalization in Reinforcement   Learning

Yiding Jiang; J. Zico Kolter; Roberta Raileanu

arXiv:2306.05483·cs.LG·June 12, 2023·5 cites

On the Importance of Exploration for Generalization in Reinforcement Learning

Yiding Jiang, J. Zico Kolter, Roberta Raileanu

PDF

Open Access

TL;DR

This paper emphasizes the critical role of exploration strategies in enhancing generalization in deep reinforcement learning, introducing a novel method called EDE that leverages distributional ensembles to improve performance on high-dimensional benchmarks.

Contribution

The paper introduces EDE, a value-based exploration method using distributional ensembles, demonstrating state-of-the-art results in RL generalization benchmarks.

Findings

01

Exploration improves generalization to unseen environments.

02

EDE outperforms existing methods on Procgen and Crafter benchmarks.

03

Ensemble-based exploration effectively captures epistemic uncertainty.

Abstract

Existing approaches for improving generalization in deep reinforcement learning (RL) have mostly focused on representation learning, neglecting RL-specific aspects such as exploration. We hypothesize that the agent's exploration strategy plays a key role in its ability to generalize to new environments. Through a series of experiments in a tabular contextual MDP, we show that exploration is helpful not only for efficiently finding the optimal policy for the training environments but also for acquiring knowledge that helps decision making in unseen environments. Based on these observations, we propose EDE: Exploration via Distributional Ensemble, a method that encourages exploration of states with high epistemic uncertainty through an ensemble of Q-value distributions. Our algorithm is the first value-based approach to achieve state-of-the-art on both Procgen and Crafter, two benchmarks…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Evolutionary Algorithms and Applications