Uncertainty-Aware Reward-Free Exploration with General Function   Approximation

Junkai Zhang; Weitong Zhang; Dongruo Zhou; Quanquan Gu

arXiv:2406.16255·cs.LG·July 2, 2024

Uncertainty-Aware Reward-Free Exploration with General Function Approximation

Junkai Zhang, Weitong Zhang, Dongruo Zhou, Quanquan Gu

PDF

Open Access 1 Repo

TL;DR

This paper introduces GFA-RFE, an uncertainty-aware reward-free reinforcement learning algorithm that improves sample efficiency by handling heterogeneous uncertainty, with theoretical guarantees and strong empirical performance.

Contribution

The paper proposes GFA-RFE, a novel reward-free RL algorithm that incorporates uncertainty-awareness and provides theoretical sample complexity bounds.

Findings

01

GFA-RFE outperforms existing reward-free RL algorithms.

02

Theoretical sample complexity bound is established for GFA-RFE.

03

Empirical results show GFA-RFE is competitive with state-of-the-art algorithms.

Abstract

Mastering multiple tasks through exploration and learning in an environment poses a significant challenge in reinforcement learning (RL). Unsupervised RL has been introduced to address this challenge by training policies with intrinsic rewards rather than extrinsic rewards. However, current intrinsic reward designs and unsupervised RL algorithms often overlook the heterogeneous nature of collected samples, thereby diminishing their sample efficiency. To overcome this limitation, in this paper, we propose a reward-free RL algorithm called \alg. The key idea behind our algorithm is an uncertainty-aware intrinsic reward for exploring the environment and an uncertainty-weighted learning process to handle heterogeneous uncertainty in different samples. Theoretically, we show that in order to find an $ϵ$ -optimal policy, GFA-RFE needs to collect $\tilde{O} (H^2 \log N_{\mathcal F}…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

uclaml/gfa-rfe
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReservoir Engineering and Simulation Methods