Uncertainty-Aware Reward-Free Exploration with General Function Approximation
Junkai Zhang, Weitong Zhang, Dongruo Zhou, Quanquan Gu

TL;DR
This paper introduces GFA-RFE, an uncertainty-aware reward-free reinforcement learning algorithm that improves sample efficiency by handling heterogeneous uncertainty, with theoretical guarantees and strong empirical performance.
Contribution
The paper proposes GFA-RFE, a novel reward-free RL algorithm that incorporates uncertainty-awareness and provides theoretical sample complexity bounds.
Findings
GFA-RFE outperforms existing reward-free RL algorithms.
Theoretical sample complexity bound is established for GFA-RFE.
Empirical results show GFA-RFE is competitive with state-of-the-art algorithms.
Abstract
Mastering multiple tasks through exploration and learning in an environment poses a significant challenge in reinforcement learning (RL). Unsupervised RL has been introduced to address this challenge by training policies with intrinsic rewards rather than extrinsic rewards. However, current intrinsic reward designs and unsupervised RL algorithms often overlook the heterogeneous nature of collected samples, thereby diminishing their sample efficiency. To overcome this limitation, in this paper, we propose a reward-free RL algorithm called \alg. The key idea behind our algorithm is an uncertainty-aware intrinsic reward for exploring the environment and an uncertainty-weighted learning process to handle heterogeneous uncertainty in different samples. Theoretically, we show that in order to find an -optimal policy, GFA-RFE needs to collect $\tilde{O} (H^2 \log N_{\mathcal F}…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReservoir Engineering and Simulation Methods
