Query The Agent: Improving sample efficiency through epistemic   uncertainty estimation

Julian Alverio; Boris Katz; Andrei Barbu

arXiv:2210.02585·cs.LG·October 7, 2022

Query The Agent: Improving sample efficiency through epistemic uncertainty estimation

Julian Alverio, Boris Katz, Andrei Barbu

PDF

Open Access

TL;DR

This paper introduces Query The Agent (QTA), an algorithm that enhances sample efficiency in goal-conditioned reinforcement learning by estimating epistemic uncertainty across the state space and focusing on uncertain areas.

Contribution

The paper presents a novel algorithm, QTA, utilizing Predictive Uncertainty Networks (PUN) to estimate epistemic uncertainty and improve sample efficiency in reinforcement learning.

Findings

01

QTA significantly outperforms existing methods in sample efficiency.

02

PUN effectively estimates epistemic uncertainty across states.

03

QTA accelerates value function learning through targeted exploration.

Abstract

Curricula for goal-conditioned reinforcement learning agents typically rely on poor estimates of the agent's epistemic uncertainty or fail to consider the agents' epistemic uncertainty altogether, resulting in poor sample efficiency. We propose a novel algorithm, Query The Agent (QTA), which significantly improves sample efficiency by estimating the agent's epistemic uncertainty throughout the state space and setting goals in highly uncertain areas. Encouraging the agent to collect data in highly uncertain states allows the agent to improve its estimation of the value function rapidly. QTA utilizes a novel technique for estimating epistemic uncertainty, Predictive Uncertainty Networks (PUN), to allow QTA to assess the agent's uncertainty in all previously observed states. We demonstrate that QTA offers decisive sample efficiency improvements over preexisting methods.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Data Stream Mining Techniques · Adversarial Robustness in Machine Learning