Learning Uncertainty-Aware Temporally-Extended Actions

Joongkyu Lee; Seung Joon Park; Yunhao Tang; Min-hwan Oh

arXiv:2402.05439·cs.LG·February 9, 2024·1 cites

Learning Uncertainty-Aware Temporally-Extended Actions

Joongkyu Lee, Seung Joon Park, Yunhao Tang, Min-hwan Oh

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces UTE, a new reinforcement learning algorithm that uses uncertainty measurement to improve temporally-extended actions, leading to better policy learning in complex environments.

Contribution

The paper presents UTE, a novel uncertainty-aware algorithm that enhances action repetition by strategically balancing exploration and exploitation.

Findings

01

UTE outperforms existing action repetition methods.

02

UTE mitigates performance degradation caused by sub-optimal action repetition.

03

Experimental results show improved learning efficiency in Gridworld and Atari environments.

Abstract

In reinforcement learning, temporal abstraction in the action space, exemplified by action repetition, is a technique to facilitate policy learning through extended actions. However, a primary limitation in previous studies of action repetition is its potential to degrade performance, particularly when sub-optimal actions are repeated. This issue often negates the advantages of action repetition. To address this, we propose a novel algorithm named Uncertainty-aware Temporal Extension (UTE). UTE employs ensemble methods to accurately measure uncertainty during action extension. This feature allows policies to strategically choose between emphasizing exploration or adopting an uncertainty-averse approach, tailored to their specific needs. We demonstrate the effectiveness of UTE through experiments in Gridworld and Atari 2600 environments. Our findings show that UTE outperforms existing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

oh-lab/UTE-Uncertainty-aware-Temporal-Extension-
pytorch

Videos

Learning Uncertainty-Aware Temporally-Extended Actions· underline

Taxonomy

TopicsIntelligent Tutoring Systems and Adaptive Learning · Anomaly Detection Techniques and Applications · Data Stream Mining Techniques