Subwords as Skills: Tokenization for Sparse-Reward Reinforcement   Learning

David Yunis; Justin Jung; Falcon Dai; Matthew Walter

arXiv:2309.04459·cs.LG·November 1, 2024

Subwords as Skills: Tokenization for Sparse-Reward Reinforcement Learning

David Yunis, Justin Jung, Falcon Dai, Matthew Walter

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a novel method for skill generation in sparse-reward reinforcement learning by discretizing action spaces and using NLP-inspired tokenization, leading to more efficient exploration and better performance.

Contribution

It proposes a new approach combining action space clustering and tokenization to generate skills, reducing pretraining time and improving exploration in continuous action spaces.

Findings

01

Outperforms baseline skill-generation methods in challenging domains

02

Requires significantly less computation for skill creation and online rollouts

03

Effective in sparse-reward reinforcement learning tasks

Abstract

Exploration in sparse-reward reinforcement learning is difficult due to the requirement of long, coordinated sequences of actions in order to achieve any reward. Moreover, in continuous action spaces there are an infinite number of possible actions, which only increases the difficulty of exploration. One class of methods designed to address these issues forms temporally extended actions, often called skills, from interaction data collected in the same domain, and optimizes a policy on top of this new action space. Typically such methods require a lengthy pretraining phase, especially in continuous action spaces, in order to form the skills before reinforcement learning can begin. Given prior evidence that the full range of the continuous action space is not required in such tasks, we propose a novel approach to skill-generation with two components. First we discretize the action space…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

dyunis/subwords_as_skills
pytorchOfficial

Videos

Subwords as Skills: Tokenization for Sparse-Reward Reinforcement Learning· slideslive

Taxonomy

TopicsReinforcement Learning in Robotics · Software Engineering Research