Hierarchical Universal Value Function Approximators

Rushiv Arora

arXiv:2410.08997·cs.LG·October 29, 2024

Hierarchical Universal Value Function Approximators

Rushiv Arora

PDF

Open Access

TL;DR

This paper introduces hierarchical universal value function approximators (H-UVFAs) for hierarchical reinforcement learning, enabling better scaling, planning, and generalization in multi-goal settings by leveraging the options framework.

Contribution

It extends universal value function approximators to hierarchical settings using the options framework, with new methods for learning hierarchical embeddings and demonstrating improved generalization.

Findings

01

H-UVFAs outperform UVFAs in generalization tasks

02

Developed supervised and reinforcement learning methods for hierarchical embeddings

03

Enabled scaling and planning benefits in hierarchical reinforcement learning

Abstract

There have been key advancements to building universal approximators for multi-goal collections of reinforcement learning value functions -- key elements in estimating long-term returns of states in a parameterized manner. We extend this to hierarchical reinforcement learning, using the options framework, by introducing hierarchical universal value function approximators (H-UVFAs). This allows us to leverage the added benefits of scaling, planning, and generalization expected in temporal abstraction settings. We develop supervised and reinforcement learning methods for learning embeddings of the states, goals, options, and actions in the two hierarchical value functions: $Q (s, g, o; θ)$ and $Q (s, g, o, a; θ)$ . Finally we demonstrate generalization of the HUVFAs and show they outperform corresponding UVFAs.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNumerical Methods and Algorithms