Quasimetric Value Functions with Dense Rewards

Khadichabonu Valieva; Bikramjit Banerjee

arXiv:2409.08724·cs.LG·September 16, 2024

Quasimetric Value Functions with Dense Rewards

Khadichabonu Valieva, Bikramjit Banerjee

PDF

Open Access

TL;DR

This paper demonstrates that the quasimetric property of goal-conditioned value functions in reinforcement learning is preserved under dense rewards, enabling more efficient training and improved sample complexity in challenging robotics tasks.

Contribution

It shows that the triangle inequality for quasimetric value functions holds with dense rewards under certain conditions, expanding the applicability of goal-conditioned RL architectures.

Findings

01

Dense rewards can preserve the quasimetric structure in GCRL.

02

Training with dense rewards outperforms sparse rewards in benchmark tasks.

03

Dense reward functions satisfying the key condition improve sample efficiency.

Abstract

As a generalization of reinforcement learning (RL) to parametrizable goals, goal conditioned RL (GCRL) has a broad range of applications, particularly in challenging tasks in robotics. Recent work has established that the optimal value function of GCRL $Q^{*} (s, a, g)$ has a quasimetric structure, leading to targetted neural architectures that respect such structure. However, the relevant analyses assume a sparse reward setting -- a known aggravating factor to sample complexity. We show that the key property underpinning a quasimetric, viz., the triangle inequality, is preserved under a dense reward setting as well. Contrary to earlier findings where dense rewards were shown to be detrimental to GCRL, we identify the key condition necessary for triangle inequality. Dense reward functions that satisfy this condition can only improve, never worsen, sample complexity. This opens up…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFunctional Equations Stability Results · Nonlinear Differential Equations Analysis · Optimization and Variational Analysis