Quasimetric Value Functions with Dense Rewards
Khadichabonu Valieva, Bikramjit Banerjee

TL;DR
This paper demonstrates that the quasimetric property of goal-conditioned value functions in reinforcement learning is preserved under dense rewards, enabling more efficient training and improved sample complexity in challenging robotics tasks.
Contribution
It shows that the triangle inequality for quasimetric value functions holds with dense rewards under certain conditions, expanding the applicability of goal-conditioned RL architectures.
Findings
Dense rewards can preserve the quasimetric structure in GCRL.
Training with dense rewards outperforms sparse rewards in benchmark tasks.
Dense reward functions satisfying the key condition improve sample efficiency.
Abstract
As a generalization of reinforcement learning (RL) to parametrizable goals, goal conditioned RL (GCRL) has a broad range of applications, particularly in challenging tasks in robotics. Recent work has established that the optimal value function of GCRL has a quasimetric structure, leading to targetted neural architectures that respect such structure. However, the relevant analyses assume a sparse reward setting -- a known aggravating factor to sample complexity. We show that the key property underpinning a quasimetric, viz., the triangle inequality, is preserved under a dense reward setting as well. Contrary to earlier findings where dense rewards were shown to be detrimental to GCRL, we identify the key condition necessary for triangle inequality. Dense reward functions that satisfy this condition can only improve, never worsen, sample complexity. This opens up…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFunctional Equations Stability Results · Nonlinear Differential Equations Analysis · Optimization and Variational Analysis
