Metric Residual Networks for Sample Efficient Goal-Conditioned   Reinforcement Learning

Bo Liu; Yihao Feng; Qiang Liu; Peter Stone

arXiv:2208.08133·cs.LG·January 23, 2023·1 cites

Metric Residual Networks for Sample Efficient Goal-Conditioned Reinforcement Learning

Bo Liu, Yihao Feng, Qiang Liu, Peter Stone

PDF

Open Access 2 Repos

TL;DR

This paper introduces Metric Residual Networks (MRN), a novel neural architecture for goal-conditioned reinforcement learning that leverages metric properties to significantly improve sample efficiency across various benchmarks.

Contribution

The paper proposes MRN, a new neural network architecture that decomposes the action-value function to better satisfy metric properties, enhancing sample efficiency in GCRL tasks.

Findings

01

MRN outperforms existing architectures in 12 benchmark environments.

02

MRN achieves higher sample efficiency than state-of-the-art methods.

03

The architecture is theoretically grounded in metric properties of the value function.

Abstract

Goal-conditioned reinforcement learning (GCRL) has a wide range of potential real-world applications, including manipulation and navigation problems in robotics. Especially in such robotics tasks, sample efficiency is of the utmost importance for GCRL since, by default, the agent is only rewarded when it reaches its goal. While several methods have been proposed to improve the sample efficiency of GCRL, one relatively under-studied approach is the design of neural architectures to support sample efficiency. In this work, we introduce a novel neural architecture for GCRL that achieves significantly better sample efficiency than the commonly-used monolithic network architecture. The key insight is that the optimal action-value function Q^*(s, a, g) must satisfy the triangle inequality in a specific sense. Furthermore, we introduce the metric residual network (MRN) that deliberately…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Adversarial Robustness in Machine Learning · Autonomous Vehicle Technology and Safety