Learning Context-aware Task Reasoning for Efficient Meta-reinforcement   Learning

Haozhe Wang; Jiale Zhou; Xuming He

arXiv:2003.01373·cs.LG·May 3, 2022·1 cites

Learning Context-aware Task Reasoning for Efficient Meta-reinforcement Learning

Haozhe Wang, Jiale Zhou, Xuming He

PDF

Open Access

TL;DR

This paper introduces a novel meta-reinforcement learning approach that decomposes the problem into exploration, inference, and fulfillment, utilizing deep networks and a graph-based encoder to improve efficiency and reduce overfitting.

Contribution

The work proposes a new meta-RL strategy with task decomposition, self-supervised exploration, and a graph-based encoder, addressing sampling inefficiency and overfitting issues.

Findings

01

Enhances exploration for task inference

02

Improves sample efficiency during training and testing

03

Reduces meta-overfitting in meta-RL

Abstract

Despite recent success of deep network-based Reinforcement Learning (RL), it remains elusive to achieve human-level efficiency in learning novel tasks. While previous efforts attempt to address this challenge using meta-learning strategies, they typically suffer from sampling inefficiency with on-policy RL algorithms or meta-overfitting with off-policy learning. In this work, we propose a novel meta-RL strategy to address those limitations. In particular, we decompose the meta-RL problem into three sub-tasks, task-exploration, task-inference and task-fulfillment, instantiated with two deep network agents and a task encoder. During meta-training, our method learns a task-conditioned actor network for task-fulfillment, an explorer network with a self-supervised reward shaping that encourages task-informative experiences in task-exploration, and a context-aware graph-based task encoder for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Adversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI)