Learning Context-aware Task Reasoning for Efficient Meta-reinforcement Learning
Haozhe Wang, Jiale Zhou, Xuming He

TL;DR
This paper introduces a novel meta-reinforcement learning approach that decomposes the problem into exploration, inference, and fulfillment, utilizing deep networks and a graph-based encoder to improve efficiency and reduce overfitting.
Contribution
The work proposes a new meta-RL strategy with task decomposition, self-supervised exploration, and a graph-based encoder, addressing sampling inefficiency and overfitting issues.
Findings
Enhances exploration for task inference
Improves sample efficiency during training and testing
Reduces meta-overfitting in meta-RL
Abstract
Despite recent success of deep network-based Reinforcement Learning (RL), it remains elusive to achieve human-level efficiency in learning novel tasks. While previous efforts attempt to address this challenge using meta-learning strategies, they typically suffer from sampling inefficiency with on-policy RL algorithms or meta-overfitting with off-policy learning. In this work, we propose a novel meta-RL strategy to address those limitations. In particular, we decompose the meta-RL problem into three sub-tasks, task-exploration, task-inference and task-fulfillment, instantiated with two deep network agents and a task encoder. During meta-training, our method learns a task-conditioned actor network for task-fulfillment, an explorer network with a self-supervised reward shaping that encourages task-informative experiences in task-exploration, and a context-aware graph-based task encoder for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Adversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI)
