Meta-Reinforcement Learning for Heuristic Planning

Ricardo Luna Gutierrez; Matteo Leonetti

arXiv:2107.02603·cs.AI·July 7, 2021·1 cites

Meta-Reinforcement Learning for Heuristic Planning

Ricardo Luna Gutierrez, Matteo Leonetti

PDF

Open Access

TL;DR

This paper introduces an information-theoretic task selection method for meta-reinforcement learning, which enhances training efficiency and test performance by choosing relevant and diverse training tasks.

Contribution

The paper proposes ITTS, a novel task selection algorithm that improves meta-RL training by optimizing task relevance and diversity, leading to better generalization.

Findings

01

ITTS improves meta-RL performance across multiple experiments.

02

Task relevance and diversity are crucial for effective meta-RL training.

03

ITTS outperforms random and other selection strategies.

Abstract

In Meta-Reinforcement Learning (meta-RL) an agent is trained on a set of tasks to prepare for and learn faster in new, unseen, but related tasks. The training tasks are usually hand-crafted to be representative of the expected distribution of test tasks and hence all used in training. We show that given a set of training tasks, learning can be both faster and more effective (leading to better performance in the test tasks), if the training tasks are appropriately selected. We propose a task selection algorithm, Information-Theoretic Task Selection (ITTS), based on information theory, which optimizes the set of tasks used for training in meta-RL, irrespectively of how they are generated. The algorithm establishes which training tasks are both sufficiently relevant for the test tasks, and different enough from one another. We reproduce different meta-RL experiments from the literature and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAI-based Problem Solving and Planning · Reinforcement Learning in Robotics · Software Engineering Research