Fast Inference and Transfer of Compositional Task Structures for   Few-shot Task Generalization

Sungryull Sohn; Hyunjae Woo; Jongwook Choi; lyubing qiang; Izzeddin; Gur; Aleksandra Faust; Honglak Lee

arXiv:2205.12648·cs.LG·May 26, 2022·1 cites

Fast Inference and Transfer of Compositional Task Structures for Few-shot Task Generalization

Sungryull Sohn, Hyunjae Woo, Jongwook Choi, lyubing qiang, Izzeddin, Gur, Aleksandra Faust, Honglak Lee

PDF

Open Access

TL;DR

This paper introduces a method for few-shot reinforcement learning that infers common subtask graph structures to enable faster adaptation to new complex tasks, outperforming existing algorithms.

Contribution

The paper proposes a multi-task subtask graph inferencer that captures shared high-level task structures to improve task inference and generalization in few-shot settings.

Findings

01

Faster adaptation to unseen tasks compared to existing methods

02

Effective inference of subtask graph structures from training tasks

03

Improved performance in 2D grid-world and web navigation domains

Abstract

We tackle real-world problems with complex structures beyond the pixel-based game or simulator. We formulate it as a few-shot reinforcement learning problem where a task is characterized by a subtask graph that defines a set of subtasks and their dependencies that are unknown to the agent. Different from the previous meta-rl methods trying to directly infer the unstructured task embedding, our multi-task subtask graph inferencer (MTSGI) first infers the common high-level task structure in terms of the subtask graph from the training tasks, and use it as a prior to improve the task inference in testing. Our experiment results on 2D grid-world and complex web navigation domains show that the proposed method can learn and leverage the common underlying structure of the tasks for faster adaptation to the unseen tasks than various existing algorithms such as meta reinforcement learning,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and Data Classification · Multimodal Machine Learning Applications