Fast Inference and Transfer of Compositional Task Structures for Few-shot Task Generalization
Sungryull Sohn, Hyunjae Woo, Jongwook Choi, lyubing qiang, Izzeddin, Gur, Aleksandra Faust, Honglak Lee

TL;DR
This paper introduces a method for few-shot reinforcement learning that infers common subtask graph structures to enable faster adaptation to new complex tasks, outperforming existing algorithms.
Contribution
The paper proposes a multi-task subtask graph inferencer that captures shared high-level task structures to improve task inference and generalization in few-shot settings.
Findings
Faster adaptation to unseen tasks compared to existing methods
Effective inference of subtask graph structures from training tasks
Improved performance in 2D grid-world and web navigation domains
Abstract
We tackle real-world problems with complex structures beyond the pixel-based game or simulator. We formulate it as a few-shot reinforcement learning problem where a task is characterized by a subtask graph that defines a set of subtasks and their dependencies that are unknown to the agent. Different from the previous meta-rl methods trying to directly infer the unstructured task embedding, our multi-task subtask graph inferencer (MTSGI) first infers the common high-level task structure in terms of the subtask graph from the training tasks, and use it as a prior to improve the task inference in testing. Our experiment results on 2D grid-world and complex web navigation domains show that the proposed method can learn and leverage the common underlying structure of the tasks for faster adaptation to the unseen tasks than various existing algorithms such as meta reinforcement learning,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and Data Classification · Multimodal Machine Learning Applications
