Feudal Reinforcement Learning by Reading Manuals

Kai Wang; Zhonghao Wang; Mo Yu; Humphrey Shi

arXiv:2110.06477·cs.AI·October 14, 2021

Feudal Reinforcement Learning by Reading Manuals

Kai Wang, Zhonghao Wang, Mo Yu, Humphrey Shi

PDF

Open Access

TL;DR

This paper introduces a Feudal Reinforcement Learning model with a manager-worker structure that improves reasoning from manuals by generating multi-hop plans, achieving strong results without manual curriculum design.

Contribution

The paper proposes a novel FRL framework with a multi-hop plan generator that better aligns high-level reasoning with low-level actions, applicable across diverse environments.

Findings

01

Achieves competitive performance on RTFM and Messenger tasks.

02

Effectively alleviates semantic mismatch between instructions and actions.

03

Does not require human-designed curriculum learning.

Abstract

Reading to act is a prevalent but challenging task which requires the ability to reason from a concise instruction. However, previous works face the semantic mismatch between the low-level actions and the high-level language descriptions and require the human-designed curriculum to work properly. In this paper, we present a Feudal Reinforcement Learning (FRL) model consisting of a manager agent and a worker agent. The manager agent is a multi-hop plan generator dealing with high-level abstract information and generating a series of sub-goals in a backward manner. The worker agent deals with the low-level perceptions and actions to achieve the sub-goals one by one. In comparison, our FRL model effectively alleviate the mismatching between text-level inference and low-level perceptions and actions; and is general to various forms of environments, instructions and manuals; and our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Multimodal Machine Learning Applications · Natural Language Processing Techniques