Procedures as Programs: Hierarchical Control of Situated Agents through Natural Language
Shuyan Zhou, Pengcheng Yin, Graham Neubig

TL;DR
This paper introduces a hierarchical procedural formalism and modular network architecture for natural language command of situated agents, significantly improving performance and data efficiency in instruction following tasks.
Contribution
It presents a novel formalism of procedures as programs and a hierarchical modular network model for NL command execution, advancing beyond shallow hierarchies in prior work.
Findings
Outperforms reactive baselines on IQA and ALFRED datasets
More data-efficient than existing models
Enables fast iterative development of agent control systems
Abstract
When humans conceive how to perform a particular task, they do so hierarchically: splitting higher-level tasks into smaller sub-tasks. However, in the literature on natural language (NL) command of situated agents, most works have treated the procedures to be executed as flat sequences of simple actions, or any hierarchies of procedures have been shallow at best. In this paper, we propose a formalism of procedures as programs, a powerful yet intuitive method of representing hierarchical procedural knowledge for agent command and control. We further propose a modeling paradigm of hierarchical modular networks, which consist of a planner and reactors that convert NL intents to predictions of executable programs and probe the environment for information necessary to complete the program execution. We instantiate this framework on the IQA and ALFRED datasets for NL instruction following.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · AI-based Problem Solving and Planning · Software Engineering Techniques and Practices
