ProgPrompt: Generating Situated Robot Task Plans using Large Language Models
Ishika Singh, Valts Blukis, Arsalan Mousavian, Ankit Goyal, Danfei Xu,, Jonathan Tremblay, Dieter Fox, Jesse Thomason, Animesh Garg

TL;DR
ProgPrompt leverages large language models with programmatic prompts to generate feasible, situated robot task plans across environments and robot capabilities, achieving high success in virtual and real-world tasks.
Contribution
This work introduces a novel programmatic prompting approach that enables LLMs to generate contextually feasible robot plans without exhaustive enumeration or free-form text.
Findings
Achieves state-of-the-art success rates in VirtualHome household tasks.
Successfully deployed on a physical robot arm for tabletop tasks.
Provides concrete prompt design recommendations through ablation studies.
Abstract
Task planning can require defining myriad domain knowledge about the world in which a robot needs to act. To ameliorate that effort, large language models (LLMs) can be used to score potential next actions during task planning, and even generate action sequences directly, given an instruction in natural language with no additional domain information. However, such methods either require enumerating all possible next steps for scoring, or generate free-form text that may contain actions not possible on a given robot in its current context. We present a programmatic LLM prompt structure that enables plan generation functional across situated environments, robot capabilities, and tasks. Our key insight is to prompt the LLM with program-like specifications of the available actions and objects in an environment, as well as with example programs that can be executed. We make concrete…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Topic Modeling · Natural Language Processing Techniques
