Generating Executable Action Plans with Environmentally-Aware Language   Models

Maitrey Gramopadhye; Daniel Szafir

arXiv:2210.04964·cs.RO·May 3, 2023

Generating Executable Action Plans with Environmentally-Aware Language Models

Maitrey Gramopadhye, Daniel Szafir

PDF

Open Access 1 Repo

TL;DR

This paper introduces an environmentally-aware approach to generate executable action plans with Large Language Models by incorporating scene context and object relations, significantly improving plan correctness and executability.

Contribution

The authors propose integrating environmental objects and relations into LLMs for action plan generation, enhancing plan feasibility and disambiguation capabilities.

Findings

01

310% improvement in executability

02

147% improvement in correctness

03

Effective use of environment-aware inputs

Abstract

Large Language Models (LLMs) trained using massive text datasets have recently shown promise in generating action plans for robotic agents from high level text queries. However, these models typically do not consider the robot's environment, resulting in generated plans that may not actually be executable, due to ambiguities in the planned actions or environmental constraints. In this paper, we propose an approach to generate environmentally-aware action plans that agents are better able to execute. Our approach involves integrating environmental objects and object relations as additional inputs into LLM action plan generation to provide the system with an awareness of its surroundings, resulting in plans where each generated action is mapped to objects present in the scene. We also design a novel scoring function that, along with generating the action steps and associating them with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hri-ironlab/scene_aware_language_planner
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications

MethodsBalanced Selection