Generating Executable Action Plans with Environmentally-Aware Language Models
Maitrey Gramopadhye, Daniel Szafir

TL;DR
This paper introduces an environmentally-aware approach to generate executable action plans with Large Language Models by incorporating scene context and object relations, significantly improving plan correctness and executability.
Contribution
The authors propose integrating environmental objects and relations into LLMs for action plan generation, enhancing plan feasibility and disambiguation capabilities.
Findings
310% improvement in executability
147% improvement in correctness
Effective use of environment-aware inputs
Abstract
Large Language Models (LLMs) trained using massive text datasets have recently shown promise in generating action plans for robotic agents from high level text queries. However, these models typically do not consider the robot's environment, resulting in generated plans that may not actually be executable, due to ambiguities in the planned actions or environmental constraints. In this paper, we propose an approach to generate environmentally-aware action plans that agents are better able to execute. Our approach involves integrating environmental objects and object relations as additional inputs into LLM action plan generation to provide the system with an awareness of its surroundings, resulting in plans where each generated action is mapped to objects present in the scene. We also design a novel scoring function that, along with generating the action steps and associating them with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
MethodsBalanced Selection
