One STEP at a time: Language Agents are Stepwise Planners
Minh Nguyen, Ehsan Shareghi

TL;DR
This paper introduces STEP, a framework that improves language agents' planning abilities by learning from past experiences, leading to better task performance in dynamic environments.
Contribution
The paper presents a novel framework, STEP, that enhances language agents' planning by integrating experience-based learning and structured components for task decomposition, action generation, evaluation, and memory.
Findings
STEP outperforms state-of-the-art models on ScienceWorld benchmark
Achieves an overall score of 67.4 and completes 12 out of 18 tasks
Demonstrates improved planning and task-solving in dynamic environments
Abstract
Language agents have shown promising adaptability in dynamic environments to perform complex tasks. However, despite the versatile knowledge embedded in large language models, these agents still fall short when it comes to tasks that require planning. We introduce STEP, a novel framework designed to efficiently learn from previous experiences to enhance the planning capabilities of language agents in future steps. Concretely, STEP functions through four interconnected components. First, the Planner takes on the task, breaks it down into subtasks and provides relevant insights. Then the Executor generates action candidates, while the Evaluator ensures the actions align with learned rules from previous experiences. Lastly, Memory stores experiences to inform future decisions. In the ScienceWorld benchmark, our results show that STEP consistently outperforms state-of-the-art models,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsLanguage, Linguistics, Cultural Analysis · Syntax, Semantics, Linguistic Variation · linguistics and terminology studies
MethodsALIGN
