AutoGPT+P: Affordance-based Task Planning with Large Language Models
Timo Birr, Christoph Pohl, Abdelrahman Younes, Tamim Asfour

TL;DR
AutoGPT+P introduces an affordance-based scene representation combined with symbolic planning, enabling large language models to effectively generate and execute plans for complex tasks, including those with incomplete information.
Contribution
It presents a novel system that integrates affordance-based scene understanding with LLM-driven planning, improving success rates and handling incomplete information better than existing methods.
Findings
Achieves 98% success rate on standard tasks, surpassing SayCan.
Handles incomplete information with 79% success on new dataset.
Automatically generates object-affordance mappings using ChatGPT.
Abstract
Recent advances in task planning leverage Large Language Models (LLMs) to improve generalizability by combining such models with classical planning algorithms to address their inherent limitations in reasoning capabilities. However, these approaches face the challenge of dynamically capturing the initial state of the task planning problem. To alleviate this issue, we propose AutoGPT+P, a system that combines an affordance-based scene representation with a planning system. Affordances encompass the action possibilities of an agent on the environment and objects present in it. Thus, deriving the planning domain from an affordance-based scene representation allows symbolic planning with arbitrary objects. AutoGPT+P leverages this representation to derive and execute a plan for a task specified by the user in natural language. In addition to solving planning tasks under a closed-world…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Speech and dialogue systems
