P-RAG: Progressive Retrieval Augmented Generation For Planning on Embodied Everyday Task
Weiye Xu, Min Wang, Wengang Zhou, and Houqiang Li

TL;DR
P-RAG is a novel iterative retrieval-augmented generation method that progressively accumulates task-specific knowledge for embodied AI tasks, improving planning and execution without relying on ground-truth data.
Contribution
The paper introduces P-RAG, a new iterative retrieval scheme that enhances LLM-based planning for embodied tasks by progressively updating knowledge without ground-truth.
Findings
P-RAG achieves competitive results without ground-truth data.
Iterative retrieval improves task planning performance.
Granular retrieval of similar situations enhances reference quality.
Abstract
Embodied Everyday Task is a popular task in the embodied AI community, requiring agents to make a sequence of actions based on natural language instructions and visual observations. Traditional learning-based approaches face two challenges. Firstly, natural language instructions often lack explicit task planning. Secondly, extensive training is required to equip models with knowledge of the task environment. Previous works based on Large Language Model (LLM) either suffer from poor performance due to the lack of task-specific knowledge or rely on ground truth as few-shot samples. To address the above limitations, we propose a novel approach called Progressive Retrieval Augmented Generation (P-RAG), which not only effectively leverages the powerful language processing capabilities of LLMs but also progressively accumulates task-specific knowledge without ground-truth. Compared to the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI-based Problem Solving and Planning · Human Pose and Action Recognition · Human Motion and Animation
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Attention Dropout · WordPiece · Dense Connections · Residual Connection · Linear Layer · Multi-Head Attention · Linear Warmup With Linear Decay · Adam
