Plan*RAG: Efficient Test-Time Planning for Retrieval Augmented   Generation

Prakhar Verma; Sukruta Prakash Midigeshi; Gaurav Sinha; Arno Solin,; Nagarajan Natarajan; Amit Sharma

arXiv:2410.20753·cs.CL·February 5, 2025

Plan*RAG: Efficient Test-Time Planning for Retrieval Augmented Generation

Prakhar Verma, Sukruta Prakash Midigeshi, Gaurav Sinha, Arno Solin,, Nagarajan Natarajan, Amit Sharma

PDF

Open Access

TL;DR

Plan*RAG introduces a test-time planning framework for retrieval-augmented generation that improves multi-hop reasoning by externalizing reasoning plans as DAGs, enabling systematic exploration, precise retrievals, and efficiency.

Contribution

It presents a novel externalized reasoning plan structure for RAG, enhancing multi-hop reasoning, grounding, and efficiency over existing methods.

Findings

01

Outperforms recent RAG variants on multi-hop benchmarks

02

Maintains comparable computational costs

03

Enables systematic reasoning path exploration

Abstract

We introduce Plan*RAG, a novel framework that enables structured multi-hop reasoning in retrieval-augmented generation (RAG) through test-time reasoning plan generation. While existing approaches such as ReAct maintain reasoning chains within the language model's context window, we observe that this often leads to plan fragmentation and execution failures. Our key insight is that by isolating the reasoning plan as a directed acyclic graph (DAG) outside the LM's working memory, we can enable (1) systematic exploration of reasoning paths, (2) atomic subqueries enabling precise retrievals and grounding, and (3) efficiency through parallel execution and bounded context window utilization. Moreover, Plan*RAG's modular design allows it to be integrated with existing RAG methods, thus providing a practical solution to improve current RAG systems. On standard multi-hop reasoning benchmarks,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsInformation Retrieval and Search Behavior · Speech and dialogue systems · Algorithms and Data Compression

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Adam · Linear Layer · Attention Dropout · Dropout · Weight Decay · Dense Connections · Byte Pair Encoding · BART · Layer Normalization