PlanRAG: A Plan-then-Retrieval Augmented Generation for Generative Large   Language Models as Decision Makers

Myeonghwa Lee; Seonho An; Min-Soo Kim

arXiv:2406.12430·cs.CL·June 19, 2024

PlanRAG: A Plan-then-Retrieval Augmented Generation for Generative Large Language Models as Decision Makers

Myeonghwa Lee, Seonho An, Min-Soo Kim

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces PlanRAG, a novel decision-making framework using large language models that generate plans and retrieve data iteratively, and presents a new benchmark for evaluating decision QA in complex scenarios.

Contribution

It proposes PlanRAG, a new iterative plan-then-retrieval method for decision making with LLMs, and introduces the Decision QA benchmark based on video game scenarios.

Findings

01

PlanRAG outperforms existing methods by 15.8% and 7.4% in two scenarios.

02

The Decision QA benchmark enables evaluation of decision-making capabilities.

03

The approach effectively combines planning and data retrieval for complex decision tasks.

Abstract

In this paper, we conduct a study to utilize LLMs as a solution for decision making that requires complex data analysis. We define Decision QA as the task of answering the best decision, $d_{b es t}$ , for a decision-making question $Q$ , business rules $R$ and a database $D$ . Since there is no benchmark that can examine Decision QA, we propose Decision QA benchmark, DQA. It has two scenarios, Locating and Building, constructed from two video games (Europa Universalis IV and Victoria 3) that have almost the same goal as Decision QA. To address Decision QA effectively, we also propose a new RAG technique called the iterative plan-then-retrieval augmented generation (PlanRAG). Our PlanRAG-based LM generates the plan for decision making as the first step, and the retriever generates the queries for data analysis as the second step. The proposed method outperforms the state-of-the-art iterative…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

myeon9h/planrag
noneOfficial

Videos

PlanRAG: A Plan-then-Retrieval Augmented Generation for Generative Large Language Models as Decision Makers· underline

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · WordPiece · Residual Connection · Softmax · Layer Normalization · Byte Pair Encoding · Attention Dropout · Linear Warmup With Linear Decay · Weight Decay