Retrieve-Plan-Generation: An Iterative Planning and Answering Framework   for Knowledge-Intensive LLM Generation

Yuanjie Lyu; Zihan Niu; Zheyong Xie; Chao Zhang; Tong Xu; Yang Wang,; Enhong Chen

arXiv:2406.14979·cs.CL·October 10, 2024

Retrieve-Plan-Generation: An Iterative Planning and Answering Framework for Knowledge-Intensive LLM Generation

Yuanjie Lyu, Zihan Niu, Zheyong Xie, Chao Zhang, Tong Xu, Yang Wang,, Enhong Chen

PDF

Open Access 1 Repo 1 Video

TL;DR

The paper introduces the Retrieve-Plan-Generation framework, an iterative approach that improves knowledge-intensive LLM outputs by planning and selecting relevant information, reducing errors and off-topic content.

Contribution

It proposes a novel iterative planning and answer framework with multi-task prompt-tuning, enhancing relevance and accuracy in knowledge-intensive LLM tasks.

Findings

01

RPG outperforms baselines across 5 tasks.

02

Iterative planning improves answer relevance.

03

Multi-task prompt-tuning enables efficient implementation.

Abstract

Despite the significant progress of large language models (LLMs) in various tasks, they often produce factual errors due to their limited internal knowledge. Retrieval-Augmented Generation (RAG), which enhances LLMs with external knowledge sources, offers a promising solution. However, these methods can be misled by irrelevant paragraphs in retrieved documents. Due to the inherent uncertainty in LLM generation, inputting the entire document may introduce off-topic information, causing the model to deviate from the central topic and affecting the relevance of the generated content. To address these issues, we propose the Retrieve-Plan-Generation (RPG) framework. RPG generates plan tokens to guide subsequent generation in the plan stage. In the answer stage, the model selects relevant fine-grained paragraphs based on the plan and uses them for further answer generation. This plan-answer…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

haruhi-sudo/RPG
pytorchOfficial

Videos

Retrieve-Plan-Generation: An Iterative Planning and Answering Framework for Knowledge-Intensive LLM Generation· underline

Taxonomy

TopicsSemantic Web and Ontologies