RetroLLM: Empowering Large Language Models to Retrieve Fine-grained   Evidence within Generation

Xiaoxi Li; Jiajie Jin; Yujia Zhou; Yongkang Wu; Zhonghua Li; Qi Ye,; Zhicheng Dou

arXiv:2412.11919·cs.CL·December 17, 2024·2 cites

RetroLLM: Empowering Large Language Models to Retrieve Fine-grained Evidence within Generation

Xiaoxi Li, Jiajie Jin, Yujia Zhou, Yongkang Wu, Zhonghua Li, Qi Ye,, Zhicheng Dou

PDF

Open Access 1 Repo

TL;DR

RetroLLM is a unified framework that enhances large language models by enabling them to retrieve and generate fine-grained evidence directly from a corpus, reducing hallucinations and improving accuracy in open-domain QA tasks.

Contribution

It introduces a joint retrieval-generation approach with hierarchical constraints and forward-looking decoding, addressing limitations of existing retrieval-augmented methods.

Findings

01

Outperforms existing methods on five open-domain QA datasets

02

Reduces irrelevant decoding through hierarchical FM-Index constraints

03

Improves evidence accuracy with forward-looking constrained decoding

Abstract

Large language models (LLMs) exhibit remarkable generative capabilities but often suffer from hallucinations. Retrieval-augmented generation (RAG) offers an effective solution by incorporating external knowledge, but existing methods still face several limitations: additional deployment costs of separate retrievers, redundant input tokens from retrieved text chunks, and the lack of joint optimization of retrieval and generation. To address these issues, we propose \textbf{RetroLLM}, a unified framework that integrates retrieval and generation into a single, cohesive process, enabling LLMs to directly generate fine-grained evidence from the corpus with constrained decoding. Moreover, to mitigate false pruning in the process of constrained evidence generation, we introduce (1) hierarchical FM-Index constraints, which generate corpus-constrained clues to identify a subset of relevant…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

sunnynexus/retrollm
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling

MethodsPruning