Refiner: Restructure Retrieval Content Efficiently to Advance   Question-Answering Capabilities

Zhonghao Li; Xuming Hu; Aiwei Liu; Kening Zheng; Sirui Huang; Hui; Xiong

arXiv:2406.11357·cs.CL·April 30, 2025·1 cites

Refiner: Restructure Retrieval Content Efficiently to Advance Question-Answering Capabilities

Zhonghao Li, Xuming Hu, Aiwei Liu, Kening Zheng, Sirui Huang, Hui, Xiong

PDF

Open Access 1 Repo 1 Models

TL;DR

Refiner is an end-to-end extract-and-restructure method that enhances retrieval-augmented generation by reorganizing document content, significantly improving question-answering accuracy and efficiency in LLM systems.

Contribution

It introduces a novel Restructure module that adaptively extracts and reorganizes relevant information, outperforming existing methods in QA tasks.

Findings

01

Achieves 80.5% token reduction in retrieval content.

02

Improves multi-hop QA accuracy by up to 7%.

03

Outperforms state-of-the-art RAG and compression approaches.

Abstract

Large Language Models (LLMs) are limited by their parametric knowledge, leading to hallucinations in knowledge-extensive tasks. To address this, Retrieval-Augmented Generation (RAG) incorporates external document chunks to expand LLM knowledge. Furthermore, compressing information from document chunks through extraction or summarization can improve LLM performance. Nonetheless, LLMs still struggle to notice and utilize scattered key information, a problem known as the "lost-in-the-middle" syndrome. Therefore, we typically need to restructure the content for LLM to recognize the key information. We propose $Refiner$ , an end-to-end extract-and-restructure paradigm that operates in the post-retrieval process of RAG. $Refiner$ leverages a single decoder-only LLM to adaptively extract query-relevant contents verbatim along with the necessary context, and section them based…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

allen-li1231/refiner-rag
pytorchOfficial

Models

🤗
al1231/Refiner-7B
model· 12 dl
12 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsWeb Data Mining and Analysis · Image Processing and 3D Reconstruction · Machine Learning and Algorithms

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · WordPiece · Residual Connection · Softmax · Layer Normalization · Byte Pair Encoding · Attention Dropout · Linear Warmup With Linear Decay · Weight Decay