Refiner: Restructure Retrieval Content Efficiently to Advance Question-Answering Capabilities
Zhonghao Li, Xuming Hu, Aiwei Liu, Kening Zheng, Sirui Huang, Hui, Xiong

TL;DR
Refiner is an end-to-end extract-and-restructure method that enhances retrieval-augmented generation by reorganizing document content, significantly improving question-answering accuracy and efficiency in LLM systems.
Contribution
It introduces a novel Restructure module that adaptively extracts and reorganizes relevant information, outperforming existing methods in QA tasks.
Findings
Achieves 80.5% token reduction in retrieval content.
Improves multi-hop QA accuracy by up to 7%.
Outperforms state-of-the-art RAG and compression approaches.
Abstract
Large Language Models (LLMs) are limited by their parametric knowledge, leading to hallucinations in knowledge-extensive tasks. To address this, Retrieval-Augmented Generation (RAG) incorporates external document chunks to expand LLM knowledge. Furthermore, compressing information from document chunks through extraction or summarization can improve LLM performance. Nonetheless, LLMs still struggle to notice and utilize scattered key information, a problem known as the "lost-in-the-middle" syndrome. Therefore, we typically need to restructure the content for LLM to recognize the key information. We propose , an end-to-end extract-and-restructure paradigm that operates in the post-retrieval process of RAG. leverages a single decoder-only LLM to adaptively extract query-relevant contents verbatim along with the necessary context, and section them based…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsWeb Data Mining and Analysis · Image Processing and 3D Reconstruction · Machine Learning and Algorithms
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · WordPiece · Residual Connection · Softmax · Layer Normalization · Byte Pair Encoding · Attention Dropout · Linear Warmup With Linear Decay · Weight Decay
