Hierarchical Document Refinement for Long-context Retrieval-augmented Generation

Jiajie Jin; Xiaoxi Li; Guanting Dong; Yuyao Zhang; Yutao Zhu; Yongkang Wu; Zhonghua Li; Qi Ye; Zhicheng Dou

arXiv:2505.10413·cs.CL·May 16, 2025

Hierarchical Document Refinement for Long-context Retrieval-augmented Generation

Jiajie Jin, Xiaoxi Li, Guanting Dong, Yuyao Zhang, Yutao Zhu, Yongkang Wu, Zhonghua Li, Qi Ye, Zhicheng Dou

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces LongRefiner, a hierarchical document refinement method that improves long-context retrieval-augmented generation by reducing computational costs and noise, while maintaining high performance across multiple QA datasets.

Contribution

LongRefiner is a novel, efficient plug-and-play refiner that leverages document structure and multi-task learning to enhance long-text RAG performance with significantly lower costs.

Findings

01

Achieves competitive accuracy on seven QA datasets.

02

Uses 10x less computational resources and latency.

03

Scalable and effective for real-world applications.

Abstract

Real-world RAG applications often encounter long-context input scenarios, where redundant information and noise results in higher inference costs and reduced performance. To address these challenges, we propose LongRefiner, an efficient plug-and-play refiner that leverages the inherent structural characteristics of long documents. LongRefiner employs dual-level query analysis, hierarchical document structuring, and adaptive refinement through multi-task learning on a single foundation model. Experiments on seven QA datasets demonstrate that LongRefiner achieves competitive performance in various scenarios while using 10x fewer computational costs and latency compared to the best baseline. Further analysis validates that LongRefiner is scalable, efficient, and effective, providing practical insights for real-world long-text RAG applications. Our code is available at…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ignorejjj/longrefiner
pytorchOfficial

Videos

Hierarchical Document Refinement for Long-context Retrieval-augmented Generation· underline

Taxonomy

TopicsRecommender Systems and Techniques · Web Data Mining and Analysis · Semantic Web and Ontologies

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Warmup With Linear Decay · Layer Normalization · Byte Pair Encoding · Attention Dropout · Softmax · WordPiece · Linear Layer · Weight Decay