AttentionRetriever: Attention Layers are Secretly Long Document Retrievers

David Jiahao Fu; Lam Thanh Do; Jiayu Li; Kevin Chen-Chuan Chang

arXiv:2602.12278·cs.IR·February 13, 2026

AttentionRetriever: Attention Layers are Secretly Long Document Retrievers

David Jiahao Fu, Lam Thanh Do, Jiayu Li, Kevin Chen-Chuan Chang

PDF

Open Access

TL;DR

AttentionRetriever introduces an attention-based, entity-aware retrieval model that significantly improves long document retrieval accuracy for LLMs, addressing key challenges like context-awareness and scope of retrieval.

Contribution

The paper presents a novel attention and entity-based retrieval model specifically designed for long documents, outperforming existing models in accuracy and efficiency.

Findings

01

Outperforms existing retrieval models on long document datasets

02

Maintains efficiency comparable to dense retrieval models

03

Effectively addresses context-awareness and scope challenges

Abstract

Retrieval augmented generation (RAG) has been widely adopted to help Large Language Models (LLMs) to process tasks involving long documents. However, existing retrieval models are not designed for long document retrieval and fail to address several key challenges of long document retrieval, including context-awareness, causal dependence, and scope of retrieval. In this paper, we proposed AttentionRetriever, a novel long document retrieval model that leverages attention mechanism and entity-based retrieval to build context-aware embeddings for long document and determine the scope of retrieval. With extensive experiments, we found AttentionRetriever is able to outperform existing retrieval models on long document retrieval datasets by a large margin while remaining as efficient as dense retrieval models.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsInformation Retrieval and Search Behavior · Topic Modeling · Multimodal Machine Learning Applications