ELITE: Embedding-Less retrieval with Iterative Text Exploration

Zhangyu Wang; Siyuan Gao; Rong Zhou; Hao Wang; Li Ning

arXiv:2505.11908·cs.CL·May 20, 2025

ELITE: Embedding-Less retrieval with Iterative Text Exploration

Zhangyu Wang, Siyuan Gao, Rong Zhou, Hao Wang, Li Ning

PDF

Open Access 1 Repo

TL;DR

This paper introduces ELITE, a novel embedding-less retrieval method that uses iterative text exploration and logical inference to improve long-context question answering, outperforming existing methods with less computational overhead.

Contribution

ELITE is the first embedding-free retrieval framework that leverages LLMs' logical reasoning for efficient, accurate long-context information retrieval without explicit graph structures.

Findings

01

Outperforms strong baselines on long-context QA benchmarks

02

Reduces storage and runtime by over an order of magnitude

03

Effectively retrieves logically related information without embeddings

Abstract

Large Language Models (LLMs) have achieved impressive progress in natural language processing, but their limited ability to retain long-term context constrains performance on document-level or multi-turn tasks. Retrieval-Augmented Generation (RAG) mitigates this by retrieving relevant information from an external corpus. However, existing RAG systems often rely on embedding-based retrieval trained on corpus-level semantic similarity, which can lead to retrieving content that is semantically similar in form but misaligned with the question's true intent. Furthermore, recent RAG variants construct graph- or hierarchy-based structures to improve retrieval accuracy, resulting in significant computation and storage overhead. In this paper, we propose an embedding-free retrieval framework. Our method leverages the logical inferencing ability of LLMs in retrieval using iterative search space…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

tjzvbokbnft/ELITE-Embedding-Less-retrieval-with-Iterative-Text-Exploration
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Information Retrieval and Search Behavior · Multimodal Machine Learning Applications

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Warmup With Linear Decay · Layer Normalization · Softmax · Attention Dropout · WordPiece · Residual Connection · Linear Layer · Byte Pair Encoding