ELITE: Embedding-Less retrieval with Iterative Text Exploration
Zhangyu Wang, Siyuan Gao, Rong Zhou, Hao Wang, Li Ning

TL;DR
This paper introduces ELITE, a novel embedding-less retrieval method that uses iterative text exploration and logical inference to improve long-context question answering, outperforming existing methods with less computational overhead.
Contribution
ELITE is the first embedding-free retrieval framework that leverages LLMs' logical reasoning for efficient, accurate long-context information retrieval without explicit graph structures.
Findings
Outperforms strong baselines on long-context QA benchmarks
Reduces storage and runtime by over an order of magnitude
Effectively retrieves logically related information without embeddings
Abstract
Large Language Models (LLMs) have achieved impressive progress in natural language processing, but their limited ability to retain long-term context constrains performance on document-level or multi-turn tasks. Retrieval-Augmented Generation (RAG) mitigates this by retrieving relevant information from an external corpus. However, existing RAG systems often rely on embedding-based retrieval trained on corpus-level semantic similarity, which can lead to retrieving content that is semantically similar in form but misaligned with the question's true intent. Furthermore, recent RAG variants construct graph- or hierarchy-based structures to improve retrieval accuracy, resulting in significant computation and storage overhead. In this paper, we propose an embedding-free retrieval framework. Our method leverages the logical inferencing ability of LLMs in retrieval using iterative search space…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Information Retrieval and Search Behavior · Multimodal Machine Learning Applications
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Warmup With Linear Decay · Layer Normalization · Softmax · Attention Dropout · WordPiece · Residual Connection · Linear Layer · Byte Pair Encoding
