QuOTE: Question-Oriented Text Embeddings
Andrew Neeser, Kaylen Latimer, Aadyant Khatri, Chris Latimer, and, Naren Ramakrishnan

TL;DR
QuOTE introduces question-augmented text embeddings to improve document retrieval accuracy in RAG systems by aligning representations with query semantics through hypothetical questions.
Contribution
It proposes a novel question-oriented embedding method that enhances document representation for retrieval-augmented generation systems.
Findings
Significantly improves retrieval accuracy across benchmarks
Enhances performance in multi-hop question-answering tasks
Demonstrates versatility of question generation in indexing
Abstract
We present QuOTE (Question-Oriented Text Embeddings), a novel enhancement to retrieval-augmented generation (RAG) systems, aimed at improving document representation for accurate and nuanced retrieval. Unlike traditional RAG pipelines, which rely on embedding raw text chunks, QuOTE augments chunks with hypothetical questions that the chunk can potentially answer, enriching the representation space. This better aligns document embeddings with user query semantics, and helps address issues such as ambiguity and context-dependent relevance. Through extensive experiments across diverse benchmarks, we demonstrate that QuOTE significantly enhances retrieval accuracy, including in multi-hop question-answering tasks. Our findings highlight the versatility of question generation as a fundamental indexing strategy, opening new avenues for integrating question generation into retrieval-based AI…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Byte Pair Encoding · Adam · Softmax · Dropout · Weight Decay · BART · WordPiece · Layer Normalization
