ARAGOG: Advanced RAG Output Grading
Matou\v{s} Eibich, Shivay Nagpal, Alexander Fred-Ojala

TL;DR
This paper evaluates various Retrieval-Augmented Generation methods, highlighting the effectiveness of Hypothetical Document Embedding and Sentence Window Retrieval in improving retrieval precision, and introduces the ARAGOG framework for RAG output grading.
Contribution
It provides an extensive experimental comparison of RAG techniques, identifying the most effective methods and introducing the ARAGOG framework for output grading.
Findings
HyDE and LLM reranking improve retrieval precision
MMR and Cohere rerank show no significant advantage
Sentence Window Retrieval is most effective for retrieval
Abstract
Retrieval-Augmented Generation (RAG) is essential for integrating external knowledge into Large Language Model (LLM) outputs. While the literature on RAG is growing, it primarily focuses on systematic reviews and comparisons of new state-of-the-art (SoTA) techniques against their predecessors, with a gap in extensive experimental comparisons. This study begins to address this gap by assessing various RAG methods' impacts on retrieval precision and answer similarity. We found that Hypothetical Document Embedding (HyDE) and LLM reranking significantly enhance retrieval precision. However, Maximal Marginal Relevance (MMR) and Cohere rerank did not exhibit notable advantages over a baseline Naive RAG system, and Multi-query approaches underperformed. Sentence Window Retrieval emerged as the most effective for retrieval precision, despite its variable performance on answer similarity. The…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMedical Imaging Techniques and Applications
MethodsAttention Is All You Need · Byte Pair Encoding · Refunds@Expedia|||How do I get a full refund from Expedia? · Softmax · WordPiece · Linear Layer · Dense Connections · Attention Dropout · Residual Connection · Linear Warmup With Linear Decay
