CONFLARE: CONFormal LArge language model REtrieval
Pouria Rouzrokh, Shahriar Faghani, Cooper U. Gamble, Moein Shariatnia,, Bradley J. Erickson

TL;DR
This paper introduces CONFLARE, a framework that applies conformal prediction to retrieval-augmented generation, quantifying uncertainty in retrieval to improve trustworthiness of large language models.
Contribution
It proposes a novel four-step conformal prediction framework for RAG, including a Python package for practical implementation without human intervention.
Findings
Provides a method to set similarity score thresholds with specified confidence levels.
Ensures the true answer is included in the retrieved context with high probability.
Offers a practical tool for improving RAG reliability in real-world applications.
Abstract
Retrieval-augmented generation (RAG) frameworks enable large language models (LLMs) to retrieve relevant information from a knowledge base and incorporate it into the context for generating responses. This mitigates hallucinations and allows for the updating of knowledge without retraining the LLM. However, RAG does not guarantee valid responses if retrieval fails to identify the necessary information as the context for response generation. Also, if there is contradictory content, the RAG response will likely reflect only one of the two possible responses. Therefore, quantifying uncertainty in the retrieval process is crucial for ensuring RAG trustworthiness. In this report, we introduce a four-step framework for applying conformal prediction to quantify retrieval uncertainty in RAG frameworks. First, a calibration set of questions answerable from the knowledge base is constructed. Each…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Sparse Evolutionary Training · WordPiece · Weight Decay · Byte Pair Encoding · Linear Layer · Dense Connections · Attention Dropout · Residual Connection
