CONFLARE: CONFormal LArge language model REtrieval

Pouria Rouzrokh; Shahriar Faghani; Cooper U. Gamble; Moein Shariatnia,; Bradley J. Erickson

arXiv:2404.04287·cs.CL·April 9, 2024·3 cites

CONFLARE: CONFormal LArge language model REtrieval

Pouria Rouzrokh, Shahriar Faghani, Cooper U. Gamble, Moein Shariatnia,, Bradley J. Erickson

PDF

Open Access 1 Repo

TL;DR

This paper introduces CONFLARE, a framework that applies conformal prediction to retrieval-augmented generation, quantifying uncertainty in retrieval to improve trustworthiness of large language models.

Contribution

It proposes a novel four-step conformal prediction framework for RAG, including a Python package for practical implementation without human intervention.

Findings

01

Provides a method to set similarity score thresholds with specified confidence levels.

02

Ensures the true answer is included in the retrieved context with high probability.

03

Offers a practical tool for improving RAG reliability in real-world applications.

Abstract

Retrieval-augmented generation (RAG) frameworks enable large language models (LLMs) to retrieve relevant information from a knowledge base and incorporate it into the context for generating responses. This mitigates hallucinations and allows for the updating of knowledge without retraining the LLM. However, RAG does not guarantee valid responses if retrieval fails to identify the necessary information as the context for response generation. Also, if there is contradictory content, the RAG response will likely reflect only one of the two possible responses. Therefore, quantifying uncertainty in the retrieval process is crucial for ensuring RAG trustworthiness. In this report, we introduce a four-step framework for applying conformal prediction to quantify retrieval uncertainty in RAG frameworks. First, a calibration set of questions answerable from the knowledge base is constructed. Each…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mayo-radiology-informatics-lab/conflare
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Sparse Evolutionary Training · WordPiece · Weight Decay · Byte Pair Encoding · Linear Layer · Dense Connections · Attention Dropout · Residual Connection