SFR-RAG: Towards Contextually Faithful LLMs

Xuan-Phi Nguyen; Shrey Pandit; Senthil Purushwalkam; Austin Xu; Hailin; Chen; Yifei Ming; Zixuan Ke; Silvio Savarese; Caiming Xong; Shafiq Joty

arXiv:2409.09916·cs.CL·September 17, 2024·2 cites

SFR-RAG: Towards Contextually Faithful LLMs

Xuan-Phi Nguyen, Shrey Pandit, Senthil Purushwalkam, Austin Xu, Hailin, Chen, Yifei Ming, Zixuan Ke, Silvio Savarese, Caiming Xong, Shafiq Joty

PDF

Open Access

TL;DR

This paper introduces SFR-RAG, a small instruction-tuned LLM designed for contextually faithful generation, minimizing hallucinations, and evaluated through a new comprehensive benchmarking framework called ContextualBench.

Contribution

We develop SFR-RAG, a small LLM with enhanced contextual understanding and hallucination reduction, and establish ContextualBench for consistent evaluation of RAG models.

Findings

01

SFR-RAG-9B outperforms larger models on multiple benchmarks.

02

The model is resilient to contextual alterations.

03

Maintains competitive performance in instruction-following and function-calling.

Abstract

Retrieval Augmented Generation (RAG), a paradigm that integrates external contextual information with large language models (LLMs) to enhance factual accuracy and relevance, has emerged as a pivotal area in generative AI. The LLMs used in RAG applications are required to faithfully and completely comprehend the provided context and users' questions, avoid hallucination, handle unanswerable, counterfactual or otherwise low-quality and irrelevant contexts, perform complex multi-hop reasoning and produce reliable citations. In this paper, we introduce SFR-RAG, a small LLM that is instruction-tuned with an emphasis on context-grounded generation and hallucination minimization. We also present ContextualBench, a new evaluation framework compiling multiple popular and diverse RAG benchmarks, such as HotpotQA and TriviaQA, with consistent RAG settings to ensure reproducibility and consistency…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsLibrary Science and Information Systems · Digital Rights Management and Security · Mathematics, Computing, and Information Processing

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Byte Pair Encoding · Softmax · Layer Normalization · WordPiece · Dropout · Attention Dropout · BART · Dense Connections