LettuceDetect: A Hallucination Detection Framework for RAG Applications
\'Ad\'am Kov\'acs, G\'abor Recski

TL;DR
LettuceDetect is a novel hallucination detection framework for RAG systems that leverages extended context models, outperforming previous methods in accuracy and efficiency, suitable for real-world deployment.
Contribution
It introduces LettuceDetect, a token-classification model using ModernBERT with extended context, significantly improving hallucination detection in RAG systems.
Findings
Achieves 79.22% F1 score on RAGTruth benchmark.
Outperforms previous encoder-based models by 14.8%.
Processes 30-60 examples per second on a single GPU.
Abstract
Retrieval Augmented Generation (RAG) systems remain vulnerable to hallucinated answers despite incorporating external knowledge sources. We present LettuceDetect a framework that addresses two critical limitations in existing hallucination detection methods: (1) the context window constraints of traditional encoder-based methods, and (2) the computational inefficiency of LLM based approaches. Building on ModernBERT's extended context capabilities (up to 8k tokens) and trained on the RAGTruth benchmark dataset, our approach outperforms all previous encoder-based models and most prompt-based models, while being approximately 30 times smaller than the best models. LettuceDetect is a token-classification model that processes context-question-answer triples, allowing for the identification of unsupported claims at the token level. Evaluations on the RAGTruth corpus demonstrate an F1 score of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗KRLabsOrg/lettucedect-base-modernbert-en-v1model· 4.8k dl· ♡ 174.8k dl♡ 17
- 🤗KRLabsOrg/lettucedect-large-modernbert-en-v1model· 858 dl· ♡ 29858 dl♡ 29
- 🤗KRLabsOrg/lettucedect-210m-eurobert-de-v1model· 8 dl8 dl
- 🤗KRLabsOrg/lettucedect-610m-eurobert-de-v1model· 19 dl· ♡ 119 dl♡ 1
- 🤗KRLabsOrg/lettucedect-210m-eurobert-cn-v1model· 168 dl168 dl
- 🤗KRLabsOrg/lettucedect-610m-eurobert-cn-v1model· 3 dl· ♡ 13 dl♡ 1
- 🤗KRLabsOrg/lettucedect-610m-eurobert-fr-v1model· 3 dl· ♡ 13 dl♡ 1
- 🤗KRLabsOrg/lettucedect-210m-eurobert-fr-v1model· 3 dl· ♡ 13 dl♡ 1
- 🤗KRLabsOrg/lettucedect-210m-eurobert-it-v1model· 2 dl2 dl
- 🤗KRLabsOrg/lettucedect-610m-eurobert-it-v1model· 1 dl1 dl
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPsychosomatic Disorders and Their Treatments · Functional Brain Connectivity Studies · Psychedelics and Drug Studies
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Weight Decay · Linear Layer · Layer Normalization · Byte Pair Encoding · WordPiece · Dense Connections · Attention Dropout · Residual Connection
