RAGulator: Lightweight Out-of-Context Detectors for Grounded Text   Generation

Ian Poey; Jiajun Liu; Qishuai Zhong; Adrien Chenailler

arXiv:2411.03920·cs.CL·November 7, 2024

RAGulator: Lightweight Out-of-Context Detectors for Grounded Text Generation

Ian Poey, Jiajun Liu, Qishuai Zhong, Adrien Chenailler

PDF

Open Access 1 Models

TL;DR

This paper introduces RAGulator, a lightweight model based on DeBERTa, designed to efficiently detect out-of-context LLM outputs in real-time, facilitating safer enterprise deployment of RAG applications.

Contribution

The work presents a resource-efficient, high-performance out-of-context detector using minimal preprocessing, emphasizing practical deployment considerations.

Findings

01

DeBERTa outperforms other models in this task

02

The model is fast and requires no additional feature engineering

03

Effective with minimal resource usage

Abstract

Real-time detection of out-of-context LLM outputs is crucial for enterprises looking to safely adopt RAG applications. In this work, we train lightweight models to discriminate LLM-generated text that is semantically out-of-context from retrieved text documents. We preprocess a combination of summarisation and semantic textual similarity datasets to construct training data using minimal resources. We find that DeBERTa is not only the best-performing model under this pipeline, but it is also fast and does not require additional text preprocessing or feature engineering. While emerging work demonstrates that generative LLMs can also be fine-tuned and used in complex data pipelines to achieve state-of-the-art performance, we note that speed and resource limits are important considerations for on-premise deployment.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
ipoeyke/ragulator-deberta-v3-large
model· 3 dl
3 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques

MethodsAttention Is All You Need · ADaptive gradient method with the OPTimal convergence rate · Linear Layer · Softmax · Dropout · Refunds@Expedia|||How do I get a full refund from Expedia? · Dense Connections · Layer Normalization · Linear Warmup With Linear Decay · WordPiece