VERITAS: A Unified Approach to Reliability Evaluation
Rajkumar Ramamurthy, Meghana Arakkal Rajeev, Oliver Molenschot, James, Zou, Nazneen Rajani

TL;DR
VERITAS is a versatile hallucination detection framework for large language models that balances accuracy, cost, and latency, achieving state-of-the-art results across multiple benchmarks.
Contribution
It introduces VERITAS, a flexible hallucination detection model that performs well across diverse contexts and approaches GPT-4 turbo's performance with lower costs.
Findings
VERITAS outperforms similar-sized models by 10% on hallucination detection benchmarks.
VERITAS approaches GPT-4 turbo performance in LLM-as-a-judge settings.
The model balances flexibility, cost, and latency effectively.
Abstract
Large language models (LLMs) often fail to synthesize information from their context to generate an accurate response. This renders them unreliable in knowledge intensive settings where reliability of the output is key. A critical component for reliable LLMs is the integration of a robust fact-checking system that can detect hallucinations across various formats. While several open-access fact-checking models are available, their functionality is often limited to specific tasks, such as grounded question-answering or entailment verification, and they perform less effectively in conversational settings. On the other hand, closed-access models like GPT-4 and Claude offer greater flexibility across different contexts, including grounded dialogue verification, but are hindered by high costs and latency. In this work, we introduce VERITAS, a family of hallucination detection models designed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRisk and Safety Analysis · Software Reliability and Analysis Research
MethodsLinear Layer · Layer Normalization · Position-Wise Feed-Forward Layer · Adam · Attention Is All You Need · Multi-Head Attention · Residual Connection · Byte Pair Encoding · Dropout · Absolute Position Encodings
