MiniCheck: Efficient Fact-Checking of LLMs on Grounding Documents

Liyan Tang; Philippe Laban; Greg Durrett

arXiv:2404.10774·cs.CL·October 2, 2024·3 cites

MiniCheck: Efficient Fact-Checking of LLMs on Grounding Documents

Liyan Tang, Philippe Laban, Greg Durrett

PDF

Open Access 2 Repos 4 Models 3 Datasets 1 Video

TL;DR

MiniCheck introduces small, cost-effective fact-checking models trained on synthetic data that achieve GPT-4-level accuracy in verifying LLM outputs against grounding documents, significantly reducing computational costs.

Contribution

This work presents a method to create efficient fact-checking models using synthetic training data, enabling high performance at a fraction of the usual computational expense.

Findings

01

MiniCheck-FT5 (770M) outperforms comparable models.

02

Achieves GPT-4 level accuracy on fact-checking tasks.

03

Reduces cost by 400x compared to traditional methods.

Abstract

Recognizing if LLM output can be grounded in evidence is central to many tasks in NLP: retrieval-augmented generation, summarization, document-grounded dialogue, and more. Current approaches to this kind of fact-checking are based on verifying each piece of a model generation against potential evidence using an LLM. However, this process can be very computationally expensive, requiring many calls to a model to check a single response. In this work, we show how to build small fact-checking models that have GPT-4-level performance but for 400x lower cost. We do this by constructing synthetic training data with GPT-4, which involves creating realistic yet challenging instances of factual errors via a structured generation procedure. Training on this data teaches models to check each fact in the claim and recognize synthesis of information across sentences. For evaluation, we unify datasets…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Models

Datasets

Videos

MiniCheck: Efficient Fact-Checking of LLMs on Grounding Documents· underline

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Data Quality and Management

MethodsAttention Is All You Need · Dropout · Adam · Position-Wise Feed-Forward Layer · Linear Layer · Layer Normalization · Byte Pair Encoding · Absolute Position Encodings · Multi-Head Attention · Dense Connections