FactLens: Benchmarking Fine-Grained Fact Verification

Kushan Mitra; Dan Zhang; Sajjadur Rahman; Estevam Hruschka

arXiv:2411.05980·cs.CL·June 3, 2025

FactLens: Benchmarking Fine-Grained Fact Verification

Kushan Mitra, Dan Zhang, Sajjadur Rahman, Estevam Hruschka

PDF

Open Access 1 Video

TL;DR

FactLens introduces a benchmark for fine-grained fact verification that breaks down complex claims into sub-claims, enabling more precise error detection and transparency in verifying LLM outputs.

Contribution

This paper presents FactLens, a novel benchmark with metrics and evaluators for assessing the quality of sub-claims in fine-grained fact verification tasks.

Findings

01

Automated evaluators align well with human judgments.

02

Sub-claim characteristics significantly affect verification performance.

03

High-quality, manually curated ground truth data enhances benchmark reliability.

Abstract

Large Language Models (LLMs) have shown impressive capability in language generation and understanding, but their tendency to hallucinate and produce factually incorrect information remains a key limitation. To verify LLM-generated contents and claims from other sources, traditional verification approaches often rely on holistic models that assign a single factuality label to complex claims, potentially obscuring nuanced errors. In this paper, we advocate for a shift towards fine-grained verification, where complex claims are broken down into smaller sub-claims for individual verification, allowing for more precise identification of inaccuracies, improved transparency, and reduced ambiguity in evidence retrieval. However, generating sub-claims poses challenges, such as maintaining context and ensuring semantic equivalence with respect to the original claim. We introduce FactLens, a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

FactLens: Benchmarking Fine-Grained Fact Verification· underline

Taxonomy

TopicsSemantic Web and Ontologies · Scientific Computing and Data Management · Biomedical Text Mining and Ontologies