CaseFacts: A Benchmark for Legal Fact-Checking and Precedent Retrieval

Akshith Reddy Putta; Jacob Devasier; Chengkai Li

arXiv:2601.17230·cs.CL·April 21, 2026

CaseFacts: A Benchmark for Legal Fact-Checking and Precedent Retrieval

Akshith Reddy Putta, Jacob Devasier, Chengkai Li

PDF

TL;DR

CaseFacts is a new benchmark dataset designed to evaluate legal fact-checking and precedent retrieval systems by challenging them to verify claims against U.S. Supreme Court cases, considering semantic and temporal complexities.

Contribution

This paper introduces CaseFacts, a novel benchmark dataset for legal fact-checking that incorporates complex claim synthesis and a semantic similarity heuristic for verifying legal overrulings.

Findings

01

State-of-the-art LLMs find the task challenging.

02

Web search augmentation degrades performance due to noisy data.

03

The dataset enables research into legal fact verification systems.

Abstract

Automated Fact-Checking has largely focused on verifying general knowledge against static corpora, overlooking high-stakes domains like law where truth is evolving and technically complex. We introduce CaseFacts, a benchmark for verifying colloquial legal claims against U.S. Supreme Court precedents. Unlike existing resources that map formal texts to formal texts, CaseFacts challenges systems to bridge the semantic gap between layperson assertions and technical jurisprudence while accounting for temporal validity. The dataset consists of 6,294 claims categorized as Supported, Refuted, or Overruled. We construct this benchmark using a multi-stage pipeline that leverages Large Language Models (LLMs) to synthesize claims from expert case summaries, employing a novel semantic similarity heuristic to efficiently identify and verify complex legal overrulings. Experiments with state-of-the-art…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.