NewsScope: Schema-Grounded Cross-Domain News Claim Extraction with Open Models

Nidhi Pandya

arXiv:2601.08852·cs.CL·January 15, 2026

NewsScope: Schema-Grounded Cross-Domain News Claim Extraction with Open Models

Nidhi Pandya

PDF

Open Access

TL;DR

NewsScope introduces a new dataset, benchmark, and fine-tuned open model for schema-grounded news claim extraction across multiple domains, demonstrating high accuracy and generalization capabilities with practical deployment costs.

Contribution

The paper presents NewsScope, a novel cross-domain dataset, benchmark, and fine-tuned LLaMA-based model for structured news claim extraction with improved accuracy and generalization.

Findings

01

Achieves 89.4% human-evaluated accuracy on claim extraction.

02

Outperforms GPT-4o-mini on political claims with 94.3% accuracy.

03

Grounding filter increases accuracy to 91.6%, reducing the gap to state-of-the-art.

Abstract

Automated news verification requires structured claim extraction, but existing approaches either lack schema compliance or generalize poorly across domains. This paper presents NewsScope, a cross-domain dataset, benchmark, and fine-tuned model for schema-grounded news claim extraction. The dataset contains 455 articles across politics, health, science/environment, and business, consisting of 395 in-domain articles and 60 out-of-source articles for generalization testing. LLaMA 3.1 8B was fine-tuned using LoRA on 315 training examples and evaluated on held-out in-domain (80 articles) and out-of-source (60 articles) test sets. Human evaluation on 400 claims shows NewsScope achieves 89.4% human-evaluated accuracy compared to GPT-4o-mini's 93.7% (p=0.07). NewsScope outperforms GPT-4o-mini on political claims (94.3% vs. 87.8%). A numeric grounding filter further improves accuracy to 91.6%,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Misinformation and Its Impacts · Computational and Text Analysis Methods