ClaimPT: A Portuguese Dataset of Annotated Claims in News Articles
Ricardo Campos, Raquel Sequeira, Sara Nerea, In\^es Cantante, Diogo Folques, Lu\'is Filipe Cunha, Jo\~ao Canavilhas, Ant\'onio Branco, Al\'ipio Jorge, S\'ergio Nunes, Nuno Guimar\~aes, Purifica\c{c}\~ao Silvano

TL;DR
ClaimPT is a new annotated dataset of Portuguese news articles focused on factual claims, designed to support automated fact-checking and misinformation research in low-resource languages.
Contribution
This paper introduces ClaimPT, the first Portuguese news claim dataset with high-quality annotations, and provides baseline models for claim detection.
Findings
ClaimPT contains 1,308 articles and 6,875 claim annotations.
Baseline models establish initial benchmarks for claim detection in Portuguese news.
The dataset supports future NLP applications in fact-checking and misinformation detection.
Abstract
Fact-checking remains a demanding and time-consuming task, still largely dependent on manual verification and unable to match the rapid spread of misinformation online. This is particularly important because debunking false information typically takes longer to reach consumers than the misinformation itself; accelerating corrections through automation can therefore help counter it more effectively. Although many organizations perform manual fact-checking, this approach is difficult to scale given the growing volume of digital content. These limitations have motivated interest in automating fact-checking, where identifying claims is a crucial first step. However, progress has been uneven across languages, with English dominating due to abundant annotated data. Portuguese, like other languages, still lacks accessible, licensed datasets, limiting research, NLP developments and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
