Automated Claim Matching with Large Language Models: Empowering   Fact-Checkers in the Fight Against Misinformation

Eun Cheol Choi; Emilio Ferrara

arXiv:2310.09223·cs.CL·October 16, 2023·2 cites

Automated Claim Matching with Large Language Models: Empowering Fact-Checkers in the Fight Against Misinformation

Eun Cheol Choi, Emilio Ferrara

PDF

Open Access

TL;DR

This paper presents FACT-GPT, a framework that uses large language models to automate claim matching in fact-checking, improving efficiency and supporting fact-checkers against misinformation with publicly available datasets and models.

Contribution

Introduces FACT-GPT, a novel LLM-based framework for automating claim matching in fact-checking, with fine-tuning techniques and publicly released resources.

Findings

01

Fine-tuned LLMs match larger pre-trained models in claim matching

02

The framework closely aligns with human annotations

03

Provides datasets and models for further research

Abstract

In today's digital era, the rapid spread of misinformation poses threats to public well-being and societal trust. As online misinformation proliferates, manual verification by fact checkers becomes increasingly challenging. We introduce FACT-GPT (Fact-checking Augmentation with Claim matching Task-oriented Generative Pre-trained Transformer), a framework designed to automate the claim matching phase of fact-checking using Large Language Models (LLMs). This framework identifies new social media content that either supports or contradicts claims previously debunked by fact-checkers. Our approach employs GPT-4 to generate a labeled dataset consisting of simulated social media posts. This data set serves as a training ground for fine-tuning more specialized LLMs. We evaluated FACT-GPT on an extensive dataset of social media content related to public health. The results indicate that our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMisinformation and Its Impacts · Topic Modeling · Hate Speech and Cyberbullying Detection

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Adam · Byte Pair Encoding · Label Smoothing · Softmax · Residual Connection · Absolute Position Encodings · Layer Normalization