How to Train Your Fact Verifier: Knowledge Transfer with Multimodal Open   Models

Jaeyoung Lee; Ximing Lu; Jack Hessel; Faeze Brahman; Youngjae Yu,; Yonatan Bisk; Yejin Choi; Saadia Gabriel

arXiv:2407.00369·cs.CL·July 2, 2024

How to Train Your Fact Verifier: Knowledge Transfer with Multimodal Open Models

Jaeyoung Lee, Ximing Lu, Jack Hessel, Faeze Brahman, Youngjae Yu,, Yonatan Bisk, Yejin Choi, Saadia Gabriel

PDF

Open Access

TL;DR

This paper explores how large multimodal models can be trained for fact verification and misinformation detection without continual updates, using knowledge transfer techniques evaluated on multiple benchmarks.

Contribution

It demonstrates that knowledge transfer strategies can enhance multimodal fact-checking performance on recent benchmarks without ongoing model updates.

Findings

01

Knowledge transfer improves Fakeddit accuracy by up to 1.7%.

02

Knowledge transfer improves Mocheg accuracy by up to 2.9%.

03

Models can benefit from existing benchmarks and explanations without continual retraining.

Abstract

Given the growing influx of misinformation across news and social media, there is a critical need for systems that can provide effective real-time verification of news claims. Large language or multimodal model based verification has been proposed to scale up online policing mechanisms for mitigating spread of false and harmful content. While these can potentially reduce burden on human fact-checkers, such efforts may be hampered by foundation model training data becoming outdated. In this work, we test the limits of improving foundation model performance without continual updating through an initial study of knowledge transfer using either existing intra- and inter- domain benchmarks or explanations generated from large language models (LLMs). We evaluate on 12 public benchmarks for fact-checking and misinformation detection as well as two other tasks relevant to content moderation --…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSemantic Web and Ontologies · Natural Language Processing Techniques · Speech and dialogue systems