CoVLM: Leveraging Consensus from Vision-Language Models for   Semi-supervised Multi-modal Fake News Detection

Devank; Jayateja Kalla; Soma Biswas

arXiv:2410.04426·cs.CV·October 8, 2024

CoVLM: Leveraging Consensus from Vision-Language Models for Semi-supervised Multi-modal Fake News Detection

Devank, Jayateja Kalla, Soma Biswas

PDF

Open Access 1 Repo

TL;DR

This paper introduces CoVLM, a semi-supervised framework that leverages vision-language models to detect out-of-context fake news by generating pseudo-labels from unlabeled image-text pairs, reducing reliance on extensive labeled data.

Contribution

The paper presents a novel semi-supervised method that automatically determines thresholds for pseudo-labeling in fake news detection, improving performance with limited labeled data.

Findings

01

Effective pseudo-label generation for unlabeled data

02

Improved detection accuracy over state-of-the-art methods

03

Robustness across challenging datasets

Abstract

In this work, we address the real-world, challenging task of out-of-context misinformation detection, where a real image is paired with an incorrect caption for creating fake news. Existing approaches for this task assume the availability of large amounts of labeled data, which is often impractical in real-world, since it requires extensive manual intervention and domain expertise. In contrast, since obtaining a large corpus of unlabeled image-text pairs is much easier, here, we propose a semi-supervised protocol, where the model has access to a limited number of labeled image-text pairs and a large corpus of unlabeled pairs. Additionally, the occurrence of fake news being much lesser compared to the real ones, the datasets tend to be highly imbalanced, thus making the task even more challenging. Towards this goal, we propose a novel framework, Consensus from Vision-Language Models…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

devank3/CoVLM
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMisinformation and Its Impacts · Topic Modeling · Sentiment Analysis and Opinion Mining