Fighting Fire with Fire: Adversarial Prompting to Generate a   Misinformation Detection Dataset

Shrey Satapara; Parth Mehta; Debasis Ganguly; Sandip Modha

arXiv:2401.04481·cs.CL·January 10, 2024·1 cites

Fighting Fire with Fire: Adversarial Prompting to Generate a Misinformation Detection Dataset

Shrey Satapara, Parth Mehta, Debasis Ganguly, Sandip Modha

PDF

Open Access

TL;DR

This paper introduces an LLM-based method to automatically generate a misinformation detection dataset by creating summaries with controlled factual errors, facilitating scalable training of detection models.

Contribution

It proposes a novel prompting technique to generate labeled misinformation data from trusted news articles, reducing manual annotation efforts.

Findings

01

Generated datasets improve misinformation detection accuracy

02

Controlled factual errors enhance model training

03

LLM-based dataset creation is scalable and effective

Abstract

The recent success in language generation capabilities of large language models (LLMs), such as GPT, Bard, Llama etc., can potentially lead to concerns about their possible misuse in inducing mass agitation and communal hatred via generating fake news and spreading misinformation. Traditional means of developing a misinformation ground-truth dataset does not scale well because of the extensive manual effort required to annotate the data. In this paper, we propose an LLM-based approach of creating silver-standard ground-truth datasets for identifying misinformation. Specifically speaking, given a trusted news article, our proposed approach involves prompting LLMs to automatically generate a summarised version of the original article. The prompts in our proposed approach act as a controlling mechanism to generate specific types of factual incorrectness in the generated summaries, e.g.,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Misinformation and Its Impacts · Natural Language Processing Techniques

MethodsSparse Evolutionary Training · Refunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Attention Is All You Need · Linear Layer · Cosine Annealing · Discriminative Fine-Tuning · Attention Dropout · Softmax · Dropout