Model Generalization on COVID-19 Fake News Detection

Yejin Bang; Etsuko Ishii; Samuel Cahyawijaya; Ziwei Ji; Pascale Fung

arXiv:2101.03841·cs.CL·January 12, 2021·6 cites

Model Generalization on COVID-19 Fake News Detection

Yejin Bang, Etsuko Ishii, Samuel Cahyawijaya, Ziwei Ji, Pascale Fung

PDF

Open Access

TL;DR

This paper develops robust transformer-based models for COVID-19 fake news detection, emphasizing model generalization across different datasets and proposing methods like robust loss functions and influence-based data cleansing.

Contribution

It introduces two approaches—robust fine-tuning and influence-based data cleansing—to improve model robustness and generalization in COVID-19 fake news detection.

Findings

01

Achieved 98.13% W-F1 on shared task

02

Attained 54.33% W-F1 on external test set with data cleansing

03

Highlighted the importance of model generalization in fake news detection

Abstract

Amid the pandemic COVID-19, the world is facing unprecedented infodemic with the proliferation of both fake and real information. Considering the problematic consequences that the COVID-19 fake-news have brought, the scientific community has put effort to tackle it. To contribute to this fight against the infodemic, we aim to achieve a robust model for the COVID-19 fake-news detection task proposed at CONSTRAINT 2021 (FakeNews-19) by taking two separate approaches: 1) fine-tuning transformers based language models with robust loss functions and 2) removing harmful training instances through influence calculation. We further evaluate the robustness of our models by evaluating on different COVID-19 misinformation test set (Tweets-19) to understand model generalization ability. With the first approach, we achieve 98.13% for weighted F1 score (W-F1) for the shared task, whereas 38.18% W-F1…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMisinformation and Its Impacts · Spam and Phishing Detection · Topic Modeling