Machine Learning-based Automatic Annotation and Detection of COVID-19 Fake News
Mohammad Majid Akhtar, Bibhas Sharma, Ishan Karunanayake, Rahat, Masood, Muhammad Ikram, Salil S. Kanhere

TL;DR
This paper presents an automated machine learning approach to detect COVID-19 misinformation on Twitter, incorporating textual, user, and bot activity features to improve accuracy and analyze bot behavior.
Contribution
It introduces a scalable data labeling method using verified fact-checks and combines multiple feature types for enhanced misinformation detection.
Findings
Achieved 82% precision and 96% recall in classification
Bots contributed to approximately 10% of misinformation tweets
Behavior of bots varies over time during misinformation campaigns
Abstract
COVID-19 impacted every part of the world, although the misinformation about the outbreak traveled faster than the virus. Misinformation spread through online social networks (OSN) often misled people from following correct medical practices. In particular, OSN bots have been a primary source of disseminating false information and initiating cyber propaganda. Existing work neglects the presence of bots that act as a catalyst in the spread and focuses on fake news detection in 'articles shared in posts' rather than the post (textual) content. Most work on misinformation detection uses manually labeled datasets that are hard to scale for building their predictive models. In this research, we overcome this challenge of data scarcity by proposing an automated approach for labeling data using verified fact-checked statements on a Twitter dataset. In addition, we combine textual features with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMisinformation and Its Impacts · Spam and Phishing Detection · Advanced Malware Detection Techniques
