COVID-19 Infodemic. Understanding content features in detecting fake news using a machine learning approach

Vimala Balakrishnan; Lee Zing Hii; Eric Laporte

arXiv:2605.06435·cs.CL·May 11, 2026

COVID-19 Infodemic. Understanding content features in detecting fake news using a machine learning approach

Vimala Balakrishnan, Lee Zing Hii, Eric Laporte

PDF

TL;DR

This study explores how textual and linguistic content features can be effectively used with traditional machine learning models to detect fake news during the COVID-19 pandemic.

Contribution

It investigates the impact of content features like bigrams and POS tags on fake news detection, highlighting their effectiveness with classical ML algorithms.

Findings

01

Random Forest achieved the highest accuracy among tested models.

02

Textual and linguistic features individually improve fake news detection.

03

Combining textual and linguistic features did not significantly enhance detection performance.

Abstract

The use of content features, particularly textual and linguistic for fake news detection is under-researched, despite empirical evidence showing the features could contribute to differentiating real and fake news. To this end, this study investigates a selection of content features such as word bigrams, part of speech distribution etc. to improve fake news detection. We performed a series of experiments on a new dataset gathered during the COVID-19 pandemic and using Decision Tree, K-Nearest Neighbor, Logistic Regression, Support Vector Machine and Random Forest. Random Forest yielded the best results, followed closely by Support Vector Machine, across all setups. In general, both the textual and linguistic features were found to improve fake news detection when used separately, however, combining them into a single model did not improve the detection significantly. Differences were…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.