Detection of fake news on CoViD-19 on Web Search Engines

V. Mazzeo; A. Rapisarda; G. Giuffrida

arXiv:2103.11804·cs.LG·June 8, 2021·1 cites

Detection of fake news on CoViD-19 on Web Search Engines

V. Mazzeo, A. Rapisarda, G. Giuffrida

PDF

Open Access

TL;DR

This paper presents a machine learning approach to detect fake news related to COVID-19 on web search engines by analyzing textual content and URL features, demonstrating improved classification performance.

Contribution

It introduces a combined feature-based method using textual and URL features to enhance fake news detection on search engine results during the COVID-19 pandemic.

Findings

01

Lexical and host-based URL features improve detection accuracy

02

Resampling techniques address class imbalance effectively

03

Combined features outperform individual feature sets in classification

Abstract

In early January 2020, after China reported the first cases of the new coronavirus (SARS-CoV-2) in the city of Wuhan, unreliable and not fully accurate information has started spreading faster than the virus itself. Alongside this pandemic, people have experienced a parallel infodemic, i.e., an overabundance of information, some of which misleading or even harmful, that has widely spread around the globe. Although Social Media are increasingly being used as information source, Web Search Engines, like Google or Yahoo!, still represent a powerful and trustworthy resource for finding information on the Web. This is due to their capability to capture the largest amount of information, helping users quickly identify the most relevant, useful, although not always the most reliable, results for their search queries. This study aims to detect potential misleading and fake contents by capturing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpam and Phishing Detection · Misinformation and Its Impacts · Sentiment Analysis and Opinion Mining