Automatic Detection of Fake News

Ver\'onica P\'erez-Rosas; Bennett Kleinberg; Alexandra Lefevre; Rada; Mihalcea

arXiv:1708.07104·cs.CL·August 24, 2017·377 cites

Automatic Detection of Fake News

Ver\'onica P\'erez-Rosas, Bennett Kleinberg, Alexandra Lefevre, Rada, Mihalcea

PDF

Open Access

TL;DR

This paper introduces new datasets and machine learning experiments for automatically detecting fake news across multiple domains, addressing the challenge of misinformation in online media.

Contribution

It provides two novel datasets for fake news detection and explores linguistic differences, along with experiments on building and analyzing fake news classifiers.

Findings

01

New datasets covering seven news domains

02

Linguistic differences between fake and legitimate news identified

03

Effective fake news detection models developed

Abstract

The proliferation of misleading information in everyday access media outlets such as social media feeds, news blogs, and online newspapers have made it challenging to identify trustworthy news sources, thus increasing the need for computational tools able to provide insights into the reliability of online content. In this paper, we focus on the automatic identification of fake content in online news. Our contribution is twofold. First, we introduce two novel datasets for the task of fake news detection, covering seven different news domains. We describe the collection, annotation, and validation process in detail and present several exploratory analysis on the identification of linguistic differences in fake and legitimate news content. Second, we conduct a set of learning experiments to build accurate fake news detectors. In addition, we provide comparative analyses of the automatic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMisinformation and Its Impacts · Spam and Phishing Detection · Authorship Attribution and Profiling