Exploring Text Representations for Online Misinformation

Martins Samuel Dogo

arXiv:2412.18618·cs.CL·December 30, 2024

Exploring Text Representations for Online Misinformation

Martins Samuel Dogo

PDF

Open Access

TL;DR

This paper introduces novel textual features and methods for detecting online misinformation, leveraging thematic disparities and clustering to improve fake news identification using machine learning and NLP techniques.

Contribution

It presents a new approach for extracting thematic features from news articles and demonstrates the effectiveness of topic-based classification and clustering for misinformation detection.

Findings

01

Thematic features significantly improve fake news detection accuracy.

02

Clustering reduces the need for labeled datasets in misinformation detection.

03

Topic features enhance understanding of misinformation patterns.

Abstract

Mis- and disinformation, commonly collectively called fake news, continue to menace society. Perhaps, the impact of this age-old problem is presently most plain in politics and healthcare. However, fake news is affecting an increasing number of domains. It takes many different forms and continues to shapeshift as technology advances. Though it arguably most widely spreads in textual form, e.g., through social media posts and blog articles. Thus, it is imperative to thwart the spread of textual misinformation, which necessitates its initial detection. This thesis contributes to the creation of representations that are useful for detecting misinformation. Firstly, it develops a novel method for extracting textual features from news articles for misinformation detection. These features harness the disparity between the thematic coherence of authentic and false news stories. In other words,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMisinformation and Its Impacts · Hate Speech and Cyberbullying Detection