Exploring Text Representations for Online Misinformation
Martins Samuel Dogo

TL;DR
This paper introduces novel textual features and methods for detecting online misinformation, leveraging thematic disparities and clustering to improve fake news identification using machine learning and NLP techniques.
Contribution
It presents a new approach for extracting thematic features from news articles and demonstrates the effectiveness of topic-based classification and clustering for misinformation detection.
Findings
Thematic features significantly improve fake news detection accuracy.
Clustering reduces the need for labeled datasets in misinformation detection.
Topic features enhance understanding of misinformation patterns.
Abstract
Mis- and disinformation, commonly collectively called fake news, continue to menace society. Perhaps, the impact of this age-old problem is presently most plain in politics and healthcare. However, fake news is affecting an increasing number of domains. It takes many different forms and continues to shapeshift as technology advances. Though it arguably most widely spreads in textual form, e.g., through social media posts and blog articles. Thus, it is imperative to thwart the spread of textual misinformation, which necessitates its initial detection. This thesis contributes to the creation of representations that are useful for detecting misinformation. Firstly, it develops a novel method for extracting textual features from news articles for misinformation detection. These features harness the disparity between the thematic coherence of authentic and false news stories. In other words,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMisinformation and Its Impacts · Hate Speech and Cyberbullying Detection
