A Weakly-Supervised Iterative Graph-Based Approach to Retrieve COVID-19 Misinformation Topics
Harry Wang, Sharath Chandra Guntuku

TL;DR
This paper introduces a weakly-supervised, graph-based method leveraging BERT embeddings to efficiently identify COVID-19 misinformation topics from social media, reducing manual effort and improving detection accuracy.
Contribution
It presents novel BERT-based graph search algorithms (BWGS and BMDWGS) for misinformation detection, enabling effective topic extraction with minimal supervision.
Findings
Effective in low-resource settings
Outperforms baseline methods
Identifies prevalent misconceptions
Abstract
The COVID-19 pandemic has been accompanied by an `infodemic' -- of accurate and inaccurate health information across social media. Detecting misinformation amidst dynamically changing information landscape is challenging; identifying relevant keywords and posts is arduous due to the large amount of human effort required to inspect the content and sources of posts. We aim to reduce the resource cost of this process by introducing a weakly-supervised iterative graph-based approach to detect keywords, topics, and themes related to misinformation, with a focus on COVID-19. Our approach can successfully detect specific topics from general misinformation-related seed words in a few seed texts. Our approach utilizes the BERT-based Word Graph Search (BWGS) algorithm that builds on context-based neural network embeddings for retrieving misinformation-related posts. We utilize Latent Dirichlet…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMisinformation and Its Impacts · Topic Modeling · Sentiment Analysis and Opinion Mining
