Pulse of the Pandemic: Iterative Topic Filtering for Clinical Information Extraction from Social Media
Julia Wu, Venkatesh Sivaraman, Dheekshita Kumar, Juan M. Banda and, David Sontag

TL;DR
This paper introduces an unsupervised, iterative method for extracting clinically relevant information from social media data during health emergencies, enabling rapid identification of emerging topics without manual labeling.
Contribution
The authors develop a novel unsupervised approach combining heuristic filtering, topic modeling, and concept extraction to identify relevant clinical content from large-scale social media data.
Findings
Successfully extracted relevant clinical topics from 52 million tweets.
Identified emerging COVID-19 topics on a weekly basis.
Facilitated knowledge transfer among healthcare professionals.
Abstract
The rapid evolution of the COVID-19 pandemic has underscored the need to quickly disseminate the latest clinical knowledge during a public-health emergency. One surprisingly effective platform for healthcare professionals (HCPs) to share knowledge and experiences from the front lines has been social media (for example, the "#medtwitter" community on Twitter). However, identifying clinically-relevant content in social media without manual labeling is a challenge because of the sheer volume of irrelevant data. We present an unsupervised, iterative approach to mine clinically relevant information from social media data, which begins by heuristically filtering for HCP-authored texts and incorporates topic modeling and concept extraction with MetaMap. This approach identifies granular topics and tweets with high clinical relevance from a set of about 52 million COVID-19-related tweets from…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Social Media in Health Education · Misinformation and Its Impacts
