Twitter discussions and emotions about COVID-19 pandemic: a machine learning approach
Jia Xue (University of Toronto), Junxiang Chen (University of, Pittsburgh), Ran Hu (University of Toronto), Chen Chen (University of, Toronto), ChengDa Zheng (University of Toronto), Xiaoqian Liu (Chinese, Academy of Sciences), Tingshao Zhu (China Academy of Science)

TL;DR
This study analyzes 4 million COVID-19 related tweets using machine learning to identify key topics, sentiments, and themes, providing insights into public concerns and emotions during the early pandemic period.
Contribution
It applies Latent Dirichlet Allocation to large-scale Twitter data to uncover dominant discussion topics and sentiments related to COVID-19, demonstrating the utility of social media analysis for public health monitoring.
Findings
Identified 13 discussion topics and 5 themes in COVID-19 tweets.
Dominant sentiments include anticipation, trust, anger, and fear.
Public fear is heightened around new cases and deaths.
Abstract
The objective of the study is to examine coronavirus disease (COVID-19) related discussions, concerns, and sentiments that emerged from tweets posted by Twitter users. We analyze 4 million Twitter messages related to the COVID-19 pandemic using a list of 25 hashtags such as "coronavirus," "COVID-19," "quarantine" from March 1 to April 21 in 2020. We use a machine learning approach, Latent Dirichlet Allocation (LDA), to identify popular unigram, bigrams, salient topics and themes, and sentiments in the collected Tweets. Popular unigrams include "virus," "lockdown," and "quarantine." Popular bigrams include "COVID-19," "stay home," "corona virus," "social distancing," and "new cases." We identify 13 discussion topics and categorize them into five different themes, such as "public health measures to slow the spread of COVID-19," "social stigma associated with COVID-19," "coronavirus news…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMisinformation and Its Impacts · Data-Driven Disease Surveillance · Public Relations and Crisis Communication
