Word frequency and sentiment analysis of twitter messages during Coronavirus pandemic
Nikhil Kumar Rajput, Bhavya Ahuja Grover, Vipin Kumar Rathi, Riya, Bansal

TL;DR
This study analyzes Twitter messages during the COVID-19 pandemic using word frequency and sentiment analysis, revealing dominant neutral sentiment and modeling word usage with power law distributions.
Contribution
It introduces a combined statistical and sentiment analysis approach specifically applied to COVID-19 related Twitter data, with validation of word frequency models.
Findings
Majority of tweets are neutral in sentiment
Negative sentiment accounts for only 2.57% of tweets
Word usage follows a power law distribution
Abstract
The COVID-19 epidemic has had a great impact on social media conversation, especially on sites like Twitter, which has emerged as a hub for public reaction and information sharing. This paper deals by analyzing a vast dataset of Twitter messages related to this disease, starting from January 2020. Two approaches were used: a statistical analysis of word frequencies and a sentiment analysis to gauge user attitudes. Word frequencies are modeled using unigrams, bigrams, and trigrams, with power law distribution as the fitting model. The validity of the model is confirmed through metrics like Sum of Squared Errors (SSE), R-squared (), and Root Mean Squared Error (RMSE). High and low SSE/RMSE values indicate a good fit for the model. Sentiment analysis is conducted to understand the general emotional tone of Twitter users messages. The results reveal that a majority of tweets…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSentiment Analysis and Opinion Mining · Digital Marketing and Social Media · Mental Health via Writing
MethodsStochastic Steady-state Embedding
