On two existing approaches to statistical analysis of social media data
Martina Patone, Li-Chun Zhang

TL;DR
This paper reviews two main approaches for analyzing social media data statistically, addressing challenges like data-object mismatch and indirect measures, and discusses their applications and limitations.
Contribution
It systematically compares two existing social media analysis approaches, clarifying their methodologies, data quality issues, and practical challenges.
Findings
Analysis approach 1 focuses on social media objects directly observed.
Analysis approach 2 uses pseudo survey datasets to approximate population units.
The paper highlights key data quality challenges in social media analysis.
Abstract
Using social media data for statistical analysis of general population faces commonly two basic obstacles: firstly, social media data are collected for different objects than the population units of interest; secondly, the relevant measures are typically not available directly but need to be extracted by algorithms or machine learning techniques. In this paper we examine and summarise two existing approaches to statistical analysis based on social media data, which can be discerned in the literature. In the first approach, analysis is applied to the social media data that are organised around the objects directly observed in the data; in the second one, a different analysis is applied to a constructed pseudo survey dataset, aimed to transform the observed social media data to a set of units from the target population. We elaborate systematically the relevant data quality frameworks,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplex Network Analysis Techniques · Human Mobility and Location-Based Analysis · Opinion Dynamics and Social Influence
