#Santiago is not #Chile, or is it? A Model to Normalize Social Media Impact
Eduardo Graells-Garrido, Barbara Poblete

TL;DR
This paper presents a methodology to analyze geographic biases in social media content, specifically Twitter data from Chile, revealing centralism and regional representation during a national election.
Contribution
It introduces a vector space model-based approach to classify micro-posts by location, highlighting social media's geographic biases and providing a tool for normalization.
Findings
Classifiers outperform non-geographically-diverse baseline at regional level
Virtual population on Twitter is spatially representative of the physical population
Approach shows potential but needs testing on broader datasets
Abstract
Online social networks are known to be demographically biased. Currently there are questions about what degree of representativity of the physical population they have, and how population biases impact user-generated content. In this paper we focus on centralism, a problem affecting Chile. Assuming that local differences exist in a country, in terms of vocabulary, we built a methodology based on the vector space model to find distinctive content from different locations, and use it to create classifiers to predict whether the content of a micro-post is related to a particular location, having in mind a geographically diverse selection of micro-posts. We evaluate them in a case study where we analyze the virtual population of Chile that participated in the Twitter social network during an event of national relevance: the municipal (local governments) elections held in 2012. We observe…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplex Network Analysis Techniques · Social Media and Politics · Social Capital and Networks
