Building a Location-Based Set of Social Media Users
Christopher Marks, Tauhid Zaman

TL;DR
This paper presents an iterative expand-classify method using maximum likelihood estimation on a factor graph to efficiently identify and classify social media users by location, enabling situational awareness and social unrest detection.
Contribution
The paper introduces a novel expand-classify approach with a factor graph model for building location-specific social media user sets, improving accuracy and efficiency over existing methods.
Findings
Achieves accurate user collection for locations with less than 500,000 inhabitants.
Collects several thousand users within a few hours.
Effectively detects social unrest through content analysis.
Abstract
In many instances one may want to gain situational awareness in an environment by monitoring the content of local social media users. Often the challenge is how to build a set of users from a target location. Here we introduce a method for building such a set of users by using an \emph{expand-classify} approach which begins with a small set of seed users from the target location and then iteratively collects their neighbors and then classifies their locations. We perform this classification using maximum likelihood estimation on a factor graph model which incorporates features of the user profile and also social network connections. We show that maximum likelihood estimation reduces to solving a minimum cut problem on an appropriately defined graph. We are able to obtain several thousand users within a few hours for many diverse locations using our approach. Using geo-located data, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Mobility and Location-Based Analysis · Complex Network Analysis Techniques · Data-Driven Disease Surveillance
