How Do People Differ? A Social Media Approach
Vincent Wong, Yaneer Bar-Yam

TL;DR
This study analyzes Reddit text data to identify behavioral patterns and differences among users, revealing how pronoun usage relates to discussion topics and overall user heterogeneity.
Contribution
It introduces a methodology combining dimension reduction and linguistic analysis to characterize human behavioral heterogeneity on social media.
Findings
Pronouns characterize major behavioral dimensions.
Pronoun patterns overlap with discussion topics.
Patterns reveal relationships between user attributes and language use.
Abstract
Research from a variety of fields including psychology and linguistics have found correlations and patterns in personal attributes and behavior, but efforts to understand the broader heterogeneity in human behavior have not yet integrated these approaches and perspectives with a cohesive methodology. Here we extract patterns in behavior and relate those patterns together in a high-dimensional picture. We use dimension reduction to analyze word usage in text data from the online discussion platform Reddit. We find that pronouns can be used to characterize the space of the two most prominent dimensions that capture the greatest differences in word usage, even though pronouns were not included in the determination of those dimensions. These patterns overlap with patterns of topics of discussion to reveal relationships between pronouns and topics that can describe the user population. This…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOpinion Dynamics and Social Influence · Complex Network Analysis Techniques · Misinformation and Its Impacts
