Transcending the Attention Paradigm: Representation Learning from Geospatial Social Media Data
Nick DiSanto, Anthony Corso, Benjamin Sanders, Gavin Harding

TL;DR
This paper demonstrates that simple, location-specific word embeddings can uncover meaningful patterns in large-scale social media data, challenging the need for complex models in natural language understanding.
Contribution
It introduces a method for analyzing geospatial social media data using Bag-of-Word embeddings, revealing latent geographic influences without relying on advanced algorithms.
Findings
Geographic location significantly influences online communication patterns.
Simple embeddings can uncover hidden social and spatial insights.
Complex models are not always necessary for pattern recognition in noisy data.
Abstract
While transformers have pioneered attention-driven architectures as a cornerstone of language modeling, their dependence on explicitly contextual information underscores limitations in their abilities to tacitly learn overarching textual themes. This study challenges the heuristic paradigm of performance benchmarking by investigating social media data as a source of distributed patterns. In stark contrast to networks that rely on capturing complex long-term dependencies, models of online data inherently lack structure and are forced to detect latent structures in the aggregate. To properly represent these abstract relationships, this research dissects empirical social media corpora into their elemental components, analyzing over two billion tweets across population-dense locations. We create Bag-of-Word embedding specific to each city and compare their respective representations. This…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplex Network Analysis Techniques · Human Mobility and Location-Based Analysis · Geographic Information Systems Studies
