Textiverse: A Scalable Visual Analytics System for Exploring Geotagged and Timestamped Text Corpora
Caroline Berger, Hanjun Xian, Krishna Madhavan, Niklas Elmqvist

TL;DR
Textiverse is a scalable system that visualizes large geotagged and timestamped text data on maps, enabling analysis of social media and reports through efficient data processing and interactive visualization.
Contribution
It introduces a scalable pipeline and visualization method for exploring massive geotagged textual data on maps, outperforming existing approaches in speed and scale.
Findings
Successfully visualized 489,151 NSF awards geographically.
Analyzed 1.2 million Twitter posts about Android.
Demonstrated real-time data processing and visualization capabilities.
Abstract
We propose Textiverse, a big data approach for mining geotagged timestamped textual data on a map, such as for Twitter feeds, crime reports, or restaurant reviews. We use a scalable data management pipeline that extracts keyphrases from online databases in parallel. We speed up this time-consuming step so that it outpaces the content creation rate of popular social media. The result is presented in a web-based interface that integrates with Google Maps to visualize textual content of massive scale. The visual design is based on aggregating spatial regions into discrete sites and rendering each such site as a circular tag cloud. To demonstrate the intended use of our technique, we first show how it can be used to characterize the U.S.\ National Science Foundation funding status based on all 489,151 awards. We then apply the same technique on visually representing a more spatially…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Text Analysis Techniques · Data Visualization and Analytics · Web Data Mining and Analysis
