Wikipedia graph mining: dynamic structure of collective memory
Volodymyr Miz, Kirell Benzi, Benjamin Ricaud, Pierre Vandergheynst

TL;DR
This paper introduces a scalable, distributed graph-based model inspired by Hebbian learning to extract and analyze collective events and memories from Wikipedia visitor activity data, revealing dynamic patterns and evolving interests.
Contribution
It presents a novel distributed event extraction model for graph-structured data, specifically applied to Wikipedia, capturing collective activity patterns and their evolution over time.
Findings
The model effectively identifies meaningful event clusters.
It demonstrates scalability with data size and graph complexity.
The analysis reveals evolving collective interests and shared memories.
Abstract
Wikipedia is the biggest encyclopedia ever created and the fifth most visited website in the world. Tens of millions of people surf it every day, seeking answers to various questions. Collective user activity on its pages leaves publicly available footprints of human behavior, making Wikipedia an excellent source for analysis of collective behavior. In this work, we propose a distributed graph-based event extraction model, inspired by the Hebbian learning theory. The model exploits collective effect of the dynamics to discover events. We focus on data-streams with underlying graph structure and perform several large-scale experiments on the Wikipedia visitor activity data. We show that the presented model is scalable regarding time-series length and graph density, providing a distributed implementation of the proposed algorithm. We extract dynamical patterns of collective activity and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsWikis in Education and Collaboration · Topic Modeling · Advanced Graph Neural Networks
