Subgraph Frequencies: Mapping the Empirical and Extremal Geography of Large Graph Collections
Johan Ugander, Lars Backstrom, Jon Kleinberg

TL;DR
This paper introduces a coordinate system based on subgraph frequencies to analyze large collections of social graphs, revealing both theoretical constraints and empirical patterns, and demonstrating its utility in classification tasks.
Contribution
It develops a graph homomorphism-based framework for representing graph collections and characterizes empirical and extremal properties using models and linear programming.
Findings
Real social graphs lie near a one-dimensional curve in subgraph frequency space.
A simple stochastic model closely tracks the concentration of real social graphs.
The coordinate system can effectively distinguish graphs of different origins.
Abstract
A growing set of on-line applications are generating data that can be viewed as very large collections of small, dense social graphs -- these range from sets of social groups, events, or collaboration projects to the vast collection of graph neighborhoods in large social networks. A natural question is how to usefully define a domain-independent coordinate system for such a collection of graphs, so that the set of possible structures can be compactly represented and understood within a common space. In this work, we draw on the theory of graph homomorphisms to formulate and analyze such a representation, based on computing the frequencies of small induced subgraphs within each graph. We find that the space of subgraph frequencies is governed both by its combinatorial properties, based on extremal results that constrain all graphs, as well as by its empirical properties, manifested in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplex Network Analysis Techniques · Opinion Dynamics and Social Influence · Advanced Graph Neural Networks
