VoG: Summarizing and Understanding Large Graphs

Danai Koutra; U Kang; Jilles Vreeken; Christos Faloutsos

arXiv:1406.3411·cs.SI·June 16, 2014

VoG: Summarizing and Understanding Large Graphs

Danai Koutra, U Kang, Jilles Vreeken, Christos Faloutsos

PDF

Open Access

TL;DR

VoG introduces a principled approach to succinctly summarize large graphs by constructing a vocabulary of common subgraph types and selecting the most informative ones based on the MDL principle, demonstrated on real-world graphs.

Contribution

It provides a formal encoding scheme, an efficient algorithm for graph summarization, and demonstrates effectiveness on large real-world graphs.

Findings

01

Effective summarization of multi-million-edge graphs

02

Demonstrated scalability on real datasets like Flickr and web graphs

03

Improved understanding of large graph structures

Abstract

How can we succinctly describe a million-node graph with a few simple sentences? How can we measure the "importance" of a set of discovered subgraphs in a large graph? These are exactly the problems we focus on. Our main ideas are to construct a "vocabulary" of subgraph-types that often occur in real graphs (e.g., stars, cliques, chains), and from a set of subgraphs, find the most succinct description of a graph in terms of this vocabulary. We measure success in a well-founded way by means of the Minimum Description Length (MDL) principle: a subgraph is included in the summary if it decreases the total description length of the graph. Our contributions are three-fold: (a) formulation: we provide a principled encoding scheme to choose vocabulary subgraphs; (b) algorithm: we develop \method, an efficient method to minimize the description cost, and (c) applicability: we report…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Graph Neural Networks · Graph Theory and Algorithms · Topic Modeling