Densest Subgraph in Streaming and MapReduce
Bahman Bahmani, Ravi Kumar, Sergei Vassilvitskii

TL;DR
This paper introduces new streaming and MapReduce algorithms for finding dense subgraphs in large graphs, providing near-optimal solutions with proven guarantees and demonstrating scalability through extensive experiments.
Contribution
It presents novel algorithms for densest subgraph detection in streaming and parallel models with provable approximation guarantees.
Findings
Algorithms make O((log n)/log (1+epsilon)) passes
Guarantee within a factor 2(1+epsilon) of optimal density
Extensive experiments show scalability on real-world graphs
Abstract
The problem of finding locally dense components of a graph is an important primitive in data analysis, with wide-ranging applications from community mining to spam detection and the discovery of biological network modules. In this paper we present new algorithms for finding the densest subgraph in the streaming model. For any epsilon>0, our algorithms make O((log n)/log (1+epsilon)) passes over the input and find a subgraph whose density is guaranteed to be within a factor 2(1+epsilon) of the optimum. Our algorithms are also easily parallelizable and we illustrate this by realizing them in the MapReduce model. In addition we perform extensive experimental evaluation on massive real-world graphs showing the performance and scalability of our algorithms in practice.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCaching and Content Delivery · Advanced Graph Neural Networks · Data Management and Algorithms
