Sketch-based Influence Maximization and Computation: Scaling up with Guarantees
Edith Cohen, Daniel Delling, Thomas Pajor, Renato F. Werneck

TL;DR
This paper introduces a scalable sketch-based algorithm for influence maximization in large networks, achieving significant speedups while maintaining theoretical guarantees and high practical influence estimation accuracy.
Contribution
The authors develop SKIM, a novel sketch-based greedy algorithm for influence maximization that scales to billion-edge graphs with guaranteed approximation ratios.
Findings
SKIM scales to graphs with billions of edges.
It achieves 10-100x speedup over existing greedy methods.
Influence oracles enable fast influence queries with linear preprocessing.
Abstract
Propagation of contagion through networks is a fundamental process. It is used to model the spread of information, influence, or a viral infection. Diffusion patterns can be specified by a probabilistic model, such as Independent Cascade (IC), or captured by a set of representative traces. Basic computational problems in the study of diffusion are influence queries (determining the potency of a specified seed set of nodes) and Influence Maximization (identifying the most influential seed set of a given size). Answering each influence query involves many edge traversals, and does not scale when there are many queries on very large graphs. The gold standard for Influence Maximization is the greedy algorithm, which iteratively adds to the seed set a node maximizing the marginal gain in influence. Greedy has a guaranteed approximation ratio of at least (1-1/e) and actually produces a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Visualization and Analytics
