Efficient Approximation Algorithms for Adaptive Seed Minimization

Jing Tang; Keke Huang; Xiaokui Xiao; Laks V.S. Lakshmanan; Xueyan; Tang; Aixin Sun; and Andrew Lim

arXiv:1907.09668·cs.SI·August 1, 2019

Efficient Approximation Algorithms for Adaptive Seed Minimization

Jing Tang, Keke Huang, Xiaokui Xiao, Laks V.S. Lakshmanan, Xueyan, Tang, Aixin Sun, and Andrew Lim

PDF

TL;DR

This paper introduces ASTI, an efficient adaptive seed minimization algorithm that selects seed nodes in multiple batches to influence a target number of users in social networks, with proven approximation guarantees.

Contribution

The paper presents the first adaptive seed minimization algorithm with provable approximation guarantees and practical efficiency, outperforming existing non-adaptive methods.

Findings

01

ASTI achieves near-optimal influence with fewer seed nodes.

02

ASTI runs in expected polynomial time, scalable to large networks.

03

Experimental results show ASTI outperforms competing algorithms in effectiveness and efficiency.

Abstract

As a dual problem of influence maximization, the seed minimization problem asks for the minimum number of seed nodes to influence a required number $η$ of users in a given social network $G$ . Existing algorithms for seed minimization mostly consider the non-adaptive setting, where all seed nodes are selected in one batch without observing how they may influence other users. In this paper, we study seed minimization in the adaptive setting, where the seed nodes are selected in several batches, such that the choice of a batch may exploit information about the actual influence of the previous batches. We propose a novel algorithm, ASTI, which addresses the adaptive seed minimization problem in $O (\frac{η \cdot ( m + n )}{ε ^{2}} ln n)$ expected time and offers an approximation guarantee of $\frac{( l n η + 1 ) ^{2}}{( 1 - ( 1 - 1/ b ) ^{b} ) ( 1 - 1/ e ) ( 1 - ε )}$ in expectation,…

Tables3

Table 1. Table 1. Frequently used notations.

Notation	Description
$G = (V, E)$	a graph $G$ with node set $V$ and edge set $E$
$n, m$	the number of nodes and edges in $G$
$η$	the threshold for the targeted number of nodes to be activated
$I (S), 𝔼 [I (S)]$	the spread of a seed set $S$ and its expectation
$Γ (S), 𝔼 [Γ (S)]$	the truncated spread of $S$ and its expectation
$G_{i} = (V_{i}, E_{i})$	the $i$ -th residual graph, where $G_{1} = G$
$n_{i}, m_{i}$	the number of nodes and edges in $G_{i}$
$η_{i}$	the shortfall in activating $η$ nodes in the $i$ -th round, i.e., $η_{i} = η - (n - n_{i})$
$I (S ∣ S_{i - 1})$	the marginal spread of $S$ on top of $S_{i - 1}$ , i.e., the spread of $S$ in $G_{i}$
$Γ (S ∣ S_{i - 1})$	the marginal truncated spread of $S$ on top of $S_{i - 1}$ , i.e., $Γ (S ∣ S_{i - 1}) = \min {I (S ∣ S_{i - 1}), η_{i}}$
$\tilde{Γ} (S ∣ S_{i - 1})$	a binary estimator with value $η_{i}$ if $S \cap R \neq \emptyset$ and $0$ otherwise
$R, ℛ$	a random mRR-set and a set of mRR-sets
$Λ_{ℛ} (v)$	the number of mRR-sets in $ℛ$ covered by $v$
$v^{*}, v^{⋄}, v^{\circ}$	the optimal node maximizing $Λ_{ℛ} (v)$ , $𝔼 [\tilde{Γ} (v ∣ S_{i - 1})]$ , and $𝔼 [Γ (v ∣ S_{i - 1})]$ , respectively
${OPT}_{i}$	the optimum of $𝔼 [\tilde{Γ} (v ∣ S_{i - 1})]$ , i.e., ${OPT}_{i} = \max_{v} 𝔼 [\tilde{Γ} (v ∣ S_{i - 1})]$
$ϕ, Φ, Ω$	a specific realization, a random realization, and the realization space
$π, π^{*}$	a random policy, and an optimal policy

Table 2. Table 2. Dataset details. ( K = 𝟏𝟎 𝟑 , M = 𝟏𝟎 𝟔 formulae-sequence K superscript 10 3 M superscript 10 6 \boldsymbol{\textrm{K}=10^{3},\textrm{M}=10^{6}} )

Dataset	$𝒏$	$𝒎$	Type	Avg. deg.	LWCC size
NetHEPT	15.2K	31.4K	undirected	4.18	6.80K
Epinions	132K	841K	directed	13.4	119K
Youtube	1.13M	2.99M	undirected	5.29	1.13M
LiveJournal	4.85M	69.0M	directed	28.5	4.84M

Table 3. Table 3. Improvement ratio of ASTI over ATEUC

	$η / n$	0.01	0.05	0.1	0.15	0.2
IC Model	NetHEPT	N/A	40.8%	43.8%	43.0%	43.7%
	Epinions	N/A	N/A	50.7%	N/A	65.7%
	Youtube	0.0%	24.3%	N/A	37.5%	41.7%
	LiveJournal	N/A	43.0%	34.9%	N/A	33.0%
LT Model	NetHEPT	N/A	N/A	N/A	44.3%	47.5%
	Epinions	N/A	N/A	N/A	N/A	N/A
	Youtube	0.0%	39.5%	54.1%	N/A	47.9%
	LiveJournal	N/A	N/A	N/A	N/A	N/A

Equations180

E [I (S)] := E_{Φ \sim Ω} [I_{Φ} (S)] = ϕ \in Ω \sum I_{ϕ} (S) \cdot p (ϕ),

E [I (S)] := E_{Φ \sim Ω} [I_{Φ} (S)] = ϕ \in Ω \sum I_{ϕ} (S) \cdot p (ϕ),

π min E [∣ S (π, ϕ)∣] subject to I_{ϕ} (S (π, ϕ)) \geq η for all ϕ,

π min E [∣ S (π, ϕ)∣] subject to I_{ϕ} (S (π, ϕ)) \geq η for all ϕ,

Γ_{ϕ} (S) := min {I_{ϕ} (S), η} .

Γ_{ϕ} (S) := min {I_{ϕ} (S), η} .

I_{ϕ} (S ∣ S_{i - 1}) := I_{ϕ} (S \cup S_{i - 1}) - I_{ϕ} (S_{i - 1}),

I_{ϕ} (S ∣ S_{i - 1}) := I_{ϕ} (S \cup S_{i - 1}) - I_{ϕ} (S_{i - 1}),

Γ_{ϕ} (S ∣ S_{i - 1}) := Γ_{ϕ} (S \cup S_{i - 1}) - Γ_{ϕ} (S_{i - 1}) .

Γ_{ϕ} (S ∣ S_{i - 1})

Γ_{ϕ} (S ∣ S_{i - 1})

= min {I_{ϕ} (S ∣ S_{i - 1}), η_{i}} .

Δ (v ∣ S_{i - 1}) := E_{Φ \sim Ω_{i}} [Γ_{Φ} (v ∣ S_{i - 1})] .

Δ (v ∣ S_{i - 1}) := E_{Φ \sim Ω_{i}} [Γ_{Φ} (v ∣ S_{i - 1})] .

α^{⊥} E [I (v ∣ S)] \leq E [\tilde{I} (v ∣ S)] \leq α^{⊤} E [I (v ∣ S)],

α^{⊥} E [I (v ∣ S)] \leq E [\tilde{I} (v ∣ S)] \leq α^{⊤} E [I (v ∣ S)],

Δ (s_{i} ∣ S_{i - 1}) \geq α Δ (v ∣ S_{i - 1}) .

Δ (s_{i} ∣ S_{i - 1}) \geq α Δ (v ∣ S_{i - 1}) .

E [I (S)] = n \cdot Pr [R \cap S \neq = \emptyset] .

E [I (S)] = n \cdot Pr [R \cap S \neq = \emptyset] .

η \cdot Pr [R \cap S \neq = \emptyset] = \frac{η}{n} \cdot E [I (S)] = \frac{η}{n} ϕ \in Ω \sum I_{ϕ} (S) \cdot p (ϕ) .

η \cdot Pr [R \cap S \neq = \emptyset] = \frac{η}{n} \cdot E [I (S)] = \frac{η}{n} ϕ \in Ω \sum I_{ϕ} (S) \cdot p (ϕ) .

E [Γ (S)] = ϕ \in Ω \sum Γ_{ϕ} (S) \cdot p (ϕ) .

E [Γ (S)] = ϕ \in Ω \sum Γ_{ϕ} (S) \cdot p (ϕ) .

\frac{η}{n} \cdot I_{ϕ} (S) < min {I_{ϕ} (S), η} = Γ_{ϕ} (S) .

\frac{η}{n} \cdot I_{ϕ} (S) < min {I_{ϕ} (S), η} = Γ_{ϕ} (S) .

(1 - 1/ e) E [Γ (S)] \leq E [\tilde{Γ} (S)] \leq E [Γ (S)] .

(1 - 1/ e) E [Γ (S)] \leq E [\tilde{Γ} (S)] \leq E [Γ (S)] .

(1 - 1/ e) E [Γ (S ∣ S_{i - 1})] \leq E [\tilde{Γ} (S ∣ S_{i - 1})] \leq E [Γ (S ∣ S_{i - 1})] .

(1 - 1/ e) E [Γ (S ∣ S_{i - 1})] \leq E [\tilde{Γ} (S ∣ S_{i - 1})] \leq E [Γ (S ∣ S_{i - 1})] .

\frac{E [ Γ ( S ∣ S _{i - 1} )]}{E [ Γ ( S ^{'} ∣ S _{i - 1} )]} \geq (1 - 1/ e) \frac{E [ Γ ~ ( S ∣ S _{i - 1} )]}{E [ Γ ~ ( S ^{'} ∣ S _{i - 1} )]} .

\frac{E [ Γ ( S ∣ S _{i - 1} )]}{E [ Γ ( S ^{'} ∣ S _{i - 1} )]} \geq (1 - 1/ e) \frac{E [ Γ ~ ( S ∣ S _{i - 1} )]}{E [ Γ ~ ( S ^{'} ∣ S _{i - 1} )]} .

\frac{η _{i} Λ ^{l} ( v ^{*} )}{∣ R ∣} \leq E [\tilde{Γ} (v^{*} ∣ S_{i - 1})] .

\frac{η _{i} Λ ^{l} ( v ^{*} )}{∣ R ∣} \leq E [\tilde{Γ} (v^{*} ∣ S_{i - 1})] .

\frac{η _{i} Λ ^{u} ( v ^{\circ} )}{∣ R ∣} \geq E [\tilde{Γ} (v^{\circ} ∣ S_{i - 1})] .

\frac{η _{i} Λ ^{u} ( v ^{\circ} )}{∣ R ∣} \geq E [\tilde{Γ} (v^{\circ} ∣ S_{i - 1})] .

\frac{Δ ( v ^{*} ∣ S _{i - 1} )}{Δ ( v ^{\circ} ∣ S _{i - 1} )} \geq (1 - 1/ e) \frac{E [ Γ ~ ( v ^{*} ∣ S _{i - 1} )]}{E [ Γ ~ ( v ^{\circ} ∣ S _{i - 1} )]} .

\frac{Δ ( v ^{*} ∣ S _{i - 1} )}{Δ ( v ^{\circ} ∣ S _{i - 1} )} \geq (1 - 1/ e) \frac{E [ Γ ~ ( v ^{*} ∣ S _{i - 1} )]}{E [ Γ ~ ( v ^{\circ} ∣ S _{i - 1} )]} .

Δ (v^{*} ∣ S_{i - 1}) \geq \frac{Λ ^{l} ( v ^{*} )}{Λ ^{u} ( v ^{\circ} )} \cdot (1 - 1/ e) \cdot Δ (v^{\circ} ∣ S_{i - 1}) .

Δ (v^{*} ∣ S_{i - 1}) \geq \frac{Λ ^{l} ( v ^{*} )}{Λ ^{u} ( v ^{\circ} )} \cdot (1 - 1/ e) \cdot Δ (v^{\circ} ∣ S_{i - 1}) .

\displaystyle\Pr[\bar{X}>\mathbb{E}[\bar{X}]+\lambda]\leq\exp\Big{(}-\frac{\lambda^{2}{T}}{2\mathbb{E}[\bar{X}]+2\lambda/3}\Big{)},

\displaystyle\Pr[\bar{X}>\mathbb{E}[\bar{X}]+\lambda]\leq\exp\Big{(}-\frac{\lambda^{2}{T}}{2\mathbb{E}[\bar{X}]+2\lambda/3}\Big{)},

\displaystyle\Pr[\bar{X}<\mathbb{E}[\bar{X}]-\lambda]\leq\exp\Big{(}-\frac{\lambda^{2}{T}}{2\mathbb{E}[\bar{X}]}\Big{)}.

\displaystyle\Pr\Big{[}\mathbb{E}[\bar{X}]\cdot T<\Big{(}\sqrt{\bar{X}T+\tfrac{2\lambda}{9}}-\sqrt{\tfrac{\lambda}{2}}\Big{)}^{2}-\tfrac{\lambda}{18}\Big{]}\leq{\mathrm{e}}^{-\lambda},

\displaystyle\Pr\Big{[}\mathbb{E}[\bar{X}]\cdot T<\Big{(}\sqrt{\bar{X}T+\tfrac{2\lambda}{9}}-\sqrt{\tfrac{\lambda}{2}}\Big{)}^{2}-\tfrac{\lambda}{18}\Big{]}\leq{\mathrm{e}}^{-\lambda},

\displaystyle\Pr\Big{[}\mathbb{E}[\bar{X}]\cdot T>\Big{(}\sqrt{\bar{X}T+\tfrac{\lambda}{2}}+\sqrt{\tfrac{\lambda}{2}}\Big{)}^{2}\Big{]}\leq{\mathrm{e}}^{-\lambda}.

Γ_{ϕ} (S_{i - 1}) = Γ_{ϕ} (V) if and only if Γ_{ϕ^{'}} (S_{i - 1}) = Γ_{ϕ^{'}} (V),

Γ_{ϕ} (S_{i - 1}) = Γ_{ϕ} (V) if and only if Γ_{ϕ^{'}} (S_{i - 1}) = Γ_{ϕ^{'}} (V),

Γ_{ϕ} (v ∣ S_{i - 1}) \geq 0,

Δ (v ∣ S_{j - 1}) \geq Δ (v ∣ S_{i - 1}),

Δ (v ∣ S_{j - 1}; S_{i - 1}) \geq Δ (v ∣ S_{i - 1}),

ϕ \in Ω_{j} (ϕ_{i}) \sum p (ϕ) = p (ϕ_{i}) .

ϕ \in Ω_{j} (ϕ_{i}) \sum p (ϕ) = p (ϕ_{i}) .

Γ_{ϕ} (v ∣ S_{i - 1}) = min {∣ V_{ϕ} (v ∣ S_{i - 1})∣, η_{i}} .

Γ_{ϕ} (v ∣ S_{i - 1}) = min {∣ V_{ϕ} (v ∣ S_{i - 1})∣, η_{i}} .

Γ_{ϕ} (v ∣ S_{j - 1}) = min {∣ V_{ϕ} (v ∣ S_{j - 1})∣, η_{j}} \geq Γ_{ϕ} (v ∣ S_{i - 1}),

Γ_{ϕ} (v ∣ S_{j - 1}) = min {∣ V_{ϕ} (v ∣ S_{j - 1})∣, η_{j}} \geq Γ_{ϕ} (v ∣ S_{i - 1}),

Δ (v ∣ S_{j - 1})

Δ (v ∣ S_{j - 1})

\geq ϕ \in Ω_{j} \sum Γ_{ϕ} (v ∣ S_{i - 1}) \cdot p (ϕ)

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Efficient Approximation Algorithms for Adaptive Seed Minimization

Jing Tang

0000-0002-0785-707X

Dept. of Ind. Syst. Engg. and Mgmt.National University of Singapore

[email protected]

,

Keke Huang

School of Comp. Sci. and Engg.Nanyang Technological University

[email protected]

,

Xiaokui Xiao

School of ComputingNational University of Singapore

[email protected]

,

Laks V.S. Lakshmanan

Department of Computer ScienceUniversity of British Columbia

[email protected]

,

Xueyan Tang

School of Computer Science and EngineeringNanyang Technological University

[email protected]

,

Aixin Sun

School of Computer Science and EngineeringNanyang Technological University

[email protected]

and

Andrew Lim

Dept. of Ind. Syst. Engg. and Mgmt.National University of Singapore

[email protected]

Abstract.

As a dual problem of influence maximization, the seed minimization problem asks for the minimum number of seed nodes to influence a required number $\eta$ of users in a given social network $G$ . Existing algorithms for seed minimization mostly consider the non-adaptive setting, where all seed nodes are selected in one batch without observing how they may influence other users.

In this paper, we study seed minimization in the adaptive setting, where the seed nodes are selected in several batches, such that the choice of a batch may exploit information about the actual influence of the previous batches. We propose a novel algorithm, ASTI, which addresses the adaptive seed minimization problem in $O\Big{(}\frac{\eta\cdot(m+n)}{\varepsilon^{2}}\ln n\Big{)}$ expected time and offers an approximation guarantee of $\frac{(\ln\eta+1)^{2}}{(1-(1-1/b)^{b})(1-1/{\mathrm{e}})(1-\varepsilon)}$ in expectation, where $\eta$ is the targeted number of influenced nodes, $b$ is size of each seed node batch, and $\varepsilon\in(0,1)$ is a user-specified parameter. To the best of our knowledge, ASTI is the first algorithm that provides such an approximation guarantee without incurring prohibitive computation overhead. With extensive experiments on a variety of datasets, we demonstrate the effectiveness and efficiency of ASTI over competing methods.

Seed Minimization; Sampling; Approximation Algorithm

††ccs: Information systems Data mining††ccs: Information systems Social advertising††ccs: Information systems Social networks††ccs: Theory of computation Probabilistic computation††ccs: Theory of computation Submodular optimization and polymatroids

1. Introduction

Social networks are becoming increasingly popular for people to discuss and share their thoughts and comments towards public topics. Based on the established relations among individuals, ideas and opinions can be spread over social networks via a word-of-mouth effect. To exploit this effect for advertising, advertisers often provide free samples of their products to selected social network users, in exchange for those users to promote those products and create a cascade of influence to other users. In such a setting, advertisers might want to know the minimum number of free samples required to be given away, so as to draw sufficient attention. Goyal et al. (Goyal et al., 2013) are the first to formulate this problem as a seed minimization problem, which asks for the minimum number of seed nodes (i.e., users who receive free samples) needed to influence at least a required number $\eta$ of users, taking into account the randomness in the influence propagation process.

Existing work on seed minimization mostly focuses on the non-adaptive setting (Goyal et al., 2013; Zhang et al., 2014; Han et al., 2017), which requires that all seed nodes should be selected in one batch without observing the actual influence of any node, i.e., no randomness in the influence propagation process can be removed until all seed nodes are fixed. As a consequence of the non-adaptiveness, these solutions may return a seed set that fails to influence at least $\eta$ nodes in the actual propagation process, or may select an excessive number of seed nodes that generate an actual influence spread much larger than required.

To address the above issues, Vaswani and Lakshmanan (Vaswani and Lakshmanan, 2016) propose to consider seed minimization under the adaptive setting, where (i) the seed nodes are selected one by one, and (ii) before selecting the $i$ -th seed node, the actual influence of the first $i-1$ seed nodes can be observed, i.e., we may optimize the choice of the $i$ -th seed node to influence those users that have not been influenced by the previous $i-1$ seed nodes. Such an adaptive strategy ensures that (i) the seed set returned always achieves the required number of influenced users (since the actual influence of each seed node is known after selection), and (ii) the number of seed nodes would not be excessive (because we can stop selecting seed nodes as soon as the targeted influence is achieved). We note that similar adaptive approaches have also been adopted by other practical problems, such as influence maximization (Yadav et al., 2016), sensor placement (Asadpour et al., 2008), active learning (Chen and Krause, 2013), and object detection (Chen et al., 2014).

To our knowledge, the only existing solution for adaptive seed minimization is by Vaswani and Lakshmanan (Vaswani and Lakshmanan, 2016). As we discuss in Section 2.4, however, the solution in (Vaswani and Lakshmanan, 2016) requires that the expected influence of any seed set should be estimated with extremely high accuracy, which results in prohibitive computation overhead. Furthermore, the solution does not provide any non-trivial approximation guarantee, due to an ineffective approach used to select each seed node under the adaptive setting. Therefore, it remains an open problem to devise efficient approximation algorithms for adaptive seed minimization.

In this paper, we address the above open problem with ASTI, a novel framework tailored for adaptive seed minimization. The key idea of ASTI is to adaptively choose the seed node with the maximum expected truncated influence spread in each round of seed selection. Specifically, given a diffusion model $M$ that captures the uncertainty of influence propagation in $G$ , we consider the set $\Omega$ of all possible realizations, each of which represents a possible scenario of influence propagation among the nodes in $G$ . For each possible realization $\phi\in\Omega$ , the influence spread of a seed set $S$ , denoted as $I_{\phi}(S)$ is the number of nodes influenced by $S$ , while the truncated influence spread of $S$ is defined as $\Gamma_{\phi}(S)=\min\{\eta,I_{\phi}(S)\}$ . We consider $\Gamma_{\phi}(S)$ instead of $I_{\phi}(S)$ because, intuitively, the extra influence spread beyond $\eta$ is useless for fulfilling the requirement on influence. (In fact, as we show in Section 2.4, the extra influence spread may even lead to incorrect choice of seed nodes, and hence, it has to be ignored.)

When developing algorithms under the ASTI framework, the key challenge that we face is the design of methods to accurately estimate a seed set $S$ ’s expected truncated influence spread over a given set of possible realizations. We show that existing methods (Huang et al., 2017; Nguyen et al., 2016; Tang et al., 2015, 2018b, 2014; Borgs et al., 2014) for estimating un-truncated influence spread cannot be applied in our truncated setting, since they are unable to take into account the effect of truncation by $\eta$ . Motivated by this, we propose a novel sampling method based on the concept of multi-root reverse reachable (mRR) sets, and prove that our method provides non-trivial guarantees in terms of the efficiency and accuracy of truncated influence estimation. Building upon this sampling method, we develop TRIM, an algorithm for maximizing truncated influence spread with a provable approximation guarantee of $(1-1/{\mathrm{e}})(1-\varepsilon)$ . We show that instantiating ASTI using TRIM leads to strong theoretical guarantees for adaptive seed minimization, and TRIM can be extended into a batched version TRIM-B that selects a batch of $b$ nodes in each round, so as to accelerate seed selection.

In summary, we make the following contributions:

•

ASTI**, a general framework.** We analyze the characteristics of adaptive seed minimization, based on which we propose a general framework ASTI tailored for the problem.

•

mRR-set, a novel sampling method. ASTI requires accurate estimation of truncated influence spreads, for which the existing sampling methods are either inefficient or ineffective. To address this challenge, we propose a novel sampling method, mRR, which is able to estimate the truncated influence spread in a cost-effective manner.

•

TRIM**, an efficient algorithm for truncated influence maximization.** A key step of ASTI is to identify a set of nodes with the maximum expected truncated influence spread, for which we propose the TRIM algorithm based on mRR-sets. With a rigorous theoretical analysis, we show that ASTI instantiated by TRIM returns a $\frac{(\ln\eta+1)^{2}}{(1-1/{\mathrm{e}})(1-\varepsilon)}$ -approximate solution for adaptive seed minimization with expected time complexity of $O\big{(}\frac{\eta\cdot(m+n)}{\varepsilon^{2}}\ln n\big{)}$ .

•

TRIM-B**, the batched version of TRIM.** For further performance gain, we extend TRIM into a batched version TRIM-B that selects seed nodes in a predefined batch size $b$ in each round. ASTI instantiated by TRIM-B provides an approximation guarantee of $\frac{(\ln\eta+1)^{2}}{(1-(1-1/b)^{b})(1-1/{\mathrm{e}})(1-\varepsilon)}$ with the same time complexity as TRIM.

•

An extensive set of experiments. We experimentally evaluate ASTI instantiated by TRIM and TRIM-B against the state-of-the-art non-adaptive algorithm ATEUC (Han et al., 2017), and show that (i) our solutions are much more effective in minimizing the number of seed nodes needed and ensuring that the required influence spread is achieved, and (ii) our solutions are able to efficiently handle social networks with millions of nodes and edges.

2. Preliminaries

This section formally defines the problem of adaptive seed minimization, and reviews the existing solutions. Table 1 summarizes the notations that are frequently used. For ease of exposition, our discussions focus on the independent cascade (IC) model (Kempe et al., 2003), which is one of the most widely adopted propagation models in the literature. But we note that our algorithms can be easily extended to other propagation models, such as the linear threshold model (Kempe et al., 2003) and the topic-aware models (Barbieri et al., 2012).

2.1. Influence Propagation and Realization

Let $G$ be a social network with a node set $V$ and a directed edge set $E$ , where $|V|=n$ and $|E|=m$ . For any edge $\langle u,v\rangle\in E$ , we refer to $u$ as an incoming neighbor of $v$ , and $v$ as an outgoing neighbor of $u$ . Each edge $e=\langle u,v\rangle$ is associated with a propagation probability $p(e)\in(0,1]$ . We refer to such a social network as a probabilistic social network.

Given a node set $S\subseteq V$ , the influence propagation initiated by $S$ under the independent cascade (IC) model (Kempe et al., 2003) is modeled as a discrete-time stochastic process as follows. At time slot $t_{0}$ (the subscript indicates the index of the time slot), all nodes in $S$ are activated while all other nodes are inactive. Suppose that node $u$ is first activated at slot $t_{i}$ , then $u$ has one chance to activate each outgoing neighbor $v$ with the probability $p(u,v)$ at slot $t_{i+1}$ , after which $u$ remains active. This influence propagation process continues until no more inactive nodes can be activated. As to the linear threshold (LT) model, it demands that for each node $v\in G$ , the propagation probabilities of all edges ending at $v$ sum up to no more than $1$ . With a given node set $S$ , LT model works in a similar discrete-time stochastic procedure as follows. At time slot $t_{0}$ , each node $v\in G$ is assigned with a threshold $\lambda_{v}$ sampled uniformly from $[0,1]$ , and only nodes in $S$ are activated. At time slot $t_{i}$ , we check all inactive node $u$ of its incoming edges from activated neighbors that if the sum of their propagation probabilities is no smaller than $\lambda_{u}$ . If it is, then $u$ is activated; otherwise $u$ remains inactive. This influence propagation process terminates once there is no further node activated. Let $I(S)$ be the total number of active nodes in $G$ when the influence propagation terminates. We refer to $S$ as the seed set, and $I(S)$ as the spread of $S$ .

Alternatively, the influence propagation process can also be described by the live edge procedure (Kempe et al., 2003). Specifically, for each edge $e\in E$ , we independently flip a coin of head probability $p(e)$ to decide whether the edge $e$ is live or blocked to generate a sample of influence propagation. All the blocked edges are removed and the remaining graph is referred to as a realization of the probabilistic social network $G$ , denoted as $\phi$ . Note that there are $2^{m}$ distinct possible realizations. Let $\Omega$ be the set of all possible realizations (i.e., the sample space) such that $\lvert\Omega\rvert=2^{m}$ , and $\Phi\sim\Omega$ denote that $\Phi$ is a realization randomly sampled from $\Omega$ . Given a realization $\phi\in\Omega$ , the spread of any seed set $S\subseteq V$ under $\phi$ is the total number of nodes that are reachable from $S$ , denoted as $I_{\phi}(S)$ . Thus, for any seed set $S$ , its expected spread $\mathbb{E}[I(S)]$ is defined as

[TABLE]

where $p(\phi)$ is the probability for realization $\phi$ to occur. In other words, the expected spread of $S$ is the (weighted) average spread over all the realizations in $\Omega$ .

2.2. Adaptive Seed Minimization

Given a probabilistic social network $G=(V,E)$ and a threshold $\eta\in[1,n]$ , the seed minimization problem aims to select a minimum number of seed nodes to influence at least $\eta$ nodes. In the conventional “non-adaptive” setting, seed minimization requires selecting a node set $S$ such that $\mathbb{E}[I(S)]\geq\eta$ , without any knowledge of realization that would occur in the actual influence propagation process. As a consequence, the selected $S$ may influence fewer than $\eta$ nodes for some realizations or much more than $\eta$ nodes for some other realizations, both of which are undesirable scenarios.

Meanwhile, the adaptive strategy (i.e., a recursive select-observe-select procedure) has been shown to be more effective than the non-adaptive (i.e., just select based on model) strategy in many real-world applications (Asadpour et al., 2008; Chen and Krause, 2013; Chen et al., 2014). Specifically, an adaptive strategy first selects a node $u$ from graph $G$ , and then observes the set of nodes activated by choosing node $u$ as a seed node. Based on this observation, the strategy would choose the next node as one that could influence as many currently inactive nodes as possible. This procedure is carried out in an recursive manner, until at least $\eta$ active nodes are observed.

Figure 1 illustrates the adaptive strategy. Figures 1(a) and 1(b) show a social graph $G$ and one possible realization $\phi$ of $G$ , respectively. Let $\eta=4$ and $\phi$ be the actual realization of influence propagation (which is unknown apriori). Figure 1(c) indicates that we first select node $v_{1}$ (in dark gray) as a seed node. Note that node $v_{1}$ influences nodes $v_{4}$ and $v_{6}$ (in light gray), with each bold (resp. dashed) arrow denoting a successful (resp. failed) step of influence. In addition, the thin arrows in Figures 1(c)–1(d) correspond to influence attempts which are not yet revealed. Since the number of nodes influenced by $v_{1}$ is less than $\eta$ , we continue to select the second seed node. Figure 1(d) shows that we select $v_{3}$ , which results in a total of $5$ active nodes, reaching the threshold $\eta$ . Then, the adaptive seed selection process terminates.

In this paper, we aim to study seed selection strategies (referred to as policies) for adaptive seed minimization (ASM), which is formally defined as follows:

Definition 2.1 (Adaptive Seed Minimization).

Given a probabilistic social graph $G=(V,E)$ and a threshold $\eta\in[1,n]$ , the adaptive seed minimization problem aims to identify a policy $\pi$ that minimizes the expected number of seed nodes required to achieve an influence spread of at least $\eta$ on possible realizations $\phi\in\Omega$ , i.e.,

[TABLE]

where $S(\pi,\phi)$ is the seed set selected by $\pi$ under realization $\phi$ and $\mathbb{E}[\lvert S(\pi,\phi)\rvert]=\sum_{\phi\in\Omega}\lvert S(\pi,\phi)\rvert\cdot p(\phi)$ .

Note that when the propagation probability of every edge in $G$ is $1$ , ASM reduces to the deterministic version of seed minimization, which is shown to be NP-hard (Goyal et al., 2013). Therefore, finding an optimal policy for ASM is also NP-hard.

2.3. Truncated Influence Spread

Note that, in ASM, the influence spread in excess of the threshold $\eta$ has no value. Accordingly, we introduce the notion of truncated influence spread as follows.

Definition 2.2 (Truncated Influence Spread).

Given a seed set $S$ and a threshold $\eta$ , the truncated influence spread $\Gamma_{\phi}(S)$ of $S$ under a realization $\phi$ is the smaller one between $I_{\phi}(S)$ and $\eta$ , i.e.,

[TABLE]

Recall that ASM requires considering the influence spreads of nodes when the actual influence of some other nodes has been observed. Therefore, we also introduce the notion of marginal truncated influence spread as follows. Let $V_{1}=V$ and $G_{1}=G$ . Let $V_{i}$ be the subset of nodes that remain inactive after round $(i-1)$ , $G_{i}$ be the subgraph of $G$ induced by $V_{i}$ . We refer to $G_{i}$ as the $i$ -th residual graph. For example, in Figure 1, after round $1$ , only nodes $v_{2},v_{3},v_{5}$ remain inactive, so $V_{2}=\{v_{2},v_{3},v_{5}\}$ and $G_{2}=(V_{2},E_{2})$ denotes the induced subgraph containing the thin edge $\langle v_{3},v_{5}\rangle$ .

Let $S_{i}$ be the set of nodes selected as seeds by a policy in the first $i$ rounds. Similar to the definition of $\Omega$ , we denote $\Omega_{i}$ as the set of all possible realizations in the $i$ -th round. Then, for a node set $S\subseteq V_{i}$ , we define the marginal spread $I_{\phi}(S\mid S_{i-1})$ as the additional spread that $S$ provides on top of $S_{i-1}$ under realization $\phi\in\Omega_{i}$ , and define truncated marginal spread $\Gamma_{\phi}(S\mid S_{i-1})$ accordingly, i.e.,

[TABLE]

Note that $I_{\phi}(S\mid S_{i-1})$ is exactly the influence spread of $S$ in the residual graph $G_{i}$ under realization $\phi$ .

Let $n_{i}=\lvert V_{i}\rvert$ be the number of nodes in $G_{i}$ , i.e., $I_{\phi}(S_{i-1})=n-n_{i}$ nodes have been activated by the end of round $i-1$ , based on the partial realization revealed so far. Define $\eta_{i}=\eta-(n-n_{i})$ . This is the amount by which the policy falls short of the target $\eta$ in the beginning of round $i$ . Before reaching the threshold $\eta$ , i.e., $\Gamma_{\phi}(S_{i-1})=I_{\phi}(S_{i-1})<\eta$ , we can rewrite $\Gamma_{\phi}(S\mid S_{i-1})$ as

[TABLE]

Then, $\Gamma_{\phi}(S\mid S_{i-1})$ can be easily computed in the residual graph $G_{i}$ . For brevity, we define $\Gamma_{\phi}(v\mid S_{i-1}):=\Gamma_{\phi}(\{v\}\mid S_{i-1})$ for a singleton node set $\{v\}$ .

Finally, we define the expected marginal truncated spread $\Delta(v\mid S_{i-1})$ as

[TABLE]

In other words, the expected marginal truncated spread of a node $v$ is defined based on the “lift” in the expected number of active nodes that $v$ brings on top of previously selected seeds, over all realizations consistent with what has been observed in previous rounds.

2.4. Existing Solutions

Golovin and Krause (Golovin and Krause, 2017) study the adaptive stochastic minimum cost coverage problem, which can be regarded as a variant of ASM in the case where there exists an oracle that accurately reports the expected marginal truncated spread for any given seed set. They propose to adopt a greed policy as follows. First, select the node $s_{1}$ with the largest expected truncated spread, i.e., $\Delta(s_{1}\mid S_{0})\geq\Delta(v\mid S_{0})$ for all $v\in V$ . Then, observe the actual nodes that are activated by $s_{1}$ during the stochastic process, and remove them from $G$ to induce the residual graph $G_{2}$ . After that, identify the node $s_{2}$ with the maximum expected marginal truncated spread $\Delta(s_{2}\mid S_{1})$ in the residual graph $G_{2}$ . This process continues, such that each round selects the node with the largest expected marginal truncated spread, until we observe that no less than $\eta$ nodes have been influenced.

Golovin and Krause (Golovin and Krause, 2017) show that the above greedy policy returns a $(\ln\eta+1)^{2}$ -approximate solution to the optimum.111Golovin and Krause claim that the approximation guarantee is $(\ln\eta+1)$ in an earlier version of their work (Golovin and Krause, 2011), but point out that the proof has gaps in a revised version (Golovin and Krause, 2017). Whether the logarithmic bound holds is an interesting open problem. This approximation guarantee, however, does not lead to a practical algorithm for the ASM problem, because (i) it requires the help from an oracle to exactly identify the node with the maximum expected marginal truncated spread in each round, but (ii) computing the exact expected spread of any node set is #P-hard (Chen et al., 2010a).

Motivated by this observation, Vaswani and Lakshmanan (Vaswani and Lakshmanan, 2016) attempt to extend Golovin and Krause’s method by replacing the oracle with an spread estimator with bounded errors. In particular, they assume that for any node set $S$ , the estimation ${\mathbb{E}}[\tilde{I}(v\mid S)]$ of the marginal gain $\mathbb{E}[I(v\mid S)]:=\mathbb{E}[I(S\cup\{v\})]-\mathbb{E}[I(S)]$ should satisfy

[TABLE]

where ${\alpha^{\top}}/{\alpha^{\bot}}$ denotes the multiplicative error in calculating the marginal gains. Unfortunately, this requirement on the spread estimation is so stringent that no existing methods for influence estimation could fulfill the requirement without incurring prohibitive estimation overhead. To explain, suppose that the expected marginal spread $\mathbb{E}[I(v\mid S)]$ of a node $v$ on top of $S$ is small. In that case, Equation (7) would only allow a trivial amount of estimation error, which is rather difficult to achieve by existing methods for spread estimation.

In addition, the algorithm in (Vaswani and Lakshmanan, 2016) attempts to select the node with the largest marginal spread in each round, instead of the node with the maximum marginal truncated spread. As a consequence, even when there exists an efficient estimator that provides highly accurate spread estimation, the algorithm in (Vaswani and Lakshmanan, 2016) would still fail to achieve the type of approximation guarantee in (Golovin and Krause, 2017), which the theoretical analysis in (Golovin and Krause, 2017) is based on the notion of truncated spreads. We illustrate this issue with an example.

Example 2.3.

Consider Figure 2(a), which shows a social graph $G$ with four nodes and four directed edges. The number on each edge indicates the propagation probability of the edge. $G$ has four possible realizations $\phi_{1}$ , $\phi_{2}$ , $\phi_{3}$ , and $\phi_{4}$ in total, as shown in Figures 2(b)–2(e). Each realization has an equal probability of $0.25$ to happen. Assume that $\eta=2$ . Then, the expected spread of node $v_{1}$ is $\mathbb{E}[I(v_{1})]=0.25\times(3+3+4+1)=2.75$ , which is larger than that of the other three nodes. Thus, when the vanilla expected spread is adopted as the measure, node $v_{1}$ will be selected as the first seed node. On realizations $\phi_{1}$ , $\phi_{2}$ , and $\phi_{3}$ , $v_{1}$ is qualified to influence at least $\eta=2$ users. However, there is a probability of $0.25$ that $\phi_{4}$ happens, in which case $v_{1}$ can only influence itself, and hence, one additional seed node is required. Overall, $2\times 0.25+1\times(1-0.25)=1.25$ seed nodes are selected in expectation.

Now observe that the expected truncated spread of nodes $v_{1}$ , $v_{2}$ , $v_{3}$ , and $v_{4}$ are $1.75$ , $2$ , $2$ , and $1$ , respectively. Therefore, when the expected truncated spread is adopted as the measure, either $v_{2}$ or $v_{3}$ is selected as the first seed node, which can influence $2$ users under all four realizations. This demonstrates that, for ASM, choosing nodes based on expected truncated spreads is more effective than that based on vanilla expected spreads. $\square$

In recent work (Han et al., 2018), Han et al. study the problem of adaptive influence maximization, which also considers the adaptive setting, but aims to identify a predefined number of $k$ seed nodes that could influence the maximum number of users in $G$ in expectation. At the first glance, it may seem that we can modify the adaptive influence maximization algorithms to solve the adaptive seed minimization problem, in the same way that existing work (Goyal et al., 2013) transforms non-adaptive influence maximizing algorithms to address non-adaptive seed minimization. This approach, however, does not work because the algorithm in (Han et al., 2018) is designed based on vanilla expected marginal spreads. Instead, ASM requires considering truncated expected marginal spreads, as we previously discussed. As a consequence, the algorithm in (Han et al., 2018) cannot be adopted in our setting.

3. Our Solution

3.1. Algorithmic Framework

We propose a general framework, referred to as ASTI, to address the ASM problem. Algorithm 1 shows the details. Given a probabilistic social graph $G$ and a threshold $\eta$ , ASTI aims to return a seed set $S$ such that $\Gamma(S)\geq\eta$ , where $\Gamma(S)$ is the truncated influence spread of $S$ (i.e., the smaller one of the threshold $\eta$ and the number of active nodes influenced by $S$ ). In a nutshell, ASTI iteratively (i) selects the node to maximize the expected marginal truncated spread (Line 1), (ii) observes the newly influenced nodes (Line 1), and then (iii) updates the corresponding information (Lines 1–1). The process stops when at least $\eta$ nodes are activated (Line 1).

The key step of ASTI is truncated influence maximization that targets at identifying a node to maximize the expected marginal truncated spread (Line 1). If an $\alpha$ -approximate solution for truncated influence maximization is obtained in each round (Line 1), ASTI provides a non-trivial approximation guarantee, as shown in the following theorem.

Theorem 3.1.

Suppose $\pi$ is an $\alpha$ -approximate greedy policy, for some $\alpha\in(0,1]$ , i.e., for any $G_{i}$ and $v\in V_{i}$ , it selects a node $s_{i}$ satisfying

[TABLE]

Then $\pi$ achieves an approximation ratio of $\frac{(\ln\eta+1)^{2}}{\alpha}$ to the optimal adaptive seed minimization policy.

The proof222The formal proofs of all theoretical results are given in Appendix B. of Theorem 3.1 is based on adaptive submodular optimization (Golovin and Krause, 2017). Theorem 3.1 requires that the policy should be an $\alpha$ -approximate greedy one with respect to the expected marginal truncated spread $\Delta(v\mid S_{i-1})$ . The challenge for designing such an $\alpha$ -approximate greedy policy lies in how to develop a proper sampling method for estimating the truncated influence spread.

3.2. Truncated Influence Maximization

According to Theorem 3.1, in order to provide the theoretical guarantee, the algorithm is supposed to identify a node whose truncated marginal spread is an $\alpha$ -approximation to the maximum truncated marginal spread in each round. At a first glance, it seems that we can utilize Borgs et al.’s reverse influence sampling method (Borgs et al., 2014). Unfortunately, in what follows, we show that Borgs et al.’s sampling method (Borgs et al., 2014) fails to estimate the truncated influence spread accurately.

Specifically, Borgs et al. (Borgs et al., 2014) propose to generate random reverse reachable (RR) sets for influence maximization. Compared with the Monte-Carlo simulation (Kempe et al., 2003), RR-sets can dramatically accelerate the seed selection process while retaining the same approximation guarantees for influence maximization (Borgs et al., 2014). In particular, a random RR-set of $G$ is generated by first selecting a node $v\in V$ uniformly at random, and then taking the nodes that can reach $v$ in a random realization. Evidently, a random RR-set is a subgraph of the corresponding random realization $\Phi$ , which is generated by performing a reverse breadth first search (BFS) on $\Phi$ starting from the random node $v$ . A random RR-set $R$ is an unbiased spread estimator, i.e., for any seed set $S$ ,

[TABLE]

Unfortunately, RR-sets fail to estimate truncated influence spread accurately. Intuitively, the expectation of this estimator for truncated influence spread of $S$ is

[TABLE]

Recall that the true expected truncated influence spread is

[TABLE]

Obviously, for any $\phi\in\Omega$ , unless $I_{\phi}(S)=n$ ,

[TABLE]

Specifically, consider the case that $I_{\phi}(S)\leq\eta$ for all $\phi$ . Then, this estimator is biased with a discount ${\eta}/{n}$ , which is extremely inaccurate when $\eta\ll n$ . In practice, $\eta$ is likely to be a fraction of $n$ , since even a set of ten thousand seed nodes has been found to influence less than half population on many datasets (Nguyen et al., 2017). These facts indicate that RR-sets are highly biased for estimating truncated influence spread. As a consequence, the state-of-the-art algorithms (Huang et al., 2017; Nguyen et al., 2016; Tang et al., 2015, 2018b, 2014) for influence maximization that utilize RR-sets (Borgs et al., 2014) cannot provide theoretical guarantees for truncated influence maximization. In turn, this means that these algorithms cannot be fashioned to solve ASM with approximation guarantees. To address this issue, we propose a novel sampling approach that generates multi-root reverse reachable (mRR) sets which can estimate the truncated influence spread efficiently and effectively. The algorithm utilizing mRR-sets is referred to as TRIM 333TRuncated Influence Maximization.. We rigorously show that TRIM can provide strong theoretical guarantees for truncated influence maximization and thus ASTI instantiated with TRIM is guaranteed to approximate ASM within a constant ratio.

3.3. Multi-Root Reverse Reachable Set

If we generate $n$ correlated RR-sets such that (i) they start from $n$ distinct nodes, and (ii) the materialization of each edge is consistent in all the RR-sets, then merging these RR-sets (with duplicates removed) as well as the edge statuses forms a realization sample. Based on this observation, if we generate $k$ $(k<n)$ correlated RR-sets using the same rule, then merging them as a $k$ -root RR-set is likely to estimate the truncated influence spread more accurately compared against a vanilla RR-set. To explain how multi-root reverse reachable (mRR) set works, we first introduce its definition.

Definition 3.2 (Random mRR-set).

Let $\Phi$ be a random realization of $G$ sampled from the realization space and $K$ be a size- $k$ node set selected uniformly at random from $V$ . A random mRR-set is the set of nodes in $\Phi$ that can reach $K$ . (That is, for each node $v$ in the mRR-set, there is a directed path in $\Phi$ from $v$ to some node in $K$ .)

By definition, the key difference between an mRR-set and an RR-set is that the former has multiple roots whereas the latter has one single root only. Similar to the generation of RR-sets, a random mRR-set can be generated by:

(1)

Choose a set of $k$ nodes $K\subseteq V$ uniformly at random; 2. (2)

Perform a stochastic reverse breadth first search (BFS) that starts from $K$ and follows the incoming edges of each node. Insert into $R$ all nodes that are traversed during the stochastic BFS.

A natural question is how to decide the size of $k$ for truncated spread estimation? The setting of $k$ yields a tradeoff between efficiency and accuracy in that a larger $k$ provides more accurate estimation but takes more computational resources. Through the aforementioned analysis of RR-set, we find that the high-efficiency of RR-set comes from its “binary” property. In particular, a random RR-set $R$ estimates the influence spread of any node set $S$ as $n$ if $R\cap S\neq 0$ , and as [math] otherwise. To avoid maintaining the edge statuses, our mRR-set estimator shall retain this binary property. That is, it estimates the truncated influence spread of $S$ as $\eta$ if and only if $S$ intersects this mRR-set, and as [math] otherwise. For a given $k$ -RR-set $R$ , if a node $v\in R$ , then $v$ can reach at least one of the $k$ starting nodes. Then, its influence spread is estimated to be at least $n/k$ and thus its estimated truncated influence spread is at least $\min\{n/k,\eta\}$ . By setting $n/k\geq\eta$ , the estimated truncated influence spread is $\eta$ .

On the other hand, to improve the accuracy, $k$ should be set as large as possible. So we choose $k=n/\eta$ . However, ${n}/{\eta}$ is not an integer in general. To address this issue, we adopt a randomized rounding approach. To generate a mRR-set, we randomly choose a set $K$ of nodes such that its size $k$ equals $\lfloor\frac{n}{\eta}\rfloor+1$ with probability $\frac{n}{\eta}-\lfloor\frac{n}{\eta}\rfloor$ , and equals $\lfloor\frac{n}{\eta}\rfloor$ otherwise. Then, the expectation of $k$ is ${n}/{\eta}$ . However, we note that when $k=\lfloor\frac{n}{\eta}\rfloor+1$ , the possible value of the estimated truncated influence spread is no longer binary (i.e., [math] or $\eta$ ). To address such a new challenge, we define an estimator $\tilde{\Gamma}(S)$ as $\tilde{\Gamma}(S)=\eta$ if and only if $S\cap R\neq\emptyset$ , and $\tilde{\Gamma}(S)=0$ otherwise. At the first glance, it seems that the relationship between $\mathbb{E}[{\Gamma}(S)]$ and $\mathbb{E}[\tilde{\Gamma}(S)]$ is unclear. Fortunately, the following theorem shows that under the above setting of $k$ such that $\mathbb{E}[k]={n}/{\eta}$ , the ratio of ${\mathbb{E}[\tilde{\Gamma}(S)]}$ and ${\mathbb{E}[{\Gamma}(S)]}$ is in the range of $[1-1/{\mathrm{e}},1]$ .

Theorem 3.3.

Let $k^{\bot}=\lfloor\frac{n}{\eta}\rfloor$ and $r={n}/{\eta}-k^{\bot}$ be the integer and fractional part of ${n}/{\eta}$ , respectively. For any mRR-set, if we sample $k$ nodes such that $k=k^{\bot}+1$ with probability $r$ and $k=k^{\bot}$ otherwise, then

[TABLE]

Theorem 3.3 states that $\tilde{\Gamma}$ is a biased but sufficiently accurate estimator of the expected truncated influence spread $\mathbb{E}[\Gamma(S)]$ . In fact, this estimator also works for any residual graph $G_{i}$ . Specifically, let $\tilde{\Gamma}(S\mid S_{i-1})$ be the estimated truncated spread of $S$ in $G_{i}$ with respect to $\eta_{i}$ , the lowered target corresponding to graph $G_{i}$ . Recall that $\eta_{i}=\eta-(n-n_{i})$ . We have the following corollary.

Corollary 3.4.

In the residual graph $G_{i}$ , let $k^{\bot}=\lfloor\frac{n_{i}}{\eta_{i}}\rfloor$ and $r={n_{i}}/{\eta_{i}}-k^{\bot}$ be the integer and fractional part of ${n_{i}}/{\eta_{i}}$ , respectively. For each mRR-set, if we sample $k$ nodes such that $k=k^{\bot}+1$ with probability $r$ and $k=k^{\bot}$ otherwise, then

[TABLE]

Furthermore, for any two sets $S,S^{\prime}\subseteq V_{i}$ , it holds that

[TABLE]

Now, we can construct a $(1-1/{\mathrm{e}})(1-\varepsilon)$ -approximate greedy policy using the estimator $\tilde{\Gamma}$ built upon mRR-sets.

Remark. It is worth pointing out that our randomized rounding approach for choosing $k$ is critical for achieving the above approximation bound. Specifically, if we fix $k$ to be $\lfloor\frac{n}{\eta}\rfloor$ , following the proof methodology of Theorem 3.3, we may derive that the ratio of ${\mathbb{E}[\tilde{\Gamma}(S)]}$ to ${\mathbb{E}[{\Gamma}(S)]}$ will be in the range of $[1-1/\sqrt{{\mathrm{e}}},1]$ . On the other hand, if we fix $k$ to be $\lfloor\frac{n}{\eta}\rfloor+1$ , the ratio of ${\mathbb{E}[\tilde{\Gamma}(S)]}$ to ${\mathbb{E}[{\Gamma}(S)]}$ will be in the range of $[1-1/{\mathrm{e}},2]$ . Both settings yield much coarser bounds than our setting that uses a smart randomized rounding approach.

3.4. The Design of TRIM

Algorithm 2 presents the details of TRIM that can return a $(1-1/{\mathrm{e}})(1-\varepsilon)$ -approximate solution for truncated influence maximization for any input graph $G_{i}$ and error threshold $\varepsilon$ . TRIM is similar in spirit to OPIM-C which is the state-of-the-art algorithm for influence maximization (Tang et al., 2018b). Specifically, OPIM-C uses two disjoint groups of random RR-sets, among which one group is used to derive the solution and the other is used to verify its quality. We customize TRIM by utilizing one group of mRR-sets, which would be more efficient for selecting a singleton seed set as pointed out in (Huang et al., 2017). In a nutshell, TRIM starts from a small number of mRR-sets and iteratively increases the mRR-set number until a satisfactory solution is identified. Next, we discuss the details of TRIM.

In the mRR-set sampling stage (Lines 2 and 2), each mRR-set is started from a random set $K$ of nodes whose size $k$ is an independent random number. Recall that $k$ is $\lfloor\frac{n_{i}}{\eta_{i}}\rfloor+1$ with probability $\frac{n_{i}}{\eta_{i}}-\lfloor\frac{n_{i}}{\eta_{i}}\rfloor$ and $\lfloor\frac{n_{i}}{\eta_{i}}\rfloor$ otherwise. Given a set $\mathcal{R}$ of random mRR-sets, we say that a node $v$ covers a mRR-set $R\in\mathcal{R}$ if $v\in R$ , and we define the coverage of $v$ in $\mathcal{R}$ , denoted as $\Lambda_{\mathcal{R}}(v)$ , as the number of mRR-sets in $\mathcal{R}$ that are covered by $v$ . Based on the mRR-sets generated, TRIM identifies the node $v^{\ast}\in V_{i}$ that covers the largest number of mRR-sets in $\mathcal{R}$ (Line 2). Let $v^{\circ}$ be the optimal node such that $\Delta(v^{\circ}\mid S_{i-1})=\max_{v\in V_{i}}\Delta(v\mid S_{i-1})$ . Then, $\Lambda_{\mathcal{R}}(v^{\circ})$ is bounded by $\Lambda_{\mathcal{R}}(v^{\ast})$ . According to Lemma A.2 in Appendix A, with high probability, $\Lambda^{l}(v^{\ast})$ (Line 2) is a lower bound on the expected coverage of $v^{\ast}$ in $\mathcal{R}$ , which indicates that

[TABLE]

Similarly, with high probability, $\Lambda^{u}(v^{\circ})$ (Line 2) is an upper bound on the expected coverage of $v^{\circ}$ in $\mathcal{R}$ . Thus,

[TABLE]

In addition, by Equation (11) in Corollary 3.4, we know that

[TABLE]

Combining Equations (12)–(14), we can derive a quantitative relationship between $\Delta(v^{\ast}\mid S_{i-1})$ and $\Delta(v^{\circ}\mid S_{i-1})$ such that with high probability

[TABLE]

Therefore, the final guarantee is $(1-1/{\mathrm{e}}){\Lambda^{l}(v^{\ast})}/{\Lambda^{u}(v^{\circ})}$ . Note that in our stopping condition of ${\Lambda^{l}(v^{\ast})}/{\Lambda^{u}(v^{\circ})}\geq 1-\hat{\varepsilon}$ (Line 2), we use $\hat{\varepsilon}$ (defined in Line 2) to correct the error on Equations (12) and (13) (with low failure probability). This proves the $(1-1/{\mathrm{e}})(1-\varepsilon)$ approximation ratio of $\Delta(v^{\ast}\mid S_{i-1})$ .

3.5. Theoretical Analysis

Before we proceed to the theoretical analysis, we first present the hardness of ASM.

Lemma 3.5.

Given a probabilistic social network $G=(V,E)$ with $|V|=n$ and a threshold $\eta\in[1,n]$ , for any $\xi>0$ , adaptive seed minimization cannot be approximated within a ratio of $(1-\xi)\ln\eta$ in polynomial time unless $\mathrm{NP}\subseteq\mathrm{DTIME}(n^{O(\log\log n)})$ .

Approximation Guarantee. Theorem 3.1 indicates that any $\alpha$ -approximation greedy policy $\pi$ could achieve an approximation ratio of $\frac{(\ln\eta+1)^{2}}{\alpha}$ . We examine the potential of TRIM to serve the role of such a policy. To cope with the randomness of seed selection algorithms (due to sampling), we use the notion of expected approximation guarantee, which considers the average case. We first obtain the approximation ratio of TRIM for each round of seed selection.

Lemma 3.6.

For the $i$ -th round of seed selection in $G_{i}$ , TRIM returns a $(1-1/{\mathrm{e}})(1-\varepsilon)$ -approximate solution to the optimum.444Here, $\alpha$ -approximation indicates that $\mathbb{E}[\tfrac{1}{\Delta(v^{\ast}\mid S_{i-1})}]\leq\tfrac{1}{\alpha}\cdot\tfrac{1}{\Delta(v^{\circ}\mid S_{i-1})}$ , which is required by Theorem 3.1 for a randomized algorithm through a detailed check of the proof of Theorem 40 in (Golovin and Krause, 2017).

Combining Theorem 3.1 and Lemma 3.6, we obtain the approximation guarantee of ASTI.

Theorem 3.7.

ASTI* with the instantiation of TRIM achieves an expected approximation ratio of $\frac{(\ln\eta+1)^{2}}{(1-1/{\mathrm{e}})(1-\varepsilon)}$ .*

Time Complexity. The time complexity of TRIM is dominated by the procedure for generating mRR-sets. Intuitively, this is based on (i) how much time is used for generating a random mRR-set, and (ii) how many mRR-sets are generated. In what follows, we show their relationship. In particular, for the $i$ -th round of seed selection in $G_{i}$ , let ${\operatorname{OPT}}_{i}$ (resp. $v^{\diamond}$ ) be the optimum (resp. optimal node) of $\mathbb{E}[\tilde{\Gamma}(v\mid S_{i-1})]$ , i.e., ${\operatorname{OPT}}_{i}=\mathbb{E}[\tilde{\Gamma}(v^{\diamond}\mid S_{i-1})]=\max_{v}{\mathbb{E}[\tilde{\Gamma}(v\mid S_{i-1})]}$ . (Note that $v^{\ast}$ maximizes $\Lambda_{\mathcal{R}}(v)$ , $v^{\diamond}$ maximizes $\mathbb{E}[\tilde{\Gamma}(v\mid S_{i-1})]$ , and $v^{\circ}$ maximizes $\Delta(v\mid S_{i-1})$ .) We first show the expected time used for generating a random mRR-set in the following lemma.

Lemma 3.8.

For the $i$ -th round of seed selection in $G_{i}$ , the expected time complexity for generating a random mRR-set is $O\big{(}\frac{{\operatorname{OPT}}_{i}}{\eta_{i}}m_{i}\big{)}$ .

Now, we present the following lemma that gives the expected number of mRR-sets generated by TRIM. The proof is similar to that of OPIM-C (Tang et al., 2018b).

Lemma 3.9.

For the $i$ -th round of seed selection in $G_{i}$ , the expected number of mRR-sets TRIM generated is $O\big{(}\frac{\eta_{i}\ln{n_{i}}}{\varepsilon^{2}{\operatorname{OPT}}_{i}}\big{)}$ .555In general, it is $O\big{(}\frac{\eta_{i}\ln{({n_{i}}/{\varepsilon})}}{\varepsilon^{2}{\operatorname{OPT}}_{i}}\big{)}$ . Here, we assume that $\varepsilon\in\Omega\big{(}\frac{1}{\operatorname{poly}(n_{i})}\big{)}$ .

Finally, we provide the expected time complexity of TRIM in the following lemma.

Lemma 3.10.

For the $i$ -th round of seed selection in $G_{i}$ , TRIM achieves an expected time complexity of $O\big{(}\frac{m_{i}+n_{i}}{\varepsilon^{2}}\ln{n_{i}}\big{)}$ .

At the first glance, the expected time complexity of TRIM is counterintuitive. In particular, the expected root size of $n_{i}/\eta_{i}$ in the $i$ -th round is increasing with $i$ . It seems that the time complexity of TRIM is more likely to increase with $i$ . However, Lemma 3.10 just tells us the opposite. This is due to either the residual graph $G_{i}$ being reduced significantly (Lemma 3.8) or the mRR-set size being reduced considerably (Lemma 3.9). Overall, the time complexity of TRIM in each round can be independent of the number of initially selected nodes. There are at most $\eta$ rounds in total, we can derive the expected time complexity of ASTI instantiated with TRIM.

Theorem 3.11.

ASTI* with the instantiation of TRIM has an expected time complexity of $O\big{(}\frac{\eta\cdot(m+n)}{\varepsilon^{2}}\ln{n}\big{)}$ .*

4. Extensions

TRIM selects one node in each round until at least $\eta$ users are influenced. Therefore, the seed selection phase in ASTI instantiated by TRIM can be quite time consuming due to that the marginal (truncated) spread of a singleton node set is potentially small which may (i) involve in many rounds to achieve the target $\eta$ , and (ii) generate a large number of mRR-sets for constructing an $\alpha$ -approximate solution in each round. To mitigate the enormous overhead, we propose a batched version of TRIM, referred to as TRIM-B 666TRuncated Influence Maximization in the Batched model. algorithm, to accelerate the node selection process of ASTI.

4.1. Batched Version of TRIM

Algorithm 3 shows the details of the TRIM-B algorithm. TRIM-B generalizes TRIM by selecting a fixed number of $b$ seeds in each round, where $b$ is an input parameter to determine the batch size. Specifically, TRIM-B first generates a small number of random mRR-sets and then uses a greedy algorithm for maximum coverage (Vazirani, 2003) to identify a size- $b$ seed set $S_{b}$ to cover mRR-sets with an approximation guarantee of $\rho_{b}=1-(1-1/b)^{b}$ (Line 3). If $S_{b}$ meets the condition (Line 3), TRIM-B terminates; otherwise, the number of mRR-sets is doubled until a qualified $S_{b}$ is derived. Consequently, the approximation ratio of TRIM-B is $\rho_{b}(1-1/{\mathrm{e}})(1-\varepsilon)$ . Note that when the batch size $b$ is $1$ , TRIM-B degenerates to TRIM.

The major differences in the design between TRIM-B and TRIM are as follows. First, in TRIM-B, the definitions of variables $\theta_{\max}$ and $\theta_{\circ}$ are involved with $\rho_{b}$ and $b$ for generalization, as shown in Line 3 and Line 3, respectively. Second, to obtain the upper bound on the coverage of the optimal solution $S_{b}^{\circ}$ in $\mathcal{R}$ , the coverage of $\Lambda_{\mathcal{R}}(S_{b})$ is divided by $\rho_{b}$ (Line 3). Third, the ratio in the stop condition is updated to be $\rho_{b}(1-\hat{\varepsilon})$ (Line 3).

4.2. Theoretical Analysis

The theoretical analysis of TRIM-B can be obtained by generalizing the properties of TRIM.

Approximation Guarantee. To establish the overall approximation guarantee, we first analyze the approximation ratio of TRIM-B in each round of seed selection.

Lemma 4.1.

For the $i$ -th round of seed selection in $G_{i}$ , TRIM-B returns a $\rho_{b}(1-1/{\mathrm{e}})(1-\varepsilon)$ -approximate solution, where $\rho_{b}=1-(1-1/b)^{b}$ .

Combining Theorem 3.1 and Lemma 4.1, we obtain the approximation guarantee of TRIM-B.

Theorem 4.2.

ASTI* with the instantiation of TRIM-B achieves an expected approximation ratio of $\frac{(\ln\eta+1)^{2}}{\rho_{b}(1-1/{\mathrm{e}})(1-\varepsilon)}$ .*

Remark. Note that there exists a gap between the optimal policy in the sequential model and the optimal policy in the batched model, which is known as the adaptivity gap (Golovin and Krause, 2017). Adaptivity gap quantifies the performance difference between the optimal adaptive policy and the optimal non-adaptive policy. To explain, a size- $b$ seed set is selected as a batch ( $b\geq 1$ ) in TRIM-B without observing the realization of any seed therein. This selection is an non-adaptive process compared to that of $b=1$ in TRIM. As a consequence, there exists an adaptivity gap between the two algorithms if the batch size $b>1$ . However, to the best of our knowledge, this adaptivity gap remains unknown in viral marketing applications, which makes it hard to quantify the difference between the optimal policy in the sequential model and that in the batched model. Meanwhile, the existing bound of adaptivity gap of $(1-1/{\mathrm{e}})$ in (Chen and Krause, 2013) is not applicable to adaptive seed minimization. It holds only if the nodes in social graph $G$ are independent, which, however, is not true.

Time Complexity. The time complexity of TRIM-B depends on three factors: (i) the time for generating a random mRR-set, (ii) the number of mRR-sets generated, and (iii) the time to derive a size- $b$ seed set. The expected time used for generating a random mRR-set is given in Lemma 3.8. We now show the number of mRR-sets generated.

Lemma 4.3.

For the $i$ -th round of seed selection in $G_{i}$ , the expected number of mRR-sets TRIM-B generates is $O\Big{(}\frac{\eta_{i}\ln{\binom{n_{i}}{b}}}{\varepsilon^{2}{\operatorname{OPT}}_{b,i}}\Big{)}$ , where ${\operatorname{OPT}}_{b,i}$ denotes the maximum expected truncated spread among all the size- $b$ seed sets in $G_{i}$ .

On the other hand, the greedy algorithm for identifying the size- $b$ seed set runs in time linear to the total size of its input (Vazirani, 2003), i.e., $\sum_{R\in\mathcal{R}}\lvert R\rvert$ . Meanwhile, the total number of mRR-sets examined in all the iterations is within twice of that in the last iteration. According to Wald’s equation (Wald, 1947), the expected time complexity of the greedy procedure is $O(\mathbb{E}[\lvert\mathcal{R}\rvert]\cdot\mathbb{E}[\lvert R\rvert])$ , which is dominated by that for generating mRR-sets. Consequently, by Lemma 3.8 and Lemma 4.3, the expected time used in the $i$ -th round of TRIM-B is $O\big{(}\tfrac{b(m_{i}+n_{i})\ln n_{i}}{\varepsilon^{2}}\big{)}$ . There are at most $O(\eta/b)$ rounds in total. Based on the analysis above, the expected time complexity of TRIM-B is given in the following theorem.

Theorem 4.4.

ASTI* with the instantiation of TRIM-B achieves an expected time complexity of $O\big{(}\frac{\eta\cdot(m+n)}{\varepsilon^{2}}\ln{n}\big{)}$ .*

5. Additional Related Work

In Section 2.4, we have discussed the work (Vaswani and Lakshmanan, 2016) most related to ours. In what follows, we survey other relevant work in the literature.

Influence maximization, as the dual problem of seed minimization, seeks to identify a set of $k$ seed nodes with the maximum expected spread. Domingos and Richardson (Domingos and Richardson, 2001; Richardson and Domingos, 2002) are the first to study viral marketing from an algorithmic perspective. After that, Kempe et al. (Kempe et al., 2003) formulate the influence maximization problem and propose a greedy algorithm that returns $(1-1/{\mathrm{e}}-\epsilon)$ -approximation for several influence diffusion models, by utilizing Monte Carlo simulations. Subsequently, there has been a large body of research on improved algorithms for influence maximization (Kim et al., 2013; Chen et al., 2010b, a, 2009; Goyal et al., 2011a; Jung et al., 2012; Wang et al., 2010; Leskovec et al., 2007; Kempe et al., 2003, 2005; Borgs et al., 2014; Tang et al., 2015, 2018b, 2014; Nguyen et al., 2016; Huang et al., 2017; Tang et al., 2017, 2018a; Galhotra et al., 2016; Arora et al., 2017; Cheng et al., 2014; Cohen et al., 2014; Goyal et al., 2011b; Zhou et al., 2013). Among them, some recent work (Borgs et al., 2014; Huang et al., 2017; Nguyen et al., 2016; Tang et al., 2015, 2018b, 2014) focuses on algorithms that ensure $(1-1/{\mathrm{e}}-\varepsilon)$ -approximations by utilizing the reverse influence sampling technique (Borgs et al., 2014).

Seed minimization, which has mainly been studied from the non-adaptive perspective, aims at finding a minimum-size set of seed nodes to achieve a given threshold of expected spread. Chen (Chen, 2009) investigates seed minimization under a variant of the linear threshold model, where each node is assigned with a fixed threshold. Chen shows that the problem cannot be approximated within a ratio of $O(2^{\log^{1-\epsilon}n})$ unless $\mathrm{NP}\subseteq\mathrm{DTIME}(n^{\operatorname{polylog}(n)})$ as the expected spread function under the fixed threshold model is not submodular. After that, Long and Wong (Long and Wong, 2011) study seed minimization under the widely used independent cascade and linear threshold models. Goyal et al. (Goyal et al., 2013) provide a bi-criteria approximation algorithms for seed minimization. Zhang et al. (Zhang et al., 2014) then improve the theoretical results by removing the bi-criteria restriction. However, the requirements of these algorithms are either impractical or extremely stringent, which makes these algorithms vastly ineffective in practice. Han et al. (Han et al., 2017) propose the ATEUC algorithm for non-adaptive seed minimization by utilizing reverse influence sampling for estimating the spreads of nodes. However, the expected time complexity of the algorithm is unknown, and its worst-case time complexity is prohibitively large. As we show in the experiments, our adaptive algorithm is more effective than these non-adaptive algorithms in terms of the number of seed nodes required.

Finally, there is a series of recent work (Vaswani and Lakshmanan, 2016; Horel and Singer, 2015; Badanidiyuru et al., 2016; Seeman and Singer, 2013; Han et al., 2018) that focuses on adaptive influence maximization. Recall that, as analyzed in Section 3.1, to construct approximate solutions for adaptive seed minimization, some approximation algorithms for truncated influence maximization are required. However, the algorithms for adaptive influence maximization generally target at maximizing the influence spread in each round, which cannot provide theoretical guarantees for truncated influence maximization, as we point out in Section 3.2. As a consequence, techniques developed for adaptive influence maximization are inapplicable to the adaptive seed minimization problem. In addition, in the case of influence maximization, going adaptive does not really boost the spread significantly, as confirmed by the experiments in (Han et al., 2018). However, it shall be observed in our experiments that going adaptive provides a substantial advantage for seed minimization.

6. Experiments

This section evaluates the performance of the proposed algorithms against the state of the art. All the experiments are conducted on a Linux machine with an Intel Xeon 2.6GHz CPU and 64GB RAM. For fair comparison, we first randomly generate $20$ possible realizations for each dataset, and then measure the performance of each algorithm on those $20$ realizations and report the average performance.

6.1. Experimental Setting

Datasets. The experiments are conducted on four datasets, i.e., NetHEPT, Epinions, Youtube, and LiveJournal. NetHEPT (Chen et al., 2009) represents the academic collaboration networks of ”High Energy Physics - Theory” area. The rest of the three are real-life social networks from (Leskovec and Krevl, 2014). Table 2 summarizes the details of the four datasets. Note that an undirected edge is transformed into two directed edges. There does exist any isolated node in the four tested datasets. Furthermore, the number of nodes in the largest weakly connected component (LWCC) indicates that nodes are highly interconnected, especially for the three social networks. As shown in Figure 3, all the four datasets have a power law degree distribution. The largest dataset that has been used for adaptive seed minimization in the literature contains $75k$ nodes and $500k$ edges (Vaswani and Lakshmanan, 2016), which is far smaller than LiveJournal. To the best of our knowledge, LiveJournal with millions of nodes and edges is the largest dataset ever tested in adaptive seed minimization experiments.

Algorithms. We evaluate six algorithms: ASTI, ASTI-2, ASTI-4, ASTI-8, AdaptIM and ATEUC (Han et al., 2017). ASTI- $b$ is ASTI instantiated by TRIM-B with the batch sizes of $b$ . (Note that ASTI is the version with a batch size of $1$ .) AdaptIM is modified from the AdaptIM-1 method proposed in (Han et al., 2018) for the adaptive influence maximization problem. It iteratively runs a non-adaptive influence maximization algorithm (i.e., EPIC (Han et al., 2018)) to select the node that maximizes the expected marginal influence spread on the residual graphs, until the desired threshold is reached. AdaptIM differs from our ASTI algorithm in that it greedily selects the node to maximize the influence spread instead of the truncated influence spread. The batch size of AdaptIM is set to $1$ by default. As introduced in Section 5, ATEUC is the state of the art for the non-adaptive seed minimization problem. By comparing ASTI with ATEUC, we aim to prove the advantage of adaptivity over non-adaptivity in terms of the effectiveness. Meanwhile, three batched algorithms, i.e., ASTI-2, ASTI-4, ASTI-8, are compared with both ASTI and ATEUC to study how the batch size would affect the efficiency and effectiveness. For AdaptIM, we obtain the source code of AdaptIM-1 from the authors (Han et al., 2018) with some necessary modifications (e.g., stop condition). For the other five algorithms, we implement them in C++ strictly following the algorithm description and compile them with the same optimization options.

Parameter Settings. In our experiments, all the algorithms are tested under both the Independent Cascade (IC) model and the Linear Threshold (LT) model. Following the common setting in the literature (Tang et al., 2014; Arora et al., 2017), we set the approximation parameter $\varepsilon=0.5$ for the five adaptive algorithms. For those parameters in ATEUC, we use the values recommended in (Han et al., 2017). For each dataset, we set the edge probability $p(\langle u,v\rangle)=\frac{1}{\mathrm{indeg}_{v}}$ where $\mathrm{indeg}_{v}$ is the in-degree of node $v$ .

The performance metrics measured include the number of seeds selected and the corresponding running time. To better understand the performance of the algorithms, we design the large $\eta$ setting of the threshold for NetHEPT, Epinions, and Youtube, i.e., $\frac{\eta}{n}=\{0.01,0.05,0.1,0.15,0.2\}$ , where $n$ is the number of nodes in the social network. Observing that around $2K$ nodes are required on LiveJournal under the large $\eta$ setting which is not convenient for exhibition, we thus use a tailored small $\eta$ setting, i.e., $\frac{\eta}{n}=\{0.01,0.02,0.03,0.04,0.05\}$ for LiveJournal.

6.2. Results under the IC model

Seed Size vs. Threshold. Figure 4 reports the number of seeds selected by the six algorithms for different thresholds $\eta$ under the IC model. As can be seen, ASTI selects far fewer seed nodes than ATEUC does, especially when the threshold $\eta$ becomes larger. In general, ATEUC selects around $30\%$ – $40\%$ more nodes than ASTI does on all the four datasets. In particular, with a threshold $\eta/n=0.2$ on dataset Epinions, ASTI selects $116.95$ seed nodes on average while ATEUC needs $193.8$ seed nodes (i.e., $65.7\%$ more nodes). For the sake of clarity, Table 3 shows the exact improvement ratio of ASTI over ATEUC on the number of seed nodes for the corresponding five thresholds under both the IC and LT model. Note that there exist many points (indicated by N/A) where the actual number of nodes activated by the seed set returned by ATEUC does not reach the required threshold under some realizations. This is because ATEUC selects a node set $S$ such that $\mathbb{E}[I(S)]\geq\eta$ but may influence fewer than $\eta$ nodes under some realizations, whereas our adaptive algorithms always ensure that at least $\eta$ nodes are influenced by the returned node set under every realization. We shall explore this in more detail in Section 6.4. These facts support the superiority of adaptive algorithms over non-adaptive algorithms. We also observe that the number of nodes selected by AdaptIM is close to that of ASTI, which indicates that AdaptIM is empirically effective in seed minimization. However, it does not provide any approximation guarantees in terms of the number of nodes selected. Another interesting observation is that ASTI-2, ASTI-4, and ASTI-8 slightly increase the number of seed nodes selected compared with ASTI and still select nodes far less than ATEUC does for most of the cases. This confirms that adaptive algorithms by utilizing the information of partial realizations are more effective than non-adaptive algorithms.

Running Time vs. Threshold. Figure 5 presents the results of running time against the threshold under the IC model. As the results show, ATEUC runs faster than the other five adaptive algorithms on the four datasets when the threshold $\eta$ is large. The main reason is that adaptive algorithms involve multiple rounds of seed selection whereas only one round is required for non-adaptive algorithms. Observe that the running time of ATEUC generally decreases with the increase of the threshold $\eta$ , unlike the results of the five adaptive algorithms. The reason lies in the design of ATEUC. Specifically, ATEUC selects two seed set candidates $S_{u}$ and $S_{l}$ , which are taken as the upper bound and lower bound on the number of seed nodes in the optimal solution. Only when the condition $|S_{u}|\leq 2|S_{l}|$ is satisfied, the candidate set $S_{u}$ is returned as the solution; otherwise ATEUC will continue to refine $S_{u}$ and $S_{l}$ (Han et al., 2017). The larger the threshold, the more seed nodes are required, and the more easily this stop condition is met, which explains the unique running time pattern of ATEUC. We also observe that AdaptIM runs around $10$ – $20$ times slower than ASTI for all cases. Particularly, AdaptIM cannot finish within $72$ hours when $\eta/n=0.05$ under the IC model on the LiveJournal dataset (see Figure 5(d)). This demonstrates that AdaptIM is significantly inferior to ASTI in terms of computational overheads. The reason behind this is that ASTI selects the node to maximize the expected marginal truncated spread, while AdaptIM attempts to maximize the expected marginal influence spread. Specifically, recall that the expected number of mRR-sets generated by ASTI is proportional to $\eta_{i}/{\operatorname{OPT}}_{i}$ . Meanwhile, the expected number of RR-sets generated by AdaptIM is proportional to $n_{i}/{\operatorname{OPT}}_{i}^{\prime}$ , where ${\operatorname{OPT}}_{i}^{\prime}$ is the maximum expected marginal influence spread in the $i$ -th round of seed selection in $G_{i}$ . For the last few rounds of seed selection, we have ${\operatorname{OPT}}_{i}^{\prime}\approx{\operatorname{OPT}}_{i}\approx\eta_{i}\ll n_{i}$ , which indicates that the number of mRR-sets generated by ASTI is much smaller than the number of RR-sets generated by AdaptIM. Consequently, ASTI runs remarkably faster than AdaptIM. As such, ASTI is more preferable than AdaptIM, as the former provides significantly better efficiency and approximation guarantees than the latter, while offering similar empirical effectiveness. Note that the batched algorithms, i.e., ASTI-2, ASTI-4, and ASTI-8, reduce the running time significantly, to around $30\%$ , $10\%$ , and $5\%$ of ASTI, which makes them quite competitive with ATEUC in terms of the efficiency, not to mention AdaptIM. In addition, as explained earlier, the terminal condition $|S_{u}|\leq 2|S_{l}|$ in ATEUC is easier satisfied when the threshold $\eta$ is larger, and hence, ATEUC runs faster along with the increase of $\eta$ . On the other hand, the running times of the adaptive algorithms increase with $\eta$ . Therefore, ASTI-4 and ASTI-8 outperform ATEUC on datasets Epinions and Youtube when $\eta$ is relatively small, but when the threshold $\eta/n=0.2$ , the running times of all three algorithms become similar, as shown in Figures 5(b) and 5(c). Recall that ASTI-8 selects far fewer seed nodes than ATEUC does. Therefore, ASTI-8 strikes a good balance between efficiency and effectiveness in the current setting. We also observe that the running time of ASTI-8 fluctuates from $\eta/n=0.01$ to $\eta/n=0.05$ on datasets Epinions and Youtube. This is due to the combined effects of the threshold and the batch size. In these cases, it needs no more than $8$ nodes to reach the thresholds. Consequently, ASTI-8 finishes selecting seed nodes within just one round. However, when $\eta/n$ increases from $0.01$ to $0.05$ , the root size of mRR-sets decreases. As a consequence, it takes relatively less time to generate a random mRR-set in practice, which leads to the decrease in running time.

6.3. Results under the LT model

Seed Size vs. Threshold. Figure 6 reports the number of nodes selected by different algorithms under the LT model. In general, the results show similar trends to those observed in Figure 4. Similarly, AdaptIM selects a close number of nodes as ASTI does on the four datasets, with negligible difference. ATEUC requires around $40\%$ more nodes than the five adaptive algorithms do. Details are displayed in Table 3. In addition, we also observe that ASTI-8 selects more nodes than ATEUC for several settings (e.g., $\eta/n=0.01$ on the Epionions and Youtube datasets). Through a careful analysis, we find that (i) all the algorithms select less nodes under the LT model than those under the IC model, and (ii) ASTI-8 selects $8$ seed nodes in a batch with influence spread much higher than the requirements. These observations clearly tell us that there is a tradeoff in the setting of batch size. Increasing the batch size will speed up the algorithms but may result in more nodes selected.

Running Time vs. Threshold. Figure 7 shows the results of running time for different thresholds under the LT model. The conclusions we summarize for Figure 5 are generally applicable to Figure 7 as well. The major differences lie in two aspects: (i) the running time under the LT model is shorter than that under the IC model under the same setting as it takes less time to generate a random mRR-set under the LT model than that under the IC model (as mentioned and analyzed in previous work (Arora et al., 2017; Tang et al., 2018b)), which is consistent with the results in Figure 6, (ii) ASTI-4 outperforms ATEUC on Epinions and ASTI-8 outperforms ATEUC on both Epinions and Youtube for all cases under the LT model. This fact indicates (i) the batched version of ASTI is more scalable than ATEUC does, and (ii) when the batch size $b$ is well-calibrated, ASTI can beat ATEUC in both efficiency and effectiveness.

6.4. Discussions on Spread Distribution

As discussed previously, non-adaptive algorithms may find solutions with influence spread far away from the requirement (i.e., either under-qualified or over-qualified). Figure 8 reports the spread distribution of $20$ realizations achieved by the ASTI and ATEUC algorithms on the NetHEPT dataset under the IC and LT models, respectively. The solid (red) line in the figure represents the spread threshold ( $153$ ) required. As shown, ATEUC fails to reach the threshold for $5$ and $6$ realizations under the IC and LT models, respectively, with corresponding percentages of $25\%$ and $30\%$ . In addition, for $5$ and $6$ realizations under the IC and LT models, respectively, the seed nodes selected by ATEUC produce influence spread much higher (over $50\%$ ) than the requirement. In contrast, ASTI meets the spread requirement for all the realizations under both the IC and LT models. Moreover, the spread produced by ASTI is generally kept close to the requirement. The spread exceeds the requirement by more than $50\%$ for only $2$ realizations under the LT model. These two over-qualified exceptions are due to that the last seed node selected achieves much higher spread than the gap to reach $\eta$ , which is rare to happen in practice. These observations indicate that non-adaptive algorithms are unreliable for seed minimization.

7. Conclusion

This paper studies the problem of adaptive seed minimization, and proposes algorithms that provide both strong theoretical guarantees and superior empirical effectiveness. Our approach is based on a novel ASTI framework instantiated by a truncated influence maximization algorithm TRIM, which has a provable approximation guarantee. The core of our TRIM algorithm is an elegant sampling method that generates random multi-root reverse reachable (mRR) sets for estimating the truncated influence spread. We also extend TRIM into its batched version TRIM-B to further improve the efficiency of seed selection. With extensive experiments on real data, we show that our solutions considerably outperform the state of the art for seed minimization under both the IC and LT diffusion models.

Acknowledgements.

This research is supported by Sponsor Singapore National Research Foundation under grant Grant #NRF-RSS2016-004, by Sponsor Singapore Ministry of Education Academic Research Fund Tier 2 under grant Grant #MOE2015-T2-2-069, by Sponsor National University of Singapore under an Grant #SUG, by Sponsor Singapore Ministry of Education Academic Research Fund Tier 1 under grant Grant #MOE2017-T1-002-024, and by a Grant #Discovery grant and a Grant #Discovery Accelerator Supplement grant from the Sponsor Natural Sciences and Engineering Research Council of Canada (NSERC) .

Appendix A Concentration Bounds

We show some useful martingale concentration bounds, i.e., the Chernoff-like bounds (Tang et al., 2015) and their variants (Tang et al., 2018b).

Lemma A.1 ((Tang

et al., 2015)).

Let $X_{1}-\mathbb{E}[X_{1}],\dots,X_{T}-\mathbb{E}[X_{T}]$ be a martingale difference sequence such that $X_{i}\in[0,1]$ for each $i$ . Let $\bar{X}=\frac{1}{{T}}\sum_{i=1}^{T}X_{i}$ . If $\mathbb{E}[X_{i}]$ is identical for every $i$ , i.e., $\mathbb{E}[X_{i}]=\mathbb{E}[\bar{X}]$ , then for any $\lambda\geq 0$ , we have

[TABLE]

Lemma A.2 ((Tang

et al., 2018b)).

Let $X_{1}-\mathbb{E}[X_{1}],\dots,X_{T}-\mathbb{E}[X_{T}]$ be a martingale difference sequence such that $X_{i}\in[0,1]$ for each $i$ . Let $\bar{X}=\frac{1}{{T}}\sum_{i=1}^{T}X_{i}$ . If $\mathbb{E}[X_{i}]$ is identical for every $i$ , i.e., $\mathbb{E}[X_{i}]=\mathbb{E}[\bar{X}]$ , then for any $\lambda\geq 0$ , we have

[TABLE]

Appendix B Proofs

We first introduce the following lemma that is used to prove Theorem 3.1.

Lemma B.1 ((Golovin and

Krause, 2017)).

If function $\Gamma$ satisfies all the following conditions:

•

there exists $Q$ such that $\Gamma_{\phi}(V)=Q$ for all $\phi$ ;

•

$\Gamma$ * is integer-valued;*

•

$\Gamma$ * is self-certifying;*

•

$\Gamma$ * is strong adaptive monotone;*

•

$\Gamma$ * is strong adaptive submodular;*

then an $\alpha$ -approximate greedy policy $\pi$ achieves an approximation ratio of $\frac{(\ln\eta+1)^{2}}{\alpha}$ .

Proof of Theorem 3.1.

Obviously, $\Gamma_{\phi}(V)=\eta$ for all $\phi$ and $\Gamma$ is an integer-valued function. Now, we need to prove that for any $v\in V_{i}$ , $\phi,\phi^{\prime}\in\Omega_{i}$ , and $j\leq i$

[TABLE]

where $\Delta(v\mid S_{j-1};S_{i-1}):=\mathbb{E}_{\Phi\sim\Omega_{i}}[\Gamma_{\Phi}(v\mid S_{j-1})]$ . Equation (20) represents self-certifying, Equation (21) describes strong monotonicity, Equations (22) and (23) capture strong adaptive submodularity.

Equation (20) obviously holds, i.e., if $\Gamma_{\phi}(S_{i-1})=\Gamma_{\phi}(V)=\eta$ , we must have $\Gamma_{\phi^{\prime}}(S_{i-1})=\eta=\Gamma_{\phi^{\prime}}(V)$ , and vice versa.

Equation (21) holds naturally as “selecting more nodes never hurts” the function $\Gamma$ .

Next, we prove Equation (22). Let $\phi_{i}$ be a realization of $G_{i}$ with probability $p(\phi_{i})$ according to the influence propagation. Let $\Omega_{j}(\phi_{i})$ be the subset realizations of $\Omega_{j}$ that are consistent with $\phi_{i}$ . That is, for every $\phi\in\Omega_{j}(\phi_{i})$ and every edge $e\in E_{i}$ , the statuses of $e$ are the same in $\phi$ and $\phi_{i}$ such that both are either live or blocked. Then, for any $\phi_{i}$ ,

[TABLE]

In addition, for any $\phi\in\Omega_{i}$ , let $V_{\phi}(v\mid S_{i-1})$ be the set of nodes activated by $v$ in $G_{i}$ . Thus, $\lvert V_{\phi}(v\mid S_{i-1})\rvert$ is the spread of $v$ in $G_{i}$ under realization $\phi$ . As a consequence, the marginal truncated spread of $v$ in $G_{i}$ under $\phi$ is

[TABLE]

Similarly, for any $\phi\in\Omega_{j}$ , we have

[TABLE]

where the inequality is due to $G_{i}\subseteq G_{j}$ and $\eta_{i}\leq\eta_{j}$ . Therefore,

[TABLE]

Finally, we prove Equation (23). For any $\phi\in\Omega_{i}$ , we have

[TABLE]

Taking the expectation over $\Phi\sim\Omega_{i}$ completes the proof. ∎

Proof of Theorem 3.3.

We prove the elementary version of Equation (9), i.e., for any given realization $\phi$ ,

[TABLE]

where the expectation is only taken over the randomness of root size $K$ .

Let $x=I_{\phi}(S)$ denote the number of nodes influenced by $S$ under $\phi$ . Let $p(x)$ be the probability that none of the $k$ nodes sampled can be influenced by $S$ , which is given by

[TABLE]

Then, by the definition of $\tilde{\Gamma}_{\phi}(S)$ , with probability $p(x)$ , $\tilde{\Gamma}_{\phi}(S)=0$ ; and with probability $1-p(x)$ , $\tilde{\Gamma}_{\phi}(S)=\eta$ . As a consequence, we have

[TABLE]

where the expectation on the right hand side is taken with respect to the randomness of $k$ . Let $f(x)$ be the ratio of $\mathbb{E}[\tilde{\Gamma}_{\phi}(S)]$ to ${\Gamma}_{\phi}(S)$ , which is given by

[TABLE]

Now, we need to prove that $1-1/{\mathrm{e}}\leq f(x)\leq 1$ . We consider the following two scenarios: (i) $x\geq\eta$ , and (ii) $x<\eta$ .

(i) $x\geq\eta$ : In this case, $f(x)=1-\mathbb{E}[p(x)]\leq 1$ . Meanwhile,

[TABLE]

As $1-y\leq{\mathrm{e}}^{-y}$ for any $0\leq y\leq 1$ , in the above equation,

[TABLE]

As $x\geq\eta$ by assumption, this implies that $f(x)\geq 1-1/{\mathrm{e}}$ .

(ii) $x<\eta$ : In this case, $f(x)=\eta\big{(}1-\mathbb{E}[p(x)]\big{)}/x$ . Take the derivative,

[TABLE]

Let $g(x)=p(x)-1-xp^{\prime}(x)$ . Take the derivative, when $x>0$ ,

[TABLE]

According to the definition of $p(x)$ , we can get that

[TABLE]

Thus, $g(x)$ decreases with $x$ , which indicates that $g(x)\leq g(0)=0$ . This implies that $f^{\prime}(x)\leq 0$ . As a consequence,

[TABLE]

where the last step above follows from the analysis for the case $x\geq\eta$ , by considering the special case $x=\eta$ .

Hence, the theorem is proved. ∎

Proof of Corollary 3.4.

Equation (10) follows directly from Theorem 3.3. By Equation (10),

[TABLE]

Hence, Equation (11) holds. ∎

Proof of Lemma 3.5.

We consider the special case of the adaptive seed minimization problem in which the probability $p(e)=1$ for each edge $e\in E$ . In this case, for any node $v\in V$ , the set of nodes influenced by $v$ is the set of nodes that can be reached by $v$ in $G$ , denoting as the cover set $S_{v}$ . Thus, for each node $v\in V$ , its cover set $S_{v}$ is deterministic. As a consequence, the adaptive seed minimization problem reduces to a set cover problem, i.e., aiming to find as few nodes as possible to cover at least $\eta$ nodes. Feige (Feige, 1998) has shown that no polynomial time algorithm can approximate the optimal solution of set cover within a ratio of $(1-\varepsilon)\ln\eta$ for any $\varepsilon>0$ unless $\mathrm{NP}\subseteq\mathrm{DTIME}(n^{O(\log\log n)})$ . Hence, lemma 3.5 holds on noting that ASM generalizes set cover. ∎

Proof of Lemma 3.6.

Let $\mathcal{E}$ be the following event:

[TABLE]

Note that $v^{\ast}$ is the seed node returned by the policy which is a random variable. Let $U_{t}$ be the set of possible seed nodes selected (but not necessarily returned) by TRIM in the $t$ -th iteration in which each node $u\in U_{t}$ has a probability $\Pr[u]$ such that $\sum_{u\in U_{t}}\Pr[u]=1$ , where $1\leq t\leq T$ . Let $U^{\ast}_{t}$ denote the set of random seed nodes returned at the $t$ -th iteration of TRIM, where $U^{\ast}_{t}\subseteq U_{t}$ . Therefore, the event $\mathcal{E}(v^{\ast})$ does not happen only if there exists a node $v^{\ast}_{t}\in U^{\ast}_{t}$ at iteration $t\in[1,T]$ satisfying that $\mathcal{E}(v^{\ast}_{t})$ does not happen.

If TRIM stops at the iteration $t=T$ , according to the setting of $\theta_{\max}$ and by (Tang et al., 2015), we have

[TABLE]

If TRIM stops at the iteration $t<T$ , for any node $v\in V_{i}$ , we define two events $\mathcal{E}_{1}(v)$ and $\mathcal{E}_{2}(v)$ as

[TABLE]

where $\mathbb{E}[\Lambda_{\mathcal{R}}(v)]={\lvert\mathcal{R}\rvert}\cdot\mathbb{E}[\tilde{\Gamma}(v\mid S_{i-1})]/{\eta_{i}}$ is the expected coverage of $v$ in $\mathcal{R}$ . Then, if $v$ is independent of $\mathcal{R}$ , by Lemma A.2, we have

[TABLE]

By a union bound that ensures all the $n_{i}$ nodes satisfying Equation (26), we have

[TABLE]

Meanwhile, $v^{\circ}$ is independent of $\mathcal{R}$ naturally. Thus, together with the fact that $\Lambda_{\mathcal{R}}(v^{\circ})\leq\Lambda_{\mathcal{R}}(v^{\ast})$ , by Equation (27)

[TABLE]

As a consequence, when TRIM stops at ${\Lambda^{l}(v^{\ast}_{t})}/{\Lambda^{u}(v^{\circ})}\geq 1-\hat{\varepsilon}$ , if the event $\mathcal{E}(v^{\ast}_{t})$ does not happen, then at least one of the events $\mathcal{E}_{1}(v^{\ast}_{t})$ and $\mathcal{E}_{2}(v^{\circ})$ does not happen. Thus, the event $\mathcal{E}(v^{\ast}_{t})$ does not happen for all $t<T$ with probability at most:

[TABLE]

Combining Equations (25) and (28) shows that the event $\mathcal{E}(v^{\ast})$ holds with probability at least $1-\delta$ . Thus, together with the Equation (11) in Corollary 3.4, we have

[TABLE]

Hence, the lemma is proved. ∎

Proof of Lemma 3.8.

For any node $v$ , $v$ is not visited by a random mRR-set $R$ if and only if $v\notin R$ . The probability for not visiting $v$ under a realization $\phi$ is $p(x_{v})$ , where $x_{v}=I_{\phi}(v\mid S_{i-1})$ is the number of nodes that can be activated by $v$ in $G_{i}$ under $\phi$ . On the other hand, if a node is visited, all of its incoming edges will be examined. Let $\mathrm{indeg}_{v}$ denote the number of $v$ ’s incoming edges. Then, the expected time complexity for generating a random mRR-set is

[TABLE]

where the expectation is over the randomness of both $k$ and $\Phi$ . In addition, we already know that

[TABLE]

Combining (29) and (30) gives

[TABLE]

Hence, the lemma is proved. ∎

Proof of Lemma 3.9.

Let $\varepsilon_{1}=\hat{\varepsilon}/{2}$ and ${\varepsilon}_{2}$ be the root of

[TABLE]

where $a=c\ln({4n_{i}T}/{\delta})$ for any $c\geq 1$ and $\delta=1/n_{i}$ . Let

[TABLE]

As $\hat{\varepsilon}=O(\varepsilon)$ , one can verify that $\theta^{\ast}=O\big{(}\frac{\eta_{i}\ln n_{i}}{\varepsilon^{2}{\operatorname{OPT}}_{i}}\big{)}$ .777Without loss of generality, we assume $\hat{\varepsilon}\leq 0.5$ . If $\hat{\varepsilon}>0.5$ , TRIM achieves a higher approximation of $0.5$ with $O\big{(}\frac{\eta_{i}\ln n_{i}}{{\operatorname{OPT}}_{i}}\big{)}$ mRR-sets. Define the events $\mathcal{E}_{1},\mathcal{E}_{2},\mathcal{E}_{3},\mathcal{E}_{4}$ as follows:

[TABLE]

Then, when a number of $\lvert\mathcal{R}\rvert=c\theta^{\ast}$ mRR-sets are generated, by Lemma A.1, it is easy to verify that any event $\mathcal{E}_{j}$ ( $1\leq j\leq 4$ ) does not happen with probability at most

[TABLE]

By the union bound, the probability that all the events $\mathcal{E}_{1},\mathcal{E}_{2},\mathcal{E}_{3},\mathcal{E}_{4}$ happen is at least $1-\delta^{c}$ .

If the events $\mathcal{E}_{1},\mathcal{E}_{2}$ happen,

[TABLE]

Thus, we have

[TABLE]

In addition, let

[TABLE]

According to the definition of $\varepsilon_{2}$ , we have

[TABLE]

Since $a_{1}\leq a$ , if event $\mathcal{E}_{4}$ happens (i.e., $\Lambda_{\mathcal{R}}(v^{\ast})\leq\Lambda_{l}$ ), then

[TABLE]

As a consequence, if event $\mathcal{E}_{3}$ also happens, we have

[TABLE]

Similarly, let

[TABLE]

According to the definition of $\varepsilon_{2}$ , we have

[TABLE]

Since $a_{2}\leq a$ , if event $\mathcal{E}_{3}$ happens (i.e., $\Lambda_{\mathcal{R}}(v^{\ast})\geq\Lambda_{u}$ ), then

[TABLE]

As a consequence, we have

[TABLE]

Putting it all together of (31), (32) and (33), we have

[TABLE]

Therefore, when a number of $c\theta^{\ast}$ mRR-sets are generated, TRIM does not stop only if at least one of the events in $\mathcal{E}_{1},\mathcal{E}_{2},\mathcal{E}_{3},\mathcal{E}_{4}$ does not happen, with probability at most $\delta^{c}$ .

Let $t$ be the first iteration that the number of mRR-sets generated by TRIM reaches $\theta^{\ast}$ such that $2^{t-2}\cdot\theta_{\circ}<\theta^{\ast}$ and $2^{t-1}\cdot\theta_{\circ}\geq\theta^{\ast}$ . From this iteration onward, the expected number of mRR-sets further generated is at most

[TABLE]

The first inequality is due to $2^{t-1}\cdot\theta_{\circ}<2\theta^{\ast}$ and $\delta\leq 1/2$ , and the second inequality is due to $-2^{z}+z\leq-z$ . If the algorithm stops before the $t$ -th iteration, there are at most $\theta^{\ast}$ random samples generated. Therefore, the expected number of random samples generated is less than $5\theta^{\ast}$ , which is $O\big{(}\frac{\eta_{i}\ln n_{i}}{\varepsilon^{2}{\operatorname{OPT}}_{i}}\big{)}$ .

Hence, the lemma is proved. ∎

Proof of Lemma 3.10.

The time complexity of TRIM is determined by that for generating mRR-sets. By Wald’s equation (Wald, 1947), the expected total time used for generating mRR-sets equals the expected number of mRR-sets generated, times the expected time used for generating one mRR-set. Thus, according to Lemmas 3.8 and 3.9, the expected time complexity of TRIM is $O(\frac{m_{i}+n_{i}}{\varepsilon^{2}}\ln{n_{i}})$ . ∎

Proof of Lemma 4.1.

Let $S^{\ast}$ be the seed set returned by the batched policy with $|S^{\ast}|=b$ and $S^{\circ}$ be the corresponding optimal seed set in the $i$ -th round. Let $\mathcal{E}_{b}$ be the following event:

[TABLE]

Let $S^{\ast}_{t}$ be the generalized definition of $v^{\ast}_{t}$ in Section 3.5. If $S^{\ast}$ is returned at $T$ -th iteration, based on the setting of $T$ and by (Tang et al., 2015), we still have

[TABLE]

If TRIM-B stops at the iteration $t<T$ , for any node $S\subseteq V_{i}$ obtained by greedy method with $|S|=b$ , we define two events $\mathcal{E}_{b,1}(S)$ and $\mathcal{E}_{b2}(v)$ as

[TABLE]

where $\mathbb{E}[\Lambda_{\mathcal{R}}(S)]={\lvert\mathcal{R}\rvert}\cdot\mathbb{E}[\tilde{\Gamma}(S\mid S_{i-1})]/{\eta_{i}}$ is the expected coverage of $S$ in $\mathcal{R}$ .

Based on Lemma A.2, we could have

[TABLE]

Similarly, by union bound for all $\binom{n_{i}}{b}$ candidates of size- $b$ node set, we could immediately have

[TABLE]

Let $S^{\circ}_{\mathcal{R}}$ be the size- $b$ seed set that could cover largest number of mRR-sets in $\mathcal{R}$ . Since $S$ is derived by Greedy method from $\mathcal{R}$ , by the property of greedy method, we have $\Lambda_{\mathcal{R}}(S)\geq\rho_{b}\Lambda_{\mathcal{R}}(S^{\circ}_{\mathcal{R}})\geq\rho_{b}\Lambda_{\mathcal{R}}(S^{\circ}).$ Then $\Lambda_{\mathcal{R}}(S)/\rho_{b}$ can be taken as the upper bound of $\Lambda_{\mathcal{R}}(S^{\circ})$ . Similarly, by Lemma A.2, we have following equation

[TABLE]

By following the analysis in Section 3.5, we acquire the fact that event $\mathcal{E}_{b}$ holds with at least $1-\delta$ probability where $\delta=1/n_{i}$ . By Corollary 3.4, the expected approximation ratio of TRIM-B is at least

[TABLE]

Hence, the lemma is proved. ∎

Appendix C Discussions on Influence Spread

Figure 9 reports the spread of the tested algorithms under the IC model (results under the LT model are similar). For the most parts, all the algorithms achieve a comparable spread on the four datasets. The major differences lie in $\eta/n=0.01$ on Epinions and Youtube. As observed, ASTI-8 (resp. ATEUC) achieves the largest (resp. smallest) spread among all algorithms. This is because the batch size $b$ is relatively large with regard to the small threshold, owing to which the spread of the 8-size seed set selected by ASTI-8 significantly overshoots $0.01n$ on Epinions and Youtube. Another interesting observation is that the spread achieved by ATEUC is slightly larger than each of the other five adaptive algorithms as the threshold becomes larger (not quite noticeable in the figure). This is because ATEUC selects considerably more seeds than the adaptive algorithms do, resulting in a larger spread at the cost of an excessive number of seeds. This is also supported by the results in Table 3.

Appendix D Discussions on Marginal Truncated Spread

To explore the property of the marginal truncated spread, we record the marginal spread of each seed node selected by adaptive algorithms under the $20$ realizations sampled. Figures 10 shows the result of each realization with $\eta/n=0.2$ on corresponding datasets (or $\eta/n=0.05$ on the LiveJournal dataset) under the IC model. (The result under the LT model is similar.) In general, the marginal spread diminishes along the index of the seed node, which is consistent with the property of submodularity as expected. Note that the spread fluctuation is due to the randomness of the tested realizations, i.e., in some particular realizations, some seed node selected later may influence more nodes than some seed node selected earlier.

Bibliography49

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1(1)
2Arora et al . (2017) Akhil Arora, Sainyam Galhotra, and Sayan Ranu. 2017. Debunking the Myths of Influence Maximization: An In-Depth Benchmarking Study. In Proc. ACM SIGMOD . 651–666.
3Asadpour et al . (2008) Arash Asadpour, Hamid Nazerzadeh, and Amin Saberi. 2008. Stochastic Submodular Maximization. In Proc. WINE . 477–489.
4Badanidiyuru et al . (2016) Ashwinkumar Badanidiyuru, Christos Papadimitriou, Aviad Rubinstein, Lior Seeman, and Yaron Singer. 2016. Locally Adaptive Optimization: Adaptive Seeding for Monotone Submodular Functions. In Proc. SODA . 414–429.
5Barbieri et al . (2012) Nicola Barbieri, Francesco Bonchi, and Giuseppe Manco. 2012. Topic-Aware Social Influence Propagation Models. In Proc. IEEE ICDM . 81–90.
6Borgs et al . (2014) Christian Borgs, Michael Brautbar, Jennifer Chayes, and Brendan Lucier. 2014. Maximizing Social Influence in Nearly Optimal Time. In Proc. SODA . 946–957.
7Chen (2009) Ning Chen. 2009. On the Approximability of Influence in Social Networks. SIAM Journal on Discrete Mathematics 23, 3 (2009), 1400–1415.
8Chen et al . (2010 a) Wei Chen, Chi Wang, and Yajun Wang. 2010 a. Scalable Influence Maximization for Prevalent Viral Marketing in Large-Scale Social Networks. In Proc. ACM KDD . 1029–1038.