Non-zero-sum Stackelberg Budget Allocation Game for Computational   Advertising

Daisuke Hatano; Yuko Kuroki; Yasushi Kawase; Hanna Sumita; Naonori; Kakimura; Ken-ichi Kawarabayashi

arXiv:1906.05998·cs.GT·June 18, 2019

Non-zero-sum Stackelberg Budget Allocation Game for Computational Advertising

Daisuke Hatano, Yuko Kuroki, Yasushi Kawase, Hanna Sumita, Naonori, Kakimura, Ken-ichi Kawarabayashi

PDF

Open Access

TL;DR

This paper introduces a novel Stackelberg game model for budget allocation in computational advertising, addressing competitive market dynamics and proposing algorithms with proven guarantees and efficiency.

Contribution

It formalizes a new Stackelberg budget allocation model with a bipartite influence structure and develops algorithms for finding strong equilibria, including approximation, heuristic, and exact methods.

Findings

01

Algorithms outperform baseline in real-world tests

02

Proposed methods effectively handle competitive influence

03

Exact algorithm works for disjoint customer cases

Abstract

Computational advertising has been studied to design efficient marketing strategies that maximize the number of acquired customers. In an increased competitive market, however, a market leader (a leader) requires the acquisition of new customers as well as the retention of her loyal customers because there often exists a competitor (a follower) who tries to attract customers away from the market leader. In this paper, we formalize a new model called the Stackelberg budget allocation game with a bipartite influence model by extending a budget allocation problem over a bipartite graph to a Stackelberg game. To find a strong Stackelberg equilibrium, a standard solution concept of the Stackelberg game, we propose two algorithms: an approximation algorithm with provable guarantees and an efficient heuristic algorithm. In addition, for a special case where customers are disjoint, we propose…

Tables2

Table 1. Table 1: Results averaged over 30 instances for real-world datasets.

MovieLens $((n, m, \| E \|) = (20, 844, 3506))$
$𝒟_{ℱ}$	( $k_{L}, k_{F}$ )	Greedy	Approx	Prop.
$𝒰 (0, 0.2)$	$(1, 2)$	37.05	37.05	37.05
$𝒰 (0, 0.2)$	$(2, 2)$	65.10	65.10	65.24
$𝒰 (0, 0.2)$	$(4, 2)$	114.22	114.22	114.22
$𝒰 (0.1, 0.9)$	$(1, 2)$	14.34	14.55	17.22
$𝒰 (0.1, 0.9)$	$(2, 2)$	24.22	29.97	31.87
$𝒰 (0.1, 0.9)$	$(4, 2)$	54.46	54.46	56.12

Table 2. Table 2: Results averaged over 30 instances for real-world datasets.

MovieLens $((n, m, \| E \|) = (20, 844, 3506))$
$𝒟_{ℱ}$	( $k_{L}, k_{F}$ )	Greedy	Approx	Prop.
$𝒰 (0, 0.2)$	$(1, 2)$	37.05	37.05	37.05
$𝒰 (0, 0.2)$	$(2, 2)$	65.10	65.10	65.24
$𝒰 (0, 0.2)$	$(4, 2)$	114.22	114.22	114.22
$𝒰 (0.1, 0.9)$	$(1, 2)$	14.34	14.55	17.22
$𝒰 (0.1, 0.9)$	$(2, 2)$	24.22	29.97	31.87
$𝒰 (0.1, 0.9)$	$(4, 2)$	54.46	54.46	56.12
Yahoo! Webscope $((n, m, \| E \|) = (50, 447, 871))$
$𝒟_{ℱ}$	( $k_{L}, k_{F}$ )	Greedy	Approx	Prop.
$𝒰 (0, 0.2)$	$(1, 2)$	5.42	5.42	5.46
$𝒰 (0, 0.2)$	$(2, 2)$	10.68	10.68	10.72
$𝒰 (0, 0.2)$	$(4, 2)$	19.56	19.56	19.55
$𝒰 (0.1, 0.9)$	$(1, 2)$	2.20	3.00	3.67
$𝒰 (0.1, 0.9)$	$(2, 2)$	5.03	6.36	7.05
$𝒰 (0.1, 0.9)$	$(4, 2)$	11.72	12.68	13.31

Equations38

P_{v} (z) = 1 - \prod_{u \in N_{v} : z_{u} = 1} (1 - p_{uv}),

P_{v} (z) = 1 - \prod_{u \in N_{v} : z_{u} = 1} (1 - p_{uv}),

f (x^{*}, y^{*}) \geq f (x, y)

f (x^{*}, y^{*}) \geq f (x, y)

f (z, y) = \sum_{v \in V} P_{v} (z) (1 - P_{F, v} (y)) .

f (z, y) = \sum_{v \in V} P_{v} (z) (1 - P_{F, v} (y)) .

g (z, y) = \sum_{v \in V}

g (z, y) = \sum_{v \in V}

f_{BR} (x) = max {f (x, y) ∣ y \in BR (x)} .

f_{BR} (x) = max {f (x, y) ∣ y \in BR (x)} .

max f_{BR} (x) s.t. x \in D_{L} .

max f_{BR} (x) s.t. x \in D_{L} .

Φ (x, y)

Φ (x, y)

= \sum_{v \in V} [P_{v} (x) (1 - P_{F, v} (y)) - P_{v} (y) (1 - P_{v} (x))] .

f (x^{'}, y^{'})

f (x^{'}, y^{'})

\geq α Φ (x^{*}, y^{*}) - ϵ + ϵ_{1} = α f (x^{*}, y^{*}) - (α ϵ_{2} - ϵ_{1} + ϵ),

\displaystyle\begin{array}[]{rl}\max&f(x,y)\\ {\rm s.t.}&x\in D_{L},\\ &g(x,y)\geq g(x,y^{\prime})\quad\forall y^{\prime}\in D_{F}.\end{array}

\displaystyle\begin{array}[]{rl}\max&f(x,y)\\ {\rm s.t.}&x\in D_{L},\\ &g(x,y)\geq g(x,y^{\prime})\quad\forall y^{\prime}\in D_{F}.\end{array}

\displaystyle\!\!\!\!\!\!\begin{array}[]{rl}\max&\sum_{z\in S_{L}}f(z,y^{*})x_{z}\\ \text{s.t.}&\sum_{z\in S_{L}}(g(z,y^{*})-g(z,y^{\prime}))x_{z}\geq 0\ \ \ \forall y^{\prime}\in D_{F},\\ &\sum_{z\in S_{L}}x_{z}=1,\\ &x_{z}\geq 0\quad\forall z\in S_{L}.\end{array}

\displaystyle\!\!\!\!\!\!\begin{array}[]{rl}\max&\sum_{z\in S_{L}}f(z,y^{*})x_{z}\\ \text{s.t.}&\sum_{z\in S_{L}}(g(z,y^{*})-g(z,y^{\prime}))x_{z}\geq 0\ \ \ \forall y^{\prime}\in D_{F},\\ &\sum_{z\in S_{L}}x_{z}=1,\\ &x_{z}\geq 0\quad\forall z\in S_{L}.\end{array}

0 \leq z \leq 1, \sum_{u \in U} z_{u} \leq k_{L} .

0 \leq z \leq 1, \sum_{u \in U} z_{u} \leq k_{L} .

\sum_{u \in S} z_{u} \leq q (S) (\forall S \subseteq U), z \geq 0.

\sum_{u \in S} z_{u} \leq q (S) (\forall S \subseteq U), z \geq 0.

f (x, y)

f (x, y)

g (x, y)

\displaystyle\begin{array}[]{rl}\max&\ \sum_{u\in U}\sum_{v\in N_{u}}p_{uv}(1-p_{F,uv}y^{*}_{u})r_{u}\\ \text{s.t.}&\ \sum_{u\in U}\sum_{v\in N_{u}}p_{uv}y^{\prime}_{u}r^{\prime}_{u}\geq 0,\\ &\ y^{\prime}_{u}=y^{*}_{u}-y_{u}\quad\forall u\in U,y\in D_{F},\\ &\ r^{\prime}_{u}=1-p^{\prime}_{uv}r_{u}\quad\forall u\in U,\\ &\ \sum_{u\in U}r_{u}\leq k_{L},\\ &\ r_{u}\in[0,1]\quad\forall u\in U.\end{array}

\displaystyle\begin{array}[]{rl}\max&\ \sum_{u\in U}\sum_{v\in N_{u}}p_{uv}(1-p_{F,uv}y^{*}_{u})r_{u}\\ \text{s.t.}&\ \sum_{u\in U}\sum_{v\in N_{u}}p_{uv}y^{\prime}_{u}r^{\prime}_{u}\geq 0,\\ &\ y^{\prime}_{u}=y^{*}_{u}-y_{u}\quad\forall u\in U,y\in D_{F},\\ &\ r^{\prime}_{u}=1-p^{\prime}_{uv}r_{u}\quad\forall u\in U,\\ &\ \sum_{u\in U}r_{u}\leq k_{L},\\ &\ r_{u}\in[0,1]\quad\forall u\in U.\end{array}

f (x, y)

f (x, y)

g (x, y)

\sum_{u \in U} r_{u} \leq k_{L}, 0 \leq r_{u} \leq 1 (\forall u \in U) .

\sum_{u \in U} r_{u} \leq k_{L}, 0 \leq r_{u} \leq 1 (\forall u \in U) .

\displaystyle\begin{array}[]{rl}\max&\ \sum_{u\in U}\sum_{v\in N_{u}}p_{uv}(1-p_{F,uv}y^{*}_{u})r_{u}\\ \text{s.t.}&\ \sum_{u\in U}\sum_{v\in N_{u}}p_{uv}y^{\prime}_{u}r^{\prime}_{u}\geq 0,\\ &\ y^{\prime}_{u}=y^{*}_{u}-y_{u}\quad\forall u\in U,y\in D_{F},\\ &\ r^{\prime}_{u}=1-p^{\prime}_{uv}r_{u}\quad\forall u\in U,\\ &\ \sum_{u\in U}r_{u}\leq k_{L},\\ &\ r_{u}\in[0,1]\quad\forall u\in U.\end{array}

\displaystyle\begin{array}[]{rl}\max&\ \sum_{u\in U}\sum_{v\in N_{u}}p_{uv}(1-p_{F,uv}y^{*}_{u})r_{u}\\ \text{s.t.}&\ \sum_{u\in U}\sum_{v\in N_{u}}p_{uv}y^{\prime}_{u}r^{\prime}_{u}\geq 0,\\ &\ y^{\prime}_{u}=y^{*}_{u}-y_{u}\quad\forall u\in U,y\in D_{F},\\ &\ r^{\prime}_{u}=1-p^{\prime}_{uv}r_{u}\quad\forall u\in U,\\ &\ \sum_{u\in U}r_{u}\leq k_{L},\\ &\ r_{u}\in[0,1]\quad\forall u\in U.\end{array}

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsConsumer Market Behavior and Pricing · Complex Network Analysis Techniques · Game Theory and Applications

Full text

\includeversion

fullpaper \excludeversionconference

Non-zero-sum Stackelberg Budget Allocation Game for Computational Advertising

Daisuke Hatano

RIKEN AIP [email protected]

Yuko Kuroki

The University of Tokyo [email protected]

Yasushi Kawase

Tokyo Institute of Technology and RIKEN AIP [email protected]

Hanna Sumita

Tokyo Metropolitan University [email protected]

Naonori Kakimura

Keio University [email protected]

Ken-ichi Kawarabayashi

National Institute of Informatics [email protected]

Abstract

Computational advertising has been studied to design efficient marketing strategies that maximize the number of acquired customers. In an increased competitive market, however, a market leader (a leader) requires the acquisition of new customers as well as the retention of her loyal customers because there often exists a competitor (a follower) who tries to attract customers away from the market leader. In this paper, we formalize a new model called the Stackelberg budget allocation game with a bipartite influence model by extending a budget allocation problem over a bipartite graph to a Stackelberg game. To find a strong Stackelberg equilibrium, a standard solution concept of the Stackelberg game, we propose two algorithms: an approximation algorithm with provable guarantees and an efficient heuristic algorithm. In addition, for a special case where customers are disjoint, we propose an exact algorithm based on linear programming. Our experiments using real-world datasets demonstrate that our algorithms outperform a baseline algorithm even when the follower is a powerful competitor.

1 Introduction

An aim of computational advertising is to find the best advertisement that can help build customers loyalty. More specifically, the purpose of advertisers is to devise an optimum allocation of budgets to media, such as newspapers, radio stations, TV, and websites, in order to maximize the number of activated customers. Recently, Alon et al. [1] proposed a model to deal with a simple case of the problem, called a bipartite influence model. In this study, we shall extend the model by integrating a game-theoretic framework, called the non-zero-sum Stackelberg game framework. Let us explain the model more precisely below.

In the bipartite influence model, we consider a bipartite graph where one side is a set of media, the other is a set of customers, and each edge is associated with a probability. Intuitively, each edge between a medium and a customer indicates that the customer is influenced by the medium with some given probability that depends on the budget allocated to the medium. We aim to allocate budgets on media so that the expected number of activated customers is maximized. The problem can be formulated as a combinatorial optimization problem. Constant-factor approximation algorithms for the problem have been developed in a framework of submodularity [1, 12, 13].

In this paper, we shall try to extend the above-mentioned model to deal with a situation of a duopoly where a market leader has occupied the market of a certain product for a long time and a competitor tries to break into the market. The competitor tries to grab the share of the market by aggressively marketing its product. On the other hand, the market leader wants to gain customers and retain her loyal customers simultaneously. This implies that the leader’s gain does not necessarily result in the competitor’s loss. In order to capture the dynamics of this market, we exploit a Stackelberg game [14] framework to model the interactions between the market leader and the competitor. The Stackelberg game is a two-player two-period game, in which one player (a leader) can commit to an action before the other player (a follower) plays an action. A standard solution concept of this game is the strong Stackelberg equilibrium, which is an optimal solution maximizing the leader’s utility under the constraint that the follower plays a best response to the leader’s action (i.e., intended to maximize the follower’s utility).

The Stackelberg game matches to model our problem setting because the leader wants to increase the number of activated customers, and at the same time, prevent the outflow of her customers, which is achieved by finding a strong Stackelberg equilibrium. In a strong Stackelberg equilibrium, the leader plays a mixed strategy and the follower plays a pure strategy, where pure strategy and mixed strategy correspond to a budget allocation and a probability distribution over the pure strategies, respectively.

In this paper, we propose a new model called the Stackelberg budget allocation game with a bipartite influence model, which is an extension of the budget allocation problem presented in [1]. The difficulties of our game lie in the leader’s utility function. Our game belongs to a non-zero-sum game, and the utility function is a submodular (nonlinear) function even when the follower’s action is fixed. It is hard to construct an approximation algorithms by the following reasons: (i) the cumbersome constraint that the follower optimally responds and (ii) the leader’s utility may be non-linearly changed by a follower’s strategy. Thus, existing techniques for submodular functions cannot be directly applied to our problem. Furthermore, the leader’s utility function is not necessarily monotone, that is, the utility does not always increase in the number of allocated budgets. This entails the increment of the number of pure strategies. To design an efficient algorithm is an arduous task.

In this paper, we propose three efficient algorithms:

•

We design an approximation algorithm with theoretical guarantee. The key idea to construct an approximation algorithm is to create a zero-sum game close to the original non-zero-sum game, and to find an approximate minimax strategy of the zero-sum game with the aid of submodularity.

•

We give an efficient heuristic algorithm that repeatedly finds a leader’s pure strategy greedily and uniformly picks from the pure strategies. The running time is polynomial in the leader’s budget. This heuristic can deal with a situation that the leader should not spend up her whole budget due to the non-monotonicity of the utility function. We also evaluate its performance by numerical experiments.

•

If the customers are disjoint, we prove that a strong Stackelberg equilibrium can be found efficiently even when the leader has exponentially many pure strategies by using the multiple linear programming (LP for short) formulations. The point in the disjoint case is that we can aggregate a leader’s mixed strategy to a fractional budget allocation. At the same time we can recover a mixed strategy in a compact representation without loss of the leader’s utility. This enables us to save memories to keep a mixed strategy and reduce the size of LP instances.

The rest of the paper is organized as follows: We describe related work in Section 2 and define notations in Section 3. We formalize our model and analyze its (mathematical) properties in Section 4. We then provide an approximation and a heuristic algorithms in Section 5, and provide an exact algorithm for the disjoint customers in Section 6. In Section 7, we empirically show the performance of our algorithm, and finally we conclude the study in Section 8.

2 Related work

Our problem setting can be viewed as a non-monotone non-zero-sum Stackelberg game with submodular functions. Vanek et al. [17] modeled a non-zero-sum Stackelberg game with submodular functions where the defender (the leader) cares about minimizing the loss of her utility. In our game, the leader maximizes her utility incorporating her loss against the follower’s action. Thus, the goal of the leader is different. Moreover, direct application of their technique to find a Stackelberg equilibrium does not seem to work well in our setting. Recently, Wilder et al. [18] extended a bipartite influence model to a zero-sum Stackelberg game, which is closely related to our problem setting. They proved that the problem is APX-hard, while it has FPTAS for some special cases.

In combinatorial optimization and machine learning, approximation algorithms for maximizing submodular functions under certain constraints have been extensively studied [10]. Our problem can be viewed as a submodular maximization under a best-response constraint, which is more cumbersome than typical constraints in the submodular maximization literature (e.g., cardinality constraint and knapsack constraint).

The budget allocation problem with the bipartite influence model has been extended in [11, 12, 7, 16]. In particular, some formulations have incorporated the view of the multi-agent system. Maehara et al. [11] extended a budget allocation in the bipartite influence model to a strategic form game, called the budget allocation game with a bipartite influence model. Hatano et al. [7] extended the budget allocation problem to the problem with two participants; advertiser and match maker. In the problem, there exist multiple advertisers who cooperatively maximize the influence on customers and single match maker who allocates slots of media to advertisers.

3 Preliminary

Let $\mathbb{Z}_{+}$ be the set of non-negative integers. For an integer $k\in\mathbb{Z}_{+}$ , let $[k]$ be the set $\{1,2,\ldots,k\}$ . In this section, we describe the budget allocation problem with a bipartite influence model and the Stackelberg game.

3.1 Bipartite influence model

Let $G=(U,V;E)$ be a bipartite graph, where $(U,V)$ is a bipartition and $E\subseteq U\times V$ is a set of edges. Each vertex $u\in U$ corresponds to a medium and $v\in V$ corresponds to a customer. Let $n$ and $m$ be the sizes of $U$ and $V$ , respectively. Each edge $uv\in E$ is associated with a probability $p_{uv}\in[0,1]$ , which means that allocating a budget to medium $u\in U$ activates customer $v\in V$ with probability $p_{uv}$ . We assume that the activation events are independent. The advertiser has a total available budget of $k\in\mathbb{Z}_{+}$ , and each medium $u\in U$ has a slot to which the advertiser can allocate her budget. The goal is to find the optimal budget allocation $z\in\{0,1\}^{U}$ with $\sum_{u\in U}z_{u}\leq k$ that maximizes the number of activated customers. Throughout this paper, we identify a set $S$ of media with its characteristic vector $z_{S}\in\{0,1\}^{U}$ . A probability that a customer $v\in V$ is activated by the advertiser’s trial from media in $U$ is given by

[TABLE]

where $N_{v}=\{u\mid uv\in E\}$ is the set of the neighbors of $v$ . The expected number of customers activated through the budget allocation $z$ is given by $\sum_{v\in V}P_{v}(z)$ . The objective of the budget allocation problem with a bipartite influence model is to find $z$ that maximizes $\sum_{v\in V}P_{v}(z)$ subject to $\sum_{u\in U}z_{u}\leq k$ .

The function $P_{v}(z)$ is shown to be a monotone submodular function [15]. Here, a function $f:\{0,1\}^{n}\rightarrow\mathbb{R}$ is submodular if it satisfies $f(x)+f(y)\geq f(x\vee y)+f(x\wedge y)$ for all $x,y\in\{0,1\}^{n}$ , where $x\vee y$ and $x\wedge y$ denote the vector of component-wise maxima and minima, respectively, i.e., $(x\vee y)_{i}=\max\{x_{i},y_{i}\}$ and $(x\wedge y)_{i}=\min\{x_{i},y_{i}\}$ . A function $f$ is monotone if it satisfies $f(x)\leq f(y)$ for all $x\leq y$ , i.e., $x_{i}\leq y_{i}$ for all $i\in[n]$ . Thus the budget allocation problem is a special case of the submodular maximization problem with a cardinality constraint, and it is well-known that the problem is NP-hard [3] and has a $(1-1/e)$ -approximation algorithm [13].

3.2 Stackelberg game

The Stackelberg game is played between two players: the leader and the follower. Both players can play a mixed strategy, but it is sufficient to consider that the follower plays a pure strategy. Let $S_{L}$ and $D_{F}$ be the sets of pure strategies of the leader and the follower, respectively. We denote the set of mixed strategies of the leader by $D_{L}=\{x\in[0,1]^{S_{L}}\mid\sum_{s\in S_{L}}x_{s}=1\}$ , each of which is a probability distribution on pure strategies in $S_{L}$ . We define $f:D_{L}\times D_{F}\rightarrow\mathbb{R}$ and $g:D_{L}\times D_{F}\rightarrow\mathbb{R}$ as utility functions of the leader and the follower, respectively. We define an instance of the game as $\mathcal{G}=(D_{L},D_{F},f,g)$ . Let $\mathrm{BR}(x)=\mathop{\rm arg\,max}\limits\nolimits_{y\in D_{F}}g(x,y)$ be the set of best responses of the follower against $x$ . In this game, the leader will commit to play a mixed strategy before the follower plays his strategy. Thus the leader needs to find a mixed strategy $x$ maximizing $f(x,y)$ under the constraint that the follower would choose a best-response pure strategy $y\in\mathrm{BR}(x)$ . More precisely, the goal of this game is to find a leader’s mixed strategy that forms a strong Stackelberg equilibrium, as indicated below.

Definition 3.1.

*A strong Stackelberg equilibrium of $\mathcal{G}$ is a pair $(x^{*},y^{*})$ that satisfies {conference} $f(x^{*},y^{*})\geq f(x,y)$

{fullpaper}*

[TABLE]

for all $x\in D_{L}$ , $y\in\mathrm{BR}(x)$ , and $y^{*}\in\mathrm{BR}(x^{*})$ .

4 Stackelberg budget allocation game

In this section, we extend the budget allocation problem with a bipartite influence model to a Stackelberg game. For any set $S_{L}$ and $s\in S_{L}$ , we denote by $\chi_{s}$ a characteristic vector in $\{0,1\}^{S_{L}}$ such that $(\chi_{s})_{s^{\prime}}=1$ for $s^{\prime}=s$ and $(\chi_{s})_{s^{\prime}}=0$ for $s^{\prime}\neq s$ ( $s^{\prime}\in S_{L}$ ). For a mixed strategy $x\in D_{L}$ , the support of $x$ is the set of pure strategies that is played with non-zero probability under $x$ , i.e., $\mathop{\rm supp}(x)=\{s\in S_{L}\mid x_{s}>0\}$ .

4.1 Definition

Let $G=(U,V;E)$ be a bipartite graph consisting of a set $U$ of $n$ media, a set $V$ of $m$ customers, and a set $E$ of edges between them. For each $uv\in E$ , we denote by $p_{uv}$ a probability that a customer $v$ is activated through a medium $u$ by a leader’s or a follower’s trial, and by $p_{F,uv}$ a probability that a medium $u$ activates a customer $v$ who has been already activated by the leader. Two probabilities intuitively mean that $p_{uv}$ is a basic activation probability in the market, and $p_{F,uv}$ is a probability that the follower recaptures customers who were activated by the leader. Let $k_{L}$ and $k_{F}$ be the budgets of the leader and the follower, respectively. An instance of the Stackelberg budget allocation game with a bipartite influence model is parameterized by $\phi=(G=(U,V;E),\{p_{uv}\}_{uv\in E},\{p_{F,uv}\}_{uv\in E},k_{L},k_{F})$ .

We construct a Stackelberg game $\mathcal{G}$ from an instance $\phi$ as follows. A pure strategy for the leader (respectively the follower) is a set of at most $k_{L}$ media (respectively $k_{F}$ media). $D_{L}$ and $D_{F}$ of the game $\mathcal{G}$ are defined by setting $S_{L}=\{z\in\{0,1\}^{U}\mid\sum_{u\in U}z_{u}\leq k_{L}\}$ (or equivalently $S_{L}=\{S\subseteq U\mid|S|\leq k_{L}\}$ ) and $D_{F}=\{y\in\{0,1\}^{U}\mid\sum_{u\in U}y_{u}\leq k_{F}\}$ .

Let $v\in V$ be any customer. Let $z$ and $y$ be a leader’s and a follower’s pure strategies, respectively. The probability that the leader activates $v$ is given by the equation (1). If $v$ is not activated by the leader, then the activation probability for the follower is given by the same basic probability, that is $P_{v}(y)$ . If $v$ is activated by the leader, then the probability that the follower attracts a customer $v\in V$ away from the leader is $P_{F,v}(y)=1-\prod_{u\in N_{v}:y_{u}=1}(1-p_{F,uv})$ .

Example 4.1.

We explain the difference between $p_{uv}$ and $p_{F,uv}$ . Consider a game instance illustrated in Figure 1. There are three media $u_{1},u_{2},u_{3}$ and four customers $v_{1},v_{2},v_{3},v_{4}$ . For an arbitrary edge $uv$ , $p_{uv}=0.8$ and $p_{F,uv}=0.5$ . The budget for the leader and the follower is $k_{L}=2$ and $k_{F}=1$ , respectively. At first, the leader plays a mixed strategy $x$ that chooses $\{u_{1},u_{2}\}$ w.p. $1$ . Suppose the situation in Figure 1(a) where $\{u_{1},u_{2}\}$ is chosen and $v_{1}$ , $v_{2}$ , and $v_{3}$ are activated w.p. 0.8, who are shown in gray. After that, the follower plays a pure strategy that chooses $\{u_{3}\}$ . In Figure 1(b), the customer $v_{2}$ switches to the follower w.p. $0.5$ if $v_{2}$ is activated by the leader, and otherwise $v_{2}$ is activated w.p. $0.8$ . Thus, the probability to activate $v_{2}$ is $0.96\cdot 0.5+0.04\cdot 0.8=0.512$ . In addition, $v_{4}$ is activated w.p. $0.8$ because $v_{4}$ is non-activated.

The utility functions $f$ and $g$ are given as follows. The expected number of customers that are activated by the leader but do not shift to the follower is given by

[TABLE]

The expected number of activated customers for the follower is given by

[TABLE]

When the leader uses a mixed strategy $x$ , we abuse the notation and write $P_{v}(x)=\mathbb{E}_{z\sim x}[P_{v}(z)]$ . Here, for a probability distribution $x$ over a domain $D$ , $z\sim x$ means that we sample $z\in D$ from the distribution $x$ . Similarly, we write $f(x,y)=\mathbb{E}_{z\sim x}[f(z,y)]=\sum_{v\in V}P_{v}(x)(1-P_{F,v}(y))$ , and $g(x,y)=\mathbb{E}_{z\sim x}[g(z,y)]=\sum_{v\in V}\bigr{(}P_{v}(x)P_{F,v}(y)+(1-P_{v}(x))P_{v}(y)\bigl{)}$ .

The goal of the Stackelberg budget allocation game with a bipartite influence model is to find a leader’s mixed strategy $x$ in a strong Stackelberg equilibrium of the game $\mathcal{G}$ . We define a function $f_{\texttt{BR}}$ that receives a mixed strategy $x\in D_{L}$ and returns the leader’s utility when the follower takes a best response, i.e.,

[TABLE]

We aim to solve

[TABLE]

Note that $x$ is an optimal solution to (2) if and only if $(x,y)$ is a strong Stackelberg equilibrium, where $y$ is a best response against $x$ . We can evaluate $f_{\texttt{BR}}(x)$ for $x\in S_{L}$ in ${\rm O}(|D_{F}|\cdot|E|\cdot|\mathop{\rm supp}(x)|)$ time by evaluating $f(x,y)$ $|D_{F}|$ times. To obtain the value of $f(x,y)$ , we evaluate $f(z,y)$ for $z\in\mathop{\rm supp}(x)$ , which takes ${\rm O}(|E|)$ time.

We now see that the leader’s optimal strategy may not be a pure strategy.

Example 4.2.

{fullpaper}

*Consider an instance with $U=\{u_{1},u_{2},u_{3}\}$ , $V=\{v_{1},v_{2},v_{3},v_{4}\}$ , $E=\{u_{1}v_{1},u_{1}v_{2},u_{2}v_{2},u_{2}v_{3},u_{3}v_{4}\}$ , and $k_{L}=k_{F}=1$ . For a set $S$ of media, $\chi_{S}$ denotes a unit vector in $\{0,1\}^{S_{L}}$ with $(\chi_{S})_{S}=1$ . The instance including activation probabilities is depicted in Figure 2(a).

{conference} Consider an instance depicted in Figure 2(a) with $k_{L}=k_{F}=1$ .

In this case, an optimal strategy for the leader is $x^{*}=0.5\chi_{\{u_{1}\}}+0.5\chi_{\{u_{2}\}}$ and $f_{\texttt{BR}}(x^{*})=1.1$ where the best response of the follower is $\{u_{1}\}$ . However, $f_{\texttt{BR}}(\chi_{\{u_{1}\}})=f_{\texttt{BR}}(\chi_{\{u_{2}\}})=0.6$ and $f_{\texttt{BR}}(\chi_{\{u_{3}\}})=0.599$ .*

We next see that the leader may not use the whole budget in her optimal strategy.

Example 4.3.

{fullpaper}

*Consider an instance depicted in Figure 2(b) where $U=\{u_{1},u_{2},u_{3}\}$ , $V=\{v_{1},v_{2}\}$ , $E=\{u_{1}v_{1},u_{2}v_{1},u_{2}v_{2},u_{3}v_{2}\}$ , $k_{L}=3$ , and $k_{F}=1$ . Also, $p_{u_{1}v_{1}}=p_{F,u_{2}v_{1}}=p_{F,u_{2}v_{2}}=p_{u_{3}v_{2}}=1$ and $p_{F,u_{1}v_{1}}=p_{u_{2}v_{1}}=p_{u_{2}v_{2}}=p_{F,u_{3}v_{2}}=0$ .

{conference} Consider an instance depicted in Figure 2(b) with $k_{L}=3$ and $k_{F}=1$ .

Then $f_{\texttt{BR}}(\chi_{U})=0$ while $f_{\texttt{BR}}(\chi_{\{u_{1}\}})=f_{\texttt{BR}}(\chi_{\{u_{3}\}})=1$ .*

{conference}

There also exists an instance without a pure Stackelberg equilibrium (see Example 4.4 in the upcoming full version).

{fullpaper} There also exists an instance without a pure Stackelberg equilibrium.

Example 4.4.

Consider an instance with $U=\{u_{1},u_{2},u_{3},u_{4}\}$ and $V=V_{1}\cup V_{2}\cup V_{3}\cup V_{4}$ where $V_{1}=\{v_{1,1},\dots,v_{1,10}\}$ and $V_{i}=\{v_{i,1},\dots,v_{i,6}\}$ $(i=2,3,4)$ . Let $E=\{uv\mid u=u_{i},v\in V_{i},i=1,2,3,4\}$ and let $p_{uv}=1$ and $p_{F,uv}=0.5$ for all $uv\in E$ . We have $|N_{v}|=1$ for all $v\in V$ . There exists a mixed Stackelberg equilibrium $(x,y)$ , where $x=\frac{1}{3}(\chi_{\{u_{1},u_{4}\}}+\chi_{\{u_{1},u_{2},u_{4}\}}+\chi_{\{u_{1},u_{3},u_{4}\}})$ and $y=\chi_{\{u_{2},u_{3}\}}$ . However, there is no pure Stackelberg equlibrium in this instance.

4.2 Hardness

In this subsection, we show hardness results. We observe that finding a leader’s optimal pure strategy when $k_{F}=0$ is equivalent to the optimal budget allocation problem. Thus, it is NP-hard to find the leader’s mixed strategy that forms a Stackelberg equilibrium even if $k_{F}=0$ , since our problem (2) when $k_{F}=0$ always has the leader’s optimal strategy that is pure. It is also known that the approximation ratio $1-1/e$ is best possible for the maximum coverage problem under the assumption that P $\neq$ NP [4]. Hence, our problem (2) is also inapproximable within ratio $1-1/e$ unless P $=$ NP.

Moreover, when $k_{F}$ is not a fixed constant, it is even NP-hard to evaluate $f_{\texttt{BR}}(x)$ for a given $x\in D_{L}$ . The proof is reducing from the maximum coverage problem, which is shown to be NP-hard (see e.g., [8]). Given an integer $k$ and a collection of sets $\mathcal{S}=\{S_{1},S_{2},\ldots,S_{n}\}$ , the maximum coverage problem is to find a subset $\mathcal{S}^{\prime}\subseteq\mathcal{S}$ of at most $k$ sets such that the number of covered elements $\left|\bigcup_{S_{i}\in\mathcal{S}^{\prime}}{S_{i}}\right|$ is maximized. {conference} See the upcoming full version for the proof.

Theorem 4.5.

It is NP-hard to compute $f_{\texttt{BR}}(x)$ for $x\in D_{L}$ .

{fullpaper}

Proof.

Let $(k,\mathcal{S})$ be any instance of the maximum coverage problem. We consider an instance of Stackelberg budget allocation problem: $U=\{S_{1},\dots,S_{n}\}$ , $V=\bigcup_{i=1}^{n}S_{i}$ , $E=\{S_{i}v\mid v\in S_{i},~{}i=1,\dots,n\}$ , and $p_{uv}=p_{F,uv}=1$ for each $uv\in E$ . Let $k_{L}=n$ and $k_{F}=k$ .

We fix a leader’s mixed strategy as $x=\chi_{U}$ . We denote $z$ be an all-one-vector. To evaluate $f_{\texttt{BR}}(x)$ , it is necessary to know $\max_{y\in D_{F}}g(x,y)$ since $f_{\texttt{BR}}(x)=|V|-\max_{y\in D_{F}}g(x,y)$ .

We show that there exists $\mathcal{S}^{\prime}$ that covers at least $\alpha$ elements if and only if $g(x,y)\geq\alpha$ for some $y\in D_{F}$ . By construction, we have $g(x,y)=\sum_{v\in V}P_{F,v}(y)$ . In addition, $P_{F,v}(y)=1$ if $v$ is covered by some set $S$ with $y_{S}=1$ , and $P_{F,v}(y)=0$ otherwise. Thus, $g(x,y)=\left|\bigcup_{S\in\mathop{\rm supp}(y)}{S}\right|$ .

Then, if $\mathcal{S}^{\prime}\subseteq\mathcal{S}$ that covers at least $\alpha$ elements, then $y\in D_{F}$ attains $g(x,y)\geq\alpha$ , where $y$ is defined by $y_{S}=1$ if $S\in\mathcal{S}^{\prime}$ and $y_{S}=0$ otherwise. Conversely, if $y$ satisfies $g(x,y)\geq\alpha$ , then $\mathop{\rm supp}(y)$ covers at least $\alpha$ elements. This completes the proof. ∎

5 Algorithms for non-disjoint customers

In this section, for the non-disjoint customers setting, which has no assumption about the graph structure, we propose two types of algorithms for (2). Let $\mathcal{G}$ be a game instance created from an instance $\phi=(G=(U,V;E),\{p_{uv}\}_{uv\in E},\{p_{F,uv}\}_{uv\in E},\allowbreak k_{L},k_{F})$ and let $\Lambda$ be its data size. Due to the hardness result (Theorem 4.5), in this section we assume that $k_{F}$ is a constant.

5.1 Approximation algorithm via zero-sum game

We shall approximately solve a game $\mathcal{G}$ by solving a zero-sum game close to $\mathcal{G}$ . The core idea of constructing such a zero-sum game is to keep the same set of best-responses of the follower for any strategy of the leader as $\mathcal{G}$ . Let us focus on the structure of $f$ and $g$ , which include the term $-\sum_{v\in V}P_{v}(x)P_{F,v}(y)$ and its negation, respectively. We define a utility function for the leader as

[TABLE]

Note that $C\coloneqq\max_{y\in D_{F}}\sum_{v\in V}P_{v}(y)\geq-\Phi(x,y)$ and we can compute $C$ in polynomial time since $|D_{F}|$ is polynomially bounded. Let $\mathcal{G}_{\Phi}$ be a zero-sum game $(D_{L},D_{F},\Phi,\allowbreak-\Phi)$ .

For reals $\alpha\in[0,1]$ and $\epsilon\geq 0$ , we call an algorithm $(\alpha,\epsilon)$ -approximation for $\mathcal{G}$ (resp. $\mathcal{G}_{\Phi}$ ) if it provides a strategy profile $(x^{\prime},y^{\prime})$ such that $y^{\prime}\in\mathrm{BR}(x^{\prime})$ and $f(x^{\prime},y^{\prime})\geq\alpha\cdot\max_{x\in D_{L}}f_{\texttt{BR}}(x)-\epsilon$ (resp. $\Phi(x^{\prime},y^{\prime})\geq\alpha\cdot\max_{x\in D_{L},y\in\mathrm{BR}(x)}\Phi(x,y)-\epsilon$ ). Such $(x^{\prime},y^{\prime})$ is called an $(\alpha,\epsilon)$ -approximate solution.

Lemma 5.1.

Let $(x^{\prime},y^{\prime})$ be an $(\alpha,\epsilon)$ -approximate solution of a zero-sum game $\mathcal{G}_{\Phi}$ , and let $(x^{*},y^{*})$ be a strong Stackelberg equilibrium of the original game $\mathcal{G}$ . Let $\epsilon_{1}\coloneqq\sum_{v\in V}(1-P_{v}(x^{\prime}))P_{v}(y^{\prime})$ and $\epsilon_{2}\coloneqq\sum_{v\in V}(1-P_{v}(x^{*}))P_{v}(y^{*})$ . Then $(x^{\prime},y^{\prime})$ is an $(\alpha,\alpha\epsilon_{2}-\epsilon_{1}+\epsilon)$ -approximate solution for the game $\mathcal{G}$ .

Proof.

We remark that $f(x,y)$ can be rewritten by $\Phi(x,y)$ as $f(x,y)=\Phi(x,y)+\sum_{v\in V}(1-P_{v}(x))P_{v}(y)$ . Let $(\tilde{x},\tilde{y})$ be the minimax strategy of $\mathcal{G}_{\Phi}$ . We have

[TABLE]

where the second inequality holds by $\Phi(\tilde{x},\tilde{y})=\max_{x\in D_{L},y\in\mathrm{BR}(x)}\Phi(x,y)\geq\Phi(x^{*},y^{*})$ . ∎

To find an approximate strong Stackelberg equilibrium, it suffices to find an approximate minimax strategy for $\mathcal{G}_{\Phi}$ . Note that since $|S_{L}|$ is an exponential size, finding a minimax strategy for $\mathcal{G}_{\Phi}$ is still intractable.

To this end, we use the multiplicative weight update method [2]. Based on this method, Kawase and Sumita [9] showed that, for any nonnegative monotone submodular functions $h_{1},\dots,h_{\nu}\colon\{0,1\}^{n}\to\mathbb{R}_{+}$ and $\epsilon>0$ , there exists an algorithm that finds a $(1-1/e-\epsilon)$ -approximate solution of $\max_{x\in D_{L}}\min_{i\in[\nu]}\mathbb{E}_{s\sim x}[h_{i}(s)]$ in polynomial time in $n$ , $\nu$ and $1/\epsilon$ . We set $h_{y}(z)=\Phi(z,y)+C$ for all pure strategies $z\in S_{L}$ and $y\in D_{F}$ . By the definition, $h_{y}$ is nonnegative monotone submodular for any $y\in D_{F}$ . Thus, we see that we can compute a $(1-1/e-\epsilon)$ -approximate solution for $\max_{x\in D_{L}}\min_{y\in D_{F}}(\Phi(x,y)+C)$ in polynomial time in $\Lambda$ and $1/\epsilon$ . This solution is $\bigl{(}1-1/e-\epsilon,(1/e+\epsilon)C\bigr{)}$ -approximate for $\mathcal{G}_{\Phi}$ . Therefore, by Lemma 5.1, we observe the following result.

Theorem 5.2.

For any $\epsilon>0$ , there exists a $(1-1/e-\epsilon,\beta)$ -approximation algorithm where $\beta=(1-1/e)\epsilon_{2}-\epsilon_{1}+(1/e+\epsilon)C$ and the running time is polynomial with respect to $\Lambda$ and $1/\epsilon$ .

5.2 Heuristic algorithm

In this subsection, we propose a heuristic algorithm. Intuitively, in the algorithm, the players fictitiously play a game $\ell$ times. Here $\ell$ is a parameter. Let us assume that the leader would know that the follower estimates the leader’s mixed strategy by observing the past budget allocations. In every phase, the leader needs to allocate her budgets so that the mixed strategy estimated by the follower maximizes the leader’s utility. The algorithm outputs a mixed strategy by repeating this phase $\ell$ times.

We describe informally our algorithm, which is summarized in Algorithm 1. The algorithm repeatedly computes $\ell$ pure strategies $\chi_{S_{1}},\ldots,\chi_{S_{\ell}}\in D_{L}$ , and outputs the best mixed strategy among $\frac{1}{i}(\chi_{S_{1}}+\cdots+\chi_{S_{i}})$ $(i=1,\dots,\ell)$ . At first round, $\chi_{S_{1}}$ is chosen to maximize $f_{\texttt{BR}}(x)$ . Each $\chi_{S_{i}}$ is computed greedily (lines 4–9).

In each round $i$ , we evaluate $f_{\texttt{BR}}$ ${\rm O}(n\cdot k_{L})$ times, and each evaluation of $f_{\texttt{BR}}$ takes ${\rm O}(|D_{F}|\cdot|E|\cdot i)$ time. Thus the total running time is ${\rm O}(|D_{F}|\cdot|E|\cdot n\cdot k_{L}\cdot\ell^{2})$ .

6 Algorithm for disjoint customers

In this section, we focus on the disjoint customers setting where each customer is interested in only one medium, i.e., $|N_{v}|=1$ for all $v\in V$ . This means that the utility functions $f,g$ are bilinear. In this special case, we propose an LP-based algorithm, and modify it so that it runs fast when $|D_{F}|$ is small. We denote by $\Lambda$ the data size of an input game instance $(G,\{p_{uv}\}_{uv\in E},\{p_{F,uv}\}_{uv\in E},k_{L},k_{F})$ . The following proposition is the main result in this section.

Proposition 6.1.

When $|N_{v}|=1$ for all $v\in V$ , we can find a strong Stackelberg equilibrium $(x,y)$ in polynomial time with respect to $|D_{F}|$ and $\Lambda$ .

As we will see in Section 6.1, it is easy to compute a strong Stackelberg equilibrium by a multiple LP formulation. The running time is polynomial with respect to $\lambda$ , $|S_{L}|$ , and $|D_{F}|$ . However, this is not sufficient since $|S_{L}|$ could be exponentially large with respect to $\lambda$ and $|D_{F}|$ . To remove the dependency on $|S_{L}|$ , we reduce the size of each LP in Section 6.2. The idea is a projection of a leader’s mixed strategy $x\in[0,1]^{S_{L}}$ onto a fractional budget allocation $r\in[0,1]^{U}$ .

6.1 Multiple LP formulation

We first describe a simple exact algorithm to solve (2). The problem (2) is rewritten as

[TABLE]

When we fix $y=y^{*}$ , LP (6) is equivalent to the following LP:

[TABLE]

The simple algorithm solves (6) exactly by solving (11) for each $y^{*}\in D_{F}$ . Each LP (11) is solvable in polynomial time with respect to $\Lambda$ , $|S_{L}|$ and $|D_{F}|$ , and the algorithm produces $|D_{F}|$ instances of LP (11). Thus this algorithm runs in polynomial time with respect to $\Lambda$ , $|S_{L}|$ , and $|D_{F}|$ .

6.2 Reduced formulation

Let $A$ be a matrix in $\{0,1\}^{U\times S_{L}}$ whose rows are all pure strategies. For notational convenience, we denote $p^{\prime}_{uv}=p_{uv}-p_{F,uv}$ for each $uv\in E$ . We denote by a fractional budget allocation $r\in[0,1]^{U}$ with $\sum_{u\in U}r_{u}\leq k_{L}$ . We remark that a fractional budget allocation is a different notion from a mixed strategy $x\in D_{L}$ ; the former is uniquely defined from the latter as $r_{u}=\sum_{S:u\in S}x_{S}\ (u\in U)$ , but the converse may not hold.

We first observe that $A$ projects a mixed strategy $x$ to a fractional budget allocation $Ax\in[0,1]^{U}$ . Let $Q=\{r\in[0,1]^{U}\mid r=Ax,x\in D_{L}\}$ .

Lemma 6.2.

For any vector $z$ , it holds that $z\in Q$ if and only if

[TABLE]

{fullpaper}

Proof.

For any set $S$ of media, we define $q(S)=\min\{|S|,k_{L}\}$ . It is not difficult to see that a vector $z$ is in $Q$ if and only if $z$ satisfies

[TABLE]

Moreover, $z$ satisfies (13) if and only if it satisfies (21). The only-if part is clear. To see the if part, assume that $\sum_{u\in S}z_{u}\leq q(S)$ for some $S$ . If $|S|\geq k_{L}$ , then it holds that $1^{\top}z\geq\sum_{u\in S}z_{u}>q(S)=k_{L}$ . Otherwise, i.e., $|S|<k_{L}$ , since $\sum_{u\in S}z_{u}>|S|$ , we have $z_{u}>1$ for some $u\in S$ . ∎

We can rewrite $P_{v}$ and $P_{F,v}$ as $P_{v}(z)=p_{uv}z_{u}$ and $P_{F,v}(y)=p_{F,uv}y_{u}$ , where $u$ is the only neighbor of $v$ . Then $f$ and $g$ are simplified as

[TABLE]

The utility functions $f(x,y)$ and $g(x,y)$ are bilinear. Moreover, they depend on a fractional budget allocation $Ax\in[0,1]^{U}$ rather than $x$ .

Lemma 6.3.

Assume that $|N_{v}|=1$ for all $v\in V$ . For each $x\in D_{L}$ and $y\in D_{F}$ , it holds that $f(x,y)=f(x^{\prime},y)$ and $g(x,y)=g(x^{\prime},y)$ for any $x\in D_{L}$ such that $Ax=Ax^{\prime}$ .

This lemma gives us an intuition that we solve (11) for a fractional budget allocation $r$ and recover a mixed strategy $x$ . We claim that LP (11) is polynomially equivalent to

[TABLE]

Indeed, if $(x,y)$ is an optimal solution for (11), then we obtain an optimal solution $(r,y)$ for (28) by setting $r=Ax$ . Conversely, let $(r,y)$ be any optimal solution for (28). We observe that $r\in Q$ by Lemma 6.8. If we can construct a mixed strategy $x\in D_{L}$ such that $r=Ax$ , then we see that $(x,y)$ is an optimal solution for (11) by Lemma 6.6. In the following, we show that we can recover $x\in D_{L}$ such that $r=Ax$ in polynomial time with respect to $\Lambda$ .

Lemma 6.4.

For any $r^{*}\in Q$ , there exists a polynomial-time algorithm that finds a mixed strategy $x\in D_{L}$ such that $|\mathop{\rm supp}(x)|\leq n+1$ and $r^{*}=\sum_{z\in\mathop{\rm supp}(x)}x_{z}z$ .

Note that the mixed strategy $x$ in the statement always exists by Carathéodory’s theorem. Lemma 6.7 holds even if some $|N_{v}|$ is not necessarily equal to one. A leader’s strategy in a Stackelberg equilibrium may have the support of a large size.

{fullpaper}

To show Lemma 6.7, we use a result by Grötschel et al. [5]. The separation problem for $Q$ is the problem that receives a vector $r^{*}$ and either asserts $r^{*}\in Q$ or finds a hyperplane $a^{\top}r=b$ such that $a^{\top}r^{*}>b$ and $a^{\top}r\leq b\ (\forall r\in Q)$ .

Theorem 6.5 (Grötschel et al. [5]).

If the separation problem for $Q$ can be solved in polynomial time, then there is a polynomial time algorithm that, on any input $r^{*}\in Q$ , computes $n+1$ pure strategies $z^{1},\ldots,z^{n+1}$ and coefficients $\lambda_{1},\ldots,\lambda_{n+1}\geq 0$ such that $r^{*}=\sum_{i=1}^{n+1}\lambda_{i}z^{i}$ and $\sum_{i=1}^{n+1}\lambda_{i}=1$ .

Proof of Lemma 6.7.

By Lemma 6.8, the separation problem for $Q$ can be solved in polynomial time as follows. We check the feasibility of the $2n+1$ inequalities in (21). If yes, then we assert $z^{*}\in Q$ ; otherwise, output any inequality that is not satisfied by $z^{*}$ .

Then by Theorem 6.5, we obtain a mixed strategy $x$ by setting $x_{z^{i}}=\lambda_{i}\ (i=1,\ldots,n+1)$ and other elements are zero. This strategy $x$ needs memory of polynomial size with respect to $\Lambda$ . ∎

Therefore, we can solve (6) by solving (28) and recovering a mixed strategy $x\in D_{L}$ for each $y^{*}\in D_{F}$ . This algorithm generates $|D_{F}|$ instances of LP (28) and each instance can be solved in polynomial time in $\Lambda$ and $|D_{F}|$ . Note that the data size of the LP (28) is bounded by polynomial in $\Lambda$ and $|D_{F}|$ . The recovered mixed strategy $x$ has polynomial size in $\Lambda$ . By summarizing the above arguments, Proposition 6.1 is proved.

{comment}

Let $A$ be a matrix in $\{0,1\}^{U\times S_{L}}$ whose rows are all pure strategies. For notational convenience, we denote $p^{\prime}_{uv}=p_{uv}-p_{F,uv}$ for each $uv\in E$ . We denote by a fractional budget allocation $r\in[0,1]^{U}$ with $\sum_{u\in U}r_{u}\leq k_{L}$ . We remark that a fractional budget allocation is a different notion from a mixed strategy $x\in D_{L}$ ; the former is uniquely defined from the latter as $r_{u}=\sum_{S:u\in S}x_{S}\ (u\in U)$ , but the converse may not hold. We can rewrite $P_{v}$ and $P_{F,v}$ as $P_{v}(z)=p_{uv}z_{u}$ and $P_{F,v}(y)=p_{F,uv}y_{u}$ , where $u$ is the only neighbor of $v$ . Then $f$ and $g$ are simplified as

[TABLE]

The utility functions $f(x,y)$ and $g(x,y)$ are bilinear. Moreover, they depend on a fractional budget allocation $Ax\in[0,1]^{U}$ rather than $x$ . We remark that $A$ projects a mixed strategy $x$ to a fractional budget allocation $Ax\in[0,1]^{U}$ .

Lemma 6.6.

Assume that $|N_{v}|=1$ for all $v\in V$ . For each $x\in D_{L}$ and $y\in D_{F}$ , it holds that $f(x,y)=f(x^{\prime},y)$ and $g(x,y)=g(x^{\prime},y)$ for any $x\in D_{L}$ such that $Ax=Ax^{\prime}$ .

This lemma gives us an intuition that we solve (2) for a fractional budget allocation $r$ and recover a mixed strategy $x$ . In the following, we show that for any $r$ such that $r=Ax$ for some $x\in D_{L}$ we can recover $x$ in polynomial time with respect to $\Lambda$ . Let $Q=\{r\in[0,1]^{U}\mid r=Ax,1^{\top}x=1,x\geq 0\}$ .

Lemma 6.7.

For any $r^{*}\in Q$ , there exists a polynomial-time algorithm that finds a mixed strategy $x\in D_{L}$ such that $|\mathop{\rm supp}(x)|\leq n+1$ and $r^{*}=\sum_{z\in\mathop{\rm supp}(x)}x_{z}z$ .

Note that the mixed strategy $x$ in the statement of the lemma always exists by Carathéodory’s theorem. To show the lemma, we will use the following property. This is used also in the proof of Proposition 6.1.

Lemma 6.8.

For any $r\in[0,1]^{U}$ , $r\in Q$ if and only if

[TABLE]

Proofs of Lemmas 6.7 and 6.8 are found in the anonymized supplemental materialLABEL:ft:url.

We claim that the LP (11) is equivalent to

[TABLE]

Indeed, if $(x,y)$ is an optimal solution for (11), then we obtain an optimal solution $(r,y)$ for (28) by setting $r=Ax$ . Conversely, let $(r,y)$ be any optimal solution for (28). We observe that $r\in Q$ by Lemma 6.8. Thus we can construct a mixed strategy $x$ by Lemma 6.7 in polynomial time. Then by Lemma 6.6, $(x,y)$ is an optimal solution for (11).

Note that the data size of the LP (28) is bounded by polynomial in $|D_{F}|$ and $\Lambda$ . In our LP-based algorithm, $|D_{F}|$ instances of LP (28) are generated, and each instance can be solved in polynomial in $\Lambda$ and $|D_{F}|$ . The recovered mixed strategy $x$ has polynomial size in $\Lambda$ . Therefore, Proposition 6.1 follows.

7 Experiments

In this section, we evaluate the performance of the proposed approximation algorithm and the heuristic algorithm on real-world datasets. We execute the approximation algorithm Approx (the algorithm based on MWU described in Section 5.1 with $100$ iterations and $\epsilon=0.5$ ), and the heuristic algorithm Prop. (Algorithm 1 with $\ell=10$ ). We compare the above algorithms with a baseline algorithm Greedy, which greedily chooses $k_{L}$ media to maximize $\sum_{v\in V}P_{v}(z)$ . We conduct a series of experiments on Movielens [6] and Yahoo! webscope [19] datasets to examine the leader’s utility. The dataset MovieLens is constructed from MovieLens 100K Dataset111http://grouplens.org/datasets/movielens/100k/ with 100,000 ratings ( $1$ to $5$ ) to 1,700 movies by 1,000 users. From the dataset, we select top $n$ frequently rated movies and constructed a bipartite graph $G$ with $n=20$ media (movies) and $m=844$ customers (users) with $|E|=3506$ edges. The dataset Yahoo! Webscope is constructed from Yahoo! Search Marketing Advertiser Bidding Data222https://webscope.sandbox.yahoo.com/catalog.php?datatype=a, which contains a bipartite graph between 1,000 search keywords and 10,475 accounts, where each edge represents one bid to advertisement on the keyword with the bid price. From the dataset, we select top $n$ frequently bidden keywords and constructed a bipartite graph $G$ , which has $n=50$ media (keywords) and $m=447$ customers (accounts) with $|E|=871$ edges. $\mathcal{U}(a,b)$ denotes an uniform distribution with maximum and minimum values $a$ and $b$ . For the above bipartite graphs, we set each basic activation probability as $p_{uv}\in\mathcal{U}(0,0.2)$ for $uv\in E$ as in Wilder and Vorobeychik [18]. We generate two types of instances; for each edge $uv\in E$ , the activation probability $p_{F,uv}$ is drawn from a distribution ${\cal D_{F}}=\mathcal{U}(0,0.2)$ in the first type of instances, whereas that is drawn from a distribution ${\cal D_{F}}=\mathcal{U}(0.1,0.9)$ in the second type of instances that models a scenario where the follower aims to take customers away from the leader. We set the leader’s budget as $k_{L}=1,2,4$ , whereas the follower’s budget is set to be $k_{F}=2$ . The results reported in Table 2 indicate that our algorithms clearly outperform Greedy especially when the follower is eager to strip the leader of her customers; that is, when ${\cal D_{F}}=\mathcal{U}(0.1,0.9)$ .

{conference}

{fullpaper}

8 Conclusion

We formalized a new model called the Stackelberg budget allocation game with a bipartite influence model. For the general case of our model, we proposed two algorithms: an approximation algorithm which has provable guarantee and a heuristic algorithm empirically outputs a better solution. We remark that, to the best of our knowledge, our approximation algorithm is the first algorithm with a provable guarantee for the non-zero sum submodular Stackelberg game. When the utility functions are bilinear, we proposed our LP-based algorithm and showed that it runs in polynomial time when the follower’s budget is constant. We remark that in this case, we can generalize the budget constraint to a matroid constraint and show a similar result. Finally, experimental results indicate that our approximation and heuristic algorithms empirically output good quality solutions especially in the setting that the follower is a powerful competitor.

Acknowledgments

This work was partially supported by JST ERATO Grant Number JPMJER1201, Japan, and JSPS KAKENHI Grant Numbers JP17K12744, JP18J23034, JP16K16005, JP17K12646, JP17K00028 and JP18H05291, Japan.

Bibliography19

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] Noga Alon, Iftah Gamzu, and Moshe Tennenholtz. Optimizing budget allocation among channels and influencers. In Proceedings of the 21st World Wide Web Conference 2012, WWW 2012 , pages 381–388, 2012.
2[2] Sanjeev Arora, Elad Hazan, and Satyen Kale. The multiplicative weights update method: a meta-algorithm and applications. Theory of Computing , 8(1):121–164, 2012.
3[3] Gerard Cornuejols, Marshall L. Fisher, and George L. Nemhauser. Location of Bank Accounts to Optimize Float: An Analytic Study of Exact and Approximate Algorithms. Management Science , 23(8):789–810, 1977.
4[4] Uriel Feige. A threshold of ln n 𝑛 n for approximating set cover. Journal of the ACM , 45(4):634–652, 1998.
5[5] Martin Grötschel, László Lovász, and Alexander Schrijver. Geometric algorithms and combinatorial optimization , volume 2. Springer Science & Business Media, 2012.
6[6] Max Harper and Joseph A． Konstan. The movielens datasets: History and context. ACM Transactions on Interactive Intelligent Systems , 5(4), 2015. doi:10.1145/2827872 . · doi ↗
7[7] Daisuke Hatano, Takuro Fukunaga, Takanori Maehara, and Ken-ichi Kawarabayashi. Lagrangian decomposition algorithm for allocating marketing channels. In Proceedings of the 29th AAAI Conference on Artificial Intelligence, AAAI 2015 , pages 1144–1150, 2015.
8[8] Dorit S. Hochbaum, editor. Approximation Algorithms for NP-hard Problems . PWS Publishing Co., 1997.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

Non-zero-sum Stackelberg Budget Allocation Game for Computational Advertising

Abstract

1 Introduction

2 Related work

3 Preliminary

3.1 Bipartite influence model

3.2 Stackelberg game

Definition 3.1**.**

4 Stackelberg budget allocation game

4.1 Definition

Example 4.1**.**

Example 4.2**.**

Example 4.3**.**

Example 4.4**.**

4.2 Hardness

Theorem 4.5**.**

Proof.

5 Algorithms for non-disjoint customers

5.1 Approximation algorithm via zero-sum game

Lemma 5.1**.**

Proof.

Theorem 5.2**.**

5.2 Heuristic algorithm

6 Algorithm for disjoint customers

Proposition 6.1**.**

6.1 Multiple LP formulation

6.2 Reduced formulation

Lemma 6.2**.**

Proof.

Lemma 6.3**.**

Lemma 6.4**.**

Theorem 6.5** (Grötschel et al. [5]).**

Proof of Lemma 6.7.

Lemma 6.6**.**

Lemma 6.7**.**

Lemma 6.8**.**

7 Experiments

8 Conclusion

Acknowledgments

Definition 3.1.

Example 4.1.

Example 4.2.

Example 4.3.

Example 4.4.

Theorem 4.5.

Lemma 5.1.

Theorem 5.2.

Proposition 6.1.

Lemma 6.2.

Lemma 6.3.

Lemma 6.4.

Theorem 6.5 (Grötschel et al. [5]).

Lemma 6.6.

Lemma 6.7.

Lemma 6.8.