Submodular Optimization Problems and Greedy Strategies: A Survey

Yajing Liu; Edwin K. P. Chong; Ali Pezeshki; Zhenliang Zhang

arXiv:1905.03308·math.OC·May 10, 2019·Discret. Event Dyn. Syst.

Submodular Optimization Problems and Greedy Strategies: A Survey

Yajing Liu, Edwin K. P. Chong, Ali Pezeshki, Zhenliang Zhang

PDF

Open Access

TL;DR

This survey reviews the effectiveness of greedy algorithms in submodular optimization problems, focusing on performance bounds and improvements for set and string submodular functions.

Contribution

It provides a comprehensive overview of performance bounds for greedy strategies in submodular optimization, including recent improvements and bounds related to curvature and Nash equilibria.

Findings

01

Performance bounds for greedy strategies are well-established.

02

Improved bounds are available considering curvature of the objective.

03

Batched greedy strategies and Nash equilibrium bounds are also analyzed.

Abstract

The greedy strategy is an approximation algorithm to solve optimization problems arising in decision making with multiple actions. How good is the greedy strategy compared to the optimal solution? In this survey, we mainly consider two classes of optimization problems where the objective function is submodular. The first is set submodular optimization, which is to choose a set of actions to optimize a set submodular objective function, and the second is string submodular optimization, which is to choose an ordered set of actions to optimize a string submodular function. Our emphasis here is on performance bounds for the greedy strategy in submodular optimization problems. Specifically, we review performance bounds for the greedy strategy, more general and improved bounds in terms of curvature, performance bounds for the batched greedy strategy, and performance bounds for Nash equilibria.

Equations174

f (A) + f (B) = f (A \cup B) + f (A \cap B) .

f (A) + f (B) = f (A \cup B) + f (A \cap B) .

f (S) - f (\emptyset) = s \in S \sum (f ({s}) - f (\emptyset)) .

f (S) - f (\emptyset) = s \in S \sum (f ({s}) - f (\emptyset)) .

f (S) = ω (\emptyset) + s \in S \sum ω (s)

f (S) = ω (\emptyset) + s \in S \sum ω (s)

\displaystyle\begin{array}[]{l}\text{maximize}\ \ f(M),\ \quad\text{subject to}\ \ M\in\mathcal{I},\end{array}

\displaystyle\begin{array}[]{l}\text{maximize}\ \ f(M),\ \quad\text{subject to}\ \ M\in\mathcal{I},\end{array}

\mbox lower rank o f S = lr (S) = min {∣ B ∣ : B \mbox i s aba s i so f S},

\mbox lower rank o f S = lr (S) = min {∣ B ∣ : B \mbox i s aba s i so f S},

\mbox upper rank o f S = ur (S) = max {∣ B ∣ : B \mbox i s aba s i so f S} .

\mbox upper rank o f S = ur (S) = max {∣ B ∣ : B \mbox i s aba s i so f S} .

q (X, I) = min {\frac{lr ( S )}{ur ( S )} : S \subseteq X and ur (S) > 0}

q (X, I) = min {\frac{lr ( S )}{ur ( S )} : S \subseteq X and ur (S) > 0}

O \in argmax_{M \in I} f (M),

O \in argmax_{M \in I} f (M),

\frac{f ( G )}{f ( O )} \geq q (X, I),

\frac{f ( G )}{f ( O )} \geq q (X, I),

I = {

I = {

{w, x}, {u, v, w}, {u, v, x}, {u, w, x}, {v, w, x}, {u, v, w, x}} .

\frac{f ( G )}{f ( O )} \geq \frac{1}{1 + p} .

\frac{f ( G )}{f ( O )} \geq \frac{1}{1 + p} .

\frac{f ( G ) - f ( \emptyset )}{f ( O ) - f ( \emptyset )} \geq \frac{1}{1 + p} .

\frac{f ( G ) - f ( \emptyset )}{f ( O ) - f ( \emptyset )} \geq \frac{1}{1 + p} .

\frac{f ( G )}{f ( O )} \geq 1 - (1 - \frac{1}{K})^{K} > 1 - \frac{1}{e},

\frac{f ( G )}{f ( O )} \geq 1 - (1 - \frac{1}{K})^{K} > 1 - \frac{1}{e},

f ({a_{1}, \dots, a_{k}}) = \frac{1}{n} i = 1 \sum n (1 - j = 1 \prod k (1 - p_{i} (a_{j}))) .

f ({a_{1}, \dots, a_{k}}) = \frac{1}{n} i = 1 \sum n (1 - j = 1 \prod k (1 - p_{i} (a_{j}))) .

y_{i} = B_{i} x + w_{i},

y_{i} = B_{i} x + w_{i},

f ({B_{1}, \dots, B_{k}}) = H_{0} - H_{k} .

f ({B_{1}, \dots, B_{k}}) = H_{0} - H_{k} .

H_{k} = \frac{1}{2} log det (P_{k}) + \frac{N}{2} log (2 π e),

H_{k} = \frac{1}{2} log det (P_{k}) + \frac{N}{2} log (2 π e),

P_{j} = (P_{j - 1}^{- 1} + \frac{1}{σ ^{2}} B_{j}^{T} B_{j})^{- 1}

P_{j} = (P_{j - 1}^{- 1} + \frac{1}{σ ^{2}} B_{j}^{T} B_{j})^{- 1}

c (f) := j \in X : ϱ_{j} (\emptyset) \neq = 0 max {1 - \frac{ϱ _{j} ( X ∖ { j } )}{ϱ _{j} ( \emptyset )}} .

c (f) := j \in X : ϱ_{j} (\emptyset) \neq = 0 max {1 - \frac{ϱ _{j} ( X ∖ { j } )}{ϱ _{j} ( \emptyset )}} .

c (f) := j \in X : f ({j}) \neq = f (\emptyset) max {\frac{( f ({ j }) - f ( \emptyset )) - ( f ( X ) - f ( X ∖ { j }))}{f ({ j }) - f ( \emptyset )}} .

c (f) := j \in X : f ({j}) \neq = f (\emptyset) max {\frac{( f ({ j }) - f ( \emptyset )) - ( f ( X ) - f ( X ∖ { j }))}{f ({ j }) - f ( \emptyset )}} .

\frac{f ( G _{K} )}{f ( O )} \geq \frac{1}{c} [1 - (1 - \frac{c}{K})^{k}],

\frac{f ( G _{K} )}{f ( O )} \geq \frac{1}{c} [1 - (1 - \frac{c}{K})^{k}],

\frac{f ( G _{K} )}{f ( O )} \geq \frac{1}{1 + c} .

\frac{f ( G _{K} )}{f ( O )} \geq \frac{1}{1 + c} .

\frac{f ( G _{K} )}{f ( O )} \geq \frac{1}{c} [1 - (1 - \frac{c}{K})^{K}] > \frac{1 - e ^{- c}}{c} .

\frac{f ( G _{K} )}{f ( O )} \geq \frac{1}{c} [1 - (1 - \frac{c}{K})^{K}] > \frac{1 - e ^{- c}}{c} .

f ({a_{1}, \dots, a_{k}}) = 1 - j = 1 \prod k (1 - p (a_{j})),

f ({a_{1}, \dots, a_{k}}) = 1 - j = 1 \prod k (1 - p (a_{j})),

0 < p (a_{[1]}) \leq p (a_{[2]}) \leq \dots \leq p (a_{[N]}) \leq 1.

0 < p (a_{[1]}) \leq p (a_{[2]}) \leq \dots \leq p (a_{[N]}) \leq 1.

c = j \in X max {1 - \frac{f ( X ) - f ( X ∖ { j })}{f ({ j }) - f ( \emptyset )}} = 1 - l = 2 \prod N (1 - p (a_{[l]})) < 1,

c = j \in X max {1 - \frac{f ( X ) - f ( X ∖ { j })}{f ({ j }) - f ( \emptyset )}} = 1 - l = 2 \prod N (1 - p (a_{[l]})) < 1,

f (B) - f (B ∖ {a}) \geq f (B^{*}) - f (A ∖ {a}),

f (B) - f (B ∖ {a}) \geq f (B^{*}) - f (A ∖ {a}),

g (A) = g (B^{*}) + d_{A},

g (A) = g (B^{*}) + d_{A},

0 \leq d_{A} \leq B : B \subset A, ∣ B ∣ = k a : a \in B min {f (B) - f (B^{*}) + f (A ∖ {a}) - f (B ∖ {a})} .

0 \leq d_{A} \leq B : B \subset A, ∣ B ∣ = k a : a \in B min {f (B) - f (B^{*}) + f (A ∖ {a}) - f (B ∖ {a})} .

\frac{f ( G _{K} )}{f ( O )} \geq \frac{1}{1 + d},

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComplexity and Algorithms in Graphs · Optimization and Search Problems · Advanced Graph Theory Research

Full text

∎

11institutetext: 1 National Renewable Energy Laboratory (NREL), Golden, CO

2 Department of Electrical and Computer Engineering, and Department of Mathematics, Colorado State University, Fort Collins, CO

3 Alibaba iDST, Seattle, WA

Submodular Optimization Problems and Greedy Strategies: A Survey

Yajing Liu 1 Yajing Liu

[email protected]

Edwin K. P. Chong2 Edwin K. P. Chong

[email protected]

Ali Pezeshki2 Ali Pezeshki

[email protected]

Zhenliang Zhang3 Zhenliang Zhang

[email protected]

(Received: date / Accepted: date)

Abstract

The greedy strategy is an approximation algorithm to solve optimization problems arising in decision making with multiple actions. How good is the greedy strategy compared to the optimal solution? In this survey, we mainly consider two classes of optimization problems where the objective function is submodular. The first is set submodular optimization, which is to choose a set of actions to optimize a set submodular objective function, and the second is string submodular optimization, which is to choose an ordered set of actions to optimize a string submodular function. Our emphasis here is on performance bounds for the greedy strategy in submodular optimization problems. Specifically, we review performance bounds for the greedy strategy, more general and improved bounds in terms of curvature, performance bounds for the batched greedy strategy, and performance bounds for Nash equilibria.

Keywords:

Curvature greedy strategy Nash equilibrium optimization performance submodular

1 Introduction

We are often faced with choosing a set of actions from a ground set of actions to optimize an objective function. Such problems arise in a multitude of applications of interest to discrete-event dynamic system researchers. A specific example is the task assignment problem (Streeter and Golovin 2008; Zhang et al. 2016; Liu et al. 2018d), one of the fundamental combinatorial optimization problems in the study of optimization or operations research. This problem involves a number of agents and a number of tasks. Each agent successfully accomplishes a task with a certain probability and the aim is to assign the available tasks to a given number of agents such that the probability of accomplishing the tasks is maximized.

When the number of agents is relatively small, we can use brute-force search (Paar and Pelzl 2010) to enumerate all possible candidate solutions to find the optimal solution. However, when the number of agents is large, it is impractical to enumerate all the possible candidate solutions. At this point, we have to resort to approximation methods. One of the most well-studied approximation methods is the greedy strategy (Nemhauser et al. 1978), which starts with the empty set and iteratively adds to the current solution set an element that results in the largest gain in the objective function while satisfying the constraints. The greedy strategy yields an approximation to an optimal solution in a reasonable amount of time. The downside is that there is often no theoretical guarantee for the greedy strategy. But when the problem has a special property called submodularity, the greedy strategy is provably guaranteed to produce a solution with an objective value at least a constant scalar times the optimum value. Celebrated results by Fisher et al. (1978) and Nemhauser et al. (1978) prove that when the objective function $f$ is a monotone submodular set function with $f(\emptyset)=0$ , the greedy strategy yields a $1/2$ -approximation111The term $\beta$ -approximation means that $f(G)/f(O)\geq\beta$ , where $G$ and $O$ denote a greedy solution and an optimal solution, respectively. for a general matroid and a $(1-e^{-1})$ -approximation for a uniform matroid.

For set optimization problems, the objective function is not influenced by the order of actions. However, a great number of problems in engineering and applied science aim to optimally choose a string (finite sequence) of actions over a finite horizon to maximize an objective function whose value depends on the order of actions. The problem arises in sequential decision making in engineering, economics, management science, and medicine. A motivating example is the problem of scheduling sensors to detect targets (Li et al. 2009). Suppose that a given number of sensors are distributed in a sensor field to detect a certain number of targets. The goal is to activate sensors sequentially to maximize the total coverage area. If the coverage region of each sensor remains constant over time, the total coverage area is not influenced by the order of the sensors activated, and the problem becomes a set optimization problem. However, if the sensors are moving, then the total coverage area depends on the order of the sensors activated, which makes the problem fall into the framework of string optimization problems. The optimal solution to a string optimization problem is characterized by dynamic programming via Bellman’s principle (Powell 2007). However, the approach suffers from the curse of dimensionality and is therefore impractical for many problems of interest. This motivates the study of approximation algorithms, among which the greedy strategy is easy to implement and has guaranteed performance bounds under certain conditions. For example, Streeter and Golovin (2008) prove that when the objective function is prefix and postfix monotone and has the diminishing-return property (as defined later in the paper), the greedy strategy yields a $(1-e^{-1})$ -approximation.

In this paper, we review the performance guarantees for greedy strategies in submodular maximization problems. The paper is organized as follows. In Section 2, we review results that are related to choosing sets of actions. This involves introducing set functions, set optimization problem, performance bounds for the greedy strategy, examples, curvature, improved bounds, batched actions, and noncooperative games. In Section 3, we review results related to choosing strings of actions. This involves introducing new notation and terminology, string optimization problem, performance of the greedy strategy, and applications. In Section 4, we conclude by listing a number of related papers that consider extensions and/or variation of greedy strategies and their performance bounds in combinatorial optimization problems.

2 Sets of Actions

In this section, we first introduce our notation for sets, properties of set functions, and set optimization problems. Then, we review various performance bounds for the greedy strategy.

2.1 Set Functions

Before we introduce functions defined on sets, we would like to introduce some similar and familiar properties for functions defined on real numbers. Consider a real function $f:\mathbb{R}\rightarrow\mathbb{R}$ . The function $f$ is said to be monotone and submodular if it satisfies properties 1 and 2 below, respectively:

Monotone: $\forall x\leq y\in\mathbb{R}$ , $f(x)\leq f(y)$ .

2.

Submodular: $\forall x\leq y\in\mathbb{R}$ , $\forall z\in\mathbb{R}$ , $f(x+z)-f(x)\geq f(y+z)-f(y)$

The ‘monotone’ property here simply means being ‘nondecreasing’. The function in Fig. 1 satisfies the monotone property. From Fig. 1, we can see that the function is a concave function – adding $z$ to $x$ gains more than adding $z$ to $y$ , which tells us that the additional value accrued by adding a number to a smaller number is larger than adding it to a bigger number. This is consistent with the inequality $f(x+z)-f(x)\geq f(y+z)-f(y)$ for $x\leq y$ , so we say that ‘submodularity’ here boils down to ‘concavity’ in some sense.

In this paper, we want to go beyond the real line to a more general setting. Specifically, we will consider objective functions with multiple decision “actions” as arguments. The first setting is sets of actions, and the second one is strings (ordered sets) of actions. We introduce functions defined on sets first.

Let $X$ denote a ground set, which includes all possible actions. Let $2^{X}$ denote the power set of $X$ , which includes all possible subsets of $X$ . The size or cardinality of a set $S\in 2^{X}$ is denoted by $|S|$ , and the empty set is denoted by $\emptyset$ . Define a set function $f$ : $2^{X}\longrightarrow\mathbb{R}$ . The set function $f$ is said to be monotone and submodular if it satisfies properties i and ii below, respectively:

i.

Monotone: $\forall A\subseteq B\subseteq X$ , $f(A)\leq f(B)$ .

ii.

Submodular: $\forall A\subseteq B\subseteq X$ and $\forall j\in X\setminus B$ , $f(A\cup\{j\})-f(A)\geq f(B\cup\{j\})-f(B)$ .

Notice the similarity between these properties and those involving functions on the real line introduced earlier.

For convenience, we denote the incremental value of adding a set $T$ to the set $A\subseteq X$ as $\varrho_{T}(A)=f(A\cup T)-f(A)$ (following the notation in Conforti and Cornuéjols (1984)).

A set function $f$ : $2^{X}\longrightarrow\mathbb{R}$ is called a polymatroid set function (Boros et al. 2003) if it is monotone, submodular, and $f(\emptyset)=0$ . Submodularity in property ii means that the additional value accruing from an extra action decreases as the size of the input set increases, and is also called the diminishing-return property in economics. Submodularity has many equivalent definitions; for example, $f:2^{X}\longrightarrow\mathbb{R}$ is submodular if $\forall A,B\subseteq X$ , $f(A)+f(B)\geq f(A\cup B)+f(A\cap B)$ . For more equivalent definitions, see Nemhauser et al. (1978).

The set function $f$ is called supermodular if $-f$ is submodular. Moreover, $f$ is called modular if it is both submodular and supermodular, i.e., for any $A\subseteq B\subseteq X$ ,

[TABLE]

By induction, (1) implies that for any $S\subseteq X$ ,

[TABLE]

By (2), $f-f(\emptyset)$ is additive when $f$ is modular. If $f(\emptyset)=0$ , then $f(S)=\sum_{s\in S}f(\{s\})$ , which implies that $f$ is additive. It is also easy to check that $f$ is modular iff for any subset $S\subseteq X$ ,

[TABLE]

for some weight function $\omega:X\rightarrow\mathbb{R}$ (Krause and Golovin 2012).

There are many non-trivial examples of submodular or supermodular set functions. We only consider submodular maximization problems in this paper, so we only give submodular function examples. For supermodular examples, see Lovász (1983). To easily understand submodularity, we provide a simple example as follows.

Example 1

Sensor Coverage. Let $X$ be a family of locations in space where we can place sensors. If a sensor is placed at a particular location in space, it covers a circular area around it as illustrated in Fig. 2. Let $f(S)$ denote the total area covered if we place sensors at locations $S\subseteq X$ (see Fig. 2). The set function $f$ is submodular. An instance of submodularity is illustrated in the figure. As can be seen, the gain in adding sensor $3$ after placing sensor $1$ is larger than the gain in adding sensor $3$ after placing sensors $1,2$ . ∎

Submodular functions arise in many applications, such as the rank function of the matrix formed by its columns, weighted coverage functions, the rank function of a matroid, Shannon entropy, mutual information, cut capacity functions, some measurements on the graph, etc. (Lovász 1983; Krause and Golovin 2012)

2.2 Submodular Set Optimization Problem

Submodular set optimization plays an important role in combinatorial optimization. It has a wide range of applications, including generalized assignment (Shmoys and Tardos 1993; Cohen et al. 2006; Nauss 2003; Fleischer et al. 2006; Bator 1957; Korula et al. 2015; Vondrák 2008), matroid partition (Edmonds and Fulkerson 1965; Cunningham 1986; Knuth 1973), maximum cut (Goemans and Williamson 1995; Sahni and Gonzalez 1976), maximum coverage location (Church and Velle 1974; Khuller et al. 1999; Cornuéjols et al. 1977), multi-agent coverage problem (Sun et al. 2017), leader-selection problem in multi-agent systems (Clark and Poovendran 2011), welfare maximization (Korula et al. 2015; Vondrák 2008; Kapralov et al. 2013), and data summarization (Lin and Bilmes 2011; Badanidiyuru et al. 2014; Mirzasoleiman et al. 2017). The aim is to find a set of actions satisfying some constraints to maximize the objective function. The set optimization problem can be formulated as follows:

[TABLE]

where $\mathcal{I}$ is a non-empty collection of subsets of a finite set $X$ , and $f$ is a real-valued submodular set function defined on the power set $2^{X}$ of $X$ . Before proceeding any further with discussing optimization problem (5), we will need to introduce some concepts related to the constraint set $\mathcal{I}$ .

Let $X$ be a finite set, and $\mathcal{I}$ be a non-empty collection of subsets of $X$ . The collection $\mathcal{I}$ is said to be hereditary if it satisfies property i below and has the augmentation property if it satisfies property ii below:

i.

Hereditary: For all $B\in\mathcal{I}$ , any set $A\subseteq B$ is also in $\mathcal{I}$ .

ii.

Augmentation: For any $A,B\in\mathcal{I}$ , if $|B|>|A|$ , then there exists $j\in B\setminus A$ such that $A\cup\{j\}\in\mathcal{I}$ .

The pair $(X,\mathcal{I})$ is called an independence system if it satisfies property i. In this case, the sets in $\mathcal{I}$ are called independent sets. A maximal independent set is an independent set that is not a subset of any other independent set (Conforti and Cornuéjols 1984). The independence system $(X,\mathcal{I})$ is called a matroid if it satisfies property ii (Edmonds 1970). The pair $(X,\mathcal{I})$ is called a uniform matroid if $\mathcal{I}=\{S\subseteq X:|S|\leq K\}$ for a given $K$ (Nemhauser et al. 1978). All maximal independent sets in a matroid have the same cardinality. We call this cardinality the rank of the matroid. In the uniform matroid above, the rank is $K$ .

Example 2

We now give three example collections to illustrate the notions of independence systems and matriods. Let $X=\{a,b,c\}$ , $\mathcal{I}_{1}=\{\{a\},\{b\},\{a,c\},\{c\},\emptyset\}$ , $\mathcal{I}_{2}=\{\{a\},\{a,b\}\}$ , and $\mathcal{I}_{3}=\{\emptyset,\{a\},\{b\},\{a,b\}\}$ . It is easy to check that $\mathcal{I}_{1}$ satisfies the hereditary property but not augmentation, $\mathcal{I}_{2}$ satisfies augmentation but not the hereditary property, and $\mathcal{I}_{3}$ satisfies both hereditary and augmentation properties. Hence, $(X,\mathcal{I}_{1})$ is an independence system, $(X,\mathcal{I}_{3})$ is a matroid, and $(X,\mathcal{I}_{2})$ is neither an independence system nor a matroid. The maximal independent sets in $(X,\mathcal{I}_{1})$ are $\{b\}$ and $\{a,c\}$ , and $(X,\mathcal{I}_{3})$ only has one maximal independent set $\{a,b\}$ . ∎

Let $(X,\mathcal{I})$ be an independence system where $\mathcal{I}$ is nonempty, and let $S\subseteq X$ be an arbitrary subset of $X$ . A basis of $S$ is a subset $B$ of $S$ that satisfies the following two conditions:

It is an independent set; i.e., $B\in\mathcal{I}$ .
It is maximal; i.e., $B$ is not a subset of any other independent subset of $S$ . The subset $B$ satisfying the above two conditions is also called a maximal independent subset of $S$ . Define

[TABLE]

Note that lr( $S$ ) and ur( $S$ ) might not be well defined, depending on $S$ . Note also that in the definition above, $S$ is not necessarily in $\mathcal{I}$ . The number

[TABLE]

is called the rank quotient of $(X,\mathcal{I})$ (Hausmann et al. 1980).

Example 3

To illustrate the concept of rank quotient, again consider the independence system $(X,\mathcal{I}_{1})$ given in Example 2. We now consider all the subsets of $X$ and calculate their lower and upper ranks. If $S$ is a singleton (i.e., $\{a\}$ , $\{b\}$ , or $\{c\}$ ), then $S$ has only one basis, which is $S$ itself. In this case, $\text{lr}(S)=\text{ur}(S)=1$ , which means that $\text{lr}(S)/\text{ur}(S)=1$ .

If $S=\{a,b\}$ , its bases are $\{a\}$ and $\{b\}$ . Again, $\text{lr}(S)=\text{ur}(S)=1$ , which means that $\text{lr}(S)/\text{ur}(S)=1$ . Note that $\{a,b\}$ is not a basis of $S$ because it does not belong to $\mathcal{I}_{1}$ . If $S=\{a,c\}$ , it has only one basis, which is itself, and again $\text{lr}(S)/\text{ur}(S)=1$ . If $S=\{b,c\}$ , its bases are $\{b\}$ and $\{c\}$ , in which case $\text{lr}(S)/\text{ur}(S)=1$ again.

If $S=\{a,b,c\}=X$ , the bases are $\{b\}$ and $\{a,c\}$ . So, $\text{lr}(S)=1$ and $\text{ur}(S)=2$ , which implies that $\text{lr}(S)/\text{ur}(S)=1/2$ .

Because the rank quotient is the smallest among the ratios calculated above, we deduce that $q(X,\mathcal{I}_{1})=1/2$ . ∎

Example 4

As in Example 3, we can similarly check that $q(X,\mathcal{I}_{3})=1$ . In fact, the rank quotient of any matroid $(X,\mathcal{I})$ is equal to $1$ , because for any susbset $S\subseteq X$ , $\text{lr}(S)=\text{ur}(S)$ (Edmonds 1966). The rank quotient of an independence system $(X,\mathcal{I})$ can be regarded as a measure of how much $(X,\mathcal{I})$ differs from being a matroid. ∎

For any independence system $(X,\mathcal{I})$ , if there exist matroids $(X,\mathcal{I}^{i})$ ( $1\leq i\leq p$ ) such that $\mathcal{I}=\mathcal{I}^{1}\cap\cdots\cap\mathcal{I}^{p}$ , then the pair $(X,\mathcal{I})$ is called the intersection of the matroids $(X,\mathcal{I}^{i})$ (Hausmann et al. 1980).

Finding the optimal solution to (5) in general is NP-hard. The greedy strategy provides a tractable way to approximately solve the problem, which starts with the empty set, and incrementally adds an element to the current solution set giving the largest gain in the objective function under the constraints. Although the greedy strategy yields an approximate solution, its performance might be arbitrarily poor. However, when the optimization problem has the further special structure of being polymatroid, the greedy strategy has provable guarantees. The celebrated results by Fisher et al. (1978) and Nemhauser et al. (1978) show that the greedy strategy provides a good approximation to the optimal solution when the objective function is a polymatroid set function under both general matroid constraints and uniform matroid constraints. We will review the performance of the greedy strategy for (5) under different constraints in the following section.

2.3 Performance Bounds for Greedy Strategy

First we introduce definitions of the optimal strategy and the greedy strategy. Optimal Set: Any set $O$ is called an optimal solution of Problem (5) if

[TABLE]

where argmax denotes the set of actions that maximize $f(\cdot)$ .

Greedy Algorithm:

Input: A pair $(X,\mathcal{I})$ , a set function $f:2^{X}\rightarrow\mathbb{R}$

Output: A subset $G\in\mathcal{I}$

$G_{0}\leftarrow\emptyset$

For $i=1,2,\ldots$ ,

$g_{i}\leftarrow\mathop{\mathrm{argmax}}\limits_{\begin{subarray}{c}a\in X\setminus G_{i-1}\\ G_{i-1}\cup\{a\}\in\mathcal{I}\end{subarray}}f(G_{i-1}\cup a)$ . If $g_{i}\neq\emptyset$ , set $G_{i}=G_{i-1}\cup\{g_{i}\}$ ; otherwise, stop and set $G=G_{i-1}$ .

Any output of the above algorithm is called a greedy solution. Note that there may exist more than one optimal solution or more than one greedy solution. How good is a greedy solution compared to an optimal solution in terms of the objective function? In the following theorems, we review performance bounds for the greedy strategy under different constraints. These bounds are worst-case performance bounds, which means that the greedy strategy performs much better than those bounds in many cases.

Theorem 2.1

(Hausmann et al. 1980)*

Let $(X,\mathcal{I})$ be an independence system. If $f$ is additive on $X$ , i.e., $f(S)=\sum_{s\in S}f(\{s\})$ for any subset $S\subseteq X$ , then any greedy solution $G$ satisfies*

[TABLE]

where $q(X,\mathcal{I})$ is the rank quotient defined in Section 2.2. Furthermore, for some function $f$ , (7) holds with equality.

Remark 1

When $(X,\mathcal{I})$ is a matroid, $q(X,\mathcal{I})=1$ . By Theorem 2.1, the greedy strategy is optimal when $(X,\mathcal{I})$ is a matroid and the objective function is additive.

Remark 2

When $(X,\mathcal{I})$ is the intersection of $p$ matroids, then $q(X,\mathcal{I})\geq 1/p$ (Hausmann et al. 1980). So when $p=1$ , i.e., $(X,\mathcal{I})$ is a matroid, the greedy strategy is optimal, which is consistent with Remark 1.

Example 5

We provide an example222We thank the anonymous reviewer for this example. to demonstrate the performance bound in Theorem 2.1. Let $X=\{s,t,u,v,w,x\}$ , and consider the collection of subsets

[TABLE]

Define a function $f$ such that $f(A)=\sum_{a\in A}f(\{a\})$ . Let $f(\{s\})=1.01,f(\{u\})=f(\{v\})=f(\{w\})=f(\{x\})=1$ , and $f(\{t\})=0$ .

It is easy to check that $(X,\mathcal{I})$ is an independence system. If $S=\{s,u,v,w,x\}$ , it has bases $\{s\}$ and $\{u,v,w,x\}$ , which results in $\text{lr}(S)/\text{ur}(S)=1/4$ . Because the maximum cardinality of the maximal independent subsets of any $S\subseteq X$ is 4, $\text{lr}(S)/\text{\text{ur}(S)}\geq{1}/{4}$ for any set $S\subseteq X$ with $\text{ur}(S)>0$ . Therefore, $q(X,\mathcal{I})={1}/{4}$ . The greedy solution is $G=\{s,t\}$ with $f(G)=1.01$ and the optimal solution is $O=\{u,v,w,x\}$ with $f(O)=4$ , which satisfy the bound $f(G)/f(O)\geq q(X,\mathcal{I})$ . In fact, the bound holds with equality if we lower $f(\{s\})$ to exactly $1$ . ∎

The following theorem bounds the performance of the greedy strategy when $(X,\mathcal{I})$ is the intersection of $p$ matroids and $f$ is a polymatroid set function.

Theorem 2.2

(Fisher et al. 1978)*

Let $(X,\mathcal{I})$ be the intersection of $p$ matroids and $f:2^{X}\rightarrow\mathbb{R}$ a polymatroid set function. Then any greedy solution $G$ satisfies*

[TABLE]

Remark 3

The condition that $f$ is additive in Theorem 2.1 is stronger than the condition that $f$ is a polymatroid set function in Theorem 2.2, so the bound $1/p$ in Theorem 2.1 is stronger than the bound $1/(1+p)$ in Theorem 2.2.

Remark 4

The bound $1/(1+p)$ can be achieved for any positive integer $p$ . When $p=1$ , $(X,\mathcal{I})$ is a matroid, and the bound becomes $1/2$ , which means that the greedy strategy yields $1/2$ -approximation for general matroid constraints.

Remark 5

Theorem 2.2 requires that $f(\emptyset)=0$ . If $f(\emptyset)\neq 0$ , the following performance bound holds

[TABLE]

The following theorem provides a performance bound for the greedy strategy when $(X,\mathcal{I})$ is a uniform matroid and $f$ is a polymatroid set function.

Theorem 2.3

(Nemhauser et al. 1978)*

Let $(X,\mathcal{I})$ be a uniform matroid and $f:2^{X}\rightarrow\mathbb{R}$ a polymatroid set function. Then any greedy solution $G_{K}$ satisfies*

[TABLE]

where $K$ is the rank of the uniform matroid and $e$ is the base of the natural logarithm.

Remark 6

The bound $1-(1-1/K)^{K}$ is stronger than the bound $1/(1+p)$ when $p=1$ in Theorem 2.2, because the uniform matroid is a special matroid.

Remark 7

The bound $1-(1-1/K)^{K}$ is decreasing in $K$ and tends to $1-1/e$ when $K$ goes to infinity. When $K=1$ , the bound becomes 1, which is consistent with the fact that the greedy strategy chooses the best action at each stage.

Remark 8

The bound $1-(1-1/K)^{K}$ is tight, which means that it can be achieved for each $K$ (Nemhauser et al. 1978).

Remark 9

By Theorem 2.2, the greedy strategy only achieves a $1/2$ -approximation under general matroid constraints. However, Calinescu et al. (2011) proved that a variant of the greedy strategy yields a $(1-1/e)$ -approximation under general matroid constraints.

2.4 Examples

We introduce two examples – a task scheduling problem and an adaptive sensing problem – to illustrate polymatroid set functions. In both problems, $(X,\mathcal{I})$ is a uniform matroid and hence the greedy strategy satisfies a $(1-e^{-1})$ -approximation.

Task Assignment Problem: The task scheduling problem was posed by Streeter and Golovin (2008), and was also analyzed in Zhang et al. (2016) and Liu et al. (2018d). In this problem, there are $n$ subtasks and a set $X$ of $N$ agents. At each stage, a subtask $i$ is assigned to an agent $a$ , who accomplishes the task with probability $p_{i}(a)$ . Let $X_{i}(\{a_{1},a_{2},\ldots,a_{k}\})$ denote the Bernoulli random variable that signifies whether or not subtask $i$ has been accomplished after assigning the set of agents $\{a_{1},a_{2},\ldots,a_{k}\}$ over $k$ stages. Then $\frac{1}{n}\sum_{i=1}^{n}X_{i}(\{a_{1},a_{2},\ldots,a_{k}\})$ is the fraction of subtasks accomplished after $k$ stages by employing agents $\{a_{1},a_{2},\ldots,a_{k}\}$ . The objective function $f$ for this problem is the expected value of this fraction, which can be written as

[TABLE]

The aim is to choose a set of agents to maximize this objective function.

Assume that $p_{i}(a)>0$ for any $a\in X$ . Then it is easy to check that $f$ is monotone, submodular, and $f(\emptyset)=0$ , which implies that $f$ is a polymatroid set function.

Adaptive Sensing: As our second example application, we consider the adaptive sensing design problem posed in Zhang et al. (2016) and Liu et al. (2018d). Consider a signal of interest $x\in{\rm I\!R}^{2}$ with normal prior distribution $\mathcal{N}(0,I)$ , where $I$ is the $2\times 2$ identity matrix; our analysis easily generalizes to dimensions larger than $2$ . Let $\mathbb{B}=\{\mathrm{Diag}(\sqrt{b},\sqrt{1-b}):b\in\{b_{1},\ldots,b_{N}\}\}$ , where $b_{i}\in[0,1]$ for $1\leq i\leq N$ . At each stage $i$ , we make a measurement $y_{i}$ of the form

[TABLE]

where $B_{i}\in\mathbb{B}$ and $w_{i}$ represents i.i.d. Gaussian measurement noise with mean zero and covariance $\sigma^{2}I$ , independent of $x$ .

The objective function $f$ for this problem is the information gain, which can be written as

[TABLE]

Here, $H_{0}=\frac{N}{2}\text{log}(2\pi e)$ is the entropy of the prior distribution of $x$ and $H_{k}$ is the entropy of the posterior distribution of $x$ given $\{y_{i}\}_{i=1}^{k}$ ; that is,

[TABLE]

where for $1\leq j\leq k$

[TABLE]

is the posterior covariance of $x$ given $\{y_{i}\}_{i=1}^{j}$ . The objective is to choose a set of measurements to maximize the information gain $f(\{B_{1},\ldots,B_{K}\})=H_{0}-H_{K}$ .

It is easy to check that $f$ is monotone, submodular, and $f(\emptyset)=0$ ; i.e., $f$ is a polymatroid set function.

2.5 Curvature

As we saw in Section 2.1, submodularity is a second-order property by analogy to concavity. If we can quantify this second order property, then we can get tighter bounds. One way to quantify the second order property is to use the total curvature, defined by Conforti and Cornuéjols (1984):

[TABLE]

To see that this is a second-order property, rewrite it in terms of differences of differences:

[TABLE]

For convenience, we use $c$ to denote $c(f)$ when there is no ambiguity. Note that $0\leq c\leq 1$ when $f$ is a polymatroid set function, and $c=0$ when $f$ is modular. When $f$ is modular, $f-f(\emptyset)$ is additive. If we consider $f-f(\emptyset)$ as the objective function, then the greedy strategy achieves optimality. Therefore, in the rest of the paper, when we assume that $f$ is a polymatroid set function, we only consider $c\in(0,1]$ .

Conforti and Cornuéjols (1984) provided performance bounds in terms of the total curvature for the greedy strategy under independence system, general matroid, and uniform matroid constraints, which will be reviewed as follows.

Theorem 2.4

(Conforti and Cornuéjols 1984)* If $(X,\mathcal{I})$ is an independence system with ur $(X)=K$ and lr $(X)=k$ , and $f$ is is a polymatroid set function with total curvature $c$ , then any greedy solution $G_{K}$ satisfies*

[TABLE]

and this bound is tight for all $0<c\leq 1$ .

Theorem 2.5

(Conforti and Cornuéjols 1984)*

If $(X,\mathcal{I})$ is a matroid and $f$ is a polymatroid set function with total curvature $c$ , then any greedy solution $G_{K}$ satisfies*

[TABLE]

Moreover, if $(X,\mathcal{I})$ is a uniform matroid with rank $K$ , then any greedy solution $G_{K}$ satisfies

[TABLE]

Remark 10

When $(X,\mathcal{I})$ is a matroid, the bound $1/(1+c)$ is stronger than the bound $1/2$ in Theorem 2.2 because $c\in(0,1]$ when $f$ is a polymatroid set function and $1/(1+c)$ is nonincreasing in $c$ .

Remark 11

The function $(1-e^{-c})/c$ is nonincreasing in $c$ , and therefore $(1-e^{-c})/c\in[1-e^{-1},1)$ when $f$ is a polymatroid set function. Also it is easy to check that $(1-e^{-c})/c\geq 1/(1+c)$ for $c\in(0,1]$ , which implies that the bound $(1-e^{-c})/c$ for the uniform matroid constraints is stronger than the bound $1/(1+c)$ for the general matroid constraints.

Remark 12

The two bounds in terms of the total curvature $c$ are both tight; for proofs, see Conforti and Cornuéjols (1984).

Remark 13

There are other notions of curvatures that can be used to characterize the second-order property of the set function $f$ , such as the greedy curvature defined by Conforti and Cornuéjols (1984) and the elemental curvature defined by Wang et al. (2014). Performance bounds in terms of the corresponding curvatures were also derived by Conforti and Cornuéjols (1984) and Wang et al. (2014) under different constraints.

Example: Consider again the task assignment example from Section 2.4. For convenience, we only consider the special case $n=1$ ; our analysis can be generalized to any $n\geq 2$ . For $n=1$ , we have

[TABLE]

where $p(\cdot)=p_{1}(\cdot)$ .

Let us order the elements of $X$ as $a_{[1]},a_{[2]},\ldots,a_{[N]}$ such that

[TABLE]

Then by the definition of the total curvature $c$ , we have

[TABLE]

which is consistent with our conclusion that $c\in[0,1]$ .

2.6 Improved Bounds

The performance bounds of Conforti and Cornuéjols (1984) reviewed in Section 2.5, are the best bounds in terms of the total curvature $c$ for general matroid constraints and uniform matroid constraints, respectively. However, the total curvature $c$ depends on function values on sets outside the constraint matroid. If we are given a function defined only on the matroid, problem (5) still makes sense, but the bounds involving $c$ do not apply. Liu et al. (2018a, 2019) investigated modified bounds that overcome this drawback. The idea is first to extend a polymatroid set function defined on the matroid to one defined on the entire power set, and then apply the results from Conforti and Cornuéjols (1984). However, not every polymatroid function defined on the matroid can be extended to one defined on the entire power set.

In Liu et al. (2019), they first provide necessary and sufficient conditions for the existence of an incremental extension of a polymatroid set function defined on the uniform matroid of rank $k$ to one defined on the uniform matroid of rank $k+1$ . Whenever a polymatroid objective function defined on a matroid can be extended to the entire power set, the greedy approximation bounds involving the total curvature of the extension apply. However, the bounds still depend on sets outside the matroid. Motivated by this, Liu et al. (2019) defined a new notion of curvature called partial curvature, involving only sets in the matroid. They derived necessary and sufficient conditions for an extension of the function to have a total curvature that is equal to the partial curvature. Moreover, they proved that the bounds in terms of the partial curvature are in general improved over the previous ones.

The following theorems state the necessary and sufficient conditions for the existence of an extension of a polymatroid set function defined on the uniform matroid of rank $k$ to one defined on the uniform matroid of rank $k+1$ .

Theorem 2.6

(Liu et al. 2019)*

Let $f:\mathcal{I}\rightarrow\mathbb{R}$ be a polymatroid function defined on the uniform matroid of rank $k$ . Then $f$ can be extended to a polymatroid function $g$ defined on the uniform matroid of rank $k+1$ if and only if for any $A\subseteq X$ with $|A|=k+1$ , any $B\subset A$ with $|B|=k$ , and any $a\in B$ ,*

[TABLE]

where $B^{*}\in\mathop{\mathrm{argmax}}\limits_{\begin{subarray}{c}B:B\subset A,|B|=k\end{subarray}}f(B)$ .

Construction: If $f$ is extendable, then an extension $g$ can be constructed as follows: For any $A$ with $|A|\leq k$ , $g(A)=f(A)$ ; For any $A$ with $|A|=k+1$ ,

[TABLE]

where $d_{A}$ satisfies

[TABLE]

Note that $f:2^{X}\rightarrow\mathbb{R}$ is itself an extension of $f$ from $\mathcal{I}$ to the entire $2^{X}$ , and the extended $f:2^{X}\rightarrow\mathbb{R}$ is a polymatroid function on $2^{X}$ . Therefore, we have that $c(f)\geq d=\inf_{g\in\mathcal{E}_{f}}c(g)$ , where $\mathcal{E}_{f}$ is the set of all polymatroid functions $g$ on $2^{X}$ that agree with $f$ on $\mathcal{I}$ . So if a polymatroid set function defined on the matroid can be extended to one defined on the whole power set, applying the performance bounds in Theorem 2.5 results in the following theorem.

Theorem 2.7

(Liu et al. 2019)*

Let $(X,\mathcal{I})$ be a matroid of rank $K$ and $f:\mathcal{I}\rightarrow\mathbb{R}$ a polymatroid function. If there exists an extension of $f$ to the entire power set, then any greedy solution $G_{K}$ to problem $(\ref{setproblem})$ satisfies*

[TABLE]

where $d=\inf_{g\in\mathcal{E}_{f}}c(g)$ . In particular, when $(X,\mathcal{I})$ is a uniform matroid, any greedy solution $G_{K}$ to problem $(\ref{setproblem})$ satisfies

[TABLE]

Remark 14

The bounds $1/(1+d)$ and $(1-e^{-d})/d$ apply to problems where the objective function is a polymatroid function defined only for sets in the matroid and can be extended to one defined on the entire power set. However, these bounds still depend on sets not in the matroid, because of the way $d$ is defined.

Then Liu et al. (2019) defined a new curvature called the partial curvature $b(h)$ as follows:

[TABLE]

and the partial curvature satisfies that $b(f)\leq c(g)$ when $g$ is an extension of $f$ from $\mathcal{I}$ to $2^{X}$ . The following theorem provides necessary and sufficient conditions for the existence of an extension $g$ to have $c(g)=b(f)$ .

Theorem 2.8

(Liu et al. 2019)*

Let $(X,\mathcal{I})$ be a matroid and $f:\mathcal{I}\rightarrow\mathbb{R}$ a polymatroid function. Let $g:2^{X}\rightarrow\mathbb{R}$ be a polymatroid function that agrees with $f$ on $\mathcal{I}$ . Then $c(g)=b(f)$ if and only if*

[TABLE]

for any $a\in X$ , and equality holds for some $a\in X$ .

Liu et al. (2019) provided the following improved bounds for the greedy strategy if there exists an extension $g$ of $f$ such that $c(g)=b(f)$ .

Theorem 2.9

(Liu et al. 2019)*

Let $(X,\mathcal{I})$ be a matroid of rank $K$ . Let $g:2^{X}\rightarrow\mathbb{R}$ be a polymatroid function that agrees with $f$ on $\mathcal{I}$ such that $g(X)-g(X\setminus\{a\})\geq(1-b(f))g(\{a\})$ for any $a\in X$ with equality holding for some $a\in X$ . Then, any greedy solution $G_{K}$ to problem $(\ref{setproblem})$ satisfies*

[TABLE]

In particular, when $(X,\mathcal{I})$ is a uniform matroid, any greedy solution $G_{K}$ to problem $(\ref{setproblem})$ satisfies

[TABLE]

Remark 15

The bounds ${1}/({1+b(f)})$ and $(1-\left(1-{b(f)}/{K}\right)^{K})/b(f)$ do not depend on sets outside the matroid, so they apply to problems where the objective function is only defined on the matroid, provided that an extension that satisfies the assumptions in Theorem 2.8 exists. When $f$ is defined on the entire power set, $b(f)\leq c(f)$ , which implies that the bounds are stronger than those from Conforti and Cornuéjols (1984).

Next consider again the task assignment problem from Section 2.4. Liu et al. (2019) gave an extension $g$ of $f$ defined on the uniform matroid of rank $2$ to the whole power set with $c(g)=b(f)$ , which is reviewed as follows.

Example: Let $X=\{a_{1},a_{2},a_{3},a_{4}\}$ , $p(a_{1})=0.4$ , $p(a_{2})=0.6$ , $p(a_{3})=0.8$ , and $p(a_{4})=0.9$ . Then, $f(A)$ is defined as in (10) for any $A=\{a_{i},\ldots,a_{k}\}\subseteq X$ . Let $K=2$ , then $\mathcal{I}=\{S\subseteq X:|S|\leq 2\}$ . It is easy to show that $f:\mathcal{I}\rightarrow\mathbb{R}$ is a polymatroid function.

The polymatroid function $g$ constructed using (12) while satisfying (13) and (17) from Liu et al. (2019) is of the following form:

$g(\{a_{1},a_{2},a_{3}\})=f(\{a_{2},a_{3}\})+d_{\{a_{1},a_{2},a_{3}\}}=0.96.$

$g(\{a_{1},a_{2},a_{4}\})=f(\{a_{2},a_{4}\})+d_{\{a_{1},a_{2},a_{4}\}}=1,$

$g(\{a_{1},a_{3},a_{4}\})=f(\{a_{3},a_{4}\})+d_{\{a_{1},a_{3},a_{4}\}}=1.02,$

$g(\{a_{2},a_{3},a_{4}\})=f(\{a_{3},a_{4}\})+d_{\{a_{2},a_{3},a_{4}\}}=1.04,$

$g(X)=g(\{a_{2},a_{3},a_{4}\})+d_{X}=1.08$ .

The total curvature $c$ of $g:2^{X}\rightarrow\mathbb{R}$ is

[TABLE]

By Theorem 2.9, the greedy strategy for the task scheduling problem satisfies the bound $(1-(1-{b(f)}/{2})^{2})/b(f)=0.775$ , which is better than the previous bound $(1-(1-{c(f)}/{2})^{2})/c(f)=0.752$ .

2.7 Batch Actions

Suppose we batch the selected actions into batches of size $k$ . What results is the $k$ -batch greedy strategy, which starts with the empty set and iteratively adds to the current solution set a batch of elements with the largest gain in the objective function under the constraints. The greedy strategy we considered in Sections 2.3–2.5 is a special case of the batched greedy with batch size equal to $1$ . Intuitively, larger $k$ should result in better performance, albeit at the expense of increasing computational complexity. But how do the previous bounds improve as a function of $k$ ? In this section, we review performance bounds for the $k$ -batch greedy strategy.

We start by introducing the $k$ -batch greedy strategy as follows. Consider again problem (5) and write the maximal cardinality of the sets in $\mathcal{I}$ as $K=k(l-1)+m$ , where $l,m$ are nonnegative integers and $0<m\leq k$ . Note that $m$ is not necessarily the remainder of $K/k$ , because $m$ could be equal to $k$ . This happens when $k$ divides $K$ . The $k$ -batch greedy strategy is as follows (Liu et al. 2018c, d):

Step 1: Let $S^{0}=\emptyset$ and $t=0$ .

Step 2: Select $J_{t+1}\subseteq X\setminus S^{t}$ such that $|J_{t+1}|=k$ , $S^{t}\cup J_{t+1}\in\mathcal{I}$ , and

[TABLE]

then set $S^{t+1}=S^{t}\cup J_{t+1}$ .

Step 3: If $t+1<l-1$ , set $t=t+1$ , and repeat Step 2.

Step 4: If $t+1=l-1$ , select $J_{l}\subseteq X\setminus S^{l-1}$ such that $|J_{l}|=m$ , $S^{l-1}\cup J_{l}\in\mathcal{I}$ , and

[TABLE]

Step 5: Return the set $S=S^{l-1}\cup J_{l}$ and terminate.

Any set generated by the above procedure is called a $k$ -batch greedy solution. For the above strategy, there are $l$ steps in total, and exactly $k$ actions are selected at each of the first $l-1$ steps but the final step may select fewer than $k$ actions. A similar batched greedy strategy is investigated by Hausmann et al. (1980) called the $(\leq k)$ -greedy strategy, where at most $k$ actions are selected at each stage.

The performance of the $k$ -batch greedy strategy under uniform matroid constraints was first investigated by Nemhauser et al. (1978), stated as follows.

Theorem 2.10

(Nemhauser et al. 1978)* If $(X,\mathcal{I})$ is a uniform matroid of rank $K$ and $f$ is a polymatroid set function, then any $k$ -batch greedy solution $S$ satisfies*

[TABLE]

Remark 16

When $m=k$ , i.e., the batch size $k$ divides the rank $K$ , the bound is tight; see Nemhauser et al. (1978) for proof.

By introducing the total $k$ -batch curvature

[TABLE]

where $\hat{X}=\{I\subseteq X:\varrho_{I}(\emptyset)\neq 0\ \text{and}\ |I|=k\}$ , Liu et al. (2018d) derived performance bounds in terms of $c_{k}$ for the $k$ -batch greedy strategy under both general matroid and uniform matroid constraints, and investigated the monotoneity of the performance bounds with respect to the batch size $k$ .

Theorem 2.11

(Liu et al. 2018d)* Assume that $f$ is a polymatroid set function. When $(X,\mathcal{I})$ is a general matroid, then any $k$ -batch greedy solution satisfies*

[TABLE]

When $(X,\mathcal{I})$ is a uniform matroid, then any $k$ -batch greedy solution satisfies

[TABLE]

Remark 17

When $k=1$ , the bound for general matroid constraint becomes $1/(1+c)$ and the bound for uniform matroid constraints becomes $(1-(1-c/K)^{K})/c$ , which is consistent with the results in Theorem 2.5.

Remark 18

The total $k$ -batch curvature is nonincreasing in $k$ , i.e., $c_{k_{2}}\leq c_{k_{1}}$ whenever $k_{2}\geq k_{1}$ (Liu et al. 2018d).

Remark 19

Based on Remark 18, we can discuss the monotoneity of the bounds for both general matroid and uniform matroid constraints. The bound $1/(1+c_{k})$ for general matroid constraints is monotone in $k$ . For uniform matroid constraints, when the batch size $k$ divides $K$ , the bound becomes

[TABLE]

which is monotone in $k$ . Moreover,

[TABLE]

which means that the bound for uniform matroid constraints is better than the bound for general matroid constraints. However, if $k$ does not divide $K$ , the exponential bound might be worse than the harmonic bound. For example, when $K=100,k=80$ , and $c_{k}=0.6$ , the exponential bound is $0.5875$ , which is worse than the harmonic bound $0.6250$ (Liu et al. 2018d).

Examples: Now consider again the task assignment and adaptive sensing problems from Section 2.4 to demonstrate that the total curvature $c_{k}$ decreases in $k$ and the performance bound for a uniform matroid increases in $k$ under the condition that the batch size $k$ divides the rank $K$ .

Task Assignment Problem: We still order the elements of $X$ as $a_{[1]},a_{[2]},\ldots,a_{[N]}$ such that

[TABLE]

Then by the definition of the total curvature $c_{k}$ , we have

[TABLE]

From the expression of $c_{k}$ , we can see that $c_{k}$ is nonincreasing in $k$ , but when $N$ is large, $c_{k}$ is close to 1 for each $k$ .

To numerically evaluate the relevant quantities here, Liu et al. (2018d) randomly generated a set $\{p(a_{i})\}_{i=1}^{30}$ . In Fig. 3, they considered $K=20$ , and batch sizes $k=1,2,\ldots,10$ . Fig. 3 shows that the exponential bound for $k=3,6,8,9$ is worse than that for $k=1,2$ , which illustrates our earlier remark that the exponential bound for the uniform matroid case is not necessarily monotone in $k$ even though $c_{k}$ is monotone in $k$ . Fig. 3 also shows that the exponential bound $(1-(1-c_{k}/l\cdot m/k)(1-c_{k}/l)^{l-1}/c_{k}$ coincides with $(1-(1-c_{k}/l)^{l})/c_{k}$ for $k=1,2,4,5,10$ and it is nondecreasing in $k$ , which illustrates our remark that the exponential bound is nondecreasing in $k$ under the condition that $k$ divides $K$ . Owing to the nature of the total curvature for this example, it is not easy to see that $c_{k}$ is nonincreasing in $k$ (all $c_{k}$ values here are very close to 1).

Adaptive Sensing: For convenience, set $\sigma=1$ . Then, we have

[TABLE]

where $X=\{B_{1},\ldots,B_{N}\}$ , $s=1+\sum_{i=1}^{N}e_{i}$ , and $t=1+\sum_{i=1}^{N}(1-e_{i})$ .

We already saw that the exponential bound for the uniform matroid case is not necessarily monotone in $k$ from the task assignment problem, so we will only consider the case when the batch size $k$ divides $K$ . Liu et al. (2018d) considered $K=24$ for $k=1,2,3,4,6,8$ in Fig. 4. The figure shows that the curvature decreases in $k$ and the exponential bound increases in $k$ since $k$ divides $K$ for $k=1,2,3,4,6,8$ , which again demonstrates the claim that $c_{k}$ decreases in $k$ and the exponential bound increases in $k$ under the condition that $k$ divides $K$ .

2.8 Noncooperative Games

In the previous sections, we reviewed performance bounds for greedy-type strategies in set submodular optimization problems. It turns out that similar techniques can be used to bound the performance of Nash equilibria in noncooperative games–utility maximization problems. The connection to the game setting is easy to imagine by associating the objective function in set optimization with a social utility function in games, greedy strategies with Nash equilibria, and batching with cooperation of subgroups in games. We first introduce some background on utility maximization problems and Nash equilibria.

A great number of interesting practical problems can be posed as utility maximization problems: these include facility location (Ahmed and Atamtürk 2011), traffic routing and congestion management (Arslan et al. 2007; He et al. 2007), sensor selection (Rowaihy et al. 2007; Liu et al. 2014), and network resource allocation (La and Anantharam 2002; Palomar and Chiang 2007). In a utility maximization problem, a set of users make decisions according to their own set of feasible strategies, resulting in an overall social utility value, such as profit, coverage, achieved data rate, and quality of service. The goal is to maximize the social utility function. Often, the users do not cooperate in selecting their strategies.

In general, it is impractical to find the optimal strategy maximizing the social utility function. However, it is feasible to consider scenarios where individual users or groups of users separately maximize their own private objective functions. The usual framework for studying such scenarios is game theory together with its celebrated notion of Nash equilibria. A Nash equilibrium is a set of strategies (deterministic or randomized) for which no user can improve its own private utility by changing its strategy unilaterally. Nash (1951) proved that any finite and non-cooperative game has at least one Nash equilibrium.

The performance of Nash equilibria compared with the optimal solution in submodular utility maximization problems was investigated by Vetta (2002). Based on the existing results, Liu et al. (2018b) established bounds for Nash equilibria when there is “grouping” among users, which is useful in understanding the role of cooperation and social ties in games. Before we review these results, we introduce some notation and terminology from Vetta (2002) and Liu et al. (2018b).

Suppose we have a set $\mathcal{N}=\{1,2,\ldots,N\}$ of $N$ users. Each element in $V_{i}$ $(i=1,\ldots,N)$ represents an act that user $i$ can take. We call a set of acts an action, and if an action $x_{i}\subseteq V_{i}$ is available to user $i$ we call it a feasible action. We denote by $\mathcal{X}_{i}$ the set of all feasible actions for user $i$ , i.e., $\mathcal{X}_{i}=\{x_{i}\subseteq V_{i}:x_{i}$ is a feasible action $\}$ , with $n_{i}=|\mathcal{X}_{i}|$ the cardinality of $\mathcal{X}_{i}$ . We call $\mathcal{X}_{i}$ the action space for user $i$ . A pure strategy is one in which the user takes a specific action. A mixed strategy is one in which the user takes actions according to some probability distribution. The set of mixed strategies is called the strategy space. We represent the strategy space for user $i$ by $\mathcal{S}_{i}=\{s_{i}\in\mathbb{R}^{n_{i}}:\sum_{j=1}^{n_{i}}s_{i}^{j}=1,s_{i}^{j}\geq 0\}$ , where $s_{i}=(s_{i}^{1},\ldots,s_{i}^{n_{i}})$ is called a strategy taken by user $i$ and $s_{i}^{j}\geq 0$ is the probability with which user $i$ takes action $j$ . When $s_{i}^{j}=1$ for some $j$ and $s_{i}^{l}=0$ for all $l\neq j$ , user $i$ is said to take a pure strategy. Otherwise, user $i$ takes a mixed strategy. Write $\mathcal{S}=\prod_{i=1}^{N}\mathcal{S}_{i}$ . The indexed set $S=(s_{1},\ldots,s_{N})$ , with $s_{i}\in\mathcal{S}_{i}$ and $i=1,\ldots,N$ , is called a strategy set of size $N$ in $\mathcal{S}$ .

Given a strategy set $S=(s_{1},\ldots,s_{N})\in\mathcal{S}$ , the set $S_{-i}=(s_{1},\ldots,s_{i-1},s_{i+1},\ldots,s_{N})$ is the subset of $S$ that contains strategies taken by all users except user $i$ , and $(S_{-i},s_{i}^{\prime})=(s_{1},\ldots,s_{i-1},s_{i}^{\prime},s_{i+1},\ldots,s_{N})$ is the strategy set that results from $S$ when user $i$ changes its strategy from $s_{i}$ to $s_{i}^{\prime}$ .

The expected social utility function and expected private utility function for user $i$ from strategies in $\mathcal{S}$ to real numbers are denoted by $\bar{\gamma}$ and $\bar{\alpha}_{i}$ , respectively. Define $\bar{\gamma}_{s_{i}}(S_{-i})=\bar{\gamma}(S)-\bar{\gamma}(S_{-i})$ for any set $S=(s_{1},\ldots,s_{N})\in\mathcal{S}$ and $s_{i}$ $(i=1,\ldots,N)$ .

Now we introduce the definition of a Nash equilibrium and a valid system, then review performance bounds for Nash equilibria under some conditions from Vetta (2002).

Definition 1

A strategy set $S\in\mathcal{S}$ is a Nash equilibrium if no user has an incentive to unilaterally change its strategy, i.e., for any user $i$ ,

[TABLE]

Assumption 1

(Vetta 2002)* The private utility of user $i$ ( $i=1,\ldots,N$ ) is at least as large as the loss in the social utility resulting from user $i$ dropping out of the game. That is, the system ( $\bar{\gamma},\{\bar{\alpha}_{i}\}_{i=1}^{N}$ ) has the property that for any strategy set $S=(s_{1},\ldots,s_{N})\in\mathcal{S}$ ,*

[TABLE]

Assumption 2

(Vetta 2002)* The sum of the private utilities of the system is not larger than the social utility, i.e., for any strategy set $S=(s_{1},\ldots,s_{N})\in\mathcal{S}$ ,*

[TABLE]

A utility system $(\bar{\gamma},\{\bar{\alpha}_{i}\}_{i=1}^{N})$ satisfying Assumptions 1 and 2 is called a valid system. We denote by $\Omega=(\omega_{1},\ldots,\omega_{N})$ the optimal strategy set in maximizing an expected utility function $\bar{\gamma}$ , and assume that $\Omega$ is composed of pure strategies $\omega_{i}\in\mathcal{S}_{i}$ , $i=1,\ldots,N$ . For convenience, we also use $\omega_{i}$ to denote the optimal action that user $i$ takes. Consider a strategy set $S=(s_{1},\ldots,s_{i})$ where $i=1,\ldots,N$ . Suppose that user $j$ ( $j=1,\ldots,i$ ) uses a mixed strategy $s_{j}$ that takes actions $x_{j}^{1},\ldots,x_{j}^{n_{j}}$ with probabilities $s_{j}^{1},\ldots,s_{j}^{n_{j}}$ . We use the notation $\Omega\cup S$ to represent the strategy in which user $j$ $(j=1,\ldots,i)$ takes the actions $\omega_{j}\cup x_{j}^{1},\ldots,\omega_{j}\cup x_{j}^{n_{j}}$ with probabilities $s_{j}^{1},\ldots,s_{j}^{n_{j}}$ , and user $j$ $(j=i+1,\ldots,N)$ takes the action $\omega_{j}$ , so $\Omega\cup S$ is well defined.

Theorem 2.12

(Vetta 2002)* For a valid utility system $(\bar{\gamma},\{\bar{\alpha}_{i}\}_{i=1}^{N})$ , if the expected social utility function $\bar{\gamma}$ is submodular, then for any Nash equilibrium $S\in\mathcal{S}$ we have*

[TABLE]

Remark 20

If $\bar{\gamma}$ is monotone, then $\bar{\gamma}_{s_{i}}({\Omega\cup S_{-i}})\geq 0$ and the above inequality shows that any Nash equilibrium achieves at least $1/2$ of the optimal social utility function value.

By defining the curvature $c$ of the expected social utility function $\bar{\gamma}$ ,

[TABLE]

Vetta (2002) derived the following tighter performance bound in terms of the curvature for Nash equilibria.

Theorem 2.13

(Vetta 2002)* For a valid utility system $(\bar{\gamma},\{\bar{\alpha}_{i}\}_{i=1}^{N})$ , if the expected social utility function $\bar{\gamma}$ is monotone and submodular, then for any Nash equilibrium $S\in\mathcal{S}$ we have*

[TABLE]

Remark 21

When the expected social utility function $\bar{\gamma}$ is monotone and submodular, we have $c\in[0,1]$ , which implies that $\bar{\gamma}(S)\geq\bar{\gamma}(\Omega)/2$ .

Next we review performance bounds for group Nash equilibria defined by Liu et al. (2018b). They considered the case where the set of all users in the utility maximization system are divided into disjoint groups, and the users in the same group choose their strategies by maximizing their group utility function jointly.

Assume that the set of users $\mathcal{N}=\{1,\ldots,N\}$ is divided into $l$ disjoint groups, in which group $i$ ( $i=1,\ldots,l$ ) has users $\{m_{i}+1,\ldots,m_{i}+k_{i}\}$ , where $m_{i}=\sum_{j=1}^{i-1}k_{j}$ , $k_{j}$ is the number of users in group $j$ , and $\sum_{j=1}^{l}k_{j}=N$ . Let $s^{i}=(s_{m_{i}+1},\ldots,s_{m_{i}+k_{i}})$ denote the group strategy for group $i$ , where $s_{i}\in\mathcal{S}_{i}$ is the strategy for user $i$ . This includes the strategies taken by all the users in group $i$ ( $i=1,\ldots,l$ ). Let $S^{-i}$ denote the set of group strategies taken by all groups except for group $i$ and $(S^{-i},t^{i})$ denote the group strategy set obtained when group $i$ changes its group strategy from $s^{i}$ to $t^{i}$ . Let $\bar{\eta}_{i}$ denote the expected group utility function for group $i$ . Define $\bar{\gamma}_{s^{i}}(S^{-i})=\bar{\gamma}(S)-\bar{\gamma}(S^{-i})$ for any $S=(s^{1},\ldots,s^{l})\in\mathcal{S}$ and $s^{i}$ ( $i=1,\ldots,l$ ).

Definition 2

A strategy set $S=(s^{1},\ldots,s^{l})\in\mathcal{S}$ is a group Nash equilibrium of a utility system if no group can improve its group utility by unilaterally changing its group strategy, i.e., for any $i=1,\ldots,l$ ,

[TABLE]

where $t_{j}\in\mathcal{S}_{j}$ for $j=m_{i}+1,\ldots,m_{i}+k_{i}$ .

The utility system $(\bar{\gamma},\{\bar{\eta}_{i}\}_{i=1}^{l})$ is valid if it satisfies the following two assumptions (Liu et al. 2018b).

Assumption 3

The group utility of group $i$ is at least as large as the loss in the social utility resulting from all the users in group $i$ dropping out of the game. That is, the system $(\bar{\gamma},\{\bar{\eta}_{i}\}_{i=1}^{l})$ has the property that for any strategy set $S=(s^{1},\ldots,s^{l})\in\mathcal{S}$ ,

[TABLE]

Assumption 4

The sum of the group utilities of the system is not larger than the social utility, i.e., for any strategy set $S=(s^{1},\ldots,s^{l})\in\mathcal{S}$ ,

[TABLE]

Theorem 2.14

(Liu et al. 2018b)*

For a valid utility system $(\bar{\gamma},\{\bar{\eta}_{i}\}_{i=1}^{l})$ , if the expected social utility function $\bar{\gamma}$ is submodular, then any group Nash equilibrium $S=(s^{1},\ldots,s^{l})\in\mathcal{S}$ satisfies*

[TABLE]

To better characterize the relation of the social utility value of any group Nash equilibrium and that of the optimal solution $\Omega$ , Liu et al. (2018b) defined the group curvature $c_{k_{i}}$ of the social utility function for group $i$ as

[TABLE]

Theorem 2.15

(Liu et al. 2018b)*

For a valid utility system $(\bar{\gamma},\{\bar{\eta}_{i}\}_{i=1}^{l})$ , if the expected social utility function $\bar{\gamma}$ is monotone and submodular, then any group Nash equilibrium $S=(s^{1},\ldots,s^{l})\in\mathcal{S}$ satisfies*

[TABLE]

In particular, if $\mathcal{X}_{1}=\mathcal{X}_{2}=\cdots=\mathcal{X}_{N}$ , we have

[TABLE]

where $k^{*}=\min_{1\leq i\leq l}k_{i}$ .

Remark 22

When the expected group utility function $\bar{\gamma}$ is monotone and submodular, it is easy to check that $c_{k_{i}}\in[0,1]$ , which implies that $1/(1+\max_{1\leq i\leq l}c_{k_{i}})\geq 1/2$ .

Remark 23

When the expected group utility function $\bar{\gamma}$ is monotone and submodular, we have $\bar{\gamma}(S)\geq\bar{\gamma}(\Omega)/(1+\max_{1\leq i\leq l}c_{k_{i}})\geq\bar{\gamma}(\Omega)/(1+c)$ . This shows that the bound for the case with grouping is tighter than that for the case without grouping. Of course, this is unsurprising, because grouping entails cooperation. Moreover, under the condition that each user has the same strategy space, the larger the value of $k_{i}$ , the higher the degree of cooperation, and the tighter the lower bound.

3 Strings of Actions

In Section 2, we considered the optimization problem where the argument of the objective function is a set of actions. Suppose the objective function depends not only on the set of actions but also on the order of actions. We call the argument of the objective function a string of actions. In this section, we introduce notation and terminology for strings and string functions, string optimization problem, performance bounds for the greedy strategy, and applications.

3.1 Notation and Terminology

Let $X$ be a set of all possible actions. We use $A=(a_{1},a_{2},\ldots,a_{k})$ ( $a_{i}\in X$ ) to denote a string of actions taken over $k$ consecutive stages. We define its length as $k$ , denoted by $|A|=k$ . Note that $k=0$ corresponds to the empty string, denoted by $A=\emptyset$ .

Let ${X}^{*}$ denote the set of all possible strings of actions. If two strings in ${X}^{*}$ are expressed by $M=(a_{1}^{m},a_{2}^{m},\ldots,a_{k_{1}}^{m})$ and $N=(a_{1}^{n},a_{2}^{n},\ldots,a_{k_{2}}^{n})$ , we write $M=N$ iff $k_{1}=k_{2}$ and $a_{i}^{m}=a_{i}^{n}$ for each $i=1,2,\ldots,k_{1}$ . Moreover, we define string concatenation as $M\oplus N=(a_{1}^{m},a_{2}^{m},\ldots,a_{k_{1}}^{m},a_{1}^{n},a_{2}^{n},\ldots,a_{k_{2}}^{n})$ .

We write $M\preceq N$ if we have $N=M\oplus L$ for some $L\in X^{*}$ . In this case, we also say that $M$ is a prefix of $N$ . We write $M\prec N$ if there exists a set of strings $L_{i}\in X^{*}$ such that $N=L_{1}\oplus(a_{1}^{m},\ldots,a_{i_{1}}^{m})\oplus L_{2}\oplus(a_{i_{1}+1}^{m},\ldots,a_{i_{2}}^{m})\oplus\cdots\oplus(a_{i_{k-1}+1}^{m},\ldots,a_{k_{1}}^{m})\oplus L_{k}$ . Note that $\prec$ is weaker than $\preceq$ , which means $M\preceq N$ implies $M\prec N$ , but the converse is not necessarily true.

Similar to the definition of a polymatroid set function in Section 2.1, we define a function from strings to real numbers, $f:X^{*}\to\mathbb{R}$ , a polymatroid string function if

i.

$f(\emptyset)=0$ .

ii.

$f$ has the prefix-monotone property: $\forall M,N\in X^{*},$ $f(M\oplus N)\geq f(M)$ .

iii.

$f$ has the diminishing-return property: $\forall M\preceq N\in X^{*},\forall a\in X$ , $f(M\oplus(a))-f(M)\geq f(N\oplus(a))-f(N)$ .

A function $f:X^{*}\to\mathbb{R}$ is postfix monotone if

[TABLE]

Notice the difference between the prefix-monotone property and postfix-monotone property.

Let $\mathcal{I}$ denote a collection of strings from $X^{*}$ . The pair $(X,\mathcal{I})$ is called a string matroid (Zhang et al. 2016) if $\mathcal{I}$ satisfies the following properties:

i.

$\mathcal{I}$ is non-empty;

ii.

Hereditary: $\forall M\in\mathcal{I},N\prec M$ implies that $n\in\mathcal{I}$ ;

iii.

Augmentation: $\forall M,N\in\mathcal{I}$ and $|M|<|N|$ , there exists an element $x\in X$ in the string $N$ such that $M\oplus(x)\in\mathcal{I}$ .

The length of the longest string in $\mathcal{I}$ is called the rank of $(X,\mathcal{I})$ . When $\mathcal{I}=\{A\in X^{*}:|A|\leq K\}$ , the pair $(X,\mathcal{I})$ is called a uniform string matroid of rank $K$ .

3.2 String Optimization Problem

In this section, we first formulate the string optimization problem and define the greedy strategy. Then we review performance bounds for the greedy strategy under uniform string matroid constraints and general string matroid constraints.

In a variety of problems in engineering and applied science such as sequential decision making (Littman 1996; Roijers et al. 2013), adaptive sensing (Liu et al. 2014; Krause et al. 2008), and adaptive control (Jarvis 1975; Schlegel et al. 2005), we are faced with optimally choosing a string (ordered set) of actions over a finite horizon to maximize an objective function under some constraints. We call this class of optimization problems string optimization. For set optimization problems, the objective function is not influenced by the order of actions. However, for string optimization problems, the objective function depends on the order of actions. Let $f:X^{*}\to\mathbb{R}$ be an objective function. The goal is to find a string $M$ , with the constraint $M\in\mathcal{I}$ , to maximize the objective function:

[TABLE]

where $X^{*}$ denotes the set of all possible strings of actions and $\mathcal{I}$ is a collection of strings from $X^{*}$ .

The solution to the string optimization problems can be characterized using dynamic programming via Bellman’s principle (Bertsekas 2005; Powell 2007). However, dynamic programming suffers from the curse of dimensionality and is therefore impractical for many problems of interest. Hence, we often turn to approximation techniques. One approximation technique is the greedy strategy, which is to find an action at each stage to maximize the step-wise gain in the objective function. The performance for the greedy strategy in string optimization problems has been investigated by Streeter and Golovin (2008), Zhang et al. (2016), and Liu et al. (2015). And these specific results will be reviewed in this section.

Assume that the rank of $(X,\mathcal{I})$ is $K$ . We now define optimal and greedy strategies for problem (30) and some related notation.

Optimal String: Any string $O$ is called an optimal solution of Problem (30) if

[TABLE]

If $f$ is prefix monotone, then there exists at least one optimal string of length $K$ , denoted by $O_{K}=(o_{1},\ldots,o_{K})$ .

Greedy Algorithm:

Input: A string matroid $(X,\mathcal{I})$ of rank $K$ , a set function $f:X^{*}\rightarrow\mathbb{R}$ , collection $\mathcal{I}$ , size $K$

Output: A string $G_{K}\in\mathcal{I}$

$G_{0}\leftarrow\emptyset$

For $i=1,\ldots,K$ ,

$g_{i}\leftarrow\mathop{\mathrm{argmax}}\limits_{\begin{subarray}{c}a\in X,G_{i-1}\oplus(a)\in\mathcal{I}\end{subarray}}f(G_{i-1}\oplus(a))$
$G_{i}\leftarrow G_{i-1}\oplus(g_{i})$

Any output of the above algorithm is called a greedy solution. There may exist more than one greedy solution.

3.3 Performance Bounds for Greedy Strategy

Streeter and Golovin (2008) first derived performance bounds for the greedy strategy under uniform string matroid constraints, stated as follows.

Theorem 3.1

(Streeter and Golovin 2008)*

Let $(X,\mathcal{I})$ be a uniform string matroid. If $f:X^{*}\rightarrow\mathbb{R}$ is a polymatroid string function and postfix monotone, then any greedy string $G_{K}$ satisfies*

[TABLE]

Remark 24

The same bound holds if $f$ satisfies $f(G_{i}\oplus O_{K})\geq f(O_{K})$ for $i=1,\ldots,K-1$ , which is weaker than being postfix monotone.

Zhang et al. (2016) investigated performance bounds for the greedy strategy under both uniform string matroid and general string matroid constraints by defining the following curvatures.

The total backward curvature of $f$ is defined as (Zhang et al. 2016)

[TABLE]

When $f$ is postfix monotone and diminishing return, we have $0\leq\sigma\leq 1$ . The total backward curvature is an upper bound on the second-order difference, over all possible actions $a$ and strings $M$ . Next, Zhang et al. (2016) defined the total backward curvature of $f$ with respect to the optimal string $O_{K}$ by

[TABLE]

When $f$ is postfix monotone and string submodular, it is easy to prove that $0\leq\sigma(O)\leq\sigma\leq 1$ .

Theorem 3.2

(Zhang et al. 2016)*

Let $(X,\mathcal{I})$ be a uniform string matroid of rank $K$ . If $f:X^{*}\rightarrow\mathbb{R}$ is a polymatroid string function, then any greedy string $G_{K}$ satisfies*

[TABLE]

Moreover, if $f$ is postfix monotone, then any greedy string $G_{K}$ satisfies

[TABLE]

Remark 25

When $f$ is polymatroid and postfix monotone, we have $0\leq\sigma\leq 1$ by (32). So we have $(1-\left(1-{\sigma}/{K}\right)^{K})/{\sigma}\geq 1-(1-1/K)^{K}$ and $(1-e^{-\sigma})/\sigma>1-e^{-1}$ , which implies that Theorem 3.2 provides better bounds than Theorem 3.1.

Theorem 3.3

(Zhang et al. 2016)*

Let $(X,\mathcal{I})$ be a string matroid. If $f:X^{*}\rightarrow\mathbb{R}$ is a polymatroid string function, then any greedy string $G_{K}$ satisfies*

[TABLE]

Moreover, if $f$ is postfix monotone, then any greedy string $G_{K}$ satisfies

[TABLE]

From Theorems 3.1 and 3.2, we can see that all the sufficient conditions obtained so far involve strings of length greater than $K$ , even though (30) involves only strings up to length $K$ . Liu et al. (2015) derived sufficient conditions, which only involve strings of length at most $K$ , to have the same bounds hold for uniform string matroid constraints, by defining the following conditions.

A function $f:X^{*}\to\mathbb{R}$ is $K$ -polymatroid if

i.

$f(\emptyset)=0$ .

ii.

$f$ is $K$ -monotone: $\forall M,N\in X^{*},$ and $|M|+|N|\leq K$ , $f(M\oplus N)\geq f(M)$ .

iii.

$f$ is $K$ -diminishing: $\forall M\preceq N\in X^{*}$ and $|N|\leq K-1$ , $\forall a\in X$ , $f(M\oplus(a))-f(M)\geq f(N\oplus(a))-f(N)$ .

Let $G_{K}=(g_{1},\ldots,g_{K})$ and $\bar{O}_{K-i}=(o_{i+1},\ldots,o_{K})$ for $i=1,\ldots,K$ . Then, $f$ is $K$ -GO-concave (Liu et al. 2015) if for $1\leq i\leq K-1$ ,

[TABLE]

Theorem 3.4

(Liu et al. 2015)*

Let $(X,\mathcal{I})$ be a uniform string matroid. If $f$ is $K$ -polymatroid, then any greedy string satisfies*

[TABLE]

By defining the curvature $\eta$ ,

[TABLE]

Liu et al. (2015) derived more general performance bounds in terms of the curvature.

Theorem 3.5

(Liu et al. 2015)*

Let $(X,\mathcal{I})$ be a uniform string matroid. If $f$ is $K$ -polymatroid and $K$ -GO-concave, then any greedy string satisfies*

[TABLE]

Remark 26

If $f$ is $K$ -GO-concave, then we have $0\leq\eta\leq 1$ .

Examples: We again consider the task assignment problem and adaptive sensing problem from Section 2.4 to give some sufficient conditions on the parameters of the problems to achieve the performance bound $(1-(1-1/K)^{K})$ .

Task Assignment Problem: We use $p_{i}^{j}(a)$ to denote the probability of accomplishing subtask $i$ at stage $j$ when it is assigned to agent $a\in X$ . Let $a_{j}$ be the index of the agent selected at stage $j$ , the objective function $f$ becomes

[TABLE]

For simplicity, we consider the case of $n=1$ (our results can easily be generalized to the case where $n>1$ ). For $n=1$ , the objective function $f$ reduces to

[TABLE]

and from here on we simply use $p^{j}(a_{j})$ in place of $p_{1}^{j}(a_{j})$ .

Note that the value of $f$ depends on the order of the agents selected over time when the probabilities vary from stage to stage. In other words, suppose that we have two agents, Alice and Bob. Then, in general, $p^{1}(\text{Alice})\neq p^{2}(\text{Alice})$ , $p^{1}(\text{Bob})\neq p^{2}(\text{Bob})$ , $p^{1}(\text{Alice})\neq p^{1}(\text{Bob})$ , and $p^{2}(\text{Alice})\neq p^{2}(\text{Bob})$ . Therefore, $f((\text{Alice},\text{Bob}))\neq f((\text{Bob},\text{Alice}))$ .

It is easy to check that $f$ is $K$ -monotone and $f(\emptyset)=0$ .

Assume that $p^{j}(a)\in[L(a),U(a)]$ , where $L(a)=\min_{j}p^{j}(a)$ and $U(a)=\max_{j}p^{j}(a)$ . By Zhang et al. (2016), a sufficient condition for $f$ to be diminishing return is

[TABLE]

where

[TABLE]

Let $\hat{U}=\max_{a}{U(a)}$ and $\hat{L}=\min_{a}{L(a)}$ . By Liu et al. (2015), a sufficient condition for $f$ to be $K$ -diminishing is

[TABLE]

and a sufficient condition for $K$ -GO-concavity is

[TABLE]

When all $p^{j}(a_{j})\geq 1/2$ , then (41) and (40) automatically hold, but (39) is not necessarily satisfied. In that sense, the $K$ -monotone, $K$ -diminishing, and $K$ -Go concavity conditions of Theorem 3.4 are weaker sufficient conditions for achieving the bound $(1-(1-\frac{1}{K})^{K})$ than the prefix monotone, diminishing-return, and postfix monotone conditions of Theorem 3.1.

Adaptive Sensing: Consider the situation where the additive noise set is independent but not identically distributed. Assume that $w_{i}$ is a Gaussian vector with mean zero and covariance $\sigma_{i}I$ , where $I$ denotes the identity matrix. Recall the problem formulation in Section 2.4. The objective function $f$ for this problem is as follows:

[TABLE]

where $P_{0}=I$ and for $1\leq j\leq k-1$ ,

[TABLE]

From the expression above, it is easy to check that the order of $B_{1},\ldots,B_{k}$ influences the objective function value under the assumption that $\sigma_{1},\ldots,\sigma_{k}$ take different values. For example,

[TABLE]

and

[TABLE]

If $\sigma_{1}\neq\sigma_{2}$ , then $f((A,B))\neq f((B,A))$ .

By Liu et al. (2015), some sufficient conditions for $f$ to be $K$ -polymatroid and $K$ -GO-concave are

[TABLE]

for $i=1,\ldots,K-1$ .

By Zhang et al. (2016), to achieve the bound $(1-(1-1/K)^{K})$ , it requires both (42) and

[TABLE]

where $[a,b]$ is the interval that contains all the $\sigma_{i}$ ’s.

Comparing the sufficient conditions for achieving the same bound $(1-(1-1/K)^{K})$ from Liu et al. (2015) and Zhang et al. (2016), we see that the conditions from Liu et al. (2015) are weaker.

4 Final Remarks

In this survey, we considered two classes of submodular maximization problems: set submodular maximization and string submodular maximization. For set submodular optimization, we reviewed performance bounds for the greedy strategy under matroid constraints, improved performance bounds, and performance bounds for the batched greedy strategy. There are many important results about performance of the greedy strategy under some other constraints and conditions. Wolsey (1982), Sviridenko (2004), and Kulik et al. (2009) derived performance bounds for the greedy strategy in submodular maximization problems subject to a knapsack constraint and multiple linear constraints. Bian et al. (2017) established performance bounds for the greedy strategy in monotone but nonsubmodular maximization problems under uniform matroid constraints. People also investigated performance bounds for some variations of greedy strategies. Calinescu et al. (2011) and Feldman et al. (2011) derived performance bounds for a randomized continuous greedy algorithm and a unified continuous greedy algorithm in monotone submodular maximization problems, respectively. Buchbinder et al. (2012) established performance bounds for an adaptive greedy algorithm in unconstrained submodular maximization problems. They also derived performance bounds for randomized greedy algorithms in nonmonotone submodular maximization problems (Buchbinder et al. 2014). Mirzasoleiman et al. (2016) considered submodular maximization problems in a distributed fashion, and they derived performance bounds for a two-stage greedy algorithm under matroid or knapsack constraints. Qu et al. (2015) proposed a distributed greedy strategy and showed that it has the same guarantee as the centralized greedy strategy.

For string submodular optimization problems, we reviewed performance bounds for the greedy strategy under matroid constraints. There are some related results on performance bounds for greedy strategies in string submodular maximization problems that were not reviewed in this paper. For example, Golovin and Krause (2001) considered a particular class of partially observable adaptive stochastic optimization problems, and established performance bounds for the greedy strategy by introducing the notion of adaptive submodularity. Tschiatschek et al. (2017) derived performance bounds for a modified greedy strategy in submodular string optimization problems under uniform string matroid constraints.

The scope of this study is limited to the performance of the greedy strategies in deterministic optimization problems where the objective function only involves actions. Potentially fruitful areas for further research include performance bounds for the greedy strategy in stochastic optimization problems, where the objective function involves states and control actions, and real-world applications of the performance bounds in the deterministic and stochastic settings.

Bibliography69

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1Ahmed and Atamtürk (2011) Ahmed, S., and A. Atamtürk, 2011: Maximizing a class of submodular utility functions. Math Program , 128 , 149–169.
2Arslan et al. (2007) Arslan, G., J. R. Marden, and J. S. Shamma, 2007: Autonomous vehicle-target assignment: a game-theoretical formulation. J Dyn Syst Meas Control , 129 , 584–596.
3Badanidiyuru et al. (2014) Badanidiyuru, A., B. Mirzasoleiman, A. Karbasi, and A. Krause, 2014: Streaming submodular maximization: massive data summarization on the fly. In: Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining , 671–680.
4Bator (1957) Bator, F. M., 1957: The simple analytics of welfare maximization. Am Econ Rev , 47 , 22–59.
5Bertsekas (2005) Bertsekas, D. P., 2005: Dynamic programming and optimal control . 3rd ed., Athena Scientific.
6Bian et al. (2017) Bian, A. A., J. M. Buhmann, A. Krause, and S. Tschiatschek, 2017: Guarantee for greedy maximization of non-submodular functions with applications. In: Proceedings of the 34th International Conference on Machine Learning , 498–507.
7Boros et al. (2003) Boros, E., K. Elbassioni, and L. Khachiyan, 2003: An inequality for polymatroid functions and its applications. Discrete Appl Math , 131 , 255–281.
8Buchbinder et al. (2012) Buchbinder, N., M. Feldman, J. Naor, and R. Schwartz, 2012: A tight linear time (1/2)-approximation for unconstrained submodular maximization. SIAM J Comput , 44 , 255–281.