Submodular Optimization Problems and Greedy Strategies: A Survey
Yajing Liu, Edwin K. P. Chong, Ali Pezeshki, Zhenliang Zhang

TL;DR
This survey reviews the effectiveness of greedy algorithms in submodular optimization problems, focusing on performance bounds and improvements for set and string submodular functions.
Contribution
It provides a comprehensive overview of performance bounds for greedy strategies in submodular optimization, including recent improvements and bounds related to curvature and Nash equilibria.
Findings
Performance bounds for greedy strategies are well-established.
Improved bounds are available considering curvature of the objective.
Batched greedy strategies and Nash equilibrium bounds are also analyzed.
Abstract
The greedy strategy is an approximation algorithm to solve optimization problems arising in decision making with multiple actions. How good is the greedy strategy compared to the optimal solution? In this survey, we mainly consider two classes of optimization problems where the objective function is submodular. The first is set submodular optimization, which is to choose a set of actions to optimize a set submodular objective function, and the second is string submodular optimization, which is to choose an ordered set of actions to optimize a string submodular function. Our emphasis here is on performance bounds for the greedy strategy in submodular optimization problems. Specifically, we review performance bounds for the greedy strategy, more general and improved bounds in terms of curvature, performance bounds for the batched greedy strategy, and performance bounds for Nash equilibria.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplexity and Algorithms in Graphs · Optimization and Search Problems · Advanced Graph Theory Research
∎
11institutetext: 1 National Renewable Energy Laboratory (NREL), Golden, CO
2 Department of Electrical and Computer Engineering, and Department of Mathematics, Colorado State University, Fort Collins, CO
3 Alibaba iDST, Seattle, WA
Submodular Optimization Problems and Greedy Strategies: A Survey
Yajing Liu 1 Yajing Liu
Edwin K. P. Chong2 Edwin K. P. Chong
Ali Pezeshki2 Ali Pezeshki
Zhenliang Zhang3 Zhenliang Zhang
(Received: date / Accepted: date)
Abstract
The greedy strategy is an approximation algorithm to solve optimization problems arising in decision making with multiple actions. How good is the greedy strategy compared to the optimal solution? In this survey, we mainly consider two classes of optimization problems where the objective function is submodular. The first is set submodular optimization, which is to choose a set of actions to optimize a set submodular objective function, and the second is string submodular optimization, which is to choose an ordered set of actions to optimize a string submodular function. Our emphasis here is on performance bounds for the greedy strategy in submodular optimization problems. Specifically, we review performance bounds for the greedy strategy, more general and improved bounds in terms of curvature, performance bounds for the batched greedy strategy, and performance bounds for Nash equilibria.
Keywords:
Curvature greedy strategy Nash equilibrium optimization performance submodular
1 Introduction
We are often faced with choosing a set of actions from a ground set of actions to optimize an objective function. Such problems arise in a multitude of applications of interest to discrete-event dynamic system researchers. A specific example is the task assignment problem (Streeter and Golovin 2008; Zhang et al. 2016; Liu et al. 2018d), one of the fundamental combinatorial optimization problems in the study of optimization or operations research. This problem involves a number of agents and a number of tasks. Each agent successfully accomplishes a task with a certain probability and the aim is to assign the available tasks to a given number of agents such that the probability of accomplishing the tasks is maximized.
When the number of agents is relatively small, we can use brute-force search (Paar and Pelzl 2010) to enumerate all possible candidate solutions to find the optimal solution. However, when the number of agents is large, it is impractical to enumerate all the possible candidate solutions. At this point, we have to resort to approximation methods. One of the most well-studied approximation methods is the greedy strategy (Nemhauser et al. 1978), which starts with the empty set and iteratively adds to the current solution set an element that results in the largest gain in the objective function while satisfying the constraints. The greedy strategy yields an approximation to an optimal solution in a reasonable amount of time. The downside is that there is often no theoretical guarantee for the greedy strategy. But when the problem has a special property called submodularity, the greedy strategy is provably guaranteed to produce a solution with an objective value at least a constant scalar times the optimum value. Celebrated results by Fisher et al. (1978) and Nemhauser et al. (1978) prove that when the objective function is a monotone submodular set function with , the greedy strategy yields a -approximation111The term -approximation means that , where and denote a greedy solution and an optimal solution, respectively. for a general matroid and a -approximation for a uniform matroid.
For set optimization problems, the objective function is not influenced by the order of actions. However, a great number of problems in engineering and applied science aim to optimally choose a string (finite sequence) of actions over a finite horizon to maximize an objective function whose value depends on the order of actions. The problem arises in sequential decision making in engineering, economics, management science, and medicine. A motivating example is the problem of scheduling sensors to detect targets (Li et al. 2009). Suppose that a given number of sensors are distributed in a sensor field to detect a certain number of targets. The goal is to activate sensors sequentially to maximize the total coverage area. If the coverage region of each sensor remains constant over time, the total coverage area is not influenced by the order of the sensors activated, and the problem becomes a set optimization problem. However, if the sensors are moving, then the total coverage area depends on the order of the sensors activated, which makes the problem fall into the framework of string optimization problems. The optimal solution to a string optimization problem is characterized by dynamic programming via Bellman’s principle (Powell 2007). However, the approach suffers from the curse of dimensionality and is therefore impractical for many problems of interest. This motivates the study of approximation algorithms, among which the greedy strategy is easy to implement and has guaranteed performance bounds under certain conditions. For example, Streeter and Golovin (2008) prove that when the objective function is prefix and postfix monotone and has the diminishing-return property (as defined later in the paper), the greedy strategy yields a -approximation.
In this paper, we review the performance guarantees for greedy strategies in submodular maximization problems. The paper is organized as follows. In Section 2, we review results that are related to choosing sets of actions. This involves introducing set functions, set optimization problem, performance bounds for the greedy strategy, examples, curvature, improved bounds, batched actions, and noncooperative games. In Section 3, we review results related to choosing strings of actions. This involves introducing new notation and terminology, string optimization problem, performance of the greedy strategy, and applications. In Section 4, we conclude by listing a number of related papers that consider extensions and/or variation of greedy strategies and their performance bounds in combinatorial optimization problems.
2 Sets of Actions
In this section, we first introduce our notation for sets, properties of set functions, and set optimization problems. Then, we review various performance bounds for the greedy strategy.
2.1 Set Functions
Before we introduce functions defined on sets, we would like to introduce some similar and familiar properties for functions defined on real numbers. Consider a real function . The function is said to be monotone and submodular if it satisfies properties 1 and 2 below, respectively:
Monotone: , .
- 2.
Submodular: , ,
The ‘monotone’ property here simply means being ‘nondecreasing’. The function in Fig. 1 satisfies the monotone property. From Fig. 1, we can see that the function is a concave function – adding to gains more than adding to , which tells us that the additional value accrued by adding a number to a smaller number is larger than adding it to a bigger number. This is consistent with the inequality for , so we say that ‘submodularity’ here boils down to ‘concavity’ in some sense.
In this paper, we want to go beyond the real line to a more general setting. Specifically, we will consider objective functions with multiple decision “actions” as arguments. The first setting is sets of actions, and the second one is strings (ordered sets) of actions. We introduce functions defined on sets first.
Let denote a ground set, which includes all possible actions. Let denote the power set of , which includes all possible subsets of . The size or cardinality of a set is denoted by , and the empty set is denoted by . Define a set function : . The set function is said to be monotone and submodular if it satisfies properties i and ii below, respectively:
- i.
Monotone: , .
- ii.
Submodular: and , .
Notice the similarity between these properties and those involving functions on the real line introduced earlier.
For convenience, we denote the incremental value of adding a set to the set as (following the notation in Conforti and Cornuéjols (1984)).
A set function : is called a polymatroid set function (Boros et al. 2003) if it is monotone, submodular, and . Submodularity in property ii means that the additional value accruing from an extra action decreases as the size of the input set increases, and is also called the diminishing-return property in economics. Submodularity has many equivalent definitions; for example, is submodular if , . For more equivalent definitions, see Nemhauser et al. (1978).
The set function is called supermodular if is submodular. Moreover, is called modular if it is both submodular and supermodular, i.e., for any ,
[TABLE]
By induction, (1) implies that for any ,
[TABLE]
By (2), is additive when is modular. If , then , which implies that is additive. It is also easy to check that is modular iff for any subset ,
[TABLE]
for some weight function (Krause and Golovin 2012).
There are many non-trivial examples of submodular or supermodular set functions. We only consider submodular maximization problems in this paper, so we only give submodular function examples. For supermodular examples, see Lovász (1983). To easily understand submodularity, we provide a simple example as follows.
Example 1
Sensor Coverage. Let be a family of locations in space where we can place sensors. If a sensor is placed at a particular location in space, it covers a circular area around it as illustrated in Fig. 2. Let denote the total area covered if we place sensors at locations (see Fig. 2). The set function is submodular. An instance of submodularity is illustrated in the figure. As can be seen, the gain in adding sensor after placing sensor is larger than the gain in adding sensor after placing sensors . ∎
Submodular functions arise in many applications, such as the rank function of the matrix formed by its columns, weighted coverage functions, the rank function of a matroid, Shannon entropy, mutual information, cut capacity functions, some measurements on the graph, etc. (Lovász 1983; Krause and Golovin 2012)
2.2 Submodular Set Optimization Problem
Submodular set optimization plays an important role in combinatorial optimization. It has a wide range of applications, including generalized assignment (Shmoys and Tardos 1993; Cohen et al. 2006; Nauss 2003; Fleischer et al. 2006; Bator 1957; Korula et al. 2015; Vondrák 2008), matroid partition (Edmonds and Fulkerson 1965; Cunningham 1986; Knuth 1973), maximum cut (Goemans and Williamson 1995; Sahni and Gonzalez 1976), maximum coverage location (Church and Velle 1974; Khuller et al. 1999; Cornuéjols et al. 1977), multi-agent coverage problem (Sun et al. 2017), leader-selection problem in multi-agent systems (Clark and Poovendran 2011), welfare maximization (Korula et al. 2015; Vondrák 2008; Kapralov et al. 2013), and data summarization (Lin and Bilmes 2011; Badanidiyuru et al. 2014; Mirzasoleiman et al. 2017). The aim is to find a set of actions satisfying some constraints to maximize the objective function. The set optimization problem can be formulated as follows:
[TABLE]
where is a non-empty collection of subsets of a finite set , and is a real-valued submodular set function defined on the power set of . Before proceeding any further with discussing optimization problem (5), we will need to introduce some concepts related to the constraint set .
Let be a finite set, and be a non-empty collection of subsets of . The collection is said to be hereditary if it satisfies property i below and has the augmentation property if it satisfies property ii below:
- i.
Hereditary: For all , any set is also in .
- ii.
Augmentation: For any , if , then there exists such that .
The pair is called an independence system if it satisfies property i. In this case, the sets in are called independent sets. A maximal independent set is an independent set that is not a subset of any other independent set (Conforti and Cornuéjols 1984). The independence system is called a matroid if it satisfies property ii (Edmonds 1970). The pair is called a uniform matroid if for a given (Nemhauser et al. 1978). All maximal independent sets in a matroid have the same cardinality. We call this cardinality the rank of the matroid. In the uniform matroid above, the rank is .
Example 2
We now give three example collections to illustrate the notions of independence systems and matriods. Let , , , and . It is easy to check that satisfies the hereditary property but not augmentation, satisfies augmentation but not the hereditary property, and satisfies both hereditary and augmentation properties. Hence, is an independence system, is a matroid, and is neither an independence system nor a matroid. The maximal independent sets in are and , and only has one maximal independent set . ∎
Let be an independence system where is nonempty, and let be an arbitrary subset of . A basis of is a subset of that satisfies the following two conditions:
- It is an independent set; i.e., .
- It is maximal; i.e., is not a subset of any other independent subset of . The subset satisfying the above two conditions is also called a maximal independent subset of . Define
[TABLE]
[TABLE]
Note that lr() and ur() might not be well defined, depending on . Note also that in the definition above, is not necessarily in . The number
[TABLE]
is called the rank quotient of (Hausmann et al. 1980).
Example 3
To illustrate the concept of rank quotient, again consider the independence system given in Example 2. We now consider all the subsets of and calculate their lower and upper ranks. If is a singleton (i.e., , , or ), then has only one basis, which is itself. In this case, , which means that .
If , its bases are and . Again, , which means that . Note that is not a basis of because it does not belong to . If , it has only one basis, which is itself, and again . If , its bases are and , in which case again.
If , the bases are and . So, and , which implies that .
Because the rank quotient is the smallest among the ratios calculated above, we deduce that . ∎
Example 4
As in Example 3, we can similarly check that . In fact, the rank quotient of any matroid is equal to , because for any susbset , (Edmonds 1966). The rank quotient of an independence system can be regarded as a measure of how much differs from being a matroid. ∎
For any independence system , if there exist matroids () such that , then the pair is called the intersection of the matroids (Hausmann et al. 1980).
Finding the optimal solution to (5) in general is NP-hard. The greedy strategy provides a tractable way to approximately solve the problem, which starts with the empty set, and incrementally adds an element to the current solution set giving the largest gain in the objective function under the constraints. Although the greedy strategy yields an approximate solution, its performance might be arbitrarily poor. However, when the optimization problem has the further special structure of being polymatroid, the greedy strategy has provable guarantees. The celebrated results by Fisher et al. (1978) and Nemhauser et al. (1978) show that the greedy strategy provides a good approximation to the optimal solution when the objective function is a polymatroid set function under both general matroid constraints and uniform matroid constraints. We will review the performance of the greedy strategy for (5) under different constraints in the following section.
2.3 Performance Bounds for Greedy Strategy
First we introduce definitions of the optimal strategy and the greedy strategy. Optimal Set: Any set is called an optimal solution of Problem (5) if
[TABLE]
where argmax denotes the set of actions that maximize .
Greedy Algorithm:
Input: A pair , a set function
Output: A subset
For ,
. If , set ; otherwise, stop and set .
Any output of the above algorithm is called a greedy solution. Note that there may exist more than one optimal solution or more than one greedy solution. How good is a greedy solution compared to an optimal solution in terms of the objective function? In the following theorems, we review performance bounds for the greedy strategy under different constraints. These bounds are worst-case performance bounds, which means that the greedy strategy performs much better than those bounds in many cases.
Theorem 2.1
(Hausmann et al. 1980)*
Let be an independence system. If is additive on , i.e., for any subset , then any greedy solution satisfies*
[TABLE]
where is the rank quotient defined in Section 2.2. Furthermore, for some function , (7) holds with equality.
Remark 1
When is a matroid, . By Theorem 2.1, the greedy strategy is optimal when is a matroid and the objective function is additive.
Remark 2
When is the intersection of matroids, then (Hausmann et al. 1980). So when , i.e., is a matroid, the greedy strategy is optimal, which is consistent with Remark 1.
Example 5
We provide an example222We thank the anonymous reviewer for this example. to demonstrate the performance bound in Theorem 2.1. Let , and consider the collection of subsets
[TABLE]
Define a function such that . Let , and .
It is easy to check that is an independence system. If , it has bases and , which results in . Because the maximum cardinality of the maximal independent subsets of any is 4, for any set with . Therefore, . The greedy solution is with and the optimal solution is with , which satisfy the bound . In fact, the bound holds with equality if we lower to exactly . ∎
The following theorem bounds the performance of the greedy strategy when is the intersection of matroids and is a polymatroid set function.
Theorem 2.2
(Fisher et al. 1978)*
Let be the intersection of matroids and a polymatroid set function. Then any greedy solution satisfies*
[TABLE]
Remark 3
The condition that is additive in Theorem 2.1 is stronger than the condition that is a polymatroid set function in Theorem 2.2, so the bound in Theorem 2.1 is stronger than the bound in Theorem 2.2.
Remark 4
The bound can be achieved for any positive integer . When , is a matroid, and the bound becomes , which means that the greedy strategy yields -approximation for general matroid constraints.
Remark 5
Theorem 2.2 requires that . If , the following performance bound holds
[TABLE]
The following theorem provides a performance bound for the greedy strategy when is a uniform matroid and is a polymatroid set function.
Theorem 2.3
(Nemhauser et al. 1978)*
Let be a uniform matroid and a polymatroid set function. Then any greedy solution satisfies*
[TABLE]
where is the rank of the uniform matroid and is the base of the natural logarithm.
Remark 6
The bound is stronger than the bound when in Theorem 2.2, because the uniform matroid is a special matroid.
Remark 7
The bound is decreasing in and tends to when goes to infinity. When , the bound becomes 1, which is consistent with the fact that the greedy strategy chooses the best action at each stage.
Remark 8
The bound is tight, which means that it can be achieved for each (Nemhauser et al. 1978).
Remark 9
By Theorem 2.2, the greedy strategy only achieves a -approximation under general matroid constraints. However, Calinescu et al. (2011) proved that a variant of the greedy strategy yields a -approximation under general matroid constraints.
2.4 Examples
We introduce two examples – a task scheduling problem and an adaptive sensing problem – to illustrate polymatroid set functions. In both problems, is a uniform matroid and hence the greedy strategy satisfies a -approximation.
Task Assignment Problem: The task scheduling problem was posed by Streeter and Golovin (2008), and was also analyzed in Zhang et al. (2016) and Liu et al. (2018d). In this problem, there are subtasks and a set of agents. At each stage, a subtask is assigned to an agent , who accomplishes the task with probability . Let denote the Bernoulli random variable that signifies whether or not subtask has been accomplished after assigning the set of agents over stages. Then is the fraction of subtasks accomplished after stages by employing agents . The objective function for this problem is the expected value of this fraction, which can be written as
[TABLE]
The aim is to choose a set of agents to maximize this objective function.
Assume that for any . Then it is easy to check that is monotone, submodular, and , which implies that is a polymatroid set function.
Adaptive Sensing: As our second example application, we consider the adaptive sensing design problem posed in Zhang et al. (2016) and Liu et al. (2018d). Consider a signal of interest with normal prior distribution , where is the identity matrix; our analysis easily generalizes to dimensions larger than . Let , where for . At each stage , we make a measurement of the form
[TABLE]
where and represents i.i.d. Gaussian measurement noise with mean zero and covariance , independent of .
The objective function for this problem is the information gain, which can be written as
[TABLE]
Here, is the entropy of the prior distribution of and is the entropy of the posterior distribution of given ; that is,
[TABLE]
where for
[TABLE]
is the posterior covariance of given . The objective is to choose a set of measurements to maximize the information gain .
It is easy to check that is monotone, submodular, and ; i.e., is a polymatroid set function.
2.5 Curvature
As we saw in Section 2.1, submodularity is a second-order property by analogy to concavity. If we can quantify this second order property, then we can get tighter bounds. One way to quantify the second order property is to use the total curvature, defined by Conforti and Cornuéjols (1984):
[TABLE]
To see that this is a second-order property, rewrite it in terms of differences of differences:
[TABLE]
For convenience, we use to denote when there is no ambiguity. Note that when is a polymatroid set function, and when is modular. When is modular, is additive. If we consider as the objective function, then the greedy strategy achieves optimality. Therefore, in the rest of the paper, when we assume that is a polymatroid set function, we only consider .
Conforti and Cornuéjols (1984) provided performance bounds in terms of the total curvature for the greedy strategy under independence system, general matroid, and uniform matroid constraints, which will be reviewed as follows.
Theorem 2.4
(Conforti and Cornuéjols 1984)* If is an independence system with ur and lr, and is is a polymatroid set function with total curvature , then any greedy solution satisfies*
[TABLE]
and this bound is tight for all .
Theorem 2.5
(Conforti and Cornuéjols 1984)*
If is a matroid and is a polymatroid set function with total curvature , then any greedy solution satisfies*
[TABLE]
Moreover, if is a uniform matroid with rank , then any greedy solution satisfies
[TABLE]
Remark 10
When is a matroid, the bound is stronger than the bound in Theorem 2.2 because when is a polymatroid set function and is nonincreasing in .
Remark 11
The function is nonincreasing in , and therefore when is a polymatroid set function. Also it is easy to check that for , which implies that the bound for the uniform matroid constraints is stronger than the bound for the general matroid constraints.
Remark 12
The two bounds in terms of the total curvature are both tight; for proofs, see Conforti and Cornuéjols (1984).
Remark 13
There are other notions of curvatures that can be used to characterize the second-order property of the set function , such as the greedy curvature defined by Conforti and Cornuéjols (1984) and the elemental curvature defined by Wang et al. (2014). Performance bounds in terms of the corresponding curvatures were also derived by Conforti and Cornuéjols (1984) and Wang et al. (2014) under different constraints.
Example: Consider again the task assignment example from Section 2.4. For convenience, we only consider the special case ; our analysis can be generalized to any . For , we have
[TABLE]
where .
Let us order the elements of as such that
[TABLE]
Then by the definition of the total curvature , we have
[TABLE]
which is consistent with our conclusion that .
2.6 Improved Bounds
The performance bounds of Conforti and Cornuéjols (1984) reviewed in Section 2.5, are the best bounds in terms of the total curvature for general matroid constraints and uniform matroid constraints, respectively. However, the total curvature depends on function values on sets outside the constraint matroid. If we are given a function defined only on the matroid, problem (5) still makes sense, but the bounds involving do not apply. Liu et al. (2018a, 2019) investigated modified bounds that overcome this drawback. The idea is first to extend a polymatroid set function defined on the matroid to one defined on the entire power set, and then apply the results from Conforti and Cornuéjols (1984). However, not every polymatroid function defined on the matroid can be extended to one defined on the entire power set.
In Liu et al. (2019), they first provide necessary and sufficient conditions for the existence of an incremental extension of a polymatroid set function defined on the uniform matroid of rank to one defined on the uniform matroid of rank . Whenever a polymatroid objective function defined on a matroid can be extended to the entire power set, the greedy approximation bounds involving the total curvature of the extension apply. However, the bounds still depend on sets outside the matroid. Motivated by this, Liu et al. (2019) defined a new notion of curvature called partial curvature, involving only sets in the matroid. They derived necessary and sufficient conditions for an extension of the function to have a total curvature that is equal to the partial curvature. Moreover, they proved that the bounds in terms of the partial curvature are in general improved over the previous ones.
The following theorems state the necessary and sufficient conditions for the existence of an extension of a polymatroid set function defined on the uniform matroid of rank to one defined on the uniform matroid of rank .
Theorem 2.6
(Liu et al. 2019)*
Let be a polymatroid function defined on the uniform matroid of rank . Then can be extended to a polymatroid function defined on the uniform matroid of rank if and only if for any with , any with , and any ,*
[TABLE]
where .
Construction: If is extendable, then an extension can be constructed as follows: For any with , ; For any with ,
[TABLE]
where satisfies
[TABLE]
Note that is itself an extension of from to the entire , and the extended is a polymatroid function on . Therefore, we have that , where is the set of all polymatroid functions on that agree with on . So if a polymatroid set function defined on the matroid can be extended to one defined on the whole power set, applying the performance bounds in Theorem 2.5 results in the following theorem.
Theorem 2.7
(Liu et al. 2019)*
Let be a matroid of rank and a polymatroid function. If there exists an extension of to the entire power set, then any greedy solution to problem satisfies*
[TABLE]
where . In particular, when is a uniform matroid, any greedy solution to problem satisfies
[TABLE]
Remark 14
The bounds and apply to problems where the objective function is a polymatroid function defined only for sets in the matroid and can be extended to one defined on the entire power set. However, these bounds still depend on sets not in the matroid, because of the way is defined.
Then Liu et al. (2019) defined a new curvature called the partial curvature as follows:
[TABLE]
and the partial curvature satisfies that when is an extension of from to . The following theorem provides necessary and sufficient conditions for the existence of an extension to have .
Theorem 2.8
(Liu et al. 2019)*
Let be a matroid and a polymatroid function. Let be a polymatroid function that agrees with on . Then if and only if*
[TABLE]
for any , and equality holds for some .
Liu et al. (2019) provided the following improved bounds for the greedy strategy if there exists an extension of such that .
Theorem 2.9
(Liu et al. 2019)*
Let be a matroid of rank . Let be a polymatroid function that agrees with on such that for any with equality holding for some . Then, any greedy solution to problem satisfies*
[TABLE]
In particular, when is a uniform matroid, any greedy solution to problem satisfies
[TABLE]
Remark 15
The bounds and do not depend on sets outside the matroid, so they apply to problems where the objective function is only defined on the matroid, provided that an extension that satisfies the assumptions in Theorem 2.8 exists. When is defined on the entire power set, , which implies that the bounds are stronger than those from Conforti and Cornuéjols (1984).
Next consider again the task assignment problem from Section 2.4. Liu et al. (2019) gave an extension of defined on the uniform matroid of rank to the whole power set with , which is reviewed as follows.
Example: Let , , , , and . Then, is defined as in (10) for any . Let , then . It is easy to show that is a polymatroid function.
The polymatroid function constructed using (12) while satisfying (13) and (17) from Liu et al. (2019) is of the following form:
.
The total curvature of is
[TABLE]
By Theorem 2.9, the greedy strategy for the task scheduling problem satisfies the bound , which is better than the previous bound .
2.7 Batch Actions
Suppose we batch the selected actions into batches of size . What results is the -batch greedy strategy, which starts with the empty set and iteratively adds to the current solution set a batch of elements with the largest gain in the objective function under the constraints. The greedy strategy we considered in Sections 2.3–2.5 is a special case of the batched greedy with batch size equal to . Intuitively, larger should result in better performance, albeit at the expense of increasing computational complexity. But how do the previous bounds improve as a function of ? In this section, we review performance bounds for the -batch greedy strategy.
We start by introducing the -batch greedy strategy as follows. Consider again problem (5) and write the maximal cardinality of the sets in as , where are nonnegative integers and . Note that is not necessarily the remainder of , because could be equal to . This happens when divides . The -batch greedy strategy is as follows (Liu et al. 2018c, d):
Step 1: Let and .
Step 2: Select such that , , and
[TABLE]
then set .
Step 3: If , set , and repeat Step 2.
Step 4: If , select such that , , and
[TABLE]
Step 5: Return the set and terminate.
Any set generated by the above procedure is called a -batch greedy solution. For the above strategy, there are steps in total, and exactly actions are selected at each of the first steps but the final step may select fewer than actions. A similar batched greedy strategy is investigated by Hausmann et al. (1980) called the -greedy strategy, where at most actions are selected at each stage.
The performance of the -batch greedy strategy under uniform matroid constraints was first investigated by Nemhauser et al. (1978), stated as follows.
Theorem 2.10
(Nemhauser et al. 1978)* If is a uniform matroid of rank and is a polymatroid set function, then any -batch greedy solution satisfies*
[TABLE]
Remark 16
When , i.e., the batch size divides the rank , the bound is tight; see Nemhauser et al. (1978) for proof.
By introducing the total -batch curvature
[TABLE]
where , Liu et al. (2018d) derived performance bounds in terms of for the -batch greedy strategy under both general matroid and uniform matroid constraints, and investigated the monotoneity of the performance bounds with respect to the batch size .
Theorem 2.11
(Liu et al. 2018d)* Assume that is a polymatroid set function. When is a general matroid, then any -batch greedy solution satisfies*
[TABLE]
When is a uniform matroid, then any -batch greedy solution satisfies
[TABLE]
Remark 17
When , the bound for general matroid constraint becomes and the bound for uniform matroid constraints becomes , which is consistent with the results in Theorem 2.5.
Remark 18
The total -batch curvature is nonincreasing in , i.e., whenever (Liu et al. 2018d).
Remark 19
Based on Remark 18, we can discuss the monotoneity of the bounds for both general matroid and uniform matroid constraints. The bound for general matroid constraints is monotone in . For uniform matroid constraints, when the batch size divides , the bound becomes
[TABLE]
which is monotone in . Moreover,
[TABLE]
which means that the bound for uniform matroid constraints is better than the bound for general matroid constraints. However, if does not divide , the exponential bound might be worse than the harmonic bound. For example, when , and , the exponential bound is , which is worse than the harmonic bound (Liu et al. 2018d).
Examples: Now consider again the task assignment and adaptive sensing problems from Section 2.4 to demonstrate that the total curvature decreases in and the performance bound for a uniform matroid increases in under the condition that the batch size divides the rank .
Task Assignment Problem: We still order the elements of as such that
[TABLE]
Then by the definition of the total curvature , we have
[TABLE]
From the expression of , we can see that is nonincreasing in , but when is large, is close to 1 for each .
To numerically evaluate the relevant quantities here, Liu et al. (2018d) randomly generated a set . In Fig. 3, they considered , and batch sizes . Fig. 3 shows that the exponential bound for is worse than that for , which illustrates our earlier remark that the exponential bound for the uniform matroid case is not necessarily monotone in even though is monotone in . Fig. 3 also shows that the exponential bound coincides with for and it is nondecreasing in , which illustrates our remark that the exponential bound is nondecreasing in under the condition that divides . Owing to the nature of the total curvature for this example, it is not easy to see that is nonincreasing in (all values here are very close to 1).
Adaptive Sensing: For convenience, set . Then, we have
[TABLE]
where , , and .
We already saw that the exponential bound for the uniform matroid case is not necessarily monotone in from the task assignment problem, so we will only consider the case when the batch size divides . Liu et al. (2018d) considered for in Fig. 4. The figure shows that the curvature decreases in and the exponential bound increases in since divides for , which again demonstrates the claim that decreases in and the exponential bound increases in under the condition that divides .
2.8 Noncooperative Games
In the previous sections, we reviewed performance bounds for greedy-type strategies in set submodular optimization problems. It turns out that similar techniques can be used to bound the performance of Nash equilibria in noncooperative games–utility maximization problems. The connection to the game setting is easy to imagine by associating the objective function in set optimization with a social utility function in games, greedy strategies with Nash equilibria, and batching with cooperation of subgroups in games. We first introduce some background on utility maximization problems and Nash equilibria.
A great number of interesting practical problems can be posed as utility maximization problems: these include facility location (Ahmed and Atamtürk 2011), traffic routing and congestion management (Arslan et al. 2007; He et al. 2007), sensor selection (Rowaihy et al. 2007; Liu et al. 2014), and network resource allocation (La and Anantharam 2002; Palomar and Chiang 2007). In a utility maximization problem, a set of users make decisions according to their own set of feasible strategies, resulting in an overall social utility value, such as profit, coverage, achieved data rate, and quality of service. The goal is to maximize the social utility function. Often, the users do not cooperate in selecting their strategies.
In general, it is impractical to find the optimal strategy maximizing the social utility function. However, it is feasible to consider scenarios where individual users or groups of users separately maximize their own private objective functions. The usual framework for studying such scenarios is game theory together with its celebrated notion of Nash equilibria. A Nash equilibrium is a set of strategies (deterministic or randomized) for which no user can improve its own private utility by changing its strategy unilaterally. Nash (1951) proved that any finite and non-cooperative game has at least one Nash equilibrium.
The performance of Nash equilibria compared with the optimal solution in submodular utility maximization problems was investigated by Vetta (2002). Based on the existing results, Liu et al. (2018b) established bounds for Nash equilibria when there is “grouping” among users, which is useful in understanding the role of cooperation and social ties in games. Before we review these results, we introduce some notation and terminology from Vetta (2002) and Liu et al. (2018b).
Suppose we have a set of users. Each element in represents an act that user can take. We call a set of acts an action, and if an action is available to user we call it a feasible action. We denote by the set of all feasible actions for user , i.e., is a feasible action, with the cardinality of . We call the action space for user . A pure strategy is one in which the user takes a specific action. A mixed strategy is one in which the user takes actions according to some probability distribution. The set of mixed strategies is called the strategy space. We represent the strategy space for user by , where is called a strategy taken by user and is the probability with which user takes action . When for some and for all , user is said to take a pure strategy. Otherwise, user takes a mixed strategy. Write . The indexed set , with and , is called a strategy set of size in .
Given a strategy set , the set is the subset of that contains strategies taken by all users except user , and is the strategy set that results from when user changes its strategy from to .
The expected social utility function and expected private utility function for user from strategies in to real numbers are denoted by and , respectively. Define for any set and .
Now we introduce the definition of a Nash equilibrium and a valid system, then review performance bounds for Nash equilibria under some conditions from Vetta (2002).
Definition 1
A strategy set is a Nash equilibrium if no user has an incentive to unilaterally change its strategy, i.e., for any user ,
[TABLE]
Assumption 1
(Vetta 2002)* The private utility of user () is at least as large as the loss in the social utility resulting from user dropping out of the game. That is, the system () has the property that for any strategy set ,*
[TABLE]
Assumption 2
(Vetta 2002)* The sum of the private utilities of the system is not larger than the social utility, i.e., for any strategy set ,*
[TABLE]
A utility system satisfying Assumptions 1 and 2 is called a valid system. We denote by the optimal strategy set in maximizing an expected utility function , and assume that is composed of pure strategies , . For convenience, we also use to denote the optimal action that user takes. Consider a strategy set where . Suppose that user () uses a mixed strategy that takes actions with probabilities . We use the notation to represent the strategy in which user takes the actions with probabilities , and user takes the action , so is well defined.
Theorem 2.12
(Vetta 2002)* For a valid utility system , if the expected social utility function is submodular, then for any Nash equilibrium we have*
[TABLE]
Remark 20
If is monotone, then and the above inequality shows that any Nash equilibrium achieves at least of the optimal social utility function value.
By defining the curvature of the expected social utility function ,
[TABLE]
Vetta (2002) derived the following tighter performance bound in terms of the curvature for Nash equilibria.
Theorem 2.13
(Vetta 2002)* For a valid utility system , if the expected social utility function is monotone and submodular, then for any Nash equilibrium we have*
[TABLE]
Remark 21
When the expected social utility function is monotone and submodular, we have , which implies that .
Next we review performance bounds for group Nash equilibria defined by Liu et al. (2018b). They considered the case where the set of all users in the utility maximization system are divided into disjoint groups, and the users in the same group choose their strategies by maximizing their group utility function jointly.
Assume that the set of users is divided into disjoint groups, in which group () has users , where , is the number of users in group , and . Let denote the group strategy for group , where is the strategy for user . This includes the strategies taken by all the users in group (). Let denote the set of group strategies taken by all groups except for group and denote the group strategy set obtained when group changes its group strategy from to . Let denote the expected group utility function for group . Define for any and ().
Definition 2
A strategy set is a group Nash equilibrium of a utility system if no group can improve its group utility by unilaterally changing its group strategy, i.e., for any ,
[TABLE]
where for .
The utility system is valid if it satisfies the following two assumptions (Liu et al. 2018b).
Assumption 3
The group utility of group is at least as large as the loss in the social utility resulting from all the users in group dropping out of the game. That is, the system has the property that for any strategy set ,
[TABLE]
Assumption 4
The sum of the group utilities of the system is not larger than the social utility, i.e., for any strategy set ,
[TABLE]
Theorem 2.14
(Liu et al. 2018b)*
For a valid utility system , if the expected social utility function is submodular, then any group Nash equilibrium satisfies*
[TABLE]
To better characterize the relation of the social utility value of any group Nash equilibrium and that of the optimal solution , Liu et al. (2018b) defined the group curvature of the social utility function for group as
[TABLE]
Theorem 2.15
(Liu et al. 2018b)*
For a valid utility system , if the expected social utility function is monotone and submodular, then any group Nash equilibrium satisfies*
[TABLE]
In particular, if , we have
[TABLE]
where .
Remark 22
When the expected group utility function is monotone and submodular, it is easy to check that , which implies that .
Remark 23
When the expected group utility function is monotone and submodular, we have . This shows that the bound for the case with grouping is tighter than that for the case without grouping. Of course, this is unsurprising, because grouping entails cooperation. Moreover, under the condition that each user has the same strategy space, the larger the value of , the higher the degree of cooperation, and the tighter the lower bound.
3 Strings of Actions
In Section 2, we considered the optimization problem where the argument of the objective function is a set of actions. Suppose the objective function depends not only on the set of actions but also on the order of actions. We call the argument of the objective function a string of actions. In this section, we introduce notation and terminology for strings and string functions, string optimization problem, performance bounds for the greedy strategy, and applications.
3.1 Notation and Terminology
Let be a set of all possible actions. We use () to denote a string of actions taken over consecutive stages. We define its length as , denoted by . Note that corresponds to the empty string, denoted by .
Let denote the set of all possible strings of actions. If two strings in are expressed by and , we write iff and for each . Moreover, we define string concatenation as .
We write if we have for some . In this case, we also say that is a prefix of . We write if there exists a set of strings such that . Note that is weaker than , which means implies , but the converse is not necessarily true.
Similar to the definition of a polymatroid set function in Section 2.1, we define a function from strings to real numbers, , a polymatroid string function if
- i.
.
- ii.
has the prefix-monotone property: .
- iii.
has the diminishing-return property: , .
A function is postfix monotone if
[TABLE]
Notice the difference between the prefix-monotone property and postfix-monotone property.
Let denote a collection of strings from . The pair is called a string matroid (Zhang et al. 2016) if satisfies the following properties:
- i.
is non-empty;
- ii.
Hereditary: implies that ;
- iii.
Augmentation: and , there exists an element in the string such that .
The length of the longest string in is called the rank of . When , the pair is called a uniform string matroid of rank .
3.2 String Optimization Problem
In this section, we first formulate the string optimization problem and define the greedy strategy. Then we review performance bounds for the greedy strategy under uniform string matroid constraints and general string matroid constraints.
In a variety of problems in engineering and applied science such as sequential decision making (Littman 1996; Roijers et al. 2013), adaptive sensing (Liu et al. 2014; Krause et al. 2008), and adaptive control (Jarvis 1975; Schlegel et al. 2005), we are faced with optimally choosing a string (ordered set) of actions over a finite horizon to maximize an objective function under some constraints. We call this class of optimization problems string optimization. For set optimization problems, the objective function is not influenced by the order of actions. However, for string optimization problems, the objective function depends on the order of actions. Let be an objective function. The goal is to find a string , with the constraint , to maximize the objective function:
[TABLE]
where denotes the set of all possible strings of actions and is a collection of strings from .
The solution to the string optimization problems can be characterized using dynamic programming via Bellman’s principle (Bertsekas 2005; Powell 2007). However, dynamic programming suffers from the curse of dimensionality and is therefore impractical for many problems of interest. Hence, we often turn to approximation techniques. One approximation technique is the greedy strategy, which is to find an action at each stage to maximize the step-wise gain in the objective function. The performance for the greedy strategy in string optimization problems has been investigated by Streeter and Golovin (2008), Zhang et al. (2016), and Liu et al. (2015). And these specific results will be reviewed in this section.
Assume that the rank of is . We now define optimal and greedy strategies for problem (30) and some related notation.
Optimal String: Any string is called an optimal solution of Problem (30) if
[TABLE]
If is prefix monotone, then there exists at least one optimal string of length , denoted by .
Greedy Algorithm:
Input: A string matroid of rank , a set function , collection , size
Output: A string
For ,
Any output of the above algorithm is called a greedy solution. There may exist more than one greedy solution.
3.3 Performance Bounds for Greedy Strategy
Streeter and Golovin (2008) first derived performance bounds for the greedy strategy under uniform string matroid constraints, stated as follows.
Theorem 3.1
(Streeter and Golovin 2008)*
Let be a uniform string matroid. If is a polymatroid string function and postfix monotone, then any greedy string satisfies*
[TABLE]
Remark 24
The same bound holds if satisfies for , which is weaker than being postfix monotone.
Zhang et al. (2016) investigated performance bounds for the greedy strategy under both uniform string matroid and general string matroid constraints by defining the following curvatures.
The total backward curvature of is defined as (Zhang et al. 2016)
[TABLE]
When is postfix monotone and diminishing return, we have . The total backward curvature is an upper bound on the second-order difference, over all possible actions and strings . Next, Zhang et al. (2016) defined the total backward curvature of with respect to the optimal string by
[TABLE]
When is postfix monotone and string submodular, it is easy to prove that .
Theorem 3.2
(Zhang et al. 2016)*
Let be a uniform string matroid of rank . If is a polymatroid string function, then any greedy string satisfies*
[TABLE]
Moreover, if is postfix monotone, then any greedy string satisfies
[TABLE]
Remark 25
When is polymatroid and postfix monotone, we have by (32). So we have and , which implies that Theorem 3.2 provides better bounds than Theorem 3.1.
Theorem 3.3
(Zhang et al. 2016)*
Let be a string matroid. If is a polymatroid string function, then any greedy string satisfies*
[TABLE]
Moreover, if is postfix monotone, then any greedy string satisfies
[TABLE]
From Theorems 3.1 and 3.2, we can see that all the sufficient conditions obtained so far involve strings of length greater than , even though (30) involves only strings up to length . Liu et al. (2015) derived sufficient conditions, which only involve strings of length at most , to have the same bounds hold for uniform string matroid constraints, by defining the following conditions.
A function is -polymatroid if
- i.
.
- ii.
is -monotone: and , .
- iii.
is -diminishing: and , , .
Let and for . Then, is -GO-concave (Liu et al. 2015) if for ,
[TABLE]
Theorem 3.4
(Liu et al. 2015)*
Let be a uniform string matroid. If is -polymatroid, then any greedy string satisfies*
[TABLE]
By defining the curvature ,
[TABLE]
Liu et al. (2015) derived more general performance bounds in terms of the curvature.
Theorem 3.5
(Liu et al. 2015)*
Let be a uniform string matroid. If is -polymatroid and -GO-concave, then any greedy string satisfies*
[TABLE]
Remark 26
If is -GO-concave, then we have .
Examples: We again consider the task assignment problem and adaptive sensing problem from Section 2.4 to give some sufficient conditions on the parameters of the problems to achieve the performance bound .
Task Assignment Problem: We use to denote the probability of accomplishing subtask at stage when it is assigned to agent . Let be the index of the agent selected at stage , the objective function becomes
[TABLE]
For simplicity, we consider the case of (our results can easily be generalized to the case where ). For , the objective function reduces to
[TABLE]
and from here on we simply use in place of .
Note that the value of depends on the order of the agents selected over time when the probabilities vary from stage to stage. In other words, suppose that we have two agents, Alice and Bob. Then, in general, , , , and . Therefore, .
It is easy to check that is -monotone and .
Assume that , where and . By Zhang et al. (2016), a sufficient condition for to be diminishing return is
[TABLE]
where
[TABLE]
Let and . By Liu et al. (2015), a sufficient condition for to be -diminishing is
[TABLE]
and a sufficient condition for -GO-concavity is
[TABLE]
When all , then (41) and (40) automatically hold, but (39) is not necessarily satisfied. In that sense, the -monotone, -diminishing, and -Go concavity conditions of Theorem 3.4 are weaker sufficient conditions for achieving the bound than the prefix monotone, diminishing-return, and postfix monotone conditions of Theorem 3.1.
Adaptive Sensing: Consider the situation where the additive noise set is independent but not identically distributed. Assume that is a Gaussian vector with mean zero and covariance , where denotes the identity matrix. Recall the problem formulation in Section 2.4. The objective function for this problem is as follows:
[TABLE]
where and for ,
[TABLE]
From the expression above, it is easy to check that the order of influences the objective function value under the assumption that take different values. For example,
[TABLE]
and
[TABLE]
If , then .
By Liu et al. (2015), some sufficient conditions for to be -polymatroid and -GO-concave are
[TABLE]
for .
By Zhang et al. (2016), to achieve the bound , it requires both (42) and
[TABLE]
where is the interval that contains all the ’s.
Comparing the sufficient conditions for achieving the same bound from Liu et al. (2015) and Zhang et al. (2016), we see that the conditions from Liu et al. (2015) are weaker.
4 Final Remarks
In this survey, we considered two classes of submodular maximization problems: set submodular maximization and string submodular maximization. For set submodular optimization, we reviewed performance bounds for the greedy strategy under matroid constraints, improved performance bounds, and performance bounds for the batched greedy strategy. There are many important results about performance of the greedy strategy under some other constraints and conditions. Wolsey (1982), Sviridenko (2004), and Kulik et al. (2009) derived performance bounds for the greedy strategy in submodular maximization problems subject to a knapsack constraint and multiple linear constraints. Bian et al. (2017) established performance bounds for the greedy strategy in monotone but nonsubmodular maximization problems under uniform matroid constraints. People also investigated performance bounds for some variations of greedy strategies. Calinescu et al. (2011) and Feldman et al. (2011) derived performance bounds for a randomized continuous greedy algorithm and a unified continuous greedy algorithm in monotone submodular maximization problems, respectively. Buchbinder et al. (2012) established performance bounds for an adaptive greedy algorithm in unconstrained submodular maximization problems. They also derived performance bounds for randomized greedy algorithms in nonmonotone submodular maximization problems (Buchbinder et al. 2014). Mirzasoleiman et al. (2016) considered submodular maximization problems in a distributed fashion, and they derived performance bounds for a two-stage greedy algorithm under matroid or knapsack constraints. Qu et al. (2015) proposed a distributed greedy strategy and showed that it has the same guarantee as the centralized greedy strategy.
For string submodular optimization problems, we reviewed performance bounds for the greedy strategy under matroid constraints. There are some related results on performance bounds for greedy strategies in string submodular maximization problems that were not reviewed in this paper. For example, Golovin and Krause (2001) considered a particular class of partially observable adaptive stochastic optimization problems, and established performance bounds for the greedy strategy by introducing the notion of adaptive submodularity. Tschiatschek et al. (2017) derived performance bounds for a modified greedy strategy in submodular string optimization problems under uniform string matroid constraints.
The scope of this study is limited to the performance of the greedy strategies in deterministic optimization problems where the objective function only involves actions. Potentially fruitful areas for further research include performance bounds for the greedy strategy in stochastic optimization problems, where the objective function involves states and control actions, and real-world applications of the performance bounds in the deterministic and stochastic settings.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Ahmed and Atamtürk (2011) Ahmed, S., and A. Atamtürk, 2011: Maximizing a class of submodular utility functions. Math Program , 128 , 149–169.
- 2Arslan et al. (2007) Arslan, G., J. R. Marden, and J. S. Shamma, 2007: Autonomous vehicle-target assignment: a game-theoretical formulation. J Dyn Syst Meas Control , 129 , 584–596.
- 3Badanidiyuru et al. (2014) Badanidiyuru, A., B. Mirzasoleiman, A. Karbasi, and A. Krause, 2014: Streaming submodular maximization: massive data summarization on the fly. In: Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining , 671–680.
- 4Bator (1957) Bator, F. M., 1957: The simple analytics of welfare maximization. Am Econ Rev , 47 , 22–59.
- 5Bertsekas (2005) Bertsekas, D. P., 2005: Dynamic programming and optimal control . 3rd ed., Athena Scientific.
- 6Bian et al. (2017) Bian, A. A., J. M. Buhmann, A. Krause, and S. Tschiatschek, 2017: Guarantee for greedy maximization of non-submodular functions with applications. In: Proceedings of the 34th International Conference on Machine Learning , 498–507.
- 7Boros et al. (2003) Boros, E., K. Elbassioni, and L. Khachiyan, 2003: An inequality for polymatroid functions and its applications. Discrete Appl Math , 131 , 255–281.
- 8Buchbinder et al. (2012) Buchbinder, N., M. Feldman, J. Naor, and R. Schwartz, 2012: A tight linear time (1/2)-approximation for unconstrained submodular maximization. SIAM J Comput , 44 , 255–281.
