An Exact Method for Constrained Maximization of the Conditional Value-at-Risk of a Class of Stochastic Submodular Functions
Hao-Hsiang Wu, Simge Kucukyavuz

TL;DR
This paper introduces an exact method for maximizing the CVaR of stochastic submodular functions under constraints, providing a new approach for risk-averse optimization with practical applications.
Contribution
It develops valid inequalities and an exact solution method for constrained CVaR maximization of stochastic submodular functions, assuming efficient CVaR oracle access.
Findings
Effective solution method demonstrated on stochastic set covering problem.
Valid inequalities improve computational efficiency.
Method applicable to problems with efficient CVaR oracles.
Abstract
We consider a class of risk-averse submodular maximization problems (RASM) where the objective is the conditional value-at-risk (CVaR) of a random nondecreasing submodular function at a given risk level. We propose valid inequalities and an exact general method for solving RASM under the assumption that we have an efficient oracle that computes the CVaR of the random function. We demonstrate the proposed method on a stochastic set covering problem that admits an efficient CVaR oracle for the random coverage function.
| Oracle-LShape | Oracle-Ineq (7) | Oracle-Ineq (12) | Oracle-Ineq (7) and (12) | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Time | Cuts | Nodes | Time | Cuts | Nodes | Time | Cuts | Nodes | Time | Cuts | Nodes | |||
| 50 | 0.025 | 3 | 9 | 2315 | 2363 | 3 | 801 | 1915 | 224 | 336 | 398 | 667 | ||
| 50 | 0.025 | 5 | 33988 | 36405 | 681 | 16683 | 52677 | 3 | 726 | 1960 | 5 | 1462 | 2264 | |
| 50 | 0.05 | 3 | 10 | 2319 | 2363 | 2 | 799 | 1734 | 225 | 366 | 397 | 787 | ||
| 50 | 0.05 | 5 | 32548 | 36569 | 492 | 14978 | 43320 | 3 | 713 | 1859 | 5 | 1484 | 2095 | |
| 100 | 0.025 | 3 | 1152 | 19661 | 20300 | 11 | 1845 | 4287 | 24 | 1511 | 2321 | 15 | 1816 | 1252 |
| 100 | 0.025 | 5 | 21834 | 53054 | 1154 | 17741 | 97510 | 825 | 13156 | 46789 | 560 | 15648 | 45091 | |
| 100 | 0.05 | 3 | 1205 | 19634 | 20685 | 9 | 1486 | 4476 | 24 | 1542 | 2400 | 15 | 1808 | 1224 |
| 100 | 0.05 | 5 | 21980 | 52479 | 505 | 11408 | 74808 | 826 | 12911 | 47823 | 238 | 10684 | 21191 | |
| 150 | 0.025 | 3 | 18922 | 22045 | 25 | 1832 | 12753 | 457 | 6775 | 10868 | 91 | 3046 | 1389 | |
| 150 | 0.025 | 5 | 19364 | 43796 | 1640 | 17781 | 109795 | 13468 | 72593 | 1603 | 20278 | 92923 | ||
| 150 | 0.05 | 3 | 20394 | 21649 | 6 | 1021 | 1115 | 445 | 6705 | 10815 | 59 | 1802 | 8489 | |
| 150 | 0.05 | 5 | 20609 | 49523 | 10832 | 150285 | 14759 | 68158 | 1582 | 19978 | 75957 | |||
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRisk and Portfolio Optimization · Probabilistic and Robust Engineering Design · Complexity and Algorithms in Graphs
An Exact Method for Constrained Maximization of the Conditional Value-at-Risk of a Class of Stochastic Submodular Functions
@addressi
@addressii
@addressiii
@addressiv
@addressv
Abstract
We consider a class of risk-averse submodular maximization problems (RASM) where the objective is the conditional value-at-risk (CVaR) of a random nondecreasing submodular function at a given risk level. We propose valid inequalities and an exact general method for solving RASM under the assumption that we have an efficient oracle that computes the CVaR of the random function. We demonstrate the proposed method on a stochastic set covering problem that admits an efficient CVaR oracle for the random coverage function.
1 Introduction
11footnotetext: Corresponding author
We consider a class of risk-averse submodular maximization problems (RASM) recently formulated by Maehara [13]. Formally, let be a finite set, and be a probability space. We define an outcome mapping . The random outcome is defined by for all and . We assume that is a nondecreasing submodular set function. With a slight abuse of notation, we refer to the submodular function for as for interchangeably, where is the characteristic vector of , and the usage will be clear from the context. We measure the risk associated with this function using conditional value-at-risk (CVaR), where larger function values correspond to less risky random outcomes. In this context, risk measures are referred to as acceptability functionals. CVaR was first introduced by Artzner et al. [3] and has been widely used, especially in finance, due to its desirable properties (e.g., coherence and tractability).
Formally, let be the positive part of a number . Let be a variable that measures the value-at-risk (VaR) at a given risk level. For a given , the value-at-risk at a risk level is defined as
[TABLE]
For a given , the conditional value-at-risk at a risk level is defined as
[TABLE]
and it provides the conditional expected value of that is no larger than the value-at-risk at the risk level [17]. In general, the risk level is small, such as or . Let be the deterministic constraints on the variables . Given a risk level , a class of risk-averse submodular maximization problems (RASM) is defined as
[TABLE]
Let represent the expectation of with respect to . In the next observation, we review properties of CVaR that will be useful in developing valid inequalities for RASM.
Observation 1.1
From the definition of CVaR in (2), for a given , we have
- (i)
, and because is submodular (see, e.g., **[24]**), so is .
- (ii)
* for and .*
Due to Observation 1.1(i), when , problem (3) is equivalent to the -hard risk-neutral submodular maximization problem (see, e.g., [10, 11, 20] for the associated applications).
Note that for a nondecreasing submodular function , Proposition 2 of [14] shows that the submodular inequality
[TABLE]
is valid. Ahmed and Atamtürk [1] and Yu and Ahmed [26] strengthen the submodular inequalities by lifting, under the assumption that the submodular utility function is strictly concave, increasing, and differentiable, assumptions we do not make in this paper. However, Maehara [13] shows that CVaR of a stochastic submodular function is no longer submodular for any risk level . The author proves that unless , there is no polynomial-time approximation algorithm with a multiplicative error for RASM under some reasonable assumptions on the given risk level. Hence it is theoretically intractable to find a solution that is close to optimal in polynomial time without any assumption on the feasible region or the probability distribution. Along this line of work, Zhou and Tokekar [28] propose a sequential greedy algorithm for the CVaR maximization problem in [13]. The authors show that the proposed algorithm gives a solution that is within a constant factor of optimal with an additional additive term that depends on the optimal value and a parameter related to the curvature of the submodular set function. However, the running time of this algorithm is exponential as it depends quadratically on the cardinality of the feasible set.
Instead of solving the generic CVaR maximization problem in [13] directly, Ohsaka and Yoshida [16] consider a specific risk-averse submodular optimization problem arising in social networks. The authors relax the problem by replacing the combinatorial decisions that determine a single set of a predetermined size with a choice of a portfolio (convex combination) of sets of the given size. They also give a polynomial-time algorithm that obtains a portfolio with a CVaR value that has a provable guarantee with a specified probability. Wilder [22] generalizes these results and proposes an approximation algorithm for maximizing the CVaR of a generic monotone continuous submodular function (or a portfolio of dicrete sets) with a worst-case guarantee within factor of the optimal solution. That is, a function is continuous submodular if and only if for any , where is a compact subset of , , and notations and denote the coordinate-wise minimum and maximum operations, respectively (see, e.g., [6] and [7]). In contrast to this line of work that considers a portfolio of discrete sets as decision variables, we consider the discrete case, where we must choose one set of decisions for implementability purposes. In another line of work, Wu and Küçükyavuz [25] propose an exact method for the minimal cost selection of that satisfies the chance constraint for a given , where the authors assume that there exists a probability oracle for evaluating the chance constraint for a given solution. However, the exact method of [25] cannot be applied to the CVaR maximization problem (or its VaR maximization counterpart) directly.
Note that for a given and a finite probability space with a set of realizations (scenarios) , for , CVaR is given by [17]
[TABLE]
where represents the set of the first positive integers, , and for at least one . Therefore, when an oracle for evaluating CVaR is not available, we can take a sampling-based approach. To this end, in [23], we apply the stochastic submodular optimization methods of [24] to problem (3) based on the CVaR definition (5). The resulting solutions are then tested out-of-sample and statistical performance guarantees are provided.
In this paper, our goal is to give (near-)optimal solutions for RASM without sampling. We start by considering high-quality (ideally optimal) feasible solutions to RASM under a true (non-trivial) distribution of the uncertain parameters assuming that there is an efficient oracle to evaluate the true value of CVaR of the random function at a given risk level. To solve RASM, we propose a decomposition method with various classes of valid inequalities. We demonstrate our proposed methods on a risk-averse set covering problem (RASC) that admits an efficient CVaR oracle in our computational study.
2 RASM with an Exact CVaR Oracle
In this section, we assume that for a given incumbent solution , we have an efficient oracle that computes exactly. Under this assumption, we solve problem (3) without sampling by using a two-stage optimization model and an exact algorithm with various valid inequalities. In the proposed method, we solve a relaxed master problem (RMP) at any iteration in the form
[TABLE]
where is a variable that is an upper bounding approximation of for a solution , and is a set of optimality cuts to be defined later. It is an approximation in that not all necessary optimality cuts may have been generated at an intermediate iteration. A full master problem includes all optimality cuts in such that for any , we have . Therefore, solving the full master problem is equivalent to solving the original problem (3). In Algorithm 1, we propose a delayed constraint generation algorithm for solving problem (3) using RMP (6). The algorithm starts with a set of optimality cuts (could be empty). In the while loop, we solve RMP (6) and obtain an incumbent solution (Line 1). Based on the incumbent solution, we add an optimality cut to RMP (6) (Line 1). In this algorithm, is a user-defined optimality tolerance. Let UB be the upper bound obtained from the optimal objective value of RMP (6) at each iteration. Let LB be the lower bound equal to , obtained from calling the CVaR oracle with the given incumbent solution as input. If the optimality gap is below , then we terminate the algorithm and return the near-optimal solution.
As we mentioned earlier, is not submodular in even if is submodular [13]. Hence, there is no direct way use the submodular inequality (4) as a class of optimality cuts in . Therefore, in this section, we propose new optimality cuts that are valid for (6). Throughout, we let be a unit vector of dimension whose th component is 1, and let be a -dimensional vector with all entries equal to 1. For a given , we first propose an optimality cut given by
[TABLE]
Before formally proving the validity of inequality (7), a few remarks are in order. It may be tempting to think that inequality (7) is in the form of a submodular inequality (4). However, we highlight that the coefficients \big{(}\text{CVaR}_{1}(\sigma(\bar{x}+\mathbf{e}_{j}))-\text{CVaR}_{\alpha}(\sigma(\bar{x}))\big{)} for are different from their submodular counterparts \big{(}\text{CVaR}_{\alpha}(\sigma(\bar{x}+\mathbf{e}_{j}))-\text{CVaR}_{\alpha}(\sigma(\bar{x}))\big{)} for and because is not submodular, the latter coefficients are not valid. To provide some intuition for the validity of the coefficients of the variables in inequality (7), we consider the upper bound on provided by the right-hand side of the inequality (recall that for all ). First, observe that for , inequality (7) holds trivially, because for . Now consider the point for some . Then the right-hand side of inequality (7) is , which is a valid upper bound on (because from Observation 1.1 (ii)). The validity for other is proven by exploiting the properties of given in Observation 1.1 as we show next.
Proposition 2.1
Inequality (7) for a given is valid for RMP (6).
Proof.
Consider a feasible point to the full master problem, in other words, let such that . We show that satisfies inequality (7) written for any . Let . From the definition of CVaR in (2), it follows that because is nondecreasing in for all , is also nondecreasing in . For the case that , since is a monotonically nondecreasing function in and for all , we have , which shows that inequality (7) is valid for . For the case that , we select an arbitrary , where . Then
[TABLE]
Inequality (8a) follows from Observation 1.1 (ii) that is nondecreasing in . Inequality (8b) follows from the submodular inequality (4) written for the monotone submodular function (see Observation 1.1 (i)) and for evaluated at the point . Arranging terms in inequality (8b) and recalling that , we obtain equality (8c). Inequality (8d) follows from Observation 1.1 (ii), this time observing that is nonincreasing in . To obtain equality (8e), we add to inequality (8d) and reorganize the terms. Finally, equality (8f) follows from . This completes the proof. ∎
Next, we introduce a class of valid inequalities obtained by a sequential lifting procedure [15, Proposition 1.1 in Section II.2.1]. Given , which is a characteristic vector of the set , consider a restriction with for . For this restriction, we know that the base inequality is valid, because for any satisfying this restriction, we have , and due the property that is monotonically nondecreasing in . However, this base inequality is not valid when the restriction is lifted. To obtain a valid inequality, we sequentially lift the base inequality with the variables . Let be an ordering of the elements in , where . Sequential lifting following this order produces a valid inequality
[TABLE]
where for is an exact lifting function given by the -th lifting problem
[TABLE]
In (10a), we use the convention that for , the term . Note that in the -th lifting problem, to obtain the coefficient of the variable in inequality (9), we let in constraint (10b), we remove the restriction on the variables preceding this variable in the lifting sequence, while keeping the restriction that the variables following in the lifting sequence are fixed to zero in constraint (10c). The exact lifting problem is hard to solve since it is related to the submodular maximization problem (3). Instead of solving problem (10) exactly, we propose to solve a relaxation that provides an upper bound for the lifting coefficients. Then, we can obtain another valid inequality by using the upper bounds on the coefficients instead of the exact coefficients. We describe this approach next.
Given , we define as an upper bound of for , where is given by solving the following relaxation of the exact lifting problem (10):
[TABLE]
here we relax constraints (10d) and remove the term from the objective function in problem (10). This is a valid relaxation, because when is monotonically nondecreasing in . Furthermore, because and for , the feasible solution with for has the largest in the objective function of (11). Thus, the optimal solution of the relaxed lifting problem (11) is given by , where we can use an efficient oracle for to obtain the optimal value of efficiently.
Proposition 2.2
Given , its support and an ordering of given by , inequality
[TABLE]
where , is valid for RMP (6).
Proof.
Consider a feasible point . We show that satisfies inequality (12) written for .
[TABLE]
Inequality (13a) follows from the valid inequality (9). Inequality (13b) follows from for since problem (11) is a relaxation of problem (10). This completes the proof. ∎
While inequality (12) is derived by approximate lifting, next we provide some intuition on the validity of the coefficients of the variables in inequality (12). Consider the upper bound on for some provided by the right-hand side of the inequality. First, observe that for , inequality (12) holds trivially, because for . Now consider the point , for some . Then the right-hand side of inequality (12) is , which is a valid upper bound on from Observation 1.1 (ii). The approximate lifting argument establishes that the inequality is valid for other choices of as well, because the coefficients of the variables are no smaller than those obtained from an exact lifting problem.
In Algorithm 2, we propose a greedy method that generates an inequality (12) given an incumbent . In the for loop, Line 2 determines the entry by choosing the candidate index for which attains its smallest value, where the candidate has not been chosen previously.
Finally, we show the correctness of Algorithm 1 based on the inequalities (7) or (12).
Proposition 2.3
Algorithm 1 with optimality cuts (7) and/or (12) finitely converges to an optimal solution.
Proof.
From Proposition 2.1 and 2.2, for all and , we have
[TABLE]
and
[TABLE]
Hence, the following two linear integer programs,
[TABLE]
and
[TABLE]
are equivalent to Problem (3). Since the number of feasible solutions and the number of inequalities (7) and (12) are finite, the result follows. ∎
Next we report our computational experience with the proposed method.
3 An Application: the Risk-Averse Set Covering Problem (RASC)
We demonstrate the proposed methods on a variant of a risk-averse set covering problem (RASC). We represent RASC on a bipartite graph . There are two groups of nodes and in , where all arcs in are from to . Let be a set of items. Let be a collection of subsets, where . There exists an arc if for representing the covering relationship. In RASC, there is uncertainty on whether an arc appears in the graph. To formulate RASC, first consider a deterministic set covering problem with a feasibility set , where for all , and if ; otherwise, for all . We say that an item is covered, if there exists for and . Now suppose that there is uncertainty on whether a chosen subset can cover an item, where constraint has random constraint coefficients or random right-hand side for . A related class of risk-averse problems, referred to as the probabilistic set covering problems, consider a feasible set , where is a given risk level and and/or are random variables for . Here the objective is to select a minimum cost selection of subsets such that the probability of covering all items is at least the risk threshold (see, e.g., [5, 19, 8, 2]). In contrast, in this paper, we assume that for all , and that there is uncertainty on for all and and consider a different type of risk aversion, where we aim to choose at most subsets from the collection so that the CVaR of the number of covered items at a risk level is maximized. In this section, we consider an independent probability coverage model, where each node has an independent probability of being covered by node for , i.e., .
Let be a random variable representing the number of covered items in for a given , i.e., . It is known that is submodular for [25]. Given an integer and , problem (3) for RASC is
[TABLE]
which is in the form of (3), where is given by . Next we propose an efficient CVaR oracle to evaluate the objective function in problem (14) for a given under the probability distribution of interest.
Proposition 3.1
There exists a polyniamial-time oracle that computes the function for for RASC under the independent probability coverage model.
Proof.
We follow the notation described in [25] to describe the CVaR oracle. From [9, 18, 21], we know that function is equal to the probability mass function of a Poisson binomial distribution and use a dynamic program (DP) to evaluate , which is defined as the probability that the selection covers nodes among the first nodes of for . Barlow and Heidtmann [4], Zhang et al. [27], and Wu and Küçükyavuz [25] use the DP to calculate . Next, we show that we can use the same recursion to evaluate . From the definition of in (1), we have The function is given by
[TABLE]
For a given , the running time of the DP is , because obtaining for all is , and computing the recursion is . ∎
We study the computational performance of our proposed methods on RASC. All instances were executed on a Windows 8.1 operating system with an Intel Core i5-4200U 1.60 GHz CPU, 8 GB DRAM, and x64 based processor using C++ with IBM ILOG CPLEX 12.7. We set up the mixed-integer programming search method as traditional branch-and-cut with the lazycallback function of CPLEX, where the presolve process is turned off. The number of threads is equal to one. All other CPLEX options are set to their default values. The time limit is set to 1800 seconds. For RASC, we generate a complete bipartite graph with arcs from all nodes to all . Let . (Note that for RASC, because we only select nodes from .) We follow the data generation scheme of [25] to set , , and for each arc . We let and . The size of the bipartite graphs is .
In Table 1, we demonstrate the performance of Algorithm 1, referred to as “Oracle”, using three types of valid inequalities, the L-shaped cut of [12], inequality (7) and lifted inequality (12). Note that for a given incumbent solution, , which is a characteristic vector of the set , we consider the L-shaped cut
[TABLE]
For for any , inequality (15) gives a valid upper bound for \psi=\text{CVaR}_{\alpha}(\sigma(\bar{x}))\big{)}, given by \text{CVaR}_{1}(\sigma(\bar{x}))\big{)} for any , from Observation 1.1 (ii). Clearly, inequalities (7) are stronger than inequalities (15), because , for . Column “Oracle-LShape” denotes Algorithm 1 with the L-shaped cut (15). Column “Oracle-Ineq (7)” denotes Algorithm 1 with inequality (7). Column “Oracle-Ineq (12)” denotes Algorithm 1 with inequality (12). Column “Oracle-Ineq (7) and (12)” denotes Algorithm 1 with both inequalities (7) and (12). Column “Time” denotes the total solution time for each instance for RMP, in seconds. Column “Cuts” denotes the total number of user cuts added to RMP. Column “Nodes” denotes the number of branch-and-bound nodes traced in RMP. From Table 1, we observe that the solution time increases as and increase for all methods. We observe that Oracle-LShape cannot solve most instances within the time limit. Oracle-Ineq (7) and Oracle-Ineq (12) are faster than Oracle-LShape for the instances that are solvable by Oracle-LShape within the time limit. In addition, Oracle-Ineq (7) or Oracle-Ineq (12) generates a fewer number of optimality cuts and traces a fewer number of nodes compared to Oracle-LShape.
For most of the instances with , Oracle-Ineq (12) generates a fewer number of optimality cuts and traces a fewer number of nodes compared to Oracle-Ineq (7). However, for the instances with , the solution time of Oracle-Ineq (12) is more than Oracle-Ineq (7). Recall that for a given incumbent solution , inequality (12) is generated by Algorithm 2. In Algorithm 2, we observe that for a given and , we have From the above relation, if the size of is large, then there may exist a large number of nodes with a high value of the lifting function , which is close to the coefficients in the L-shaped cut (15), i.e. . This explains why Oracle-Ineq (12) does not perform well in instances with large . To benefit from the complementary strengths of inequalities (7) and (12), we add both classes of inequalities at each iteration in Algorithm 2. In Table 1, we observe that for the instances that are not solvable by either Oracle-Ineq (12) or Oracle-Ineq (7) within 1800 seconds, the solution time of Oracle-Ineq (7) and (12) is shorter. Furthermore, for a hard instance , only Oracle-Ineq (7) and (12) provides an optimal solution within the time limit. In conclusion, our proposed inequalities enable the effective solution of difficult instances of the RASC problem that cannot be solved with existing methods within the set time limit.
Acknowledgments
We thank the AE and the reviewer for their comments that improved the presentation. This work is supported, in part, by the National Science Foundation Grant #1907463.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Ahmed and Atamtürk [2011] S. Ahmed and A. Atamtürk. Maximizing a class of submodular utility functions. Mathematical Programming , 128(1):149–169, 2011.
- 2Ahmed and Papageorgiou [2013] S. Ahmed and D. J. Papageorgiou. Probabilistic set covering with correlations. Operations Research , 61(2):438–452, 2013.
- 3Artzner et al. [1999] P. Artzner, F. Delbaen, J. Eber, and D. Heath. Coherent measures of risk. Mathematical Finance , 9(3):203–228, 1999.
- 4Barlow and Heidtmann [1984] R. E. Barlow and K. D. Heidtmann. Computing k 𝑘 k -out-of- n 𝑛 n system reliability. IEEE Transactions on Reliability , 33(4):322–323, 1984.
- 5Beraldi and Ruszczyński [2002] P. Beraldi and A. Ruszczyński. The probabilistic set-covering problem. Operations Research , 50(6):956–967, 2002.
- 6Bian et al. [2017] A. A. Bian, B. Mirzasoleiman, J. Buhmann, and A. Krause. Guaranteed non-convex optimization: Submodular maximization over continuous domains. In Proceedings of the 20th International Conference on Artificial Intelligence and Statistics , AISTATS, pages 111–120, 2017.
- 7Chen et al. [2018] L. Chen, H. Hassani, and A. Karbasi. Online continuous submodular maximization. In Proceedings of the 21th International Conference on Artificial Intelligence and Statistics , AISTATS, 2018.
- 8Fischetti and Monaci [2012] M. Fischetti and M. Monaci. Cutting plane versus compact formulations for uncertain (integer) linear programs. Mathematical Programming Computation , 4(3):239–273, 2012.
