On the Complexity and Approximation of the Maximum Expected Value All-or-Nothing Subset
Noam Goldberg, Gabor Rudolf

TL;DR
This paper studies a complex nonlinear optimization problem involving selecting items with probabilities and profits to maximize expected value, proves its NP-hardness, and develops an efficient approximation scheme.
Contribution
It introduces the first FPTAS for the maximum expected value subset problem, addressing its computational complexity.
Findings
The problem is NP-hard via reduction from subset sum.
An FPTAS is developed for the problem.
The approximation scheme provides near-optimal solutions efficiently.
Abstract
An unconstrained nonlinear binary optimization problem of selecting a maximum expected value subset of items is considered. Each item is associated with a profit and probability. Each of the items succeeds or fails independently with the given probabilities, and the profit is obtained in the event that all selected items succeed. The objective is to select a subset that maximizes the total value times the product of probabilities of the chosen items. The problem is proven NP-hard by a nontrivial reduction from subset sum. Then we develop a fully polynomial time approximation scheme (FPTAS) for this problem.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
On the Complexity and Approximation of the Maximum Expected Value All-or-Nothing Subset
Noam Goldberg
Bar-Ilan University, Ramat Gan, Israel
Gabor Rudolf
Koç University, Istanbul, Turkey
Abstract
An unconstrained nonlinear binary optimization problem of selecting a maximum expected value subset of items is considered. Each item is associated with a profit and probability. Each of the items succeeds or fails independently with the given probabilities, and the profit is obtained in the event that all selected items succeed. The objective is to select a subset that maximizes the total value times the product of probabilities of the chosen items. The problem is proven NP-hard by a nontrivial reduction from subset sum. Then we develop a fully polynomial time approximation scheme (FPTAS) for this problem.
1 Introduction
In the maximum expected value all-or-nothing subset problem, a decision maker seeks to maximize the expected value of a subset of activities , where each activity is associated with a positive profit and probability of success . The profits are earned in an all-or-nothing fashion – the overall success of a subset of activities depends on the individual success of all of its independent member activities. Accordingly, the problem is
[TABLE]
The problem arises in the design of serial reliability (or 1-out-of-) systems in which each component may have a different value and reliability but deriving value from the system depends on all of the selected components being operational. For example, this objective function arises in failure-aware barter exchanges such as kidney-exchange cycles in which the failure of a single pair to barter may cause the entire chain or cycle of transactions to fail [4]. In that setting arcs of a directed graph represent possible transplants and donations; would be the value of transplant (that connects some donor-patient pair), and is the probability of the transplant taking place. The current all-subset setting corresponds to a special case of a complete directed graph of possible transplants, where a particular node (patient-donor pair) is connected to all other graph nodes through certain (probability-one) arcs. Another setting is a utility-maximizing evader who may select a subset of elicit activities that are under inspection, and the evader does not receive any value if one or more of the selected covert activities are exposed. This problem is proposed as an extension of the basic model studied in [6]. The problem may arise in network settings and a special case is that of disjoint edges that is the subject of the current paper, which is next shown to be NP-hard.
We find it convenient to formulate the problem as the (unconstrained) nonlinear mathematical program with binary decision variables ,
[TABLE]
The complexity of several related, yet different, problems has been investigated in the literature. The minimization of a (continuous) positive bilinear objective function of two variables subject to linear inequality constraints has been shown to be (strongly) NP-hard in [12]. The maximization of a product of linear functions of binary decision variables has been shown to be NP-hard in [7]. Half-product pseudo-Boolean function minimization, a special case of unconstrained quadratic binary minimization, has been shown to be NP-hard in [1]. For an extensive survey of pseudo-Boolean optimization including these special cases the reader may refer to [2].
Related cost-reliability problems, with a different objective function than (1), include variants that have been shown to be solvable in polynomial time. Let denote a collection of feasible item subsets. Then a general cost-reliability ratio minimization problems takes the form
[TABLE]
Such problems include the minimization of the spanning-tree cost to reliability ratio [3] when is the set of all spanning trees of a given graph. Katoh [10] considers a general cost-reliability ratio minimization problem under the assumption that given , the problem of determining with and minimum can be solved in polynomial time in . In [10] a fully polynomial time approximation scheme (FPTAS) is developed for this problem, but the computational complexity was unresolved and appears to remain open. Also [11] develops an FPTAS for a general quasiconcave minimization problem. Note that in contrast, the maximization variant of the ratio problem (2) with can be solved in polynomial time via the Dinkelbach algorithm; see [8] and references therein.
We first establish the NP-hardness of the maximum expected value all-or-nothing subset. Then we develop an FPTAS for this problem.
2 Maximum Expected Value All-Or-Nothing Subset - Complexity
First, observe that the objective function of problem (1) can be equivalently replaced (maintaining all optimal solutions) by the concave objective
[TABLE]
Note that if for some , , then evidently in every is optimal for (1), while similarly implies that . Therefore, the following assumption is without loss of generality:
Assumption 1**.**
For , the probabilities satisfy the inequalities .
For fixed and let
[TABLE]
The following lemma establishes the optimal value of and also that to determine a maximizer of over the integers it suffices to be able to approximately evaluate with precision that is bounded by a function of .
Lemma 1**.**
Let be an integer. Then, the function is concave, with a unique maximum at . Furthermore, for any positive integer we have .
Proof.
Since has a unique zero at , and holds for all , the first part of our claim immediately follows. Keeping in mind that is concave with a unique maximum at , for any positive integer we have
[TABLE]
Similarly, for any integer we have
[TABLE]
As holds for any integer , the proposition follows. ∎
In order to prove NP-hardness, we first show that a given instance of the subset sum problem can be decided by solving a maximum expected value all-or-nothing subset problem with the logarithmically transformed objective (3) and the values as the input parameters.
Lemma 2**.**
Let and be positive integers. Then, there exists an such that if and only if the optimal objective value of the following maximization problem equals .
[TABLE]
Proof.
First, by Lemma 1 has a unique maximum at , . Also note that the objective function in (4) can be written as .
Now assume that holds for some . Since is a solution of (4) with objective value , it is also an optimal solution according to our observation.
Similarly, if is an optimal solution of (4) with objective value then by our observation we have . ∎
The equivalence of the optimization problems following the log transformation of the objective (3) and the fact that the values are integer together imply the following corollary of Lemma 2.
Corollary 3**.**
Let and be positive integers. There exists a subset such that if and only if the optimal objective value of the problem
[TABLE]
is greater than . Equivalently, (5) has an optimal objective value of if and only if (4) has an optimal objective
[TABLE]
To prove that (1) is NP-hard it has to be shown that the reduction is polynomial time. However, as the input parameters for each cannot be exactly represented using a polynomial number of bits we employ a simple rounding argument.
For a given let
[TABLE]
Observe that . The following Lemma establishes the existence of a that is polynomial in the input size for which the maximizers of and coincide.
Lemma 4**.**
For , there exists a positive that satisfies for all that are optimal for (1),
[TABLE]
In particular, this inequality holds for any .
Proof.
Consider an . Then,
[TABLE]
Since this upper bound holds for every it also applies to the maxima of (1), and by Lemma 1 it suffices to choose so that
[TABLE]
By a Taylor series expansion of the denominator it follows that any is sufficiently large. ∎
Proposition 5**.**
The all-or nothing subset problem (1) is NP-hard.
Proof.
We prove our claim by providing a reduction of the subset sum problem with positive integer inputs, which is known to be NP-hard. Consider an instance where the goal is to decide whether there exists a subset of that sums to . Without losing of generality it is assumed that . For each , let . Let be an integer satisfying the condition of Lemma 4 (which also states that it suffices to choose ). Set for each . Following Corollary 3 the subset sum problem has a feasible solution if and only if
[TABLE]
By the choice of and Lemma 4 it follows that
[TABLE]
Then it follows there exists an such that if an only if
[TABLE]
Since it follows that the reduction of subset sum is polynomial in , and . ∎
3 Approximation of Maximum Expected Value All-or-Nothing Subset
We now develop an FPTAS for our nonlinear unconstrained problem (1). To this end we first consider a pseudo-polynomial time algorithm. This analysis is similar to that of a related constrained linear problem, namely the knapsack problem; see [9, 5, 13]. A fundamental difference is that (1) unconstrained.
3.1 A Pseudo-polynomial Dynamic Program
For let denote the maximum probability of a subset of with a profit of exactly . Consider the dynamic program (DP) given by the equations
[TABLE]
Let denote an upper bound on the sum of profits of an item set that is optimal for (1). A straightforward upper bound is .
Then, the problem of determining that maximizes (1) is solved by determining
[TABLE]
The total running time of this algorithm that determines an optimum of (1) through (7) is . In the following let be an optimal solution for (1) with support , and let denote the corresponding maximizer of (7).
The next lemma establishes a lower bound on the probabilities of items that are included in an optimal solution.
Lemma 6**.**
Suppose is (the support of a solution that is) optimal for (1) with , and . If then .
Proof.
Assume for the sake of deriving a contradiction that there exists an with and . Let . Then,
[TABLE]
thereby establishing a contradiction with the optimality of . ∎
In particular Lemma 6 implies the following corollary.
Corollary 7**.**
Suppose is (the support of a solution that is) optimal for (1). Then
[TABLE]
The result of this corollary is instrumental for developing an FPTAS that is the subject of the next section.
3.2 A Fully Polynomial Time Approximation Scheme
In order to approximately solve DP (7) and with a polynomial run-time complexity bound we consider scaling down (and rounding) the profit coefficients whose magnitude determines the running time of (7). In particular consider scaling the profit coefficients using some factor . Accordingly, for each , is the scaled profit coefficient. In the following let . Further, for convenience assume that for some ( when ). Accordingly, . For let denote the DP equations (6) with the values in place of the ’s. Also let and . Then, for and the scaled DP problem is defined as
[TABLE]
where .
Note that is an optimal objective value of (1) with replaced by and the additional constraints (fixing the decision variable values) for
[TABLE]
Let and let be the corresponding support of that maximizes (1) with replaced by (for which ). Following Corollary 7, it can observed that it suffices to evaluate with to determine and .
The following lemma establishes an upper bound on that is sufficient to bound the relative error to within a given .
Lemma 8**.**
For a given , and all ,
[TABLE]
where is a maximizer of (7).
Proof.
First note that
[TABLE]
Then, it follows that
[TABLE]
where the last inequality also followed from the optimality of with the scaled profit values. Then, given an , (9) implies that must satisfy
[TABLE]
and so it suffices to choose
[TABLE]
∎
Algorithm 1 is now considered as an approximation scheme for (7) and (the equivalent) (1).
The following proposition establishes that Algorithm 1 is an FPTAS for (1).
Proposition 9**.**
Algorithm 1 is an FPTAS for (1).
Proof.
The following cases need to be considered.
Case :
It is straightforward that Algorithm 1 outputs an optimal solution determined in step 3.
Case :
It follows from Corollary 7 that if then . Then, consider the following collectively exhaustive subcases:
Case :
For each given , satisfies the supposition of Lemma 8. So, following Lemma 8 with ,
[TABLE]
Case :
Then for each , the choice of by Lemma 8 satisfies for some
[TABLE]
and the algorithm must determine since it enumerates all elements of in the main loop (in lines 2-7).
The complexity of the algorithm is determined by at most invocations of (8). Hence, it is
[TABLE]
4 Conclusion
We have established the NP-hardness of all-or-nothing maximum expected value subset. It also implies the hardness of constrained all-or-nothing subset problems in different graph settings. In particular one may consider an all-or-nothing maximum expected value matching, a similar problem, with activities and feasible subsets corresponding to edges and matchings in a graph, respectively. In ongoing work we develop an approximation scheme for this problem.
Acknowledgement
Noam Goldberg thanks Naoki Katoh for discussing [10] and referring him to [12] and [7], John Dickerson for a discussion of kidney exchange and referring to [4], and also Martin Milanič for comments.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] T. Badics and E. Boros. Minimization of half-products. Mathematics of Operations Research , 23(3):649–660, 1998.
- 2[2] E. Boros and P.L Hammer. Pseudo-boolean optimization. Discrete applied mathematics , 123(1):155–225, 2002.
- 3[3] R. Chandrasekaran and A. Tamir. Polynomial testing of the query “Is a b ≥ c d superscript 𝑎 𝑏 superscript 𝑐 𝑑 a^{b}\geq c^{d} ?” with application to finding a minimal cost reliability ratio spanning tree. Discrete Applied Mathematics , 9(2):117 – 123, 1984.
- 4[4] J.P. Dickerson, A.D. Procaccia, and T. Sandholm. Failure-aware kidney exchange. In Proceedings of the fourteenth ACM conference on Electronic commerce , pages 323–340. ACM, 2013.
- 5[5] G. V. Gens and E. V. Levner. Optimization Techniques: Proceedings of the 9th IFIP Conference on Optimization Techniques Warsaw, September 4–8, 1979 , chapter Fast approximation algorithms for knapsack type problems, pages 185–194. Springer Berlin Heidelberg, Berlin, Heidelberg, 1980.
- 6[6] N. Goldberg. Nonzero-sum nonlinear network path interdiction with an application to inspection in terror networks. Naval Research Logistics , Accepted, 2017.
- 7[7] P. L. Hammer, P. Hansen, P.M. Pardalos, and D.J. Rader Jr. Maximizing the product of two linear functions in 0-1 variables. Optimization , 51(3):511–537, 2002.
- 8[8] P. Hansen and C. Meyer. A polynomial algorithm for a class of 0–1 fractional programming problems involving composite functions, with an application to additive clustering. In Clusters, Orders, and Trees: Methods and Applications , pages 13–50. Springer, 2014.
