On the Complexity and Approximation of the Maximum Expected Value   All-or-Nothing Subset

Noam Goldberg; Gabor Rudolf

arXiv:1706.07406·cs.CC·June 23, 2017

On the Complexity and Approximation of the Maximum Expected Value All-or-Nothing Subset

Noam Goldberg, Gabor Rudolf

PDF

TL;DR

This paper studies a complex nonlinear optimization problem involving selecting items with probabilities and profits to maximize expected value, proves its NP-hardness, and develops an efficient approximation scheme.

Contribution

It introduces the first FPTAS for the maximum expected value subset problem, addressing its computational complexity.

Findings

01

The problem is NP-hard via reduction from subset sum.

02

An FPTAS is developed for the problem.

03

The approximation scheme provides near-optimal solutions efficiently.

Abstract

An unconstrained nonlinear binary optimization problem of selecting a maximum expected value subset of items is considered. Each item is associated with a profit and probability. Each of the items succeeds or fails independently with the given probabilities, and the profit is obtained in the event that all selected items succeed. The objective is to select a subset that maximizes the total value times the product of probabilities of the chosen items. The problem is proven NP-hard by a nontrivial reduction from subset sum. Then we develop a fully polynomial time approximation scheme (FPTAS) for this problem.

Equations65

S \in 2^{[n]} max i \in S \sum c_{i} j \in S \prod p_{j} .

S \in 2^{[n]} max i \in S \sum c_{i} j \in S \prod p_{j} .

x \in {0, 1}^{n} max i = 1 \sum n c_{i} x_{i} j = 1 \prod n p_{j}^{x_{j}} .

x \in {0, 1}^{n} max i = 1 \sum n c_{i} x_{i} j = 1 \prod n p_{j}^{x_{j}} .

S \in S min \frac{\sum _{i \in S} c _{i}}{\prod _{i \in S} p _{i}} .

S \in S min \frac{\sum _{i \in S} c _{i}}{\prod _{i \in S} p _{i}} .

z (x) = ln (i = 1 \sum n c_{i} x_{i}) + i = 1 \sum n ln p_{i} x_{i} .

z (x) = ln (i = 1 \sum n c_{i} x_{i}) + i = 1 \sum n ln p_{i} x_{i} .

f (y) = ln y - \frac{y}{M} .

f (y) = ln y - \frac{y}{M} .

f (M) - f (N) \geq f (M - \frac{1}{2}) - f (M - 1) \geq \frac{1}{2} f^{'} (M - \frac{1}{2}) = \frac{1}{2} (\frac{1}{M - \frac{1}{2}} - \frac{1}{M}) = \frac{1}{4 M ^{2} - 2 M} .

f (M) - f (N) \geq f (M - \frac{1}{2}) - f (M - 1) \geq \frac{1}{2} f^{'} (M - \frac{1}{2}) = \frac{1}{2} (\frac{1}{M - \frac{1}{2}} - \frac{1}{M}) = \frac{1}{4 M ^{2} - 2 M} .

f (M) - f (N) \geq f (M + \frac{1}{2}) - f (M + 1) \geq \frac{- 1}{2} f^{'} (M + \frac{1}{2}) = \frac{- 1}{2} (\frac{1}{M + \frac{1}{2}} - \frac{1}{M}) = \frac{1}{4 M ^{2} + 2 M} .

f (M) - f (N) \geq f (M + \frac{1}{2}) - f (M + 1) \geq \frac{- 1}{2} f^{'} (M + \frac{1}{2}) = \frac{- 1}{2} (\frac{1}{M + \frac{1}{2}} - \frac{1}{M}) = \frac{1}{4 M ^{2} + 2 M} .

x \in {0, 1}^{n} max ln (i = 1 \sum n x_{i} c_{i}) - \frac{1}{M} i = 1 \sum n x_{i} c_{i}

x \in {0, 1}^{n} max ln (i = 1 \sum n x_{i} c_{i}) - \frac{1}{M} i = 1 \sum n x_{i} c_{i}

x \in {0, 1}^{n} max (i = 1 \sum n c_{i} x_{i}) i = 1 \prod n e^{- \frac{c _{i}}{M} x_{i}}

x \in {0, 1}^{n} max (i = 1 \sum n c_{i} x_{i}) i = 1 \prod n e^{- \frac{c _{i}}{M} x_{i}}

x \in {0, 1}^{n} max z (x) > max {ln (M - 1) - \frac{M - 1}{M}, ln (M + 1) - \frac{M + 1}{M}} .

x \in {0, 1}^{n} max z (x) > max {ln (M - 1) - \frac{M - 1}{M}, ln (M + 1) - \frac{M + 1}{M}} .

\overset{p}{^}_{i} = \frac{⌊ K e ^{- \frac{c _{i}}{M}} ⌋}{K},

\overset{p}{^}_{i} = \frac{⌊ K e ^{- \frac{c _{i}}{M}} ⌋}{K},

z (x^{*}) - x \in {0, 1}^{n} max \overset{z}{^} (x) < ln M - max {ln (M - 1) + \frac{1}{M}, ln (M + 1) - \frac{1}{M}} .

z (x^{*}) - x \in {0, 1}^{n} max \overset{z}{^} (x) < ln M - max {ln (M - 1) + \frac{1}{M}, ln (M + 1) - \frac{1}{M}} .

z (x) - \overset{z}{^} (x)

z (x) - \overset{z}{^} (x)

\leq i = 1 \sum n ln p_{i} - ln (p_{i} - 1/ K) = i = 1 \sum n ln (\frac{p _{i}}{p _{i} - 1/ K})

= i = 1 \sum n ln (\frac{1}{1 - 1/ ( K p _{i} )}) \leq - n ln (1 - \frac{1}{K e ^{- c_{ma x} / M}}) .

- n ln (1 - \frac{1}{K e ^{- c_{ma x} / M}})

- n ln (1 - \frac{1}{K e ^{- c_{ma x} / M}})

\Leftrightarrow 1 - \frac{e ^{c_{ma x} / M}}{K}

x \in {0, 1}^{n} max z (x) = ln M - 1 > max {ln (M - 1) - \frac{M - 1}{M}, ln (M + 1) - \frac{M + 1}{M}} .

x \in {0, 1}^{n} max z (x) = ln M - 1 > max {ln (M - 1) - \frac{M - 1}{M}, ln (M + 1) - \frac{M + 1}{M}} .

x \in {0, 1}^{n} max z (x) - {0, 1}^{n} max \overset{z}{^} (x) < ln M - max {ln (M - 1) - \frac{1}{M}, ln (M + 1) + \frac{1}{M}}

x \in {0, 1}^{n} max z (x) - {0, 1}^{n} max \overset{z}{^} (x) < ln M - max {ln (M - 1) - \frac{1}{M}, ln (M + 1) + \frac{1}{M}}

x \in {0, 1}^{n} max \overset{z}{^} (x) > max {ln (M - 1) - \frac{M - 1}{M}, ln (M + 1) - \frac{M + 1}{M}} .

x \in {0, 1}^{n} max \overset{z}{^} (x) > max {ln (M - 1) - \frac{M - 1}{M}, ln (M + 1) - \frac{M + 1}{M}} .

P (i, C) = ⎩ ⎨ ⎧ max {P (i - 1, C), p_{i} \cdot P (i - 1, C - c_{i})} P (i - 1, C) p_{1} 10 i \geq 2, c_{i} < C i \geq 2, c_{i} \geq C i = 1 and c_{1} = C C = 0 otherwise.

P (i, C) = ⎩ ⎨ ⎧ max {P (i - 1, C), p_{i} \cdot P (i - 1, C - c_{i})} P (i - 1, C) p_{1} 10 i \geq 2, c_{i} < C i \geq 2, c_{i} \geq C i = 1 and c_{1} = C C = 0 otherwise.

C max {C \cdot P (n, C) C = i \in [n] min {c_{i}}, i \in [n] min {c_{i}} + 1, \dots, \overset{ˉ}{C}} .

C max {C \cdot P (n, C) C = i \in [n] min {c_{i}}, i \in [n] min {c_{i}} + 1, \dots, \overset{ˉ}{C}} .

i \in S^{*} \prod p_{i} i \in S^{*} \sum c_{i} = p_{l} X + p_{l} c_{l} i \in S^{*} ∖ {l} \prod p_{i} < max {X, p_{l} c_{l}},

i \in S^{*} \prod p_{i} i \in S^{*} \sum c_{i} = p_{l} X + p_{l} c_{l} i \in S^{*} ∖ {l} \prod p_{i} < max {X, p_{l} c_{l}},

{i \in S^{*} p_{i} < \frac{1}{2}} \leq 1.

{i \in S^{*} p_{i} < \frac{1}{2}} \leq 1.

\overset{z}{^} (i, j) = C max {(C + \overset{c}{^}_{j}) \cdot \hat{P} (i, C) \cdot p_{j} C = k \in [i] min {\overset{c}{^}_{k}}, k \in [i] min {\overset{c}{^}_{k}} + 1, \dots, \overset{ˉ}{C} (i)},

\overset{z}{^} (i, j) = C max {(C + \overset{c}{^}_{j}) \cdot \hat{P} (i, C) \cdot p_{j} C = k \in [i] min {\overset{c}{^}_{k}}, k \in [i] min {\overset{c}{^}_{k}} + 1, \dots, \overset{ˉ}{C} (i)},

x_{k} = {01 k \in {i + 1, \dots, n} ∖ {j} k = j . .

x_{k} = {01 k \in {i + 1, \dots, n} ∖ {j} k = j . .

κ \cdot \overset{z}{^} (n, n + 1) \geq (1 - ϵ) \cdot C^{*} \cdot P (n, C^{*}),

κ \cdot \overset{z}{^} (n, n + 1) \geq (1 - ϵ) \cdot C^{*} \cdot P (n, C^{*}),

i \in S^{*} \sum c_{i} - κ i \in S^{*} \sum \overset{c}{^}_{i} \leq nκ .

i \in S^{*} \sum c_{i} - κ i \in S^{*} \sum \overset{c}{^}_{i} \leq nκ .

κ \cdot \hat{C} \cdot \hat{P} (n, \hat{C}) = κ i \in \hat{S} \sum \overset{c}{^}_{i} j \in \hat{S} \prod p_{j}

κ \cdot \hat{C} \cdot \hat{P} (n, \hat{C}) = κ i \in \hat{S} \sum \overset{c}{^}_{i} j \in \hat{S} \prod p_{j}

\frac{nκ}{\sum _{i \in S^{*}} c _{i}} \leq ϵ \Leftrightarrow κ \leq \frac{ϵ \sum _{i \in S^{*}} c _{i}}{n},

\frac{nκ}{\sum _{i \in S^{*}} c _{i}} \leq ϵ \Leftrightarrow κ \leq \frac{ϵ \sum _{i \in S^{*}} c _{i}}{n},

κ \leq \frac{ϵ max _{i \in [n]} { p _{i} c _{i} }}{n} \leq \frac{ϵ \sum _{i \in S^{*}} c _{i} \prod _{j \in S^{*}} p _{j}}{n} \leq \frac{ϵ \sum _{i \in S^{*}} c _{i}}{n} .

κ \leq \frac{ϵ max _{i \in [n]} { p _{i} c _{i} }}{n} \leq \frac{ϵ \sum _{i \in S^{*}} c _{i} \prod _{j \in S^{*}} p _{j}}{n} \leq \frac{ϵ \sum _{i \in S^{*}} c _{i}}{n} .

κ \cdot \overset{z}{^} (h, n + 1) = κ \cdot \hat{C} \cdot \hat{P} (h, \overset{ˉ}{C}) \geq (1 - ϵ) \cdot C^{*} \cdot P (h, C^{*}) = P (n, C^{*}) .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

On the Complexity and Approximation of the Maximum Expected Value All-or-Nothing Subset

Noam Goldberg

Bar-Ilan University, Ramat Gan, Israel

[email protected]

Gabor Rudolf

Koç University, Istanbul, Turkey

[email protected]

Abstract

An unconstrained nonlinear binary optimization problem of selecting a maximum expected value subset of items is considered. Each item is associated with a profit and probability. Each of the items succeeds or fails independently with the given probabilities, and the profit is obtained in the event that all selected items succeed. The objective is to select a subset that maximizes the total value times the product of probabilities of the chosen items. The problem is proven NP-hard by a nontrivial reduction from subset sum. Then we develop a fully polynomial time approximation scheme (FPTAS) for this problem.

1 Introduction

In the maximum expected value all-or-nothing subset problem, a decision maker seeks to maximize the expected value of a subset of activities $[n]=\{1,\ldots,n\}$ , where each activity $i\in[n]$ is associated with a positive profit $c_{i}$ and probability of success $p_{i}$ . The profits are earned in an all-or-nothing fashion – the overall success of a subset of activities depends on the individual success of all of its independent member activities. Accordingly, the problem is

[TABLE]

The problem arises in the design of serial reliability (or 1-out-of- $n$ ) systems in which each component may have a different value and reliability but deriving value from the system depends on all of the selected components being operational. For example, this objective function arises in failure-aware barter exchanges such as kidney-exchange cycles in which the failure of a single pair to barter may cause the entire chain or cycle of transactions to fail [4]. In that setting arcs of a directed graph represent possible transplants and donations; $c_{i}$ would be the value of transplant $i$ (that connects some donor-patient pair), and $p_{i}$ is the probability of the transplant $i$ taking place. The current all-subset setting corresponds to a special case of a complete directed graph of possible transplants, where a particular node (patient-donor pair) is connected to all other graph nodes through certain (probability-one) arcs. Another setting is a utility-maximizing evader who may select a subset of elicit activities that are under inspection, and the evader does not receive any value if one or more of the selected covert activities are exposed. This problem is proposed as an extension of the basic model studied in [6]. The problem may arise in network settings and a special case is that of disjoint edges that is the subject of the current paper, which is next shown to be NP-hard.

We find it convenient to formulate the problem as the (unconstrained) nonlinear mathematical program with binary decision variables $x_{1},\ldots,x_{n}$ ,

[TABLE]

The complexity of several related, yet different, problems has been investigated in the literature. The minimization of a (continuous) positive bilinear objective function of two variables subject to linear inequality constraints has been shown to be (strongly) NP-hard in [12]. The maximization of a product of linear functions of binary decision variables has been shown to be NP-hard in [7]. Half-product pseudo-Boolean function minimization, a special case of unconstrained quadratic binary minimization, has been shown to be NP-hard in [1]. For an extensive survey of pseudo-Boolean optimization including these special cases the reader may refer to [2].

Related cost-reliability problems, with a different objective function than (1), include variants that have been shown to be solvable in polynomial time. Let $\mathcal{S}\subseteq 2^{[n]}$ denote a collection of feasible item subsets. Then a general cost-reliability ratio minimization problems takes the form

[TABLE]

Such problems include the minimization of the spanning-tree cost to reliability ratio [3] when $\mathcal{S}$ is the set of all spanning trees of a given graph. Katoh [10] considers a general cost-reliability ratio minimization problem under the assumption that given $E\subseteq[n]$ , the problem of determining $S\subseteq E$ with $S\in\mathcal{S}$ and minimum $\sum_{i\in S}c_{i}$ can be solved in polynomial time in $\left|E\right|$ . In [10] a fully polynomial time approximation scheme (FPTAS) is developed for this problem, but the computational complexity was unresolved and appears to remain open. Also [11] develops an FPTAS for a general quasiconcave minimization problem. Note that in contrast, the maximization variant of the ratio problem (2) with $\mathcal{S}=2^{[n]}$ can be solved in polynomial time via the Dinkelbach algorithm; see [8] and references therein.

We first establish the NP-hardness of the maximum expected value all-or-nothing subset. Then we develop an FPTAS for this problem.

2 Maximum Expected Value All-Or-Nothing Subset - Complexity

First, observe that the objective function of problem (1) can be equivalently replaced (maintaining all optimal solutions) by the concave objective

[TABLE]

Note that if for some $i\in[n]$ , $p_{i}=1$ , then evidently $x^{*}_{i}=1$ in every $x^{*}$ is optimal for (1), while similarly $p_{i}=0$ implies that $x^{*}_{i}=0$ . Therefore, the following assumption is without loss of generality:

Assumption 1.

For $i\in[n]$ , the probabilities satisfy the inequalities $0<p_{i}<1$ .

For fixed $M>1$ and $y>0$ let

[TABLE]

The following lemma establishes the optimal value of $f$ and also that to determine a maximizer of $f$ over the integers it suffices to be able to approximately evaluate $f$ with precision that is bounded by a function of $M$ .

Lemma 1.

Let $M>1$ be an integer. Then, the function $f$ is concave, with a unique maximum at $f(M)=\ln(M)-1$ . Furthermore, for any positive integer $N\neq M$ we have $f(M)-f(N)\geq\frac{1}{5M^{2}}$ .

Proof.

Since $f^{\prime}(y)=\frac{1}{y}-\frac{1}{M}$ has a unique zero at $y=M$ , and $f^{\prime\prime}(y)=\frac{-1}{y^{2}}<0$ holds for all $y>0$ , the first part of our claim immediately follows. Keeping in mind that $f(y)$ is concave with a unique maximum at $y=M$ , for any positive integer $N<M$ we have

[TABLE]

Similarly, for any integer $N>M$ we have

[TABLE]

As $\frac{1}{4M^{2}-2M}\geq\frac{1}{4M^{2}+2M}\geq\frac{1}{5M^{2}}$ holds for any integer $M>1$ , the proposition follows. ∎

In order to prove NP-hardness, we first show that a given instance of the subset sum problem can be decided by solving a maximum expected value all-or-nothing subset problem with the logarithmically transformed objective (3) and the $\ln p_{i}$ values as the input parameters.

Lemma 2.

Let $c_{1},\dots,c_{n}$ and $M$ be positive integers. Then, there exists an $x\in\{0,1\}^{n}$ such that $\sum_{i=1}^{n}c_{i}x_{i}=M$ if and only if the optimal objective value of the following maximization problem equals $\ln M-1$ .

[TABLE]

Proof.

First, by Lemma 1 $f$ has a unique maximum at $y=M$ , $f(M)=\ln(M)-1$ . Also note that the objective function in (4) can be written as $f\left(\sum\limits_{i=1}^{n}x_{i}c_{i}\right)$ .

Now assume that $\sum_{i=1}^{n}c_{i}x_{i}=M$ holds for some ${x}\in^{n}$ . Since ${x}$ is a solution of (4) with objective value $\ln(M)-1$ , it is also an optimal solution according to our observation.

Similarly, if ${x}\in\{0,1\}$ is an optimal solution of (4) with objective value $\ln(M)-1$ then by our observation we have $\sum\limits_{i=1}^{n}x_{i}c_{i}=M$ . ∎

The equivalence of the optimization problems following the log transformation of the objective (3) and the fact that the $c_{i}$ values are integer together imply the following corollary of Lemma 2.

Corollary 3.

Let $c_{1},\dots,c_{n}$ and $M$ be positive integers. There exists a subset $I\subseteq[n]$ such that $\sum_{i\in I}c_{i}=M$ if and only if the optimal objective value of the problem

[TABLE]

is greater than $\max\{(M-1)e^{-1+1/M},(M+1)e^{-1-1/M}\}$ . Equivalently, (5) has an optimal objective value of $Me^{-1}$ if and only if (4) has an optimal objective

[TABLE]

To prove that (1) is NP-hard it has to be shown that the reduction is polynomial time. However, as the input parameters $e^{-c_{i}/M}$ for each $i\in[n]$ cannot be exactly represented using a polynomial number of bits we employ a simple rounding argument.

For a given $K>0$ let

[TABLE]

Observe that $p_{i}-\frac{1}{K}\leq\hat{p}_{i}\leq p_{i}$ . The following Lemma establishes the existence of a $K$ that is polynomial in the input size for which the maximizers of $\hat{z}$ and $z$ coincide.

Lemma 4.

For $c_{\text{max}}<M$ , there exists a positive $K\in O(nM^{2})$ that satisfies for all $x^{*}$ that are optimal for (1),

[TABLE]

In particular, this inequality holds for any $K>\frac{5nM^{2}}{1-1/(10nM^{2})}$ .

Proof.

Consider an $x\in\{0,1\}^{n}$ . Then,

[TABLE]

Since this upper bound holds for every $x\in\{0,1\}^{n}$ it also applies to the maxima of (1), and by Lemma 1 it suffices to choose $K$ so that

[TABLE]

By a Taylor series expansion of the denominator it follows that any $K>\frac{5nM^{2}}{1-1/(10nM^{2})}$ is sufficiently large. ∎

Proposition 5.

The all-or nothing subset problem (1) is NP-hard.

Proof.

We prove our claim by providing a reduction of the subset sum problem with positive integer inputs, which is known to be NP-hard. Consider an instance where the goal is to decide whether there exists a subset of $\{c_{1},\dots,c_{n}\}\subset\mathbb{N}$ that sums to $M\in\mathbb{N}$ . Without losing of generality it is assumed that $c_{\text{max}}\leq M$ . For each $i\in[n]$ , let $p_{i}=e^{-\frac{c_{i}}{M}}$ . Let $K\in O(nM^{2})$ be an integer satisfying the condition of Lemma 4 (which also states that it suffices to choose $K=6nM^{2}$ ). Set $\hat{p}_{i}=\lfloor p_{i}K\rfloor/K\geq e^{-\frac{c_{i}}{M}}-\frac{1}{K}$ for each $i\in[n]$ . Following Corollary 3 the subset sum problem has a feasible solution if and only if

[TABLE]

By the choice of $K$ and Lemma 4 it follows that

[TABLE]

Then it follows there exists an $x\in\{0,1\}^{n}$ such that $\sum_{i=1}^{n}c_{i}x_{i}=M$ if an only if

[TABLE]

Since $K\in O(nM^{2})$ it follows that the reduction of subset sum is polynomial in $n$ , $\ln M$ and $\ln c_{max}$ . ∎

3 Approximation of Maximum Expected Value All-or-Nothing Subset

We now develop an FPTAS for our nonlinear unconstrained problem (1). To this end we first consider a pseudo-polynomial time algorithm. This analysis is similar to that of a related constrained linear problem, namely the knapsack problem; see [9, 5, 13]. A fundamental difference is that (1) unconstrained.

3.1 A Pseudo-polynomial Dynamic Program

For $i\in[n]$ let $P(i,C)$ denote the maximum probability of a subset of $[i]$ with a profit of exactly $C$ . Consider the dynamic program (DP) given by the equations

[TABLE]

Let $\bar{C}$ denote an upper bound on the sum of profits of an item set that is optimal for (1). A straightforward upper bound is $\bar{C}=\sum_{i=1}^{n}c_{i}$ .

Then, the problem of determining $x\in\{0,1\}^{n}$ that maximizes (1) is solved by determining

[TABLE]

The total running time of this algorithm that determines an optimum of (1) through (7) is $O(n\bar{C})$ . In the following let $x^{*}\in\{0,1\}^{n}$ be an optimal solution for (1) with support $S^{*}=\left\{i\in[n]\;\left|\;\;x^{*}_{i}=1\right.\right\}$ , and let $C^{*}=\sum_{i\in S^{*}}c_{i}$ denote the corresponding maximizer of (7).

The next lemma establishes a lower bound on the probabilities of items that are included in an optimal solution.

Lemma 6.

Suppose $S^{*}$ is (the support of a solution that is) optimal for (1) with $\left|S^{*}\right|\geq 2$ , and $l\in\operatorname*{argmin}_{i\in S^{*}}\{p_{i}\}$ . If $p_{l}<\frac{1}{2}$ then $\prod_{i\in S^{*}\setminus\{l\}}p_{i}\geq\frac{1}{2}$ .

Proof.

Assume for the sake of deriving a contradiction that there exists an $l\in S^{*}$ with $p_{l}<\frac{1}{2}$ and $\prod_{i\in S^{*}\setminus\{l\}}p_{i}<\frac{1}{2}$ . Let $X=\prod_{i\in S^{*}\setminus\{l\}}p_{i}\sum_{i\in\hat{S}^{*}\setminus\{l\}}c_{i}$ . Then,

[TABLE]

thereby establishing a contradiction with the optimality of $S^{*}$ . ∎

In particular Lemma 6 implies the following corollary.

Corollary 7.

Suppose $S^{*}$ is (the support of a solution that is) optimal for (1). Then

[TABLE]

The result of this corollary is instrumental for developing an FPTAS that is the subject of the next section.

3.2 A Fully Polynomial Time Approximation Scheme

In order to approximately solve DP (7) and with a polynomial run-time complexity bound we consider scaling down (and rounding) the profit coefficients whose magnitude determines the running time of (7). In particular consider scaling the profit coefficients using some factor $\kappa>0$ . Accordingly, for each $i\in[n]$ , $\hat{c}_{i}=\lfloor\frac{c_{i}}{\kappa}\rfloor$ is the scaled profit coefficient. In the following let $N_{1/2}=\left\{i\in[n]\;\left|\;\;p_{i}\geq\frac{1}{2}\right.\right\}$ . Further, for convenience assume that $[n]\setminus N_{1/2}=[h]$ for some $h\in[n]\cup\{0\}$ ( $h=0$ when $[n]\setminus N_{1/2}=\emptyset$ ). Accordingly, $N_{1/2}=\{h+1,\ldots,n\}$ . For $i\in[n]$ let $\hat{P}(i,C)$ denote the DP equations (6) with the $\hat{c}_{i}$ values in place of the $c_{i}$ ’s. Also let $\hat{c}_{n+1}=0$ and $p_{n+1}=1$ . Then, for $i\in[n]$ and $j>i$ the scaled DP problem is defined as

[TABLE]

where $\bar{C}(i)=\sum_{k=1}^{i}\hat{c}_{k}$ .

Note that $\hat{z}(i,j)$ is an optimal objective value of (1) with $c$ replaced by $\hat{c}$ and the additional constraints (fixing the decision variable values) for $k\in\{i+1,\ldots,n\}$

[TABLE]

Let $\hat{C}\equiv\hat{z}(n,n+1)=max_{C}\left\{C\cdot\hat{P}(n,c)\;\left|\;\;C=1,\ldots,\bar{C}(n)\right.\right\}$ and let $\hat{S}$ be the corresponding support of $x$ that maximizes (1) with $c$ replaced by $\hat{c}$ (for which $\sum_{i\in\hat{S}}\hat{c}_{i}=\hat{C}$ ). Following Corollary 7, it can observed that it suffices to evaluate $\hat{z}(i,j)$ with $h=i<j=h+1,\ldots,n+1$ to determine $\hat{z}(n,n+1)$ and $\hat{C}\in\operatorname*{argmax}_{C}\left\{C\cdot\hat{P}(n,c)\;\left|\;\;C=1,\ldots,\bar{C}(n)\right.\right\}$ .

The following lemma establishes an upper bound on $\kappa$ that is sufficient to bound the relative error to within a given $\epsilon>0$ .

Lemma 8.

For a given $\epsilon>0$ , and all $\kappa\leq\frac{\epsilon\max_{i\in S^{*}}p_{i}c_{i}}{n}$ ,

[TABLE]

where $C^{*}$ is a maximizer of (7).

Proof.

First note that

[TABLE]

Then, it follows that

[TABLE]

where the last inequality also followed from the optimality of $\hat{S}$ with the scaled profit $\hat{c}_{i}$ values. Then, given an $\epsilon>0$ , (9) implies that $\kappa$ must satisfy

[TABLE]

and so it suffices to choose

[TABLE]

∎

Algorithm 1 is now considered as an approximation scheme for (7) and (the equivalent) (1).

The following proposition establishes that Algorithm 1 is an FPTAS for (1).

Proposition 9.

Algorithm 1 is an FPTAS for (1).

Proof.

The following cases need to be considered.

Case $\left|S^{*}\right|=1$ :

It is straightforward that Algorithm 1 outputs an optimal solution determined in step 3.

Case $\left|S^{*}\right|\geq 2$ :

It follows from Corollary 7 that if $\left|S^{*}\right|\geq 2$ then $\left|S^{*}\setminus N_{1/2}\right|\leq 1$ . Then, consider the following collectively exhaustive subcases:

Case $S^{*}\setminus N_{1/2}=\emptyset$ :

For each given $\epsilon>0$ , $\kappa$ satisfies the supposition of Lemma 8. So, following Lemma 8 with $\bar{C}=\bar{C}(h)=\sum_{i\in N_{1/2}}c_{i}\geq\sum_{i\in S^{*}}c_{i}$ ,

[TABLE]

Case $\left|S^{*}\setminus N_{1/2}\right|=1$ :

Then for each $\epsilon>0$ , the choice of $\kappa$ by Lemma 8 satisfies for some $j\in[n]\setminus N_{1/2}=\{h+1,\ldots,n\}$

[TABLE]

and the algorithm must determine $j$ since it enumerates all elements of $[n]\setminus N_{1/2}$ in the main loop (in lines 2-7).

The complexity of the algorithm is determined by at most $\left|[n]\setminus N_{1/2}\right|\leq n$ invocations of (8). Hence, it is

[TABLE]

4 Conclusion

We have established the NP-hardness of all-or-nothing maximum expected value subset. It also implies the hardness of constrained all-or-nothing subset problems in different graph settings. In particular one may consider an all-or-nothing maximum expected value matching, a similar problem, with activities and feasible subsets corresponding to edges and matchings in a graph, respectively. In ongoing work we develop an approximation scheme for this problem.

Acknowledgement

Noam Goldberg thanks Naoki Katoh for discussing [10] and referring him to [12] and [7], John Dickerson for a discussion of kidney exchange and referring to [4], and also Martin Milanič for comments.

Bibliography13

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] T. Badics and E. Boros. Minimization of half-products. Mathematics of Operations Research , 23(3):649–660, 1998.
2[2] E. Boros and P.L Hammer. Pseudo-boolean optimization. Discrete applied mathematics , 123(1):155–225, 2002.
3[3] R. Chandrasekaran and A. Tamir. Polynomial testing of the query “Is a b ≥ c d superscript 𝑎 𝑏 superscript 𝑐 𝑑 a^{b}\geq c^{d} ?” with application to finding a minimal cost reliability ratio spanning tree. Discrete Applied Mathematics , 9(2):117 – 123, 1984.
4[4] J.P. Dickerson, A.D. Procaccia, and T. Sandholm. Failure-aware kidney exchange. In Proceedings of the fourteenth ACM conference on Electronic commerce , pages 323–340. ACM, 2013.
5[5] G. V. Gens and E. V. Levner. Optimization Techniques: Proceedings of the 9th IFIP Conference on Optimization Techniques Warsaw, September 4–8, 1979 , chapter Fast approximation algorithms for knapsack type problems, pages 185–194. Springer Berlin Heidelberg, Berlin, Heidelberg, 1980.
6[6] N. Goldberg. Nonzero-sum nonlinear network path interdiction with an application to inspection in terror networks. Naval Research Logistics , Accepted, 2017.
7[7] P. L. Hammer, P. Hansen, P.M. Pardalos, and D.J. Rader Jr. Maximizing the product of two linear functions in 0-1 variables. Optimization , 51(3):511–537, 2002.
8[8] P. Hansen and C. Meyer. A polynomial algorithm for a class of 0–1 fractional programming problems involving composite functions, with an application to additive clustering. In Clusters, Orders, and Trees: Methods and Applications , pages 13–50. Springer, 2014.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

On the Complexity and Approximation of the Maximum Expected Value All-or-Nothing Subset

Abstract

1 Introduction

2 Maximum Expected Value All-Or-Nothing Subset - Complexity

Assumption 1**.**

Lemma 1**.**

Proof.

Lemma 2**.**

Proof.

Corollary 3**.**

Lemma 4**.**

Proof.

Proposition 5**.**

Proof.

3 Approximation of Maximum Expected Value All-or-Nothing Subset

3.1 A Pseudo-polynomial Dynamic Program

Lemma 6**.**

Proof.

Corollary 7**.**

3.2 A Fully Polynomial Time Approximation Scheme

Lemma 8**.**

Proof.

Proposition 9**.**

Proof.

Case ∣S∗∣=1\left|S^{*}\right|=1∣S∗∣=1:

Case ∣S∗∣≥2\left|S^{*}\right|\geq 2∣S∗∣≥2:

Case S∗∖N1/2=∅S^{*}\setminus N_{1/2}=\emptysetS∗∖N1/2​=∅:

Case ∣S∗∖N1/2∣=1\left|S^{*}\setminus N_{1/2}\right|=1​S∗∖N1/2​​=1:

4 Conclusion

Acknowledgement

Assumption 1.

Lemma 1.

Lemma 2.

Corollary 3.

Lemma 4.

Proposition 5.

Lemma 6.

Corollary 7.

Lemma 8.

Proposition 9.

Case $\left|S^{*}\right|=1$ :

Case $\left|S^{*}\right|\geq 2$ :

Case $S^{*}\setminus N_{1/2}=\emptyset$ :

Case $\left|S^{*}\setminus N_{1/2}\right|=1$ :