Rare-Event Simulation for Distribution Networks
Jose Blanchet, Juan Li, Marvin K. Nakayama

TL;DR
This paper develops importance sampling and Monte Carlo algorithms to efficiently estimate the probability of rare high-cost events in equilibrium allocations of distribution networks with Gaussian demands.
Contribution
It introduces novel algorithms with proven asymptotic efficiency for rare-event probability estimation in network equilibrium models.
Findings
Algorithms demonstrate strong numerical performance.
Asymptotic efficiency of the proposed methods is established.
Effective estimation of rare-event probabilities in network models.
Abstract
We model equilibrium allocations in a distribution network as the solution of a linear program (LP) which minimizes the cost of unserved demands across nodes in the network. The constraints in the LP dictate that once a given node's supply is exhausted, its unserved demand is distributed among neighboring nodes. All nodes do the same and the resulting solution is the equilibrium allocation. Assuming that the demands are random (following a jointly Gaussian law), our goal is to study the probability that the optimal cost (i.e. the cost of unserved demands in equilibrium) exceeds a large threshold, which is a rare event. Our contribution is the development of importance sampling and conditional Monte Carlo algorithms for estimating this probability. We establish the asymptotic efficiency of our algorithms and also present numerical results which illustrate strong performance of our…
| Naive Simulation | Importance Sampling | Conditional MC | ||||
|---|---|---|---|---|---|---|
| 1.5 | 6.77 | 5.04 | 6.76 | 1.59 | 6.69 | 4.35 |
| 2.5 | 6.44 | 5.34 | 6.19 | 4.40 | 6.21 | 7.74 |
| 3.2 | 6.10 | 5.63 | 6.92 | 8.82 | 6.88 | 1.14 |
| 3.9 | 8.00 | 4.27 | 4.82 | 4.68 | 4.83 | 1.43 |
| 4.5 | 0 | NaN | 3.39 | 1.62 | 3.30 | 1.84 |
| 4.9 | 0 | NaN | 4.80 | 7.08 | 4.89 | 2.03 |
| Naive Simulation | Importance Sampling | Conditional MC | ||||
|---|---|---|---|---|---|---|
| 1.0 | 3.64 | 1.21 | 3.67 | 9.57 | 3.66 | 2.00 |
| 1.3 | 3.05 | 1.39 | 3.38 | 2.09 | 3.38 | 6.85 |
| 1.5 | 2.10 | 2.00 | 2.70 | 6.14 | 2.73 | 2.28 |
| 1.6 | 4.00 | 1.04 | 3.20 | 2.19 | 3.23 | 3.79 |
| 1.7 | 0 | NaN | 4.13 | 1.09 | 4.02 | 6.07 |
| 1.8 | 0 | NaN | 7.34 | 5.24 | 7.26 | 6.87 |
| Naive Simulation | Importance Sampling | Conditional MC | ||||
|---|---|---|---|---|---|---|
| 1.20 | 3.29 | 2.09 | 3.22 | 2.94 | 3.23 | 5.96 |
| 1.50 | 2.72 | 2.16 | 2.58 | 1.06 | 2.61 | 2.96 |
| 1.70 | 2.80 | 2.03 | 3.03 | 3.33 | 3.03 | 1.20 |
| 1.95 | 1.00 | 5.78 | 1.18 | 2.34 | 1.17 | 4.47 |
| 2.05 | 0 | NaN | 2.92 | 6.39 | 3.02 | 9.92 |
| 2.16 | 0 | NaN | 3.83 | 3.07 | 3.84 | 2.15 |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Rare-Event Simulation for Distribution Networks
Jose Blanchet
Department of Industrial Engineering and Operations Research, Department of Statistics
Columbia University, New York, NY 10027, [email protected]
Juan Li
Department of Industrial Engineering and Operations Research
Columbia University, New York, NY 10027, [email protected]
Marvin K. Nakayama
Department of Computer Science, New Jersey Institute of Technology
University Heights Newwark, NJ 07102, [email protected]
Abstract
We model equilibrium allocations in a distribution network as the solution of a linear program (LP) which minimizes the cost of unserved demands across nodes in the network. The constraints in the LP dictate that once a given node’s supply is exhausted, its unserved demand is distributed among neighboring nodes. All nodes do the same and the resulting solution is the equilibrium allocation. Assuming that the demands are random (following a jointly Gaussian law), our goal is to study the probability that the optimal cost (i.e. the cost of unserved demands in equilibrium) exceeds a large threshold, which is a rare event. Our contribution is the development of importance sampling and conditional Monte Carlo algorithms for estimating this probability. We establish the asymptotic efficiency of our algorithms and also present numerical results which illustrate strong performance of our procedures.
Key words: distribution network; linear program; rare event simulation; importance sampling; conditional Monte Carlo
1 Introduction
Consider the following model of a distribution network. We assume that there is a commodity to be distributed among various nodes in a network. Each node is endowed with a given supply of the commodity and at the same time it experiences a random demand. We assume that the commodity is infinitely divisible. If the demand at a given node exceeds its supply, then the excess demand is distributed according to some proportions to each of its neighbors, which in turn do the same. In order to obtain the distribution amounts in equilibrium, we solve a linear program (LP), where the objective function to minimize is the sum across nodes of the unserved demands.
One possible practical example where such a problem might arise is an electric power grid. Each node represents a geographic region, and there is an edge between two nodes if transmission lines directly connect them. Each region has generators, which provide the region’s supply of electricity. Also, each region has a random load (i.e., demand for electricity) from consumers. If a region’s load exceeds its supply, then the network tries to serve a node’s excess load by sending it to neighboring regions. One of the most important issues in operating a power grid is to keep the stability of the network and make sure demands can be satisfied. If the total amount of load not served at their originating regions exceeds a threshold , then we consider the network to have failed. To better operate this power transmission system, it is essential to estimate the probability that this network fails.
Another application involves load distribution for internet services, such as web servers, cloud-computing services, and domain name servers (DNS). A company may have a number of fixed-capacity servers situated in different geographic regions. As the requests to servers (i.e. the demand) arrive, a specific server tries to fulfill its own local requests, but if the demand exceeds its capacity, then the server may offload its excess to a neighboring server. Since this shifting may incur additional delays for the user, we want to minimize the amount of distributed demand. This is similar to load balancing; e.g., see [13].
Let be the probability that the sum of unserved demands, in equilibrium, exceeds threshold . Our goal is to estimate the probability , with , where is a rarity parameter and we scale the supply as a function of and we let increase reflecting a situation in which the supply is large. The parameter is allowed to grow with or can be constant. Assuming jointly distributed multivariate Gaussian demands, we provide asymptotically optimal estimators, together with numerical experiments showing their performance, and associated large deviations results. We recall that an unbiased estimator for is asymptotically optimal when goes to infinity if the logarithm of its second moment is asymptotically equivalent to the logarithm of (see [4], for notions of efficiency in rare-event simulation).
As far as we know, this paper provides the first type of large deviations analysis and efficient Monte Carlo for solutions of linear programs with random input. More precisely, our contributions are as follows:
-
For our model formulation, we show that our optimal allocation is invariant if one replaces the objective function by any other criterion that is increasing as a function of the unserved demands (see Theorem 3).
-
We establish large deviations analysis for our class of linear programs with random input (see Theorem 4).
-
We develop an importance sampling (IS) algorithm for estimating , and we show that the algorithm is asymptotically optimal as the supply gets large, and the threshold is a constant or increases with (see Section 5.2).
-
We develop a conditional Monte Carlo (CMC) algorithm for the evaluation of , and we prove the asymptotic optimality of this procedure as the supply gets large, and the threshold is a constant or increases with (see Section 5.3).
5) We provide several numerical examples in Section 6 that validate the performance of our algorithm.
Some of the results regarding CMC previously appeared in a conference version of this paper ([8]). Our conference paper restricted the LP’s objective function to be the sum of the unserved demands, and we now prove its invariance, as described in contribution 1), which greatly expands the applicability of our approach. Regarding contribution 2), we study the asymptotic behaviors of this network which is not discussed in the conference version. Regarding contribution 3), we develop an importance sampling algorithm which is not studied in the conference version, and we provide a proof of asymptotic optimality and algorithm implementation. As for contribution 4), although in the conference version, we have studied the CMC algorithm and its implementation (see [8], Section 4.3), no mathematical proof is provided regarding the asymptotic optimality of this algorithm. Here, in the journal version, we prove it rigorously. Finally, regarding contribution 5), instead of only comparing the naive simulation and CMC, we compare IS as well. In addition, to show the asymptotic optimality of our algorithms, we include numerical examples in which the rarity parameter changes.
We now explain how our paper relates to prior work. First, regarding 1), we note that similar results, with different types of networks and other applications, have been obtained in the literature (see [10]). We only learned about these applications after we obtained our model formulation, but we believe the connections are relevant. For the IS algorithm (contribution 2), we introduce a probability measure that is obtained by connecting the event of interest (i.e. total unserved demands in equilibrium exceeding a threshold) with a simple union event involving the demands. Then we use an IS distribution inspired by an approach developed by [1]. IS algorithms have also been used in [18], and [16] to solve a network operation problem with random inputs. While those two papers focus on the assessment of electrical constraints violation, we make use of IS to assess the solution of LP which involves optimization. Regarding the CMC estimator, we express the Gaussian demands in polar coordinates. Given the angle, the conditional probability of the LP’s optimal objective function value exceeding can be expressed as the probability of the radial component of the Gaussian lying in an interval or union of intervals, and this conditional probability can be computed analytically. The use of polar transformations for CMC and rare event simulation has been used in the past, see for example, [3]. [4], Chapters V and VI, provide additional background material on importance sampling and conditional Monte Carlo.
Our work also has other potential applications, in particular to cascading failures, which has been an interesting and important research topic. For example, [20] studies cascades in a sparse, random network of interacting agents whose decisions are determined by the actions of their neighbors according to a simple threshold rule. [9] consider a branching process model of cascading failures in an electric power grid. [12] analyze a continuous-time Markov chain of a dependability model with cascading failures. For another example, [6] study the temperature evolution of transmission lines. While that paper provides control algorithms to limit the probability that cascading failures happen. Our IS strategy (discussed in Section 5.2 ) can be modified to estimate the probability of observing a cascading failure under the policy advocated by that paper. The modification consists of defining, for instance, the objective function to be minimized as the worst case temperature over the lines in the network. Additional constraints need to be modified or approximated by linear constraints. [19] also study rare-event simulation for analyzing blackouts.
We would like to point out that although we assume multivariate Gaussian demands in this paper, the CMC algorithm can be applied to the case when the demands follow an elliptical distribution (see [14]). Furthermore, while an elliptical copula exhibits symmetric tail dependence, the well known Archimedean copula allows asymmetric tail dependence ([7]). Making use of the results in [15], we can see that CMC algorithm is also applicable to Archimedean copula, which makes this algorithm very powerful in solving a wide range of problems. also study rare-event simulation for analyzing blackouts.
The rest of the paper develops as follows. Section 2 presents the model of the distribution network, and it also defines the LP problem and its dual. We establish some properties of the primal and dual LPs in Section 3. The asymptotic behavior of the model is discussed in Section 4. We describe the asymptotic optimality and implementations of importance sampling and conditional Monte Carlo methods for estimating in Section 5. Section 6 contains the experimental results from some examples, and we give some final comments in Section 7.
2 Model Description
As we introduce our model and discuss its properties we will follow closely the discussion in [8]. Suppose there is a directed graph , where is the set of vertices and is the set of edges. The incidence matrix of the graph is denoted by , where if , and otherwise, and we assume for any . The network model we consider is induced by this graph, and we also assume the following:
1 The network is irreducible in the sense that the matrix is irreducible.
2 Each node has a given fixed supply .
3 Each node is subjected to a random demand . The demand vector is jointly Gaussian , where prime denotes transpose, is the mean vector, and is the covariance matrix.
4 The expectation of is less than or equal to for each node .
Each node tries to serve its realized demand. However, if a given node’s supply is exhausted, it distributes the unserved demand to its neighbors, which, in turn, do the same with their respective neighbors. Nevertheless, there is a cost associated with transferring unserved demands which should be minimized. We construct a linear program to describe this problem. The demands achieve an equilibrium point at each feasible solution, and the objective function is to minimize the sum of the excess demands across the nodes. Let , and the LP is:
[TABLE]
The quantity represents the shedded demand from node in equilibrium, which is distributed among its neighbors using a fixed distribution scheme, which we describe shortly. The quantity represents the unused supply at node in equilibrium. Therefore, in equilibrium, if , then node sheds demand; if , then node has unused supply. When node has excess demand, denotes the proportion of unserved demand at node distributed to node . We assume that if , then ; if , then . In addition, . The solution moves around excess demands and supplies to neighbors but does so in such a way that the sum of ’s, which are the equilibrium shedded demands, is minimized. The problem can be expressed in matrix notation as follows. Define (note that ). Let denote the -dimensional column vector with all components equal to . Then the previous linear programming problem (2) can be written as:
[TABLE]
where is the -dimensional column vector with all components equal to [math], , is the identity matrix, , and . The goal is that the sum of shedded demands is as small as possible because, e.g., the cost of distributing demands is high. If the cost is too high, for example, larger than a given number, say , or the LP is infeasible, we consider the network to have failed.
Note that while in [8], we assume that the unserved demands are equally distributed to neighbors, here we make a small but important extension. We allow the proportions to be any non-negative numbers.
Now, we also introduce the dual linear program:
[TABLE]
where and .
We are interested in computing the probability that the network fails, for different values of . Let represent this failure probability, and denote the optimal value of the dual when the demand vector is . As discussed in [8],
[TABLE]
where is the probability that the primal is infeasible, and is the probability that the primal is feasible, but the cost is larger than .
Since the discussion in Section 3 is valid for all , we do not define as a function of the rarity parameter until Section 4 .
3 Properties of Our Primal and Dual Linear Programs
3.1 Feasibility of the Solutions to the Primal and Dual
Our previous conference paper proves two theorems on properties of the primal and dual LPs for the special case when . We claim that both theorems are still valid for our more general , and the proofs are exactly the same. Here we only list the property regarding feasibility which will be used later, but omit the proof.
Theorem 1**.**
1
(a)
The dual problem (2) is always feasible.
(b)
The primal problem (2) is feasible if and only if .
3.2 Uniqueness and Positivity of the Solution to the Primal
Theorem 2**.**
When the primal problem (2) is feasible, it has the following properties:
(a)
It has a unique optimal solution.
(b)
At the optimal solution, at most one element in the pair is strictly positive, .
To emphasize the main results of the paper, we postpone the formal proof to Appendix A, and only give a brief explanation here. For (a), assuming there are two optimal solutions and making use of duality theorem of linear programing, we can prove that these two solutions are the same. Part (b) can be proved by contradiction.
3.3 Insensitivity of the Solution to the Primal
Theorem 3**.**
Suppose is the optimal solution to the problem
[TABLE]
where is differentiable and increasing with respect to . Let be another differentiable and increasing function. Then is also the optimal solution to the problem
[TABLE]
To prove it, we construct the solution of the dual problem and make use of Karush-Kuhn-Tucker (KKT) conditions. See [5] for more information about KKT conditions. A detailed proof appears in Appendix B.
Although Theorem 3 establishes the insensitivity of the optimal solution to a large class of nonlinear objective functions, for the rest of the paper, our discussion is based on the primal problem (2) and the dual problem (2) with linear objective functions.
4 Asymptotic Behavior
Now we discuss the asymptotic behavior of the failure probability of this distribution network, which will be useful when we develop efficient simulation algorithms for estimating the failure probability in the next section. We will now assume fixed number of vertices in the network. We next specify the vertices’ supplies and the distribution for the demands.
Let , represent these locations in this network, and . Suppose we have positive functions on , and on . For each node with location , there is a deterministic supply , where , is a rarity parameter, and a random demand , where the covariance between the demands at two vertices with locations and is . Also note that only the supply function involves , not the demand function. Let be the covariance matrix of , which we require to be symmetric positive definite.
We first introduce the little notion, which is used in the theorem that will be discussed momentarily.
Definition 1**.**
Let and be two functions defined one some subset of the real numbers. Then if for every , there exists a real number such that for all , we have .
We now establish a theorem that describes the asymptotic behavior of this network. More specifically, it tells what is the most likely way in which this network fails. This result is crucial in designing an efficient importance-sampling algorithm.
Theorem 4**.**
Let denote the optimal value of the dual (2), when the demand vector is and the rarity parameter is . Then for all with ,
[TABLE]
where .
To prove this result, we derive upper and lower bounds with the same limit . The details appear in Appendix C.
5 Efficient Algorithms: Importance Sampling and Conditional Monte Carlo
5.1 Asymptotic Optimality
Suppose are locations of vertices. When is large, the failure of this network is a rare event. To estimate this failure probability, we develop two efficient simulation algorithms: one based on importance sampling (IS) and the other using conditional Monte Carlo (CMC). To evaluate the efficiency of these two algorithms, we need to introduce a definition.
Definition 2**.**
A collection of estimators for is said to be asymptotically optimal if and if
[TABLE]
Asymptotic optimality also amounts to showing that
[TABLE]
5.2 Importance Sampling
We now develop an IS estimator making use of a new probability measure :
[TABLE]
where is a Borel set, and
[TABLE]
Note that is a mixture of measures, where the -th measure in the mixture is the conditional distribution given that the -th node’s demand exceeds its supply. In other words, we force the demand to be larger than the supply for at least one node such that this network fails more often under the new measure. Since
[TABLE]
it is easy to see that
[TABLE]
5.2.1 Asymptotic Optimality
We next establish the asymptotic optimality of the IS approach based on .
Theorem 5**.**
[TABLE]
is an asymptotically optimal estimator for , where .
To prove this result, we find an upper bound of with limit 2, and make use of Theorem 4. The proof appears in Appendix D.
5.2.2 Algorithm Implementation
We now explain how to implement the IS algorithm.
Set and let be the total number of replications to simulate. 2. 2.
Generate demand vector from distribution as in (7). To do this, we choose a node with probability , and begin by generating untruncated normal variables and reject those if the demand of node does not exceed its supply. If the acceptance rate becomes too small after some iterations with escalating sample sizes, we switch to use a Gibbs sampler algorithm described in [17] to sample truncated normal variables. 3. 3.
Calculate . 4. 4.
If , set and go to step 2; otherwise, go to step 5. 5. 5.
Compute as our importance-sampling estimator of , and a confidence interval for is , where \widehat{S}^{2}=\big{(}\sum_{i=1}^{N}(Z_{n}(\boldsymbol{D}^{(i)})-\widehat{\alpha}_{n}(k_{n}))^{2}\big{)}/(N-1), and is the distribution function of a standard normal.
5.3 Conditional Monte Carlo
We first briefly introduce the Conditional Monte Carlo (CMC) approach, which is a variance-reduction technique. Suppose we are interested in estimating , and is an unbiased estimator. According to the conditional variance formula: , we have . Therefore, using as an estimator may help to reduce variance.
Now we explain how CMC is applied to our problem to estimate . Note that the multivariate-normal random demand has polar-coordinate representation (see [14])
[TABLE]
where the radius satisfies , i.e., its density function , is the gamma function, , the angle , is uniformly distributed over the unit sphere, , and . In addition, the radius and angle are independent.
Making use of this representation, [8] developed a conditional Monte Carlo approach for estimating , along with algorithmic details on how to implement the method. However, we did not discuss the optimality of the CMC algorithm in the conference paper. We now provide such an analysis.
5.3.1 Asymptotic Optimality
Recall that we defined in Section 4 the deterministic supply of node at location as , where is a constant, is the rarity parameter, and is a fixed positive function.
Theorem 6**.**
1 For , there exist , , , , such that when ,
[TABLE]
[TABLE]
Also, the conditional Monte Carlo estimator is asymptotically optimal.
To prove (9), since the dual problem is an LP, we only need to consider the extreme points of the feasible region. Making use of the polar-coordinate representation of the random demand, we show that is equal to the conditional probability that the radius is larger than a function of , which has minimum value when is large enough.
To prove (10), we show that is equal to the probability that radius is larger than a function of . We then find a lower bound by considering a small ball when is large enough.
The asymptotical optimality follows since we have found an upper found of , and a lower bound of , which is less than or equal to 2 when is large enough. The complete proof appears in Appendix E.
6 Numerical Examples
Here we use the same basis for comparing the estimators using different simulation algorithms as in [8]. Suppose we want to estimate , and are independent replications of . Then is an unbiased estimator of , and is an unbiased estimator of , which we assume is finite. We then define the * (relative standard error)* as . To consider both the accuracy and computational efficiency when comparing different unbiased estimators, as suggested in [11], we use the relative measure (Computing Time) as the criterion.
In our experiments we apply naive simulation, importance sampling, and conditional Monte Carlo methods to different networks, and compare . For each example, assume locations have been chosen, we give incidence matrix , supply parameter , and demand parameters . We have proven the asymptotic optimality of the IS and CMC estimators when the threshold is a constant or increases with the rarity parameter . Examples 1 and 2 show how failure probability changes with for constant . Example 3 shows how failure probability changes when is a function of , with and . We set the sample size for all of the three examples.
We choose parameters based on the following considerations:
- •
Network size : we did three experiments with networks of three different sizes , , and . We believe that a network with 30 nodes represents a sufficiently large example for actual applications. In addition, these experiments are used to compare the relative efficiency among different simulation algorithms. While larger networks take more time to simulate, we expect that the results across the methods would be similar.
- •
Incidence matrix : it was chosen so that the network is irreducible.
- •
Supply and demand related parameters : it is not easy to obtain this information from real-life examples, so we constructed them so that failure rarely happens.
- •
Scale parameters , rarity parameter and threshold : they were chosen so that failure probability exhibits different orders of magnitude. Although our results establish asymptotic optimality of the IS and CMC estimators, the experiments consider a range of parameter values to study when is not too small so we can assess the performance.
6.1 Example 1: , fixed
The first example is a 3-dimensional network with the following parameters:
[TABLE]
6.2 Example 2: , fixed
The second example is a 10-dimensional network with the following parameters:
for , , , , , ,, , , , , , . All other elements of are equal to [math].
, , .
[TABLE]
6.3 Example 3: , changes with
The third example is a 30-dimensional network with the following parameters:
. . All other elements of are equal to 0.
. , .
. All other elements of are equal to 0.4.
6.4 Discussion of Results and Comparisons Between Algorithms
When increases, the performance of both the naive simulation and IS deteriorates quickly in terms of . Because we fix the number of simulations , as in Example 1, 2, and 3, when is very large, we do not get even one observation of the event . However, although the performance of CMC becomes worse as well, it does not deteriorate as quickly as the other two. No matter how large is, we can obtain a non-zero estimate of . 2. 2.
Although both IS and CMC are asymptotically optimal, when is small, IS performs better than CMC, as we now explain. The IS method only needs to solve a single optimization problem to determine (see Section 5.2.2) in each replication . In contrast, our conditional Monte Carlo method needs to solve several optimization problems to find the roots which equate the optimal value of the primal and the threshold for a fixed angle (see equation (8) in [8]) in each replication . Thus, the added computational effort required by CMC can lead to it performing worse than IS. However, as increases, conditional Monte Carlo method works much better. The larger is, the bigger the advantage CMC has compared to naive simulation. The advantage arises because of the significant variance reduction obtained for large overwhelms the additional computational effort. In conclusion, for a given network, IS performs best when is small, and CMC is better when is large. 3. 3.
We have established the asymptotic optimality of our methods as the rarity parameter . But as with any technique for which an asymptotic property has been proven, the performance for fixed when the asymptotics are not yet in effect may differ from that for large , and may not outperform naive simulation. We explore this by varying in our experiments.
7 Final Comments
We discuss a distribution network model with each node subjected to given fixed supply and Gaussian random demand. The unserved demand at a node is distributed proportionally to its neighbors. The equilibrium point is determined by a linear program whose objective is to minimizing the sum of excess demands across all nodes in this network. We developed IS and CMC approaches to efficiently estimate the failure probability. Numerical results show that these two algorithms greatly outperform naive simulation, especially when the threshold is large.
We can make several extensions.
- •
Cost Structure: We assume unit cost associated with pushing unit demand from one node to another. In other words, let be the cost by distributing unit demand from node to node . Currently, for all . We can generalize this setting by using a path dependent cost structure, which means can be different for different . At the same time, the objective function of the primal problem (2) now becomes
[TABLE]
Here, we claim that, all theorems in the paper are still valid for the generalized structure as long as for all . To see this, Theorem 3 has generalized the cost structure for Theorems 1 and 2. We can also prove Theorems 4, 5, and 6 with straightforward modifications.
- •
Elliptical Copula: For CMC algorithm, note that the algorithm requires that the radial component, , is a positive continuous random variable and that we are able to calculate the root for the optimal value of the primal as a function of conditional on the angular part, . Therefore the conditional Monte Carlo algorithm applies as long as the demand vector is an elliptical copula.
- •
Growing Number of Nodes: In this paper, all of our discussion focuses on a given graph with a fixed number of nodes. We can also consider the asymptotic behavior of a graph when the number of nodes grows large. Similar properties and simulation algorithms can be developed by embedding the Gaussian vector of demands in a continuous Gaussian random field, so that Borell-TIS inequality ([2], p. 50) can be applied in the proof of Theorem 4.
ACKNOWLEDGMENTS
Support from NSF grants CMMI-1069064 and CMMI-1436700 is gratefully acknowledged by the first author.
The work of the third author has been supported in part by the NSF under Grants No. CMMI-0926949, and CMMI-1200065. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author and do not necessarily reflect the views of the National Science Foundation.
Appendix A Proof of Theorem 2
Proof.
Proof: Suppose both and are optimal solutions. Let , which is of dimension . We want to prove that . Consider the following linear program:
[TABLE]
where is a -dimensional vector with the th element equal to 1 and other elements equal to 0. Equivalently, we write the LP as
[TABLE]
where . Then we only need to prove the above LP is infeasible for all . Consider the corresponding dual problem:
[TABLE]
Then, for all , is a feasible solution to since . The value of the objective function is . Due to the arbitrariness of , we see that the optimal value of the dual is unbounded. Therefore, for all , the primal is infeasible. Hence, each element of d must be 0, which means that , proving part (a). Note that the objective function of the LP can be of multiple forms since we only aim to prove the infeasibility, and different choice of the objective function only leads to different construction of and .
To establish (b), suppose is the optimal solution of the primal (2). Suppose for some , both and are strictly positive, i.e., and for some . Let , , and define a new vector as follows:
[TABLE]
Then it is not hard to show that is a feasible solution to the problem (2). In addition, the value of the objective function at is strictly less than the value at , which conflicts with the optimality of . Therefore, at least one element in the pair is zero, . ∎
Appendix B Proof of Theorem 3
Proof.
Proof: Consider the problem
[TABLE]
Suppose is the optimal solutions to , and the Lagrange function is
[TABLE]
Then and satisfy the Karush-Kuhn-Tucher (KKT) conditions when , i.e.
[TABLE]
where represents the gradient of with respect to . Now we would like to construct the dual solution vector , such that when , and satisfy the above KKT conditions. Then we can claim that is also the optimal solution when . Define , and . For each , set ; and for each , set . Without loss of generality we assume that . Let , , and . Let be a diagonal matrix with the first diagonal elements equal to and the remaining elements equal to 0. Considering the second KKT condition, the first KKT condition becomes
[TABLE]
Notice that the matrix is irreducible and stochastic. Also we claim that cannot be the identity matrix with probability 1. To see this, suppose is the identity matrix, in other words, . Note that the conclusion of Theorem 2(b) is still valid when the objective function is , and the proof is exactly the same. Then . Adding all constraints in the primal problem (2) gives us . But this equality holds with probability 0. Therefore, is invertible with probability 1, and . Because is increasing in and , we have that . It is obvious that and satisfy the above KKT conditions when . ∎
Appendix C Proof of Theorem 4
Proof.
Proof: We will prove this result by establishing upper and lower bounds on . We start with deriving an upper bound. Note that follows standard Gaussian distribution. We first claim that
[TABLE]
To see this, if we assume , then . According to Theorem 1(b), the primal problem (2) is feasible, and it is easy to see that is an optimal solution to the primal problem. In this case . Thus , where “” represents the complement of a set, and Equation (11) is valid. Therefore,
[TABLE]
Set . Note that when is large enough, . Then
[TABLE]
where is some positive constant, and the last step makes use of the fact that if a random variable follows standard Gaussian distribution, then for any , . This establishes the desired upper bound on .
To obtain a lower bound on the probability, define , , where is some constant. We now claim that
[TABLE]
To see this, note that if , then there exists some such that . Let be the vector with the -th element equal to 1 and the rest of the elements equal to 0. It is easy to see that is a feasible solution to the dual problem (2) and . Therefore, . Then,
[TABLE]
where is some positive constant, and the second-to-last step applied the fact that if a random variable , where , then for all ,
[TABLE]
giving us the desired lower bound on .
Therefore, (C), (13), and (14) imply for sufficiently large,
[TABLE]
Taking logarithms, we have
[TABLE]
Because
[TABLE]
it follows that
[TABLE]
thereby verifying (5) and (6).
∎
Appendix D Proof of Theorem 5
Proof.
Proof: Let denote the expectation under , so by (11), we have
[TABLE]
Since implies , and under measure , ,
[TABLE]
Thus
[TABLE]
Since
[TABLE]
we have
[TABLE]
Therefore,
[TABLE]
where the last equation follows from Theorem 4. ∎
Appendix E Proof of Theorem 6
Proof.
Proof: We first prove (9). Let denote the feasible region of the dual problem (2). Then , where as defined in Section 4. We are interested in the failure probability, which includes two cases as we noted previously in Section 2. One case is that the primal problem is infeasible, which, according to Theorem 1(b), occurs if and only if when . The other case is that the primal problem is feasible but the optimal value is greater than . Since the dual problem is an LP, for the second case, we can focus on the extreme points of the feasible region . Since , when , the optimal value is 0, so we do not have a failure. Therefore, we do not need to consider the solution when calculating the failure probability.
Suppose are the extreme points of , excluding , and we have
[TABLE]
where , and
[TABLE]
Let .Then when , we have . Recall that is a positive random variable, so
[TABLE]
Define
[TABLE]
[TABLE]
For , define
[TABLE]
[TABLE]
It is easy to see that when ,
[TABLE]
In the non-trivial case when , there exists some . Let . Define
[TABLE]
Let us consider inequality (9) first. We have
[TABLE]
and
[TABLE]
Note that both and are continuous with respect to on the compact set . Then there exist and such that
[TABLE]
[TABLE]
Therefore,
[TABLE]
Then we have
[TABLE]
Let , then (9) is established.
Now we consider the inequality (10). We claim that for any in , there exists such that when ,
[TABLE]
where is the corresponding to . To see why this is true, observe that for any ,
[TABLE]
We know that . Define
[TABLE]
Choose
[TABLE]
then . For , note that both and are bounded on . Then there exist , such that
[TABLE]
[TABLE]
Since , there exists , such that when , . Therefore, when , it follows that , so
[TABLE]
We also claim that there exist , , such that if , then on . To see this, for any and , define . Note that there exists , such that when , and , for any , we have that the index corresponding to is in , and
[TABLE]
Since is continuous on , there exists such that when , we have
[TABLE]
where is some positive constant.
Define , . Since , there exists , such that when , we have . Therefore,
[TABLE]
So for any ,
[TABLE]
Since is uniformly distributed over the unit sphere, which is a -dimensional manifold, there exists some constant such that
[TABLE]
Let . By equations (16) and (20), it follows that
[TABLE]
Hence, we have proven (10).
We now establish the last part of the theorem. By (9) and (21), we have
[TABLE]
Recall that when , so (17) implies
[TABLE]
Therefore,
[TABLE]
and
[TABLE]
When , the second term inside the parentheses in (22) is non-negative. Then when , it follows that (22) is bounded above by 2, thereby concluding the result. ∎
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Adler, R.J., J. H. Blanchet, J. Liu. 2012. Efficient monte carlo for high excursions of gaussian random fields. Annals of Applied Probability 22 1167-1214.
- 2[2] Adler, R.J., J. E. Taylor. 2007. Random Fields and Geometry. Springer, New York.
- 3[3] Asmussen, S., J. Blanchent, S. Juneja, L. Rojas-Nandayapa. 2011. Efficient simulation of tail probabilities of sums of correlated lognormals. Annals of Operations Research 189 5-23.
- 4[4] Asmussen, S., P. Glynn. 2007. Stochastic Simulation: Algorithms and Analysis. Springer, New York.
- 5[5] Bertsimas, D. and N. Tsitsiklis. 1997. Introduction to Linear Optimization. Athena Scientific, Massachusetts.
- 6[6] Bienstock, D., J. Blanchet, J. Li. 2016. Stochastic models and control for electrical power line temperature. Energy Systems 7 1 173-192.
- 7[7] Brechmann, E. C., Hendrich, K., and Czado, C. 2013. Conditional copula simulation for systemic risk stress testing. Insurance: Mathematics and Economics 53 3 722-732.
- 8[8] Blanchet, J., J. Li, M.K. Nakayama. 2011. A conditional monte carlo method for estimating the failure probability of a distribution network with random demands. Proceedings of the 2011 Winter Simulation Conference (WSC), 3832-3843.
