A discontinuous Galerkin method for approximating the stationary distribution of stochastic fluid-fluid processes
Nigel Bean, Giang T. Nguyen, Malgorzata M. O'Reilly, Vikram Sunkara

TL;DR
This paper develops a discontinuous Galerkin numerical method to approximate the stationary distribution of stochastic fluid-fluid processes, enabling practical computation where previous formulas were analytically explicit but numerically intractable.
Contribution
It introduces a novel discontinuous Galerkin approach for numerical approximation of the stationary distribution in stochastic fluid-fluid processes.
Findings
Successfully applied to an on-off bandwidth sharing system
Demonstrates accurate approximation of stationary distributions
Provides a practical computational tool for complex stochastic processes
Abstract
Introduced by Bean and O'Reilly (2014), a stochastic fluid-fluid process is a Markov processes , where the first fluid is driven by the Markov chain , and the second fluid is driven by as well as by . That paper derived a closed-form expression for the joint stationary distribution, given in terms of operators acting on measures, which does not lend itself easily to numerical computations. Here, we construct a discontinuous Galerkin method for approximating this stationary distribution, and illustrate the methodology using an on-off bandwidth sharing system, which is a special case of a stochastic fluid-fluid process.
| Buffer | Buffer and Phase | ||
| , | |||
| or | |||
| or | 0 | ||
| Buffer | Buffer and Phase | ||
| , | |||
| or | |||
| or | 0 | ||
| 0.0 | 0.184 | 0.312 | ||
| 1.0 | 0.816 | 0.688 |
| Basis | Error | Comp. Times | # of Elements | Overall Storage | |
|---|---|---|---|---|---|
| Piecewise Constant | 1.5 | 0.58 | 0.21 sec | 88 | 0.5 MB |
| 0.5 | 0.25 | 0.31 sec | 248 | 4.1 MB | |
| 0.05 | 0.03 | 23 sec | 2408 | 380 MB | |
| Piecewise Linear | 1.5 | 0.01 | 0.21 sec | 168 | 2.1 MB |
| 0.5 | 0.78 sec | 488 | 16 MB | ||
| 0.05 | 130 sec | 4816 | 1.5 GB |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Queuing Theory Analysis · Transportation Planning and Optimization · Simulation Techniques and Applications
A Discontinuous Galerkin Method for
Approximating the Stationary Distribution of Stochastic Fluid-Fluid Processes
Nigel Bean
The University of Adelaide, School of Mathematical Sciences
ARC Centre of Excellence for Mathematical and Statistical Frontiers (ACEMS)
Giang T. Nguyen
Małgorzata M. O’Reilly
The University of Tasmania, Faculty of Science, Engineering, and Technology
Vikram Sunkara
Freie Universität Berlin, Department of Mathematics and Computer Science
Konrad-Zuse-Zentrum for Informationstechnik, Department of Numerical Mathematics
Abstract
Introduced by Bean and O’Reilly (2014), a stochastic fluid-fluid process is a Markov processes , where the first fluid is driven by the Markov chain , and the second fluid is driven by as well as by . That paper derived a closed-form expression for the joint stationary distribution, given in terms of operators acting on measures, which does not lend itself easily to numerical computations.
Here, we construct a discontinuous Galerkin method for approximating this stationary distribution, and illustrate the methodology using an on-off bandwidth sharing system, which is a special case of a stochastic fluid-fluid process.
keywords:
stochastic fluid–fluid processes; stationary distribution; discontinuous Galerkin method
1 Introduction
A stochastic fluid process is a two-dimensional Markov process, where the phase is a continuous-time Markov chain on a finite state space , and the fluid varies linearly at rate . A subset of Markov additive processes, stochastic fluids have been well-analysed in the past two decades. There have been two recent generalisations of stochastic fluid processes to a higher dimension: Miyazawa and Zwart [1] analysed discrete-time multidimensional Markov additive processes, and Bean and O’Reilly [2] studied the so-called stochastic fluid-fluid process, the latter is our focus in this paper.
A stochastic fluid-fluid is a Markov process , where the phase is still a Markov chain on a finite state space ; is the first fluid, which varies linearly at rate
[TABLE]
and is the second fluid, which varies linearly at rate :
[TABLE]
As the classic fluid process is used extensively in many areas, such as insurance and environmental modelling, it is clear that stochastic fluid-fluid models have even a wider range of applicability.
An example of application for a stochastic fluid fluid is the modelling of growth and bleaching of coral reefs, as described in [2]. In this process, we can model the density of symbiotic zooxanthellae at time by , with the positive rates corresponding to the growth of the zooxanthellae, the negative rates to the bleaching. If the density is below a certain threshold , the coral cannot store lipids until the density increases again past this level. During the time , the coral relies on stored lipids, modelled as , and dies when the latter runs out, that is, .
While the analyses in [1, 2] are markedly different, both papers drew inspiration from Neuts’ matrix-analytic approach [3, 4] to obtain the limiting behaviour of these processes, working with operators on function spaces instead of matrices. Thus, their closed-form expressions for the limiting distributions ([1, Theorem 4.1], [2, Theorem 2]) are given in terms of operators acting on measures, which are not immediately amenable to numerical computations for real-life applications. One way to numerically handle operators on function spaces is to construct approximations of the operators. To this end, there exist numerical procedures such as finite difference, finite volume, finite element, and discontinuous Galerkin (DG) methods [5]. The operators arising from fluid-fluid processes are assumed to be acting on a function space of smooth probability densities. The choice of an approximation method should reflect these properties in its solutions. In the DG method, the conservation of probability mass and local smoothness can be captured in the approximations [5].
In this paper, we construct a discontinuous Galerkin method to approximate the joint stationary distribution of a stochastic fluid-fluid process. We numerically illustrate the effectiveness of the methodology using an on-off bandwidth-sharing system of two processors [6]. In this example, inputs into the processors, and , are turned on and off by a Markov chain, ; the combined output capacity is fixed and allocated according to the workload of the first, high-priority, processor . Latouche et al. [6] evaluated the marginal limiting distribution of the first processor , and provided bounds for the marginal limiting distribution of the workload of the second processor . We verify our DG approximations by comparing them against Monte Carlo simulations of the system, against analytical results obtained in [6], and against our intuitive understanding of the system dynamics. In all considered cases, we find the approximations to be accurate.
The paper is organised as follows. In Section 2, we give relevant background to present the joint stationary distribution of a stochastic fluid-fluid process. We construct in Section 3 a discontinuous Galerkin method to approximate the stationary distribution, and include numerical experiments in Section 4.
2 Preliminaries
Consider a stochastic fluid-fluid process . We assume that and that there is a regulated boundary at level [math] for both buffers:
[TABLE]
for . Let be the irreducible generator for the finite Markov chain . We denote by the diagonal fluid-rate matrix for , and the diagonal fluid-rate matrix for . For the remainder of this section, we summarize the findings of [2] on the joint stationary distribution of .
Let be the state space of , so . For each Markovian state , we partition according to the rates of change for the second fluid : where
[TABLE]
For all , the functions are assumed to be sufficiently well-behaved that , , is a finite union of intervals and isolated points. Moreover, define , , and .
We assume that the process is positive recurrent, in order to guarantee the existence of the joint stationary density operator and the joint stationary mass operator , where for
[TABLE]
The determination of involves two important matrices of operators, and . Intuitively, for a set and a measure vector , gives the conditional probability of , and the conditional probability of returning to level zero and doing so when , given that the initial distribution is .
2.1 Matrix of Operators
Let be the set of integrable complex-valued Borel measures on the Borel -algebra . For , we can write
[TABLE]
where , the set of integrable complex-valued Borel measures on , and
[TABLE]
We denote by the matrix of operators , and , which are defined for as follows:
[TABLE]
the probability of the process being in the destination set at time , given that it starts in according to the measure . We can write in terms of its infinitesimal generator as
[TABLE]
For and , as well as other operators of the same dimensions to be introduced later in the paper, the operators are partitioned according to ; for example,
[TABLE]
where each block is the matrix of operators . We note that forms a strongly continuous semigroup on the set of measures that are absolutely continuous with respect to Lebesgue measure and have analytic densities as well as possibly a point mass at zero in certain phases [2]. Thus, we restrict the domain of to the set of such measures, and write
[TABLE]
where is the associated density and is the probability mass at the boundary [math] when , for and .
For simplicity, from here on we assume a set is an interval, which might or might not include its end points, that is, . By [2, Lemma 3], the operators , and , are given as follows. We include also brief probabilistic interpretations of the terms (see remarks in [2] for more details). Note that in all of the following cases, we assume ; this is without loss of generality, as for any set and a measure in our chosen domain,
[TABLE]
- Case 1.
When ,
[TABLE]
Case represents when there is a stochastic jump from state to state , which happens with rate . The integral represents the probability mass of the intersection of the initiating domain and the destination set . If , then the point mass is preserved after the change of phase. If , then the point mass disperses into the density in state , and is captured only if has an upper bound strictly greater than [math]. Note that in this case [math] does not have to be in , it only has to be in the closure of .
- Case 2.
When and ,
[TABLE]
where denotes the left boundary point that mustn’t also be the right boundary point of the closure of the set , and similarly for .
Case represents a drift from to . When neither nor is an isolated point, there is a transfer of density from one set to another through the relevant endpoints. When , , and [math] is the left endpoint of the closure of (which can’t be an isolated point itself), then there is also an accumulation of point mass.
- Case 3.
When and ,
[TABLE]
where denotes the right boundary point of the closure of , and similarly for . Case represents stochastic jumps out of state and drift across .
2.2 Matrix of Operators
We denote by the matrix of operators recording the Laplace-Stieltjes transforms of the time for to return, for the first time, to the initial level of zero. Define the stopping time to be the first time hits level , then each component and , is given by
[TABLE]
Let be the total unregulated amount of fluid that has flowed into or out of the second buffer during , and let be the first time this accumulated in-out amount hits level . We denote by the matrix of operators recording the Laplace-Stieltjes transforms of :
[TABLE]
where is the matrix of operators , for , , and Re. Each operator , for and , is given by
[TABLE]
We can write
[TABLE]
where is the infinitesimal generator of the strongly continuous semigroup . Lemma of [2] gives the following expression for .
Lemma 2.1
For , with Re, , and ,
[TABLE]
where is a diagonal matrix of operators given by
[TABLE]
By [2, Theorem 1], has the following characterisation.
Theorem 2.2
For Re, satisfies the equation:
[TABLE]
Furthermore, if is real then is the minimal nonnegative solution.
2.3 Stationary Distribution
Let . We define , for , to be the sequence of hitting times to level [math] of , with . Consider a discrete-time Markov process , and for define the measure as follows
[TABLE]
By [2], the vector of measures satisfies the following set of equations
[TABLE]
We reproduce Theorem 2 of [2] below, which gives the joint stationary distribution of . Recall that the joint stationary density operator for and the joint stationary mass operator are defined by (4) and (5), respectively. We can write
[TABLE]
where
[TABLE]
Theorem 2.3
The density , for and , and the probability mass , for , satisfy the following set of equations:
[TABLE]
where and is a normalizing constant.
3 Discontinuous Galerkin Approximations
Discontinuous Galerkin methods are used to approximate the solution to a system of partial differential equations. A brief description of these methods is as follows [5]. On the domain of the approximation, consider a finite sequence of so-called nodal points. We refer to each interval between two consecutive nodal points as a mesh, and the combination of meshes and nodal points as a stencil. Within each mesh, we have a finite element approximation, which constructs a finite-dimensional smooth Sobolev space by choosing appropriate piecewise polynomial basis functions, and then projects the partial differential equations onto this space. This projection leads to a new system of equations, referred to as the weak form of the original PDEs.
There is a flux operator moving probability from one mesh to another, in a manner similar to the underlying principle of a finite volume approximation: integrating the PDEs over each mesh and then constructing a new system of ordinary differential equations, which describe the change in the integral over the mesh. This method conserves probability, and can handle discontinuities, such as jumps and point masses.
Discontinuous Galerkin methods lead to global approximations in the space of piecewise functions. Intuitively, we sacrifice the continuity between meshes to gain the conservation of probability.
3.1 Application to a Stochastic Fluid-Fluid Model
Here, we construct a discontinuous Galerkin method to approximate the operator matrix and subsequently the operator matrix , the two key ingredients of the joint stationary distribution for . We begin with approximating the joint density of :
[TABLE]
which satisfies the system of partial differential equations
[TABLE]
subject to suitable boundary conditions [2].
While , any numerical approximation by necessity has to take place on a finite interval. Clearly, the state space truncation results in a point mass at the upper bound, which we have to address properly. It is important to choose an interval sufficiently large in order to control the error induced by the artificial upper bound for . We shall further comment on this in Section 4, where we report our numerical experiments.
Let be the domain of the approximation, where . We denote by a finite sequence of nodal points on , with and , and by the sequence of corresponding meshes, , for . As there are two point masses, one at zero where there is a regulated boundary for and another at where there is an artificial upper bound, the two end meshes, and , each of which contains both a point mass and density.
For , we choose functions , , to be the basis functions, the span of which forms our approximation space, . Then, a function has the form:
[TABLE]
for some coefficient functions . To construct an approximation for we need to determine these functions or, equivalently, the -dimensional row vector
[TABLE]
where
[TABLE]
and is the total number of basis functions on .
To that end, there are three important matrices we need to introduce. The first two matrices, and , are block-diagonal and detail the dynamics within each mesh:
[TABLE]
where, for ,
[TABLE]
The third matrix, , , is related to the dynamics between adjacent meshes: it is the flux operator moving probability from one mesh to another. Let be the projection of onto the mesh . A central idea of the discontinuous Galerkin method is that the values of on different meshes are linked to each other only through a numerical flux and that one can consider the approximation
[TABLE]
where and denote the right and left endpoints of the th mesh, respectively, and for any function .
There are many options for . Here, we choose a first-order up-winding scheme [7], that is,
[TABLE]
for and , where is an adjustment parameter for when we have a stencil with meshes of different structures; note that the numerical flux is defined at nodal points only. Suppose we are considering the th mesh, . Then, we define the function to be
[TABLE]
Here, the function , , is the ratio of the integrals of the basis functions transferring the probability from the th mesh to the th mesh, where
[TABLE]
for one and then all and . In other words, is not dependent on the particular choice of basis functions, and , but on their meshes, and , only.
This choice of requires the flux information only from the left if the fluid rate of is positive, and only from the right if that rate is negative. Intuitively, when , the numerical flux of probability going from into is the probability density accumulated on the right-hand edge of the approximation on . Similarly, when , the numerical flux going from into is the probability density accumulated on the left-hand edge of the approximation on .
Thus, on nodal points we have
[TABLE]
where for any function
[TABLE]
and is used to allow us access to the density on the left-hand edge of and the right-hand edge of , respectively.
We are now ready to introduce the block-tridiagonal matrix ,
[TABLE]
where each sub-block is of dimension and thus
[TABLE]
is an -dimensional row vector. Let be the vector containing all basis functions on the th mesh. We define for
[TABLE]
Lemma 3.1
For and ,
[TABLE]
**Proof ** We begin with the RHS of (41). For ,
[TABLE]
As each basis function , for , is trivially zero outside its th mesh, we have
[TABLE]
Substituting (43) and (44) into (42) leads to
[TABLE]
The argument for follows analogously. \qed
Theorem 3.2
The weak formulation of the PDEs (26) is the following system of ordinary differential equations
[TABLE]
**Proof ** Consider Equation (26), for , which we restate below:
[TABLE]
For details on the steps of discontinuous Galerkin methods, see [5]. Here, we start by replacing the density function in (46) by its approximation to obtain
[TABLE]
Multiplying both sides of (47) by a basis function and , and then integrating over the approximation domain gives
[TABLE]
Expanding and then integrating the third term by parts leads to
[TABLE]
where the first part can be approximated by using the numerical flux , that is,
[TABLE]
Substituting (49) into (48) and expanding gives
[TABLE]
Finally, we switch the order of integrals and summations to arrive at
[TABLE]
Equivalently, this can be written in matrix form as
[TABLE]
which completes the proof. \qed
Remark 3.3
Let be the approximation space as defined in (27). If
the weights satisfy (45), 2. 2.
the eigenvalues of are in the negative real half of the complex plane including zero, and 3. 3.
only one basis function is non-zero on each boundary of each mesh,
then
[TABLE]
Intuitively, the second and third conditions are to ensure stability in the solution and conservation of the transfer of probability across meshes, respectively.
Defining the discontinuous Galerkin infinitesimal operator
[TABLE]
we construct a DG approximation for the operator matrix as follows. For and , define to be an index set: given a region of (defined in (1–3)), is the set of meshes included in . For example, if , then . Note that the nodal points should be chosen such that for all , and .
We choose each approximation matrix to be . Similar to the definition for the operator matrix , introduced in Section 2.1, there are three cases.
- Case 1.
When , each sub-block is given by
[TABLE] 2. Case 2.
When and ,
[TABLE] 3. Case 3.
When and ,
[TABLE]
Next, we approximate the operators by an block-diagonal matrix , with diagonal sub-blocks given by
[TABLE]
where is an approximation of the fluid rate of , given , , and the basis function . These functions can sensibly be chosen to be:
[TABLE]
Define for , and for . Putting things together, a DG approximation of the operator is given by the matrix
[TABLE]
for with Re and for .
Let , then a DG approximation of the operator , the latter describing the probability of the fluid starting at level zero and returning there for the first time, is an matrix solution to the equation
[TABLE]
The matrix equation (52) can be solved using one of the many algorithms suggested by Bean et al. [8]. We then use this approximation to evaluate the limiting density, according to Theorem 2.3.
3.2 An Example
Consider a process where there is only one phase in , in other words, there is no Markov modulation. Consequently, the PDE (26) has the drift component only:
[TABLE]
Suppose we would like to develop a DG approximation for the density function over an interval . Consider the four meshes
[TABLE]
We choose the following basis functions:
[TABLE]
as depicted in Figure 1.
Then, we can verify that the matrices and are given by
[TABLE]
Assume that . Then, the non-zero upper-diagonal blocks are given by
[TABLE]
with all other sub-blocks of being identically zero. Thus,
[TABLE]
where
[TABLE]
Consequently, the DG generator is given by
[TABLE]
where note that all the row sums are zero.
Now, to illustrate the approximation of the operator matrix , suppose the phase process has a generator matrix . Let the DG approximation for this fluid-fluid process have the same stencil as described in Figure 1.
When , we assume has a positive fluid rate, , when , and a negative rate, , when . Thus, and When , we assume the opposite: and . Thus, and
- Case 1.
When : Suppose , and . Then,
[TABLE] 2. Case 2.
When and : suppose , and . Then,
[TABLE]
If we are to pre-multiply with a row vector, we can see the matrix picks up the value of the right-most basis function with support in , and then distributes this to the first two basis functions with support in . This concurs with our physical interpretation (see Section 2.1) that there is a drift from to . 3. Case 3.
When and : Suppose and . Then,
[TABLE]
4 Numerical Experiments
To illustrate the validity of our discontinuous Galerkin approximation, we perform numerical experiments on a stochastic fluid-fluid model, in a three-pronged approach.
First, we run Monte Carlo simulations, in order to compare the simulated joint density of evaluated at the time first returns to the initial level [math] against that which is obtained via the return-probability matrix . This numerically verifies the accuracy of our proposed approximation for the operator matrix . Second, using we evaluate the limiting joint density of , which we compare against the same density analytically derived in [6]. Third, we vary the parameters of the second fluid to confirm that the approximating joint density for does not change, while the marginal limiting distribution for does, both of which are consistent with our intuitive understanding of the chosen example. In all three procedures, we find the approximations to be accurate.
We also analyse different choices for the level of spatial discretisation and the degree of polynomial basis functions, with respect to the order of convergence in relevant error terms.
4.1 An on-off bandwith-sharing model
The example we choose for our experiments is as follows. Consider a stochastic fluid-fluid where and represent the workloads in Buffer 1 and Buffer 2 at time , both driven by the phase which is a Markov chain on the state space . Here, the state indicates inputs to both buffers being on, the state indicates both being off, the state is when only the first input is on, and the state is when only the second is on. The input of Buffer is switched from on to off with rate , and from off to on with rate , for . Thus, the infinitesimal generator for is given by
[TABLE]
We denote by the input rate of Buffer during an on period, and by the output rate, and assume that , for . This example imitates the model considered in [6], where the two buffers share a fixed total output capacity .
In particular, the output rate is constantly , except for when the buffer is empty, that is, when . On the other hand, varies depending on the values of the buffers. More specifically, as Buffer 1 is considered high-priority, this buffer is allocated the entire output capacity whenever exceeding a pre-determined threshold , leaving and . However, when Buffer 1 is empty, Buffer 2 is given the whole output capacity , meaning . For , and , for such that . Clearly, if Buffer is empty, its output rate would be zero. Table 1 summarizes the output rates for all different scenarios.
Given its dynamics as currently defined based on [6], Buffer 1 is an example of what is known in the literature as a level-dependent fluid, because its net rates depend on the value of relative to the threshold . As the existing theoretical analysis for stochastic fluid-fluid processes (developed in [2] and briefly summarized in Section 2) does not allow for level dependency in the first buffer, here we modify the bandwith-sharing model in [6] slightly. We let the output rate of Buffer 1 remain for , effectively eliminating the threshold effect on the first buffer but keeping the effect on the second. Table 2 represents the modified rates.
Consequently, the net rates of change for , , are given by
[TABLE]
and the net rates of change for , , are as follows
[TABLE]
For our numerical experiments, we use the parameter choices given in [6]:
[TABLE]
As mentioned previously, while the true problem has an unbounded domain , the discontinuous Garlekin method requires the domain of approximation to be a finite interval. Hence, for all approximations we consider a finite interval but large enough so that the boundary-induced dynamics do not significantly affect the results.
To specify the stencil for our numerical approximation, we define a vector \bm{\omega}_{{\color[rgb]{0,0,0}\definecolor[named]{pgfstrokecolor}{rgb}{0,0,0}\pgfsys@color@gray@stroke{0}\pgfsys@color@gray@fill{0}K},h,\Delta_{h}} of nodal points as
[TABLE]
for and for , both sufficiently small. In this stencil, there are meshes, of which are interior meshes of length , the left and right boundary meshes are of length , and the second-to-left and second-to-right ones are of length . The boundary meshes always have piecewise-constant approximations, because this is sufficient to approximate the point masses accumulated at either boundary.
4.2 The return-probability matrix via Monte Carlo simulations
We choose our initial distribution to be a point mass of for , and zero everywhere else. The choice of an initial point being a point mass instead of a non-degenerate distribution is purely for the convenience of numerical simulation, because it eliminates the need of simulating multiple initial starting points. Using this initial condition and the parameters specified in (57, 58), we simulate trajectories, terminating each path either when Buffer returns to zero or at time , Overall, 2.3% of the trajectories did not reach zero in buffer by time , and so are rejected.
Note that as we assume positive recurrence for our system, the probability of Buffer 2 returning to its initial level zero is . However, the time this takes might be longer than the time window of our simulation, , which means the trajectories that we have terminated at time must eventually return to zero with probability .
Let be the first time returns to zero. For each retained simulated trajectory, we record the values of and . For the states (the first input being on, the second being off) and (off-off), we present in Figure 2 the cumulative distributions of , , with , as determined by the simulations as well as by a piecewise linear DG approximation constructed from the approximating matrix of operator . In this piecewise linear DG approximation, we consider the approximation interval [0,\mathcal{I}]={\color[rgb]{0,0,0}\definecolor[named]{pgfstrokecolor}{rgb}{0,0,0}\pgfsys@color@gray@stroke{0}\pgfsys@color@gray@fill{0}[0,16]}, and the parameters of the stencil \bm{\omega}_{{\color[rgb]{0,0,0}\definecolor[named]{pgfstrokecolor}{rgb}{0,0,0}\pgfsys@color@gray@stroke{0}\pgfsys@color@gray@fill{0}K},h,\Delta_{h}}, defined in (59), take the following values: and . The boundary meshes each have a constant basis function of value ; each th interior mesh \mathcal{D}_{k}:={\color[rgb]{0,0,0}\definecolor[named]{pgfstrokecolor}{rgb}{0,0,0}\pgfsys@color@gray@stroke{0}\pgfsys@color@gray@fill{0}[x_{k},x_{k+1}]} has two piecewise linear basis functions
[TABLE]
for and {\color[rgb]{0,0,0}\definecolor[named]{pgfstrokecolor}{rgb}{0,0,0}\pgfsys@color@gray@stroke{0}\pgfsys@color@gray@fill{0}k=2,\ldots,K-2}. As can be seen in Figure 2, these two distributions are closely matched.
4.3 The marginal density of
Since Buffer 1, , is independent of Buffer 2, , we can use results from the existing literature on stochastic fluid flows to obtain the marginal limiting density of :
[TABLE]
On the other hand, we can use the operator to compute, based on Theorem 2.3, the joint limiting density , where
[TABLE]
Thus, we can approximate the joint limiting density , via a discontinuous Galerkin approximation of , and consequently form an approximation of the marginal limiting density . In particular, we evaluate an approximation of , and then integrate the approximating function over the domain of , and finally differentiate with respect to ; note that
[TABLE]
Let two vectors and denote respectively the piecewise constant and piecewise linear DG approximations of , obtained via the corresponding approximations . We use \bm{\omega}_{{\color[rgb]{0,0,0}\definecolor[named]{pgfstrokecolor}{rgb}{0,0,0}\pgfsys@color@gray@stroke{0}\pgfsys@color@gray@fill{0}43},0.4,0.001} as our stencil for the DG approximation and the nodal points at which we evaluate the analytical density function . Define
[TABLE]
We present in Figure 3 the analytical density at given nodal points, a piecewise constant DG approximation, and a piecewise linear DG approximation.
The approximations reconstruct the general shape of the density reasonably well: for both piecewise constant and piecewise linear, we see most of the probability being concentrated in the point mass at , and then decaying as increases. We observe from the subplot inside the left plot of Figure 3 that the piecewise constant approximation underestimates the point mass, and redistributes the difference over the rest of the state space of . Hence, there is more mass in the tails of the densities. On the other hand, the piecewise linear approximation appears to be very close to the analytical solution.
4.4 Sensitivity analysis of the dynamics of
To further confirm that the discontinuous Galerkin approximation of the operator accurately captures the dynamics of , we vary the rates at which the input to this buffer switches on and off (denoted by and , respectively). As we modify these rates, we should see a change in the distribution of probability between
[TABLE]
the limiting marginal density of when , and
[TABLE]
that of when . On the other hand, the sum of these two densities should be identical in all the different meaningful scenarios of and ; that is, the sum should remain fixed and be equal to the sum , for all .
To that end, we keep our stencil and basis functions fixed, and compute the marginal limiting density for different values of and . The results coincide with what we expect from the dynamics in .
From Table 3, we observe that as (the rate at which the input for switches off) increases, so does the probability of being empty. Furthermore, even though there are different amounts of probabilities in the two marginal densities, and , their sum remains the same as the sum of the marginal limiting densities and for all calculated values of (data not shown here). These numerical results indicate that the dynamics of are captured well by the DG approximations.
4.5 Errors of approximation
In a discontinuous Galerkin approximation, we choose the smoothness of the basis functions and the level of spatial discretisation. Once a particular selection is made, we project our operators into a finite-dimensional linear operator space corresponding to these choices (see Section 3). It has been shown that operators such as (defined in Section 2.1) under a DG approximation have an error which converges at the order of where is the discretisation and is the degree of the basis [9].
However, this result cannot be easily translated across to the operator The DG approximation of the operator is constructed by taking the DG approximation of the operators and then solving the Riccati equation (52) using the approximate operators. Further, we then use this approximation of to derive an approximation for the limiting density . With such a construction, it is not trivial to determine how the error propagates through the process of solving the Riccati equation, and then through further calculations to determine . Determining bounds for the approximation errors of and , as functions of the discretisation and basis selection, is a topic for future research.
As a preliminary step in this direction, we empirically investigate how the approximation error of the marginal limiting density of the fluid (see Section 4.3) changes with respect to the choices of basis functions and the levels of discretisation. We begin by introducing our normed vector space in which we compare the different levels of discretisation and smoothness. Recall, from (59), that the left boundary mesh of our approximations is of length , and all but two interior meshes are of length , with two compensating meshes of width . Let be the interval on which we approximate our solution, then both the approximations and the analytical solution belong to the space , where is the set of functions with countably many discontinuities. We choose the right boundary mesh to be a piecewise-constant function. Then, for any function , where for , we define the star seminorm as follows:
[TABLE]
Essentially, the star seminorm is an extension of the norm which incorporates our interpretation: that the left and right boundary meshes are treated as point masses. That is, we only study the total mass in the intervals and and not the distribution over them.
We conduct two numerical experiments. The first is to understand the effects of choosing piecewise-linear basis functions over piecewise-constant, the second experiment is to understand the effects of treating a point mass as a density on a short interval.
In the first experiment, we choose and . We then consider the error between the approximation and the reference solution in the star seminorm. The notations and denote, respectively, a piecewise-linear approximation with stencil and a piecewise-constant approximation with stencil .
In the left plot in Figure 4, we observe that the piecewise-constant approximation has an error that scales approximately with respective to the mesh size , and the piecewise linear approximation has an error that scales approximately . We observe that the coarsest piecewise-linear basis approximation has an error similar to that of the finest piecewise-constant approximation, which is two orders of magnitude finer.
In the second experiment, we fix , and the basis functions to be piecewise-linear, and then we scale and observe the trend in the star seminorm. The approximation error will plateau past the length where the error caused by and dominate over the error gains of reducing the width of the boundary mesh. Hence, we subtract from this approximation error the approximation error of a finer approximation, . The right plot in Figure 4 shows that the error scales approximately .
The numerical experiments above indicate that an increase in the degree of the basis functions would result in an increase in the order of convergence of the error in the star seminorm. However, this experiment is not taking into consideration that by increasing the degree of basis functions we need to increase the number of basis (elements) in the mesh. For example, the piecewise-constant has one basis element in each mesh, while the piecewise-linear has two. With respect to storage, the overall number of elements in the entire stencil is increasing linearly with the order of the elements.
In Table 4, we give some computational statistics of the approximations used in Figure 4 to aid in the understanding of the trade-offs between memory, computation time, mesh size, and error. We observe for a mesh size that the piecewise-constant approximation uses roughly half the number of elements and is twice as fast, compared to the piecewise linear approximation. However, the piecewise-linear approximation is two hundred times more accurate than the piecewise-constant approximation.
In our particular examples, if we were given a restriction to use no more than a prescribed amount of elements, then choosing a larger mesh size with piecewise-linear elements would be a better strategy than choosing a smaller mesh size and using piecewise-constant elements. The generalisation of these observations and the exploration of higher order basis functions is the focus for future research.
5 Conclusions
Finite Differences and Finite Volume methods have been used in the past to approximate operators that arise in stochastic processes. In principle, these methods approximate the operators by higher dimensional linear operators. In these methods, intuitive notions of mass conservation and positivity are captured; however, regularity is lost, making highly regular probability distributions computationally intensive.
We proposed the application of the discontinuous Galerkin method to approximate stochastic operators, with the intent that its ability to incorporate local regularity and maintain mass conservation, will lead to more accurate approximations and a reduction in computational effort. To demonstrate this, we applied the discontinuous Galerkin method to approximate all the operators needed to construct the joint stationary distribution of a stochastic fluid-fluid process.
The numerical results showed that the approximation of the stationary distribution arising from DG approximations of the operators is accurate and effective. We also verified that the operators and their dynamics were captured accurately. Furthermore, in our example, we observed that adding more regularity in the basis functions led to a significant decrease in computational effort. The DG method also enabled us to obtain other performance measures of stochastic fluid-fluid processes that are also analytically presented by operators.
Future work includes determining error bounds for the approximations of the operator as well as of the stationary distribution in general, and a more thorough investigation of the computational effort as higher-order basis functions are used.
Acknowledgements
The authors acknowledge the financial support of the Australian Research Council (ARC) through the Discovery Grants DP110101663 and DP180103106. Bean, Nguyen, and O’Reilly also acknowledge the support of ACEMS (ARC Centre of Excellence for Mathematical and Statistical Frontiers).
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] M. Miyazawa, B. Zwart, Wiener-Hopf factorizations for a multidimensional Markov additive process and their applications to reflected processes, Stochastic Systems 2 (2012) 67–114.
- 2[2] N. G. Bean, M. M. O’Reilly, The stochastic fluid-fluid model: A stochastic fluid model driven by an uncountable-state process, which is a stochastic fluid itself, Stochastic Processes and their Applications 124 (2014) 1741–1772.
- 3[3] M. Neuts, Introduction to Matrix Analytic Methods in Stochastic Modeling, The John Hopkins University Press, 1981.
- 4[4] G. Latouche, V. Ramaswami, Introduction to matrix analytic methods in stochastic modeling, ASA-SIAM Series on Statistics and Applied Probability, SIAM, Philadelphia PA, 1999.
- 5[5] B. Cockburn, Discontinous Garlekin methods for convection-dominated problem, in: Higher-Order Methods for Computational Physics, Vol. 9 of Lecture Notes in Computational Science and Engineering, Springer Verlag, 1999.
- 6[6] G. Latouche, G. T. Nguyen, Z. Palmowski, Two-dimensional fluid queues with temporary assistance, Vol. 27 of Springer Proceedings in Mathematics & Statistics, Springer Science, New York, NY, 2013, Ch. 9, pp. 187–207.
- 7[7] B. Cockburn, Discontinous galerkin methods, Z. Angew. Math. Mech. 83 (2003) 731–754.
- 8[8] N. G. Bean, M. M. O’Reilly, P. G. Taylor, Algorithms for the Laplace-Stieltjes transforms of first return times for stochastic fluid flows, Methodology and Computing in Applied Probability 10 (2009) 381–408.
