A new matrix-infinite-product-form solution for upper block-Hessenberg Markov chains and its quasi-algorithmic constructibility
Hiroyuki Masuyama

TL;DR
This paper introduces a novel matrix-infinite-product-form solution for upper block-Hessenberg Markov chains that does not require parameter sets or conditions, and features a quasi-algorithmic construction method.
Contribution
It proposes a new MIP-form solution that simplifies computation and removes the need for certain parameter conditions, with a quasi-algorithmic constructibility feature.
Findings
The new MIP-form solution requires no parameter set or convergence conditions.
It is constructed via an iterative recursive procedure with finite complexity per iteration.
The solution is applicable to stationary distribution computation in UBH-MCs.
Abstract
This paper presents a new matrix-infinite-product-form (MIP-form) solution for the stationary distribution in upper block-Hessenberg Markov chains (UBH-MCs). The existing MIP-form solution (Masuyama, Queueing Syst., Vol. 92, 2019, pp. 173--200) requires a certain parameter set that satisfies both a Foster-Lyapunov drift condition and a convergence condition. In contrast, the new MIP-form solution requires no such parameter sets and no other conditions. The new MIP-form solution also has "quasi-algorithmic constructibility", which is a newly introduced feature of being constructed by iterating infinitely many times a recursive procedure of finite complexity per iteration. This feature is not found in the other solutions for the stationary distribution in UBH-MCs.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Queuing Theory Analysis · Reliability and Maintenance Optimization · Railway Systems and Energy Efficiency
A new matrix-infinite-product-form solution for upper block-Hessenberg Markov chains and its quasi-algorithmic constructibility111To appear in Advances in Applied Probability, vol. 55, no. 1, March 2023
Hiroyuki Masuyama222E-mail: [email protected]
Graduate School of Management, Tokyo Metropolitan University, Tokyo 192–0364, Japan.
Abstract
[TABLE]
[TABLE]
1 Introduction
This paper studies the stationary distribution in continuous-time upper block-Hessenberg Markov chains (UBH-MCs). This study focuses on establishing a theoretical procedure for constructing the exact stationary distribution in an algorithmic way, rather than establishing a practical procedure for computing an approximate stationary distribution (of course, such practicality is a significant issue, too). Additionally, this study would be a partial answer to the question: “What is a class of Markov chains whose exact stationary distributions are theoretically constructible by an algorithmic procedure?”
The class of UBH-MCs is characterized by the upper block-Hessenberg (UBH) structure of the infinitesimal generator matrix (called generator for short), and this class includes M/G/1-type Markov chains and level-dependent quasi-birth-and-death processes (LD-QBDs). Remarkably, even a multi-dimensional random walk on the nonnegative lattice is expressed as a UBH-MC such that its level variable is the sum of coordinate components and its phase variable is the set of all the coordinate components but one. Thus, the class of UBH-MCs is a powerful tool for modeling information and communication systems, inventory systems, transportation systems, etc. Moreover, the time-average performance of these systems is evaluated through the stationary distribution. That is why this study focuses on the stationary distribution in UBH-MCs.
We now introduce the definition of UBH-MCs in order to describe the background and purpose of this study. Let denote a countable set such that
[TABLE]
where and . Let denote an essential -matrix (see Definition B.1), and thus
[TABLE]
Assume that is of UBH form:
[TABLE]
where is called level and an element of is called phase (of level ). A Markov chain having this -matrix as its generator is said to be an upper block-Hessenberg Markov chain (UBH-MC). Note that a UBH-MC may be called a level-dependent M/G/1-type Markov chain. Note also that if for all and then the generator is of block-tridiagonal form and thus the UBH-MC is reduced to an LD-QBD.
We provide a basic assumption and some definitions associated with the stationary distribution vector of the UBH generator given in (1.1). Assume that is ergodic (i.e., irreducible and positive recurrent) throughout the paper, unless otherwise stated. Let denote the unique and positive stationary distribution vector of the ergodic generator (see, e.g., [1, Chapter 5, Theorems 4.4 and 4.5]). By definition,
[TABLE]
where denotes a column vector of ones with an appropriate dimension. For later reference, let for , which leads to the level-wise partition of :
[TABLE]
Furthermore, let , , denote
[TABLE]
which is referred to as the (finite-level) conditional stationary distribution vector. For any sufficiently large , can be considered an approximation to .
The main purpose of this paper is to present a quasi-algorithmically constructible solution for the stationary distribution vector of the ergodic UBH generator given in (1.1). We here provide our notion (not strict definition) of quasi-algorithmic constructions for the stationary distribution (the stationary distribution vector) in ergodic countable-state Markov chains, which are not, of course, restricted to UBH-MCs.
Notion 1.1
(Quasi-algorithmic constructions for the stationary distribution in countable-state Markov chains.) Consider an ergodic countable-state Markov chain with generator (essential -matrix) . Let denote the stationary distribution of the ergodic Markov chain. Let denote a procedure for sequentially generating tentative solutions for the stationary distribution . The procedure is said to be a quasi-algorithmic construction of and furthermore is said to be quasi-algorithmic constructible (or have quasi-algorithmic constructibility), if has the following features:
- (i)
The procedure generates each tentative solution by using at most a finite (not necessarily bounded) number of elements of together with (if required) some or all the components of the previous tentative solutions , where for . 2. (ii)
It takes at most finite complexity for to generate each tentative solution . 3. (iii)
The sequence converges to in some mathematical sense (e.g., in the -distance).
Remark 1.2
To the best of our knowledge, there is no universal formal definition of “algorithm”. Indeed, the term of “algorithm” is widely used with some ambiguity in various contexts. Nevertheless, it would be generally accepted, as an informal definition, that an “algorithm” is a finite set of operations (of finite computational complexity) to accomplish a particular task. Considering this situation, we introduce the notion of quasi-algorithmic constructions in order to distinguish our new solution presented in this paper from the existing ones for the stationary distribution in UBH-MCs.
Remark 1.3
Any quasi-algorithmic construction can be implemented as an “algorithm” if the construction is equipped with an appropriate stopping criterion so that it stops after finitely many iterations, and if it is possible to store the elements required to compute each tentative solution.
As far as we know, there have been no studies that achieve a quasi-algorithmic construction of the stationary distribution vector in UBH-MCs including LD-QBDs. There are, however, many related studies on the computation of the stationary distribution vector in these Markov chains.
Bright and Taylor [5] presented a nice matrix-product form of the stationary distribution vector of the LD-QBD generator:
[TABLE]
The set of matrices is the minimal nonnegative solution for the system of matrix equations:
[TABLE]
The matrix-product form (1.7) appears easy to compute. Thus, based on it, Bright and Taylor [5] proposed a procedure for approximately computing the stationary distribution vector of the LD-QBD generator. Besides, some researchers use the matrix-product form (1.7) as the foundation of their respective algorithms (see, e.g., [3, 17]).
However, the matrix-product form (1.7) does not lead to any quasi-algorithmic construction of the stationary distribution vector in LD-QBDs. Equation (1.8) yields
[TABLE]
This recursive formula shows that each can be computed provided that is given. Therefore, to implement the algorithms based on (1.7), we have to truncate the infinite sequence at a sufficiently large by letting . Moreover, owing to this implementation, computing better approximations (to the stationary distribution) requires setting -matrices of higher levels to be zero matrices, which implies that all the components of such an approximation are computed anew from scratch.
Several researchers have studied the approximate computation of the stationary distribution in UBH-MCs. Takine [20] presented an algorithm for the conditional stationary distribution vector under some additional conditions, which are removed by Kimura and Takine [6]. As mentioned in Section 1 of [20], the conditional stationary distribution vector can be a good approximation to the stationary distribution vector for all sufficiently large . Klimenok and Dudin [10], Li et al. [13], and Shin and Pearce [19] proposed respective algorithms for approximately computing the stationary distribution vector in UBH-MCs by making transition rates (or transition probabilities) eventually level independent.
In summary, all the existing algorithms mentioned above are originally designed for approximately computing the stationary distribution vector in UBH-MCs (including LD-QBDs). Thus, although those algorithms fulfill practical purposes, they do not make any theoretical contribution to the quasi-algorithmic construction of the stationary distribution vector .
Unlike those previous studies, Masuyama [16] has taken the first step to the quasi-algorithmic construction of in UBH-MCs. To describe the numerical procedure proposed in [16] (and to facilitate later discussion), we introduce the notation: A finite dimensional matrix (resp. vector) is extended (if necessary) to an infinite dimensional matrix (resp. vector) by appending zeros to it with its original elements fixed in their original positions. This notation enables us to define addition, subtraction, and product over vectors and matrices with different sizes. Furthermore, for , the notation “” denotes a product operator of matrices (including scalars) such that
[TABLE]
Masuyama [16] proposed a sequential update algorithm (Algorithm 1 therein) that generates a matrix-infinite-product form (MIP-form) solution for :
[TABLE]
where (i) ; (ii) each , is computed in a finite number of operations starting from ; and (iii) is a probability vector that is an optimal solution for a certain linear fractional programing (LFP) problem, Problem 2.3 in Section 2.3. For the reader’s convenience, we summarize the sequential update algorithm [16, Algorithm 1] in Algorithm 1 in Section 4.
The existing MIP-form solution (1.9) (and thus the sequential update algorithm in [16]) requires Conditions 1 and 2 below (see [16, Conditions 1 and 2]).
Condition 1** **(Foster-Lyapunov drift condition)
There exist a constant , a finite set , and a positive column vector such that and
[TABLE]
where , , denotes a column vector given by
[TABLE]
The space of parameter sets satisfying (1.10) is denoted by , for convenience.
Condition 2** **(Convergence condition for the sequential update algorithm in [16])
[TABLE]
where, for any matrix (resp. vector ), the symbol (resp. ) denotes the matrix (resp. vector) obtained by taking the absolute value of each element of (resp. ).
Conditions 1 and 2 diminish the usefulness of the MIP-form solution (1.9). A parameter set of Condition 1 is required to describe the objective function of Problem 2.3 with optimal solution . In fact, Condition 1 holds if and only if the irreducible generator is ergodic (see [11, Theorem 1.1]). However, even though we know that is ergodic, it may not be easy to find in some cases. In addition, Condition 2 is required to prove the convergence of the MIP-form solution (1.9) (see [16, Theorem 3.2]), though this condition does not necessarily hold for all UBH-MCs (such an example is presented in Appendix A).
We note that the existing MIP-form solution (1.9) is not quasi-algorithmically constructible. For each , the MIP-form solution (1.9) requires an optimal solution for Problem 2.3, and its objective function includes the infinite sums:
[TABLE]
where is a subvector of . Clearly, the infinite sum has infinite computational complexity, in general. Therefore, the existing MIP-form solution (1.9) is not quasi-algorithmically constructible, and computing it requires the truncation of the infinite sums (1.11), which causes truncation error.
This paper presents a new MIP-form solution for quasi-algorithmically constructible:
[TABLE]
where is a free parameter, and where , , is a probability vector that is an optimal solution for the LFP problem (Problem 3.1 in Section 3) different from Problem 2.3 of the existing MIP-form solution (1.9). The objective function of Problem 3.1 does not include such a parameter set as and is computed in a finite number of operations without any infinite sum like (1.11). Thus, the new MIP-form solution (1.12) is quasi-algorithmically constructible. In addition, the new solution holds without any additional conditions, such as Condition 2 for convergence, except for the ergodicity of .
The rest of this paper is divided into five sections. Section 2 provides preliminary results including the existing MIP-form solution presented in [16]. Section 3 presents our new MIP-form solution together with its theoretical foundations, which is the main result of this paper. Section 4 considers the advantages of our MIP-form solution over the existing one. Section 5 discusses special cases where our solution can be established more effectively. Finally, Section 6 contains concluding remarks.
2 The existing MIP-form solution
This section reviews the existing MIP-form solution (1.9) to facilitate describing our new MIP-form solution (established in Section 3) and understanding its advantages over the existing one. First, we introduce the foundation of the existing MIP-form solution, the last-block-column linearly augmented (LBC-LA) truncation approximation to the stationary distribution vector . We then provide a matrix-product form of the LBC-LA truncation approximation. Finally, as a certain limit of the matrix-product form, we express the existing MIP-form solution (1.9) for .
For later reference, we define the -distance for vectors of different dimensions. Let be a countable (possibly infinite) set. Let and be real-valued vectors, where and are subsets of . For vectors and , let
[TABLE]
which is referred to as the -distance of the difference between and .
2.1 Definition of the LBC-LA truncation approximation
We begin with some definitions to describe the LBC-LA truncation approximation. Let for and for convenience. For any set , let denote the cardinality of . For any , let denote a submatrix of consisting of the first rows and columns, that is,
[TABLE]
Furthermore, let , , denote a probability vector that has its probability masses on the last block (corresponding to ), that is,
[TABLE]
where is a probability vector.
We define the LBC-LA truncation approximation to as a stationary distribution vector of the finite essential -matrix given by
[TABLE]
The -matrix is referred to as the last-block-column linearly augmented (LBC-LA) truncation of generator . Clearly, the LBC-LA truncation has at least one stationary distribution vector. One of them is given by
[TABLE]
This stationary distribution vector is referred to as the LBC-LA truncation approximation to , and is partitioned level-wise:
[TABLE]
2.2 A matrix-product form of the LBC-LA truncation approximation
This subsection presents a matrix-product form of the LBC-LA truncation approximation . We first introduce some vectors and matrices needed to express the matrix-product form of . We then present the matrix-product form together with recursive formulas for the components of it. Finally, we mention a connection of the matrix-product form to the existing MIP-form solution (1.9) for .
We define the component matrices of the matrix-product form of . Let , , denote an matrix such that its -th element represents the expected total sojourn time in state before the first visit to (i.e., to any state in levels ) starting from state . The matrices , are determined recursively:
[TABLE]
and, for ,
[TABLE]
Furthermore, let , , denote
[TABLE]
It then follows from (2.3) that
[TABLE]
For simplicity of notation, we define , as
[TABLE]
Using (2.6), we rewrite (2.4) as
[TABLE]
Finally, , , is defined as the row sum vector of the matrix :
[TABLE]
where follows from [16, Remark 2.3].
Remark 2.1
The probabilistic interpretation of leads to that of : Its -th element represents the expected total sojourn time in state before the first visit to starting from state . This interpretation implies that the matrix is identified with the rows in level of because this inverse matrix consists of the expected total sojourn times in the respective states in before the first visit to starting from the states in . We thus have (see [12, Theorem 5.5.1] and [16, Remark 2.2])
[TABLE]
where the second equality is due to (2.6) and (2.7).
We are now ready to present the matrix-product form of .
Proposition 2.2** **([16, Equation (2.21) and Lemma 2.2])
For and ,
[TABLE]
The components and of the matrix-product form (2.10) are the following recursive formulas (see [16, Equations (3.23) and (3.24)]):
[TABLE]
and
[TABLE]
where is given in (2.5).
Proposition 2.2 implies that an MIP-form solution for can be obtained as a limit of the matrix-product form (2.10), that is,
[TABLE]
However, as mentioned in [16, Section 2.3], achieving such a limit convergent to requires choosing the probability vectors , appropriately. To do this, Masuyama [16] formulated a series of certain linear fractional programing (LFP) problems. The details are given in the next subsection.
2.3 The existing MIP-form solution for the stationary distribution vector
This subsection provides a brief summary of the existing MIP-form solution (1.9) for . We first show the LFP problem whose optimal solution is the key component of the existing MIP-form solution. We then present the proposition that summarizes the theoretical results on the existing MIP-form solution.
To obtain an MIP-form solution for , Masuyama [16] formulates the following LFP problem indexed with :
Problem 2.3
[TABLE]
where is given by
[TABLE]
Remark 2.4
Equation (2.8) implies that for any feasible solution . Therefore, the objective function of Problem 2.3 is well-defined.
Remark 2.5
If the UBH generator satisfies for ( is an LD-QBD generator), then (2.14) is reduced to
[TABLE]
Proposition 2.6 below shows how to find an optimal solution for Problem 2.3 and how to construct, with such a solution, the existing MIP-form solution (1.9), originally presented in [16].
Proposition 2.6** **([16, Theorems 3.1 and 3.2, and Corollary 3.1])
If Condition 1 is satisfied, then the following hold.
- (i)
For , let denote a probability vector such that
[TABLE]
where
[TABLE]
The probability vector is an optimal solution for Problem 2.3. 2. (ii)
For , let denote a probability vector such that
[TABLE]
where denotes the -th row of the matrix in the parentheses. Furthermore, if a parameter set of Condition 1 satisfies Condition 2, then
[TABLE]
and
[TABLE]
Remark 2.7
The symbols , , and correspond to , and , respectively, in the original notation of [16].
Remark 2.8
Constructing the MIP-form solution (2.17) (and thus (1.9)) requires the sequence . Each is an optimal solution for Problem 2.3 with index . Therefore, we only have to solve a single optimization problem for each .
3 A new MIP-form solution
This section presents a new MIP-form solution for . To construct the new MIP-form solution, we first formulate an LFP problem different from Problem 2.3. We then provide a way to find its optimal solution. Finally, we show the new MIP-form solution for .
We introduce some definitions to describe our LFP problem for the new MIP-form solution. Let for any integer . We then define , , as
[TABLE]
where the second equality is due to (2.7). We also define , , as
[TABLE]
where denotes the -th element of the (row or column) vector in the parentheses. Since the generator is irreducible, for at least one and thus . Furthermore, since and ,
[TABLE]
which is used later.
The following is our LFP problem, which is formulated for each .
Problem 3.1
[TABLE]
Remark 3.2
The objective function of Problem 3.1 is well-defined, as with Problem 2.3 (see Remark 2.4).
An optimal solution for Problem 3.1 is given by Lemma 3.3 below.
Lemma 3.3
For , fix such that
[TABLE]
and let denote a unit row vector such that
[TABLE]
The vector is then an optimal solution for Problem 3.1, and the optimal (and thus maximum) value is positive.
Proof.
First, we prove that is an optimal solution for Problem 3.1. It follows from (3.10) and (3.11) that
[TABLE]
which leads to for all . Thus, for any feasible solution satisfying (3.7)–(3.9), we have
[TABLE]
Combining this result with (3.6) and (3.12) yields
[TABLE]
which shows that is an optimal solution for Problem 3.1.
Next, we prove by contradiction. To this end, suppose that , or equivalently, that for any satisfying (3.7)–(3.9). We then have
[TABLE]
Therefore, using (3.5), (3.1), and (2.9), we obtain
[TABLE]
which contradicts . The proof has been completed. ∎
The next theorem shows that the optimal solution for Problem 3.1 leads to an MIP-form solution for , which is different from the existing one (2.17) based on Problem 2.3.
Theorem 3.4
For each , let , where , , is a vector obtained by replacing in (2.10) with the optimal solution for Problem 3.1, i.e.,
[TABLE]
where defined in (3.10). We then have
[TABLE]
and therefore
[TABLE]
Remark 3.5
We only have to solve a single optimization problem (Problem 3.1) for each to construct the new MIP-form solution (3.14), as with the existing one (2.17) (see Remark 2.8).
Proof of Theorem 3.4. According to Theorem B.10, it suffices to show that
[TABLE]
To do this, we define as
[TABLE]
Equations (3.5) and (3.16) imply that
[TABLE]
and thus is a feasible solution for Problem 3.1. Recall here that maximizes the objective function (see Lemma 3.3). Therefore, it follows from (3.6), (3.1), and (3.13) that
[TABLE]
It also follows from (3.16), (3.1), and (2.8) that
[TABLE]
where the second equality is due to (2.9). Combining (3.17) and (3.18) leads to
[TABLE]
Consequently, (3.15) holds. The proof has been completed.
Theorem 3.4 shows that the new MIP-form solution (3.14) is obtained by finding the optimal solution of Problem 3.1, that is, by finding an element in . This task can be lightened by the following theorem.
Theorem 3.6
The set is given by
[TABLE]
where is defined as
[TABLE]
Proof.
Lemma 3.3 states that, for any , . This implies that does not include such that . It thus suffices to show that
[TABLE]
It follows from (3.1) and (2.11d) that
[TABLE]
It also follows from (3.20) that
[TABLE]
These two equations show that (3.21) holds, which completes the proof. ∎
4 Advantages of the new MIP-form solution
This section explains the advantages of our new MIP-form solution (3.14) over the existing one (2.17). We first point out the drawbacks of the existing MIP-form solution. We then show that our new solution (3.14) is quasi-algorithmically constructible (see Notion 1.1) unlike the existing one.
The existing MIP-form solution (2.17) has three drawbacks, compared with our new one. First, the existing solution (2.17) requires a parameter set of Condition 1 such that satisfies Condition 2 (see Proposition 2.6). In some cases, Condition 2 does not hold, and such an unfavorable case is provided in Appendix A. Second, finding a parameter set may not be easy. Finally, it is of infinite computational complexity to obtain the optimal solution for Problem 2.3. This is because the objective function of Problem 2.3 includes the infinite sums , in (2.14).
Owing to these drawbacks, especially, to the last one, the existing MIP-form solution (2.17) for is not quasi-algorithmically constructible. Nevertheless, constructing via the existing solution (2.17) is formally expressed in an algorithm style equipped with a stopping criterion, as the sequential update algorithm [16, Algorithm 1]. This is described in Algorithm 1 below.
Remark 4.1
In the original description of Algorithm 1, the following operation is inserted as the first step: “Find , , and such that Conditions 1 and 2 hold” (see [16, Algorithm 1]). However, in general, this cannot be performed algorithmically. Rather, , , and are input parameters of the algorithm, as described in Algorithm 1 above.
Remark 4.2
In general, we have to truncate the infinite sums , in order to compute (see Remark 2.5 for an exceptional case).
In contrast, our new MIP-form solution (3.14) is quasi-algorithmically constructible. The new solution uses the sequence of probability vectors , and each of them is an optimal solution for Problem 3.1. This problem is formulated without finding such a parameter set satisfying Condition 2. Moreover, the objective function of Problem 3.1 consists of only and of finite computational complexity. Indeed, the finite computational complexity of is guaranteed by the recursion (2.12), and that of is guaranteed by the following recursion:
[TABLE]
which follows from (3.1) and (3.22). Therefore, it is of finite computational complexity to obtain the optimal solution for Problem 3.1. In summary, constructing our new MIP-form solution (3.14) requires , , and , which are computed by iterating the recursive procedure consisting of an increasing but finite number of operations per iteration. Consequently, our new solution (3.14) for is quasi-algorithmically constructible.
Shown below is an algorithm-style description of computing our new MIP-form solution (3.14) with a stopping criterion. This is the sequential update algorithm for our new solution.
Remark 4.3
To compute the probability vector , Step (3.d.i) performs finding an element of . Finding an element of is equivalent to solving Problem 3.1, and it requires constructing the sets and . This construction can be easily performed based on the probabilistic interpretation: The set consists of the phases of which accept direct (incoming) transitions from ; the set consists of the phases of which are the starting points of (outgoing) paths leaving eventually and reaching while avoiding . Note that the set is obtained by identifying the positive rows of , which is a crucial component of the recursion (2.11d) of .
Remark 4.4
The choice of finite set can impact on the convergence speed of Algorithm 2. However, it would be difficult to discuss theoretically an optimal set . A reasonable choice of would be .
Remark 4.5
A single run of Algorithm 2 generates the probability vector with being equal to the value fixed on completion of all the operations. Recall that is an LBC-LA truncation approximation to the stationary distribution vector . Thus, , can be considered an approximation to the moment vector of the stationary distribution, though, as the order is larger, a smaller would need to be chosen (as the parameter of the stopping criterion) in order to achieve a satisfactory accuracy of the approximation.
Finally, in the two subsequent paragraphs, we mention the advantages of Algorithm 2 based on our new MIP-form solution (3.14), comparing with the existing algorithms [6, 10, 13, 19, 20] for UBH-MCs. These existing algorithms are classified into two types: (i) approximation by conditional stationary distribution [6, 20]; (ii) approximation by level-homogenizing [10, 13, 19]. Both types are originally designed to compute a single approximation to the stationary distribution, but performing them iteratively can generate a convergent sequence of approximations to the stationary distribution. Considering such applications of the existing algorithms, we compare them with our algorithm in the following.
The algorithms [6, 20] compute a single conditional stationary distribution , as an approximation to , in a single run. To generate each approximation , the algorithms [6, 20] have to perform a large (theoretically infinite) amount of preliminary computations involving the levels above . The preliminary numerical results can be used to construct better approximations with , but to do this actually with satisfactory accuracy requires further preliminary computations involving higher levels. In contrast, Algorithm 2 computes each tentative solution without any information on the higher levels above except .
The other algorithms [10, 13, 19] conduct level-homogenizing the transition law above some level, say level , in order to compute an approximation to , denoted by . More specifically, the approximation requires, as the input for computing itself, a (level-independent) -matrix obtained by level-homogenizing the transition law above level . Thus, each approximation is computed from scratch, which is time-consuming. In contrast, Algorithm 2 sequentially constructs tentative solutions convergent to the stationary distribution vector , making the most of the components of the “old” tentative solutions.
5 Special cases free from solving Problem 3.1
This section discusses special cases where an MIP-form solution for is constructed without solving Problem 3.1. We first show a basic theorem on this matter, and then derive its corollary that implies what cases are free from solving Problem 3.1. Finally, we mention representative examples of such favorable cases.
The following theorem provides the basis for our discussion here.
Theorem 5.1
Let be a nonnegative integer, and let , denote a probability vector such that
[TABLE]
where is a sequence such that . Furthermore, for , let denote a probability vector such that
[TABLE]
We then have
[TABLE]
Proof.
Fix arbitrarily, and note that (5.3) holds if
[TABLE]
Indeed, the triangle inequality and (1.3) yield
[TABLE]
and then, combining this and (5.4) results in (5.3).
To complete the proof, we show that (5.4) holds. Equation (5.1) implies that
[TABLE]
Applying (5.5) to (5.2) yields
[TABLE]
Furthermore, substituting (2.9) into the above inequality leads to
[TABLE]
and thus
[TABLE]
which shows that (5.4) holds. The proof has been completed. ∎
Remark 5.2
Substituting
[TABLE]
into the matrix product form (2.10) of the LBC-LA truncation approximation , and using (2.6), we obtain . Therefore, is an LBC-LA truncation approximation to .
Theorem 5.1 implies in what cases we do not have to solve Problem 3.1. However, Theorem 5.1 requires us to identify to some extent, which is not easy in general. We thus provide a more effective (but more restrictive) result that can actually contribute to the construction of MIP-form solutions without solving Problem 3.1.
Corollary 5.3
Suppose that there exists some such that for all . Furthermore, there exists some probability vector such that
[TABLE]
We then have
[TABLE]
Proof.
According to (5.6) and Theorem 5.1, it suffices to show that (5.1) holds with for all . Equation (5.6) implies that there exists some such that and
[TABLE]
This inequality is equivalent to
[TABLE]
We now fix
[TABLE]
We then have and
[TABLE]
Thus, (5.7) yields
[TABLE]
which shows that (5.1) holds with for all . ∎
A typical example covered by Corollary 5.3 is a UBH-MC with block monotonicity (see [15, Definition 3.2]). If the generator is block monotone, then, for all , is constant and thus is interpreted as the generator of the underlying Markov chain on the phase set , which controls the level process (see [15, Lemma 3.1]). In this case, is equal to the stationary distribution vector of the underlying Markov chain.
Another example is the (level-independent) M/G/1-type Markov chain. For this chain, several sufficient conditions are established for the tail asymptotics of (see [7, 8, 9, 14]) such that, for some , , and random variable on ,
[TABLE]
Moreover, (assuming additional conditions, if required), we can obtain (see, e.g., [8, Sections 4 and 5])
[TABLE]
which implies that (5.6) holds.
6 Concluding Remarks
We have established a quasi-algorithmic construction of the exact (not approximate) and whole (not partial) stationary distribution vector in upper block-Hessenberg Markov chains (UBH-MCs). The core of this theoretical construction is that all we have to do is to solve just one linear fractional programing (LFP) problem (Problem 3.1) in each iteration for the construction. We have also presented some special cases free from solving the LFP problem. To find other such special cases is an interesting future task. A possible case would be the class of UBH-MCs with asymptotically block-Toeplitz structure. The generator of such a chain satisfies (1.1) and the following: for all sufficiently large and
[TABLE]
where is an essential -matrix. In [10], this special UBH-MC is referred to as a multi-dimensional asymptotically quasi-Toeplitz Markov chain.
Appendix A An example violating Condition 2
This section presents an example of UBH -matrices for which the existing MIP-form solution in [16] does not hold. To describe that example, we assume that is the generator of a specific M/G/1-type Markov chain with power-like level increments. For such a generator , we show that Condition 2 does not hold for any parameter set of Condition 1 and therefore the existing MIP-form solution is not established.
Let , and suppose that is an M/G/1-type generator given by
[TABLE]
where , , is an matrix. We then assume the following.
Assumption A.1
(i) The generator in (A.6) is irreducible; (ii) is an essential -matrix with stationary distribution vector ; (iii) ; and (iv) for some nonnegative matrix .
We confirm that there exists some of Condition 1 under Assumption A.1. Uniformizing the generator (see, e.g., [2, Problem II.4.1]) yields an M/G/1 type stochastic matrix that satisfies the stability condition [2, Chapter XI, Proposition 3.1] of GI/G/1-type Markov chains (including M/G/1-type ones) due to the conditions (i)–(iii) of Assumption A.1. Therefore, Condition 1 holds for the generator satisfying Assumption A.1 (see, e.g., [11, Theorem 1.1]).
We confirm that Condition 2 does not hold for any of Condition 1 under Assumption A.1. Fix arbitrarily. Since is a finite subset of the state space , there exists some such that . Thus, fix as such, and let denote an M/G/1-type Markov chain on state space with the generator satisfying Assumption A.1. It then follows from [11, Theorem 1.1] that there exists some such that
[TABLE]
where and . It also follows from and the M/G/1-type structure of that there exists some such that
[TABLE]
Combining (A.7) and (A.8), we have
[TABLE]
Furthermore, from [8, Theorem 4.1.1] and Assumption A.1 (especially, the condition (iv)), we have
[TABLE]
for some constant . Using (A.6), (A.9), and (A.10), we obtain
[TABLE]
Therefore, Condition 2 does not hold for any in the present example.
Appendix B Convergent approximations to the stationary distribution vector of an essential -matrix
The purpose of this section is to provide the fundamental results needed in the proof of the main theorem (Theorem 3.4). This section consists of two subsections. Section B.1 describes several notions associated with essential -matrices. Section B.2 presents a necessary and sufficient condition for the convergence of a sequence of approximations to the stationary distribution vector of an essential -matrix.
B.1 Definition of basic notions
This subsection introduces the notions used in the next subsection. We first redefine the symbols and introduced in the body of this paper. We next define several notions, such as essential -matrices, subinvariant measures, irreducibility, and recurrence. We then present a proposition on the recurrence and subinvariant measures of an irreducible essential -matrix.
Let denote an arbitrary countable set, and let denote an arbitrary -matrix (see, e.g., [1, page 64]), that is, a diagonally dominant matrix such that
[TABLE]
The following is the definition of essential -matrices.
Definition B.1
The -matrix is said to be essential if and only if the following hold for all : (i) is finite; (ii) ; (iii) . Note that any essential -matrix can be interpreted as the infinitesimal generator of a continuous-time Markov chain. Thus, an essential -matrix may be referred to as an infinitesimal generator or generator, depending on the context.
Remark B.2
The conditions (i) and (ii), respectively, imply that is stable and conservative (see, e.g., [4, Definition 13.3.10]).
Remark B.3
An essential -matrix is not necessarily regular (or equivalently, non-explosive). Indeed, provided that is an essential -matrix, is regular if and only if, for any , the system of equations
[TABLE]
has no bounded solution other than (see, e.g., [1, Chapter 2, Theorem 2.7] and [4, Theorem 13.3.11]).
To proceed further, we assume that is an essential -matrix, and then define as a Markov chain on having this essential as its generator. For later use, we also define as a stochastic matrix such that
[TABLE]
where is a diagonal matrix whose diagonal elements are identical to those of . Note that is interpreted as the transition probability matrix of an embedded discrete-time Markov chain for with generator (see [4, Section 13.3.2]). Hence, we call the embedded transition probability matrix of .
For the essential -matrix , we define subinvariant measures and stationary distribution vectors.
Definition B.4
Let denote an arbitrary nonnegative and nonzero vector.
- (i)
If , is said to be a subinvariant measure of or the Markov chain . Furthermore, if a subinvariant measure satisfies , is said to be invariant or stationary. 2. (ii)
If an invariant measure satisfies , is said to be a stationary distribution vector of or the Markov chain .
Finally, we provide the definition of irreducibility, recurrence, and their related notions, and present a proposition that summarizes basic results on the recurrence and subinvariant measures of an irreducible essential -matrix.
Definition B.5
- (i)
An essential -matrix and its Markov chain are irreducible (resp. transient, recurrent) if and only if the embedded transition probability matrix in (B.1) is irreducible (resp. transient, recurrent) (see [4, Definitions 13.4.1 and 13.4.2]). 2. (ii)
An irreducible essential -matrix and its Markov chain are ergodic (i.e., positive recurrent) if and only if there exists a stationary distribution vector, or equivalently, a summable invariant measure unique up to constant multiples (see [4, Definition 13.4.8 and Theorem 13.4.10]).
Proposition B.6
Suppose that is an irreducible essential -matrix. The following statements hold:
- (i)
There always exists a subinvariant measure of . 2. (ii)
Any subinvariant measure of is positive, that is, its elements are all positive. 3. (iii)
If is recurrent, then it has an invariant measure, which is unique up to constant multiples. 4. (iv)
Any subinvariant measure of recurrent is, in fact, an invariant measure. 5. (v)
The matrix has a subinvariant measure that is not invariant if and only if it is transient.
Proof.
Let be a nonnegative and nonzero vector, and let
[TABLE]
It then follows from (B.1) and (B.2) that
[TABLE]
Thus, is a subinvariant (resp. an invariant) measure of the embedded transition probability matrix if and only if (resp. ) or equivalently, is a subinvariant (resp. an invariant) measure of the -matrix (see [18, Definition 5.3]). Furthermore, since the -matrix is essential (that is, for all ),
[TABLE]
Therefore, the present proposition is an immediate consequence of [18, Theorem 5.4] (together with its corollary) and [18, Lemmas 5.5 and 5.6]. That implication is as follows:
[TABLE]
The proof has been completed. ∎
B.2 Convergence condition of approximations
This subsection proves a single theorem (Theorem B.10), which provides a necessary and sufficient condition for the convergence of a sequence of approximations to the stationary distribution of an essential -matrix. That necessary and sufficient condition is crucial to the proof of Theorem 3.4, the main theorem of this paper.
We begin with the following assumption.
Assumption B.7
- (i)
The essential -matrix is ergodic with the unique stationary distribution vector (the uniqueness and positivity of is due to Proposition B.6). 2. (ii)
For all , is a stable and conservative -matrix (see Remark B.2) such that for all . 3. (iii)
Each has at least one stationary distribution vector, denoted by , which can be considered an approximation to .
Remark B.8
The vector in Assumption B.7 is redefined here and thus is completely different from “” used as the subvector of in the body of this paper.
Under Assumption B.7, we have the next lemma, which is used to prove the theorem of this subsection.
Lemma B.9
Suppose that Assumption B.7 holds. Fix arbitrarily, and let denote an infinite subset of such that converges to some , that is,
[TABLE]
Under these conditions, we have the statements (i) and (ii) below:
- (i)
It holds that
[TABLE] 2. (ii)
Furthermore, if , then
[TABLE]
where, for any vector , denotes a vector such that
[TABLE]
Proof.
We begin with the proof of the statement (i). To prove (B.4) and (B.5), it suffices to show that, for all infinite ,
[TABLE]
Indeed, for any , there exists some infinite such that
[TABLE]
Note that and (any set is a subset of itself). Therefore, if (B.7) holds for all infinite , then
[TABLE]
Combining (B.8) and (B.9) yields
[TABLE]
which shows that (B.4) and (B.5) hold.
To complete the proof of (B.4) and (B.5), we prove that (B.7) holds for all infinite . Let be an arbitrary infinite subset of , and let
[TABLE]
From (B.3) and (B.10), we then have
[TABLE]
Furthermore, using (B.10), (due to Assumption B.7 (ii)), , and Fatou’s lemma, we obtain
[TABLE]
which leads to
[TABLE]
It thus follows from Proposition B.6 (iii) and (iv) that if then is a unique (up to constant multiples) invariant measure of the ergodic -matrix . Therefore, unifying both cases that and , we can see that there exists some such that
[TABLE]
Substituting (B.12) into (B.11) yields
[TABLE]
Combining this with (B.10) and (B.12) results in
[TABLE]
In addition, it follows from , Fatou’s lemma, and (B.13) that
[TABLE]
This result and (B.13) imply (B.7). Consequently, (B.4) and (B.5) have been proved.
We move on to the proof of the statement (ii). Suppose , and let be a nonempty and finite . Since is finite, it follows from (B.4) that
[TABLE]
where the positivity of the second limit is due to , , and . The above two equations yield
[TABLE]
which implies that (B.6) holds. The proof has been completed. ∎
The following theorem presents a necessary and sufficient condition that the sequence of approximations converges to the stationary distribution vector .
Theorem B.10
Under Assumption B.7, if and only if
[TABLE]
Proof.
We note that if holds then
[TABLE]
which implies (B.14). Therefore, to complete the proof, we prove that “(B.14) (B.15)”, and then prove that “(B.15) ”.
We prove that “(B.14) (B.15)”. To this end, suppose that (B.14) holds while (B.15) does not hold, that is,
[TABLE]
Let be an arbitrary infinite subset such that
[TABLE]
It then follows from Lemma B.9 (i) that
[TABLE]
It also follows from (B.17) and the finiteness of , and that
[TABLE]
which contradicts (B.14). Therefore, (B.14) implies (B.15).
We prove that “(B.15) ”. For any fixed , there exist some infinite subset such that
[TABLE]
Thus, applying Lemma B.9 (i) to the sequence yields
[TABLE]
Combining this and (B.15) leads to
[TABLE]
Note here that and are probability vectors. Therefore, due to the dominated convergence theorem, (B.18) implies that . It has been proved that “(B.15) ”, which completes the proof. ∎
Remark B.11
Lemma B.9 (i) is the -matrix-version of [21, Lemma 2.1] for stochastic matrices. Furthermore, Lemma B.9 (ii) and Theorem B.10 are the -matrix-versions of Corollary 2.2 (i) and (ii), respectively, in [21].
Acknowledgments
The author thanks Dr. Masakiyo Miyazawa for valuable comments on the presentation of the notion of quasi-algorithmic constructions. The research of the author was supported in part by JSPS KAKENHI Grant Number JP21K11770.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] W. J. Anderson. Continuous-Time Markov Chains: An Applications-Oriented Approach . Springer, New York, 1991.
- 2[2] S. Asmussen. Applied Probability and Queues . Springer, New York, Second edition, 2003.
- 3[3] H. Baumann and W. Sandmann. Numerical solution of level dependent quasi-birth-and-death processes. Procedia Computer Science , 1(1):1561–1569, 2012.
- 4[4] P. Brémaud. Markov Chains: Gibbs Fields, Monte Carlo Simulation and Queues . Springer, New York, 2nd. edition, 2020.
- 5[5] L. Bright and P. G. Taylor. Calculating the equilibrium distribution in level dependent quasi-birth-and-death processes. Stochastic Models , 11(3):497–525, 1995.
- 6[6] M. Kimura and T. Takine. Computing the conditional stationary distribution in Markov chains of level-dependent M/G/1-type. Stochastic Models , 34(2):207–238, 2018.
- 7[7] T. Kimura, K. Daikoku, H. Masuyama, and Y. Takahashi. Light-tailed asymptotics of stationary tail probability vectors of Markov chains of M/G/1 type. Stochastic Models , 26(4):505–548, 2010.
- 8[8] T. Kimura, H. Masuyama, and Y. Takahashi. Subexponential asymptotics of the stationary distributions of GI/G/1-type Markov chains. Stochastic Models , 29(2):190–239, 2013.
