Reachability in Augmented Interval Markov Chains
Ventsislav Chonev

TL;DR
This paper introduces augmented interval Markov chains (AIMCs), a flexible model for stochastic systems with uncertain and dependent transition probabilities, and analyzes the computational complexity of reachability problems within this framework.
Contribution
It extends interval Markov chains by allowing dependent transition uncertainties and provides complexity bounds for various reachability problems in AIMCs.
Findings
Exact reachability is at least as hard as the square-root sum problem.
Approximate reachability is in NP when the graph is known.
Qualitative subproblems are NP-complete with unknown graph structure.
Abstract
In this paper we propose augmented interval Markov chains (AIMCs): a generalisation of the familiar interval Markov chains (IMCs) where uncertain transition probabilities are in addition allowed to depend on one another. This new model preserves the flexibility afforded by IMCs for describing stochastic systems where the parameters are unclear, for example due to measurement error, but also allows us to specify transitions with probabilities known to be identical, thereby lending further expressivity. The focus of this paper is reachability in AIMCs. We study the qualitative, exact quantitative and approximate reachability problem, as well as natural subproblems thereof, and establish several upper and lower bounds for their complexity. We prove the exact reachability problem is at least as hard as the famous square-root sum problem, but, encouragingly, the approximate version lies in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Reachability in
Augmented Interval Markov Chains
Ventsislav Chonev
IST Austria
Abstract
In this paper we propose augmented interval Markov chains (AIMCs): a generalisation of the familiar interval Markov chains (IMCs) where uncertain transition probabilities are in addition allowed to depend on one another. This new model preserves the flexibility afforded by IMCs for describing stochastic systems where the parameters are unclear, for example due to measurement error, but also allows us to specify transitions with probabilities known to be identical, thereby lending further expressivity.
The focus of this paper is reachability in AIMCs. We study the qualitative, exact quantitative and approximate reachability problem, as well as natural subproblems thereof, and establish several upper and lower bounds for their complexity. We prove the exact reachability problem is at least as hard as the famous square-root sum problem, but, encouragingly, the approximate version lies in if the underlying graph is known, whilst the restriction of the exact problem to a constant number of uncertain edges is in . Finally, we show that uncertainty in the graph structure affects complexity by proving -completeness for the qualitative subproblem, in contrast with an easily-obtained upper bound of for the same subproblem with known graph structure.
I Introduction
Discrete-time Markov chains are a well-known stochastic model, one which has been used extensively to reason about software systems [CY95, HJ94, RKNP04]. They comprise a finite set of states and a set of transitions labelled with probabilities in such a way that the outgoing transitions from each state form a distribution. They are useful for modelling systems with inherently probabilistic behaviour, as well as for abstracting complexity away from deterministic ones. Thus, it is a long-standing interest of the verification community to develop logics for describing properties concerning realiability of software systems and to devise verification algorithms for these properties on Markov chains and their related generalisations, such as Markov decision processes [Bel57, Put14].
One well-known such generalisation is motivated by how the assumption of precise knowledge of a Markov chain’s transition relation often fails to hold. Indeed, a real-world system’s dynamics are rarely known exactly, due to incomplete information or measurement error. The need to model this uncertainty and to reason about robustness under perturbations in stochastic systems naturally gives rise to interval Markov chains (IMCs). In this model, uncertain transition probabilities are constrained to intervals, with two different semantic interpretations. Under the once-and-for-all interpretation, the given interval Markov chain is seen as representing an uncountably infinite collection of Markov chains refining it, and the goal is to determine whether some (or alternatively, all) refinements satisfy a given property. In contrast, the at-every-step interpretation exhibits a more game-theoretic flavour by allowing a choice over the outgoing transition probabilities prior to every move. The goal is then to determine strategies which optimise the probability of some property being satisfied. Originally introduced in [JL91], interval Markov chains have recently elicited considerable attention: see for example references [SVA06], [CHS08] and [BLW13], which study the complexity of model checking branching- and linear-time properties, as well as [DLL*+*11], where the focus is on consistency and refinement.
While IMCs are very natural for modelling uncertainty in stochastic dynamics, they lack the expressivity necessary to capture dependencies between transition probabilities arising out of domain-specific knowledge of the underlying real-world system. Such a dependency could state for example that, although the probabilities of some set of transitions are only known to lie within a given interval, they are all identical. Disregarding this information and studying only a dependence-free IMC is impractical, as allowing these transitions to vary independently of one another results in a vastly over-approximated space of possible behaviours.
Therefore, in the present paper we propose augmented interval Markov chains (AIMCs), a generalisation of IMCs which allows for dependencies of this type to be described. We study the effect of this added expressivity through the prism of the (existentially quantified) reachability problem under the once-and-for-all interpretation. Our results are the following. First, we show that the full problem is hard for both the famous square-root sum problem (Theorem 6) and for the class (Theorem 3). The former hardness is present even when the underlying graph structure is known and acyclic, whilst the latter arises even in the qualitative subproblem when transition intervals are allowed to include zero, rendering the structure uncertain. Second, assuming known structure, we show the approximate reachability problem to be in (Theorem 11). Third, we show that the restriction of the reachability problem to a constant number of uncertain (i.e. interval-valued) transitions is in (Theorem 4).
II Preliminaries
II-A Markov chains
A discrete-time Markov chain or simply Markov chain (MC) is a tuple which consists of a finite set of vertices or states and a one-step transition function such that for all , we have . For the purposes of specifying Markov chains as inputs to decision problems, we will assume is given by a square matrix of rational numbers. The transition function gives rise to a probability measure on in the usual way. We denote the probability of reaching a vertex starting from a vertex in by . The structure of is its underlying directed graph, with vertex set and edge set . Two Markov chains with the same vertex set are said to be structurally equivalent if their edge sets are identical.
An interval Markov chain (IMC) generalises the notion of a Markov chain. Formally, it is a pair comprising a vertex set and a transition function from to the set of intervals contained in . For the purposes of representing an input IMC, we will assume that each transition is given by a lower and an upper bound, together with two boolean flags indicating the strictness of the inequalities. A Markov chain is said to refine and interval Markov chain with the same vertex set if for all . We denote by the set of Markov chains which refine . An IMC’s structure is said to be known if all elements of are structurally equivalent. Moreover, if there exists some such that for all and all , either or , then the IMC’s structure is -known. An IMC can have known structure but not -known structure for example by having an edge labelled with an open interval whose lower bound is [math].
An augmented interval Markov chain (AIMC) generalises the notion of an IMC further by equipping it with pairs of edges whose transition probabilities are required to be identical. Formally, an AIMC is a tuple , where is an IMC and is a set of edge equality constraints. A Markov chain is said to refine an AIMC if it refines the IMC and for each , we have . We extend the notation to AIMCs for the set of Markov chains refining .
The reachability problem for AIMCs is the problem of deciding, given an AIMC , an initial vertex , a target vertex , a threshold and a relation , whether there exists such that \mathbb{P}^{M}(\mbox{s\twoheadrightarrow t})\sim\tau. The qualitative subproblem is the restriction of the reachability problem to inputs where .
Finally, in the approximate reachability problem, we are given a (small) rational number and a reachability problem instance. If is , our procedure is required to accept if there exists some refining Markov chain with reachability probability greater than , it is required to reject if all refining Markov chains have reachability probability less than , and otherwise it is allowed to do anything. Similarly if is . Intuitively, this is a promise problem: in the given instance the optimal reachability probability is guaranteed to be outside the interval .
II-B First-order theory of the reals
We denote by the first-order language . Atomic formulas in this language are of the form and for a polynomial with integer coefficients. We denote by the first-order theory of the reals, that is, the set of all valid sentences in the language . Let be the existential first-order theory of the reals, that is, the set of all valid sentences in the existential fragment of . A celebrated result [Tar51] is that admits quantifier elimination: each formula in is equivalent to some effectively computable formula which uses no quantifiers. This immediately entails the decidability of . Tarski’s original result had non-elementary complexity, but improvements followed, culminating in the detailed analysis of [Ren92]:
Theorem 1**.**
* is complete for .* 2. 2.
* is decidable in .* 3. 3.
If is a fixed constant and we consider only existential sentences where the number of variables is bounded above by , then validity is decidable in .
We denote by the class, introduced in [SŠ11], which lies between and and comprises all problems reducible in polynomial time to the problem of deciding membership in .
II-C Square-root sum problem
The square-root sum problem is the decision problem where, given , one must determine whether . Originally posed in [O’R81], this problem arises naturally in computational geometry and other contexts involving Euclidean distance. Its exact complexity is open. Membership in is straightforward via a reduction to the existential theory of the reals. Later this was sharpened in [ABKPM09] to , the complexity class whose complete problem is deciding whether a division-free arithmetic circuit represents a positive number. This class was introduced and bounded above by the fourth level of the counting hierarchy in the same paper. However, containment of the square-root sum problem in is a long-standing open question, originally posed in [GGJ76], and the only obstacle to proving membership in for the exact Euclidean travelling salesman problem. This highlights a difference between the familiar integer model of computation and the Blum-Shub-Smale Real RAM model [BSS89], under which the square-root sum is decidable in polynomial time [Tiw92]. See also [EY09] for more background.
III Qualitative case
In this section, we will focus on the qualitative reachability problem for AIMCs. We show that, whilst membership in is straightforward when the underlying graph is known, uncertainty in the structure renders the qualitative problem -complete.
Theorem 2**.**
The qualitative reachability problem for AIMCs with known structure is in .
Proof.
Let the given AIMC be and the initial and target vertices, respectively. Since the structure of is known, the qualitative reachability problem can be solved simply using standard graph analysis techniques on . More precisely, for any , if and only if there is no path in which starts in , does not enter and ends in a bottom strongly connected component which does not contain . Similarly, if and only if there is no path from to in . ∎
Theorem 3**.**
The qualitative reachability problem for AIMCs is -complete.
Proof.
Membership in is straightforward. The equivalence classes of under structure equivalence are at most , where is the number of vertices, since for each pair of vertices, either an edge is present in the structure or not. This upper bound is exponential in the size of the input. Thus, we can guess the structure of the Markov chain in nondeterministic polynomial time and then proceed to solve an instance of the qualitative reachability problem on an AIMC with known structure in polynomial time by Theorem 2.
We now proceed to show -hardness using a reduction from 3-SAT. Suppose we are given a propositional formula in 3-CNF:
[TABLE]
where each clause is a disjunction of three literals:
[TABLE]
Let the variables in be .
Let be the following AIMC, also depicted in Figure 1. The vertex set has vertices:
[TABLE]
that is, one vertex for each possible literal over the given variables, one vertex for each clause, two special sink vertices (success and failure) and auxiliary vertices. Through a slight abuse of notation, we use to refer both to the literals over the variable and to their corresponding vertices in , and similarly, denotes both the clause in the formula and its corresponding vertex.
The transitions are the following. For all , we have:
[TABLE]
For all and , we have:
[TABLE]
For all ,
[TABLE]
Finally, . For all other pairs of vertices , we have .
The edge equality constraints are:
[TABLE]
Intuitively, the sequence of ‘diamonds’ comprised by and the vertices corresponding to literals is a variable setting gadget. Choosing transition probabilities , and hence necessarily , corresponds to setting to true, whereas and corresponds to setting to false. On the other hand, the branching from into and the edges from clauses to their literals makes up the assignment testing gadget. Assigning non-zero probability to the edge corresponds to selecting the literal as witness that the clause is satisfied.
Formally, we claim that there exists a Markov chain such that if and only if is satisfiable.
Suppose first that is satisfiable and choose some satisfying assignment . Let be the refining Markov chain which assigns the following transition probabilities to the interval-valued edges of . First, let
[TABLE]
for all . Second, for each clause , choose some literal which is true under and set and consequently for the other literals . Now we can observe that the structure of has two bottom strongly-connected components, namely and , and moreover, is unreachable from . Therefore, .
Conversely, suppose there exists some such that . We will prove that has a satisfying assignment. For each , write
[TABLE]
Notice that
[TABLE]
so we can conclude , otherwise , a contradiction. If , then
[TABLE]
whereas if , then
[TABLE]
Either way, we must have to ensure . Unrolling this argument further shows for all . In particular, there is exactly one path from to and it has probability . Let be the truth assignment , we show that satisfies . Indeed, if some clause is unsatisfied under , then its three literals are all unsatisfied, so for all . Moreover, for at least one of these three literals, say , we will have , so the path will have non-zero probability:
[TABLE]
which contradicts . Therefore, satisfies , which completes the proof of -hardness and of the Theorem.
∎
IV Constant number of uncertain edges
We now shift our attention to the subproblem of AIMC reachability which arises when the number of interval-valued transitions is fixed, that is, bounded above by some absolute constant. Our result is the following.
Theorem 4**.**
Fix a constant . The restriction of the reachability problem for AIMCs to inputs with at most interval-valued transitions lies in . Hence, the approximate reachability problem under the same restriction is also in .
Proof.
Let be the given AIMC and suppose we wish to decide whether there exists such that . Let be the set of vertices which have at least one interval-valued outgoing transition, together with and :
[TABLE]
Notice that . Write , so that is a partition of .
Let be a vector of variables, one for each interval-valued transition of . For vertices , let denote the corresponding variable in if the transition is interval-valued, and the only element of the singleton set otherwise. Let be the following propositional formula over the variables which captures the set of ‘sensible’ assignments:
[TABLE]
There is clearly a bijection between and assignments of which satisfy .
For vertices , use the notation to denote the event ‘ is reached from along a path consisting only of vertices in , with the possible exception of the endpoints ’. Notice that for all and , is independent of the choice of . Denote these probabilities by . They satisfy the system
[TABLE]
which is linear and therefore easy to solve with Gaussian elimination. Thus, assume that we have computed for all and .
Similarly, for all , write for the probability of . Notice that is a polynomial of degree at most over the variables , given by
[TABLE]
Thus, assume we have computed symbolically for all .
Finally, for each , let be a variable and write for the vector of variables in some order. Consider the following formula in the existential first-order language of the real field:
[TABLE]
where
[TABLE]
and is as above. Intuitively, states that the variables descibe a Markov chain in , states that gives the reachability probabilities from to , and states that the reachability probability from to meets the required threshold . The problem instance is positive if and only if is a valid sentence in the existential theory of the reals, which is decidable. Moreover, the formula uses exactly variables, so by Theorem 1, the problem is decidable in polynomial time, as required. ∎
Notice that removing the assumption of a constant number of interval-valued transitions only degrades the complexity upper bound, but not the described reduction to the problem of checking membership in . As an immediate corollary, we have:
Theorem 5**.**
The reachability problem and the approximate reachability problem for AIMCs are in .
Note that Theorem 5 can be shown much more easily, without the need to consider separately -vertices and -vertices as in the proof of Theorem 4. It is sufficient to use one variable per interval-valued transition to capture its transition probability as above and one variable per vertex to express its reachability probability to the target. Then write down an existentially quantified formula with the the usual system of equations for reachability in a Markov chain obtained by conditioning on the first step from each vertex. While this easily gives the upper bound, it uses at least variables, so it is insufficient for showing membership in for the restriction to a constant number of interval-valued transitions.
V Hardness for square-root sum problem
In this section, we give a lower bound for the AIMC reachability problem. This bound remains in place even when the structure of the AIMC is -known and acyclic, except for the self-loops on two sink vertices.
Theorem 6**.**
The AIMC reachability problem is hard for the square-root sum problem, even when the structure of the AIMC is -known and is acyclic, except for the self-loops on two sink vertices.
Proof.
The reduction is based on the gadget depicted in Figure 2. It is an AIMC with two sinks, and (success and failure), each with a self-loop with probability , and vertices: . The structure is acyclic and comprises four chains leading to , namely, , , and . From each vertex other than and there is also a transition to .
The probabilities are as follows. The transition has probability , whilst , , have probability , for rationals to be specified later. Consequently, the remaining outgoing transition to out of each has probability or . The transitions for all have probability . Finally, the transitions , , , , , and are interval-valued and must all have equal probability in any refining Markov chain. Assign the variable to the probability of these transitions. The interval to which these transition probabilities are restricted (i.e. the range of ) is to be specified later. Consequently, the remaining transitions ,,,, ,, are also interval-valued, with probability .
Let be a positive integer large enough to ensure
[TABLE]
Then choose a positive integer large enough, so that
[TABLE]
Now, a straightforward calculation shows
[TABLE]
Analysing the derivative of this cubic, we see that increases on , has its maximum at and then decreases on . This maximum is
[TABLE]
Thus, if we choose some closed interval which contains but not [math] and to be the range of , then the gadget described thus far will have -known structure and maximum reachability probability from to given by scaled by a constant and offset by another constant.
Now, suppose we wish to decide whether for given positive integers and . Construct a gadget as above for each . The constants are shared across the gadgets, as are the sinks , but each gadget has its own constant in place of , and its own copy of each non-sink vertex. The edge equality constraints are the same as above within each gadget, and there are no equality constraints across gadgets. Assign a variable to those edges in the -th gadget which in the description above were labelled , and choose a range for as described above for . Finally, add a new initial vertex , with equiprobable outgoing transitions to the -vertices of the gadgets.
In this AIMC, the probability of is given by the multivariate polynomial
[TABLE]
whose maximum value on is
[TABLE]
Therefore, if and only if there exists a refining Markov chain of this AIMC with
[TABLE]
so the reduction is complete. ∎
Remark 7*.*
It is easy to see that if we are given an acyclic AIMC with the interval-valued edges labelled with variables, the reachability probabilities from all vertices to a single target vertex are multivariate polynomials and can be computed symbollically with a backwards breadth-first search from the target. Then optimising reachability probabilities reduces to optimising the value of a polynomial over given ranges for its variables.
It is interesting to observe that a reduction holds in the other direction as well. Suppose we wish to decide whether there exist values of such that for a given multivariate polynomial , intervals and . Notice that can easily be written in the form , where , are constants such that , each is a non-empty product of terms drawn from , and is a (possibly negative) constant term. For example, the monomial has a negative coefficient, so rewrite it as . Do this to all monomials with a negative coefficient, then choose an appropriately large to obtain the desired form.
Now it is easy to construct an AIMC with two sinks and a designated initial vertex where the probability of is . We use a chain to represent each , and then branch from into the first vertices of the chains with distribution given by the . There exist values of the in their appropriate intervals such that if and only if there exists a refining Markov chain such that .
VI Approximate case
In this section, we focus on the approximate reachability problem for AIMCs. To obtain our upper bound, we will use a result from [Cha12].
Definition 8**.**
If and are Markov chains with the same vertex set, then their absolute distance is
[TABLE]
Lemma 9**.**
(Appears in [Cha12].) Let and be structurally equivalent Markov chains, where and for all , we have either or . Let also and fix two vertices . Then
[TABLE]
We will also need the following well-known inequality:
Lemma 10**.**
For all and , we have
[TABLE]
Now we proceed to prove our upper bound.
Theorem 11**.**
The approximate reachability problem for AIMCs with -known structure is in .
Proof.
Let be the given AIMC and let be a lower bound on all non-zero transitions across all . Suppose we are solving the maximisation version of the problem: we are given vertices and a rational , we must accept if for some and we must reject if for all .
Let be the number of vertices and let
[TABLE]
For each interval-valued transition, split its interval into at most intervals of length at most each. For example, partitions into , where is the largest natural number such that . Call the endpoints defining these subintervals grid points. Let be the set of Markov chains refining such that the probabilities of all interval-valued transitions are chosen from among the grid points. Observe that for all , there exists such that .
Our algorithm showing membership in will be the following. We will choose nondeterministically and compute using Gaussian elimination. Then if , we will accept, and otherwise we will reject.
To complete the proof, we need to argue two points. First, that is at most exponentially large in the size of the input, so that can indeed be guessed in nondeterministic polynomial time. Second, that if for all we have , then it is safe to reject, that is, there is no with . (Note that the procedure is obviously correct when it accepts.)
To the first point, we apply Lemma 10 with and :
[TABLE]
and hence,
[TABLE]
This upper bound is a polynomial in , and , and hence at most exponential in the length of the input data. Therefore, for each interval-valued transition, we can write down using only polynomially many bits which grid point we wish to use for the probability of that transition. Since the number of transitions is polynomial in the length of the input, it follows that an element of may be specified using only polynomially many bits, as required.
To the second point, consider such that . Then by Lemma 9, we have
[TABLE]
In other words, changing the transition probabilities by at most does not alter the reachability probability from to by more than . However, recall that we chose in such a way that for all , there is some with . In particular, if for all , then certainly for all , so it is safe to reject. This completes the proof. ∎
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[ABKPM 09] Eric Allender, Peter Bürgisser, Johan Kjeldgaard-Pedersen, and Peter Bro Miltersen. On the complexity of numerical analysis. SIAM Journal on Computing , 38(5):1987–2006, 2009.
- 2[Bel 57] Richard Bellman. A Markovian decision process. Technical report, DTIC Document, 1957.
- 3[BLW 13] Michael Benedikt, Rastislav Lenhardt, and James Worrell. LTL model checking of interval Markov chains. In Tools and Algorithms for the Construction and Analysis of Systems (TACAS) , pages 32–46. Springer, 2013.
- 4[BSS 89] Lenore Blum, Mike Shub, and Steve Smale. On a theory of computation and complexity over the real numbers: 𝑁𝑃 𝑁𝑃 \mathit{NP} -completeness, recursive functions and universal machines. Bulletin (New Series) of the American Mathematical Society , 21(1):1–46, 1989.
- 5[Cha 12] Krishnendu Chatterjee. Robustness of structurally equivalent concurrent parity games. In Proceedings of the 15th International Conference on Foundations of Software Science and Computational Structures , FOSSACS’12, pages 270–285, Berlin, Heidelberg, 2012. Springer-Verlag.
- 6[CHS 08] Krishnendu Chatterjee, Tom Henzinger, and Koushik Sen. Model-checking omega-regular properties of interval Markov chains. In Roberto M. Amadio, editor, Foundations of Software Science and Computation Structure (Fo S Sa CS) , pages 302–317, March 2008.
- 7[CY 95] Costas Courcoubetis and Mihalis Yannakakis. The complexity of probabilistic verification. Journal of the ACM (JACM) , 42(4):857–907, 1995.
- 8[DLL + 11] Benoît Delahaye, Kim G. Larsen, Axel Legay, Mikkel L. Pedersen, and Andrzej Wąsowski. Decision problems for interval Markov chains. In International Conference on Language and Automata Theory and Applications , pages 274–285. Springer, 2011.
