Large deviations of the long term distribution of a non Markov process

Anatolii A. Puhalskii

arXiv:1812.10163·math.PR·May 16, 2019

Large deviations of the long term distribution of a non Markov process

Anatolii A. Puhalskii

PDF

TL;DR

This paper establishes a large deviation principle for the long-term queue length distribution in ergodic generalized Jackson networks, linking it to the quasipotential and idempotent probability theory.

Contribution

It introduces a novel connection between large deviations, quasipotential, and idempotent distributions in queueing networks.

Findings

01

Long-term queue distribution obeys the Large Deviation Principle.

02

The deviation function is given by the quasipotential.

03

The quasipotential relates to the unique long-term idempotent distribution.

Abstract

We prove that the long term distribution of the queue length process in an ergodic generalised Jackson network obeys the Large Deviation Principle with a deviation function given by the quasipotential. The latter is related to the unique long term idempotent distribution, which is also a stationary idempotent distribution, of the large deviation limit of the queue length processes. The proof draws on developments in queueing network stability and idempotent probability.

Equations94

Q_{k}(t)=Q_{k}(0)+A_{k}(t)+\sum_{l=1}^{K}R_{lk}\bigl{(}D_{l}(t)\bigr{)}-D_{k}(t),

Q_{k}(t)=Q_{k}(0)+A_{k}(t)+\sum_{l=1}^{K}R_{lk}\bigl{(}D_{l}(t)\bigr{)}-D_{k}(t),

D_{k}(t)=S_{k}\bigl{(}B_{k}(t)\bigr{)}

D_{k}(t)=S_{k}\bigl{(}B_{k}(t)\bigr{)}

B_{k} (t) = \int_{0}^{t} 1_{{Q_{k} (u) > 0}} d u

B_{k} (t) = \int_{0}^{t} 1_{{Q_{k} (u) > 0}} d u

ψ_{k}^{A} (α_{k})

ψ_{k}^{A} (α_{k})

ψ_{k}^{S} (δ_{k})

ψ_{k}^{R} (ϱ_{k})

ψ_{J} (α, δ, ϱ) = k = 1 \sum K ψ_{k}^{A} (α_{k}) + k \in J^{c} \sum ψ_{k}^{S} (δ_{k}) + k \in J \sum ψ_{k}^{S} (δ_{k}) 1_{{δ_{k} > μ_{k}}} + k = 1 \sum K δ_{k} ψ_{k}^{R} (ϱ_{k}),

ψ_{J} (α, δ, ϱ) = k = 1 \sum K ψ_{k}^{A} (α_{k}) + k \in J^{c} \sum ψ_{k}^{S} (δ_{k}) + k \in J \sum ψ_{k}^{S} (δ_{k}) 1_{{δ_{k} > μ_{k}}} + k = 1 \sum K δ_{k} ψ_{k}^{R} (ϱ_{k}),

Ψ_{J} (y) = (α, δ, ϱ) \in R_{+}^{K} \times R_{+}^{K} \times S_{+}^{K \times K} : y = α + (ϱ^{T} - I) δ in f ψ_{J} (α, δ, ϱ) .

Ψ_{J} (y) = (α, δ, ϱ) \in R_{+}^{K} \times R_{+}^{K} \times S_{+}^{K \times K} : y = α + (ϱ^{T} - I) δ in f ψ_{J} (α, δ, ϱ) .

L (x, y) = J \subset K \sum 1_{F_{J}} (x) Ψ_{J} (y) .

L (x, y) = J \subset K \sum 1_{F_{J}} (x) Ψ_{J} (y) .

I_{x} (q) = \int_{0}^{\infty} L (q (t), \overset{q}{˙} (t)) d t,

I_{x} (q) = \int_{0}^{\infty} L (q (t), \overset{q}{˙} (t)) d t,

V (x) = t \in R_{+} in f q \in D (R_{+}, R_{+}^{K}) : q (t) = x in f I_{0} (q) .

V (x) = t \in R_{+} in f q \in D (R_{+}, R_{+}^{K}) : q (t) = x in f I_{0} (q) .

μ > (I - P^{T})^{- 1} λ

μ > (I - P^{T})^{- 1} λ

\displaystyle\mathbf{\Pi}^{A}(a)=\prod_{k=1}^{K}\mathbf{\Pi}^{A}_{k}(a_{k})\,,\;\mathbf{\Pi}^{A}_{k}(a_{k})=\exp\Bigl{(}-\int_{0}^{\infty}\psi^{A}_{k}(\dot{a}_{k}(t))\,dt\Bigr{)}\,,

\displaystyle\mathbf{\Pi}^{A}(a)=\prod_{k=1}^{K}\mathbf{\Pi}^{A}_{k}(a_{k})\,,\;\mathbf{\Pi}^{A}_{k}(a_{k})=\exp\Bigl{(}-\int_{0}^{\infty}\psi^{A}_{k}(\dot{a}_{k}(t))\,dt\Bigr{)}\,,

\displaystyle\mathbf{\Pi}^{S}(s)=\prod_{k=1}^{K}\mathbf{\Pi}^{S}_{k}(s_{k})\,,\;\mathbf{\Pi}^{S}_{k}(s_{k})=\exp\Bigl{(}-\int_{0}^{\infty}\psi^{S}_{k}(\dot{s}_{k}(t))\,dt\Bigr{)}\,,

\displaystyle\mathbf{\Pi}^{R}(r)=\prod_{k=1}^{K}\mathbf{\Pi}^{R}_{k}(r_{k})\,,\;\mathbf{\Pi}^{R}_{k}(r_{k})=\exp\Bigl{(}-\int_{0}^{\infty}\psi^{R}_{k}(\dot{r}_{k}(t))\,dt\Bigr{)}\,,

q_{k} (t)

q_{k} (t)

d_{k} (t)

\int_{0}^{t} q_{k} (u) d b_{k} (u)

Π (υ) = x \in R_{+}^{K} sup Π_{x} (υ) \tilde{Π}^{Q_{0}} (x),

Π (υ) = x \in R_{+}^{K} sup Π_{x} (υ) \tilde{Π}^{Q_{0}} (x),

Π^{Q} (q) = x \in R_{+}^{K} sup Π_{x}^{Q} (q) \tilde{Π}^{Q_{0}} (x) .

Π^{Q} (q) = x \in R_{+}^{K} sup Π_{x}^{Q} (q) \tilde{Π}^{Q_{0}} (x) .

q_{k} (u) = q_{k} (0) + a_{k} (u) + l = 1 \sum K r_{l k} (d_{l} (u)) - d_{k} (u) .

q_{k} (u) = q_{k} (0) + a_{k} (u) + l = 1 \sum K r_{l k} (d_{l} (u)) - d_{k} (u) .

Π_{x, t} (y) = q : q (t) = y sup Π_{x}^{Q} (q), Π_{x, t} (Γ) = y \in Γ sup Π_{x, t}^{Q} (y), where Γ \subset R_{+}^{K} .

Π_{x, t} (y) = q : q (t) = y sup Π_{x}^{Q} (q), Π_{x, t} (Γ) = y \in Γ sup Π_{x, t}^{Q} (y), where Γ \subset R_{+}^{K} .

Π_{x, u + v} (y) = z \in R_{+}^{K} sup Π_{x, u} (z) Π_{z, v} (y) .

Π_{x, u + v} (y) = z \in R_{+}^{K} sup Π_{x, u} (z) Π_{z, v} (y) .

\mathbf{\Pi}\bigl{(}\cup_{u\geq T}\{\{a(u)\notin[(\lambda-\epsilon\mathbf{1})u,(\lambda+\epsilon\mathbf{1})u]\}\cup\{s(u)\notin[(\mu-\epsilon\mathbf{1})u,(\mu+\epsilon\mathbf{1})u]\}\\ \cup\{r(u)\notin[(P-\epsilon I)u,(P+\epsilon I)u]\}\}\bigr{)}<\kappa\,.

\mathbf{\Pi}\bigl{(}\cup_{u\geq T}\{\{a(u)\notin[(\lambda-\epsilon\mathbf{1})u,(\lambda+\epsilon\mathbf{1})u]\}\cup\{s(u)\notin[(\mu-\epsilon\mathbf{1})u,(\mu+\epsilon\mathbf{1})u]\}\\ \cup\{r(u)\notin[(P-\epsilon I)u,(P+\epsilon I)u]\}\}\bigr{)}<\kappa\,.

\mathbf{\Pi}(a(u)\notin[(\lambda-\epsilon\mathbf{1})u,(\lambda+\epsilon\mathbf{1})u])=\mathbf{\Pi}^{A}(a(u)\notin[(\lambda-\epsilon\mathbf{1})u,(\lambda+\epsilon\mathbf{1})u])\\ \leq\liminf_{n\to\infty}\mathbf{P}\bigl{(}\lvert\frac{A(nu)}{n}-\lambda u\rvert>\epsilon u\bigr{)}^{1/n}\leq\sigma^{u}\,.

\mathbf{\Pi}(a(u)\notin[(\lambda-\epsilon\mathbf{1})u,(\lambda+\epsilon\mathbf{1})u])=\mathbf{\Pi}^{A}(a(u)\notin[(\lambda-\epsilon\mathbf{1})u,(\lambda+\epsilon\mathbf{1})u])\\ \leq\liminf_{n\to\infty}\mathbf{P}\bigl{(}\lvert\frac{A(nu)}{n}-\lambda u\rvert>\epsilon u\bigr{)}^{1/n}\leq\sigma^{u}\,.

Γ_{κ} = {υ : (λ - ϵ 1) u \leq a (u) \leq (λ + ϵ 1) u, (P - ϵ I) u \leq r (u) \leq (P + ϵ I) u, and (μ - ϵ 1) u \leq s (u) \leq (μ + ϵ 1) u, for u \geq T} .

Γ_{κ} = {υ : (λ - ϵ 1) u \leq a (u) \leq (λ + ϵ 1) u, (P - ϵ I) u \leq r (u) \leq (P + ϵ I) u, and (μ - ϵ 1) u \leq s (u) \leq (μ + ϵ 1) u, for u \geq T} .

q (0) + (λ - ϵ 1) u + (P^{T} - ϵ I) d (u) - d (u) \leq q (u) \leq q (0) + (λ + ϵ 1) u + (P^{T} + ϵ I) d (u) - d (u) .

q (0) + (λ - ϵ 1) u + (P^{T} - ϵ I) d (u) - d (u) \leq q (u) \leq q (0) + (λ + ϵ 1) u + (P^{T} + ϵ I) d (u) - d (u) .

d(u)\geq(I-(P^{T}+\epsilon I))^{-1}(\lambda+\epsilon\mathbf{1})(u-S)=\bigl{(}\nu+\epsilon(I-(P^{T}+\epsilon I))^{-1}(\nu+\mathbf{1})\bigr{)}(u-S)\,,

d(u)\geq(I-(P^{T}+\epsilon I))^{-1}(\lambda+\epsilon\mathbf{1})(u-S)=\bigl{(}\nu+\epsilon(I-(P^{T}+\epsilon I))^{-1}(\nu+\mathbf{1})\bigr{)}(u-S)\,,

ν = (I - P^{T})^{- 1} λ .

ν = (I - P^{T})^{- 1} λ .

d (u) \geq (ν + ρ 1) (u - S) .

d (u) \geq (ν + ρ 1) (u - S) .

1 \cdot (I - P^{T} - ϵ I)^{- 1} q (u) \leq 1 \cdot (I - P^{T} - ϵ I)^{- 1} q (0) + 1 \cdot (I - P^{T} - ϵ I)^{- 1} (λ + ϵ 1) u - 1 \cdot d (u) = 1 \cdot (I - P^{T} - ϵ I)^{- 1} q (0) + 1 \cdot (ν u - d (u)) + ϵ 1 \cdot (I - P^{T})^{- 1} 1 u + ϵ 1 \cdot (I - P^{T} - ϵ I)^{- 1} (I - P^{T})^{- 1} (λ + ϵ 1) u .

1 \cdot (I - P^{T} - ϵ I)^{- 1} q (u) \leq 1 \cdot (I - P^{T} - ϵ I)^{- 1} q (0) + 1 \cdot (I - P^{T} - ϵ I)^{- 1} (λ + ϵ 1) u - 1 \cdot d (u) = 1 \cdot (I - P^{T} - ϵ I)^{- 1} q (0) + 1 \cdot (ν u - d (u)) + ϵ 1 \cdot (I - P^{T})^{- 1} 1 u + ϵ 1 \cdot (I - P^{T} - ϵ I)^{- 1} (I - P^{T})^{- 1} (λ + ϵ 1) u .

1 \cdot (I - P^{T} - ϵ I)^{- 1} q (u) \leq 1 \cdot (I - P^{T} - ϵ I)^{- 1} q (0) + γ S - \overset{ρ}{^} u .

1 \cdot (I - P^{T} - ϵ I)^{- 1} q (u) \leq 1 \cdot (I - P^{T} - ϵ I)^{- 1} q (0) + γ S - \overset{ρ}{^} u .

q_{k} (0) + (λ_{k} - ϵ) u + ((P^{T} - ϵ I) d (u))_{k} - d_{k} (u) \leq 0, for k \in O, u \in [v, v + η] .

q_{k} (0) + (λ_{k} - ϵ) u + ((P^{T} - ϵ I) d (u))_{k} - d_{k} (u) \leq 0, for k \in O, u \in [v, v + η] .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Large deviations of the long term distribution of

a non Markov process

Anatolii A. Puhalskii 111Email: [email protected]

Institute for Problems in Information Transmission

Abstract

We prove that the long term distribution of the queue length process in an ergodic generalised Jackson network obeys the Large Deviation Principle with a deviation function given by the quasipotential. The latter is related to the unique long term idempotent distribution, which is also a stationary idempotent distribution, of the large deviation limit of the queue length process. The proof draws on developments in queueing network stability and idempotent probability.

1 Introduction and summary

In a seminal contribution, Freidlin and Wentzell [5] obtained the Large Deviation Principle (LDP) for the stationary distribution of a diffusion process and showed that the deviation function, which is often referred to as the action functional or the (tight) rate function, is given by the quasipotential. Their ingenious analysis relied heavily on the strong Markov property and involved an intricate study of attainment times. Shwartz and Weiss [10] adapted the methods of Freidlin and Wentzell [5] to the setting of jump Markov processes. In Puhalskii [8], we suggested a different, arguably, more direct and, as we hope, more robust approach. It was prompted by the analogy between large deviations and weak convergence and sought to identify the deviation function in terms of the stationary idempotent distribution of a large deviation limit. In this paper, the approach is applied to establishing the LDP for the long term distribution of the non Markov process of queue lengths in a generalised Jackson network. It is noteworthy that, in addition to being non Markovian, generalised Jackson networks fall into the category of stochastic systems with discontinuous dynamics, whose analysis is generally more difficult. We show that the deviation function is still given by the quasipotential which is related to the stationary idempotent distribution of the limit idempotent process. That stationary idempotent distribution is also a unique long term idempotent distribution, the uniqueness being proved by a coupling argument. Geometric ergodicity of the queue length process enables us to conclude that the long term idempotent distribution is the large deviation limit of the long term queue length distributions.

2 The setup and main result

We consider a queueing network with a homogeneous customer population which comprises $K$ single server stations. Customers arrive exogenously at the stations and are served there in the order of arrival, one customer at a time. Upon being served, they either join a queue at another station or leave the network. Let $A_{k}(t)$ denote the cumulative number of exogenous arrivals at station $k$ by time $t$ , let $S_{k}(t)$ denote the cumulative number of customers that are served at station $k$ for the first $t$ units of busy time of that station, and let $R_{kl}(m)$ denote the cumulative number of customers among the first $m$ customers departing station $k$ that go directly to station $l$ . Let $A_{k}=(A_{k}(t),\,t\in\mathbb{R}_{+})$ , $S_{k}=(S_{k}(t),\,t\in\mathbb{R}_{+})$ , and $R_{k}=(R_{k}(m),\,m\in\mathbb{Z}_{+})$ , where $R_{k}(m)=(R_{kl}(m),\,l\in\mathcal{K})$ and $\mathcal{K}=\{1,2,\ldots,K\}$ . It is assumed that the $A_{k}$ and $S_{k}$ are nonzero renewal processes and $R_{kl}(m)=\sum_{i=1}^{m}1_{\{\zeta_{k}^{(i)}=l\}}$ , where $\{\zeta_{k}^{(1)},\zeta_{k}^{(2)},\ldots\}$ is a sequence of i.i.d. random variables assuming values in $\mathcal{K}$ , $1_{\Gamma}$ standing for the indicator function of set $\Gamma$ . The random entities $A_{k}$ , $S_{l}$ , $R_{i}$ and $Q(0)$ are assumed to be defined on common probability space $(\Omega,\mathcal{F},\mathbf{P})$ and be mutually independent, where $k,l,i\in\mathcal{K}$ . We denote $p_{kl}=\mathbf{P}(\zeta_{k}^{(1)}=l)$ and let $P=(p_{kl})_{k,l=1}^{K}$ . The matrix $P$ is assumed to be of spectral radius less than unity so that every arriving customer eventually leaves. Let $Q=(Q(t),\,t\in\mathbb{R}_{+})$ represent the queue length process, where $Q(t)=(Q_{k}(t),\,k\in\mathcal{K})$ and $Q_{k}(t)$ represents the number of customers at station $k$ at time $t$ . All the stochastic processes are assumed to have piecewise constant right–continuous with left–hand limits trajectories. Accordingly, they are considered as random elements of the associated Skorohod spaces.

For $k\in\mathcal{K}$ and $t\in\mathbb{R}_{+}$ , the following equations are satisfied:

[TABLE]

where

[TABLE]

represents the number of departures from station $k$ by time $t$ and

[TABLE]

represents the cumulative busy time of station $k$ by time $t$ . For given realisations of $Q_{k}(0)$ , $A_{k}$ , $S_{k}$ , and $R_{k}$ , there exist unique $Q_{k}=(Q_{k}(t),\,t\in\mathbb{R}_{+})$ , $D_{k}=(D_{k}(t),\,t\in\mathbb{R}_{+})$ and $B_{k}=(B_{k}(t),\,t\in\mathbb{R}_{+})$ that satisfy (2.1), (2.2) and (2.3), see, e.g., Chen and Mandelbaum [4]. The process $Q$ is non Markov unless all $A_{k}$ and $S_{k}$ are Poisson processes.

Let, for $k\in\mathcal{K}$ , nonnegative random variables $\xi_{k}$ and $\eta_{k}$ represent generic times between exogenous arrivals and service times at station $k$ , respectively. We assume that $\mathbf{P}(\xi_{k}=0)=\mathbf{P}(\eta_{k}=0)=0,$ $\mathbf{E}\exp(\theta\xi_{k})<\infty$ and $\mathbf{E}\exp(\theta\eta_{k})<\infty$ for some $\theta>0$ , and the cumulative distribution functions of the $\xi_{k}$ and $\eta_{k}$ are right–differentiable at [math] with positive derivatives. Let $\beta_{k}=\sup\{\theta\in\mathbb{R}_{+}:\,\mathbf{E}\exp(\theta\xi_{k})<\infty\}$ and $\gamma_{k}=\sup\{\theta\in\mathbb{R}_{+}:\,\mathbf{E}\exp(\theta\eta_{k})<\infty\}$ . Let also $\pi(u)=u\ln u-u+1$ if $u>0$ , $\pi(0)=1$ , $\pi(\infty)=\infty$ , $0/0=0$ , and $\infty\cdot 0=0$ . Let $\mathbb{S}_{+}^{K\times K}$ represent the set of row–substochastic $K\times K$ –matrices and $I$ represent the $K\times K$ –identity matrix. Given vectors $\alpha=(\alpha_{1},\ldots,\alpha_{K})^{T}\in\mathbb{R}_{+}^{K}$ and $\delta=(\delta_{1},\ldots,\delta_{K})^{T}\in\mathbb{R}_{+}^{K}$ , matrix $\varrho\in\mathbb{S}_{+}^{K\times K}$ with rows $\varrho_{k},\,k\in\mathcal{K}$ , and $J\subset\mathcal{K}$ , we define

[TABLE]

and

[TABLE]

where $J^{c}=\mathcal{K}\setminus J\,$ . Also, for $y\in\mathbb{R}^{K}$ , we let

[TABLE]

If $\,J\,$ is a nonempty subset of $\mathcal{K}$ , we denote $F_{J}=\{x=(x_{1},\ldots,x_{K})\in\mathbb{R}_{+}^{K}:x_{k}=0,k\in J,x_{k}>0,k\not\in J\}\,$ , $F_{\emptyset}$ is defined to be the interior of $\mathbb{R}_{+}^{K}$ . Let, for $x\in\mathbb{R}_{+}^{K}$ and $y\in\mathbb{R}^{K}$ ,

[TABLE]

The function $L(x,y)$ is seen to be nonnegative.

Let

[TABLE]

provided $q=(q(t)\,,t\in\mathbb{R}_{+})\in\mathbb{D}(\mathbb{R}_{+},\mathbb{R}_{+}^{K})$ is absolutely continuous with $q(0)=x\in\mathbb{R}_{+}^{K}$ and $\mathbf{I}_{x}(q)=\infty$ , otherwise, where $q(t)=(q_{1}(t),\ldots,q_{K}(t))^{T}$ .

With large deviations in mind, we will assume in the next theorem that the initial queue length depends on large parameter $n$ , so, superscript ” $n$ ” will be used to denote the associated random quantities, e.g., $Q^{n}(t)$ is the queue length vector at time $t$ . Theorem 2.2 in Puhalskii [9] proves the following result.

Theorem 2.1.

If, in addition, $\mathbf{P}(\lvert Q^{n}(0)/n-x\rvert>\epsilon)^{1/n}\to 0$ as $n\to\infty$ , for all $\epsilon>0$ , then the queue length processes $\{(Q^{n}(nt)/n\,,t\in\mathbb{R}_{+})\,,n\in\mathbb{N}\}$ obey the LDP in $\mathbb{D}(\mathbb{R}_{+},\mathbb{R}^{K}_{+})$ for rate $n$ with the deviation function $\mathbf{I}_{x}(q)$ .

For $x\in\mathbb{R}_{+}^{K}$ , we define the quasipotential by

[TABLE]

In order to address an LDP for the stationary queue length distribution, we assume that the network is subcritical:

[TABLE]

where $\mu=(\mu_{1},\ldots,\mu_{K})^{T}$ , $\lambda=(\lambda_{1},\ldots,\lambda_{K})^{T}$ , $\mu_{k}=1/\mathbf{E}\eta_{k}$ and $\lambda_{k}=1/\mathbf{E}\xi_{k}$ . (Inequalities between vectors or matrices are understood to hold entrywise.) In addition, we assume that

there exists number $\overline{\eta}>0$ such that $\mathbf{E}(\eta_{k}-u|\eta_{k}>u)\leq\overline{\eta}$ , for $k\in\mathcal{K}$ and $u>0$ , 2. 2.

$\mathbf{P}(\xi_{k}>u)>0$ , for $k\in\mathcal{K}$ and $u>0$ , 3. 3.

for $k\in\mathcal{K}$ , there exist nonnegative function $f_{k}(u)$ on $\mathbb{R}_{+}$ with $\int_{0}^{\infty}f_{k}(u)\,du>0$ and $m_{k}\in\mathbb{N}$ such that $\mathbf{P}(\sum_{i=1}^{m_{k}}\xi_{k,i}\in[v,w])\geq\int_{v}^{w}f_{k}(u)\,du$ , provided $0\leq v\leq w$ , where $\xi_{k,1},\ldots,\xi_{k,m_{k}}$ are i.i.d. and are distributed as $\xi_{k}$ .

Under these hypotheses, the $Q(t)$ converge in distribution to random variable $\hat{Q}$ , as $t\to\infty$ , see Down and Meyn [6]. The convergence holds for arbitrary initial vector $Q(0)$ and the convergence rate is geometric for the metric of total variation. In addition, if the random variables $Q(t)$ are augmented with residual service and interarrival times to produce a Markov process, then that Markov process has a unique stationary distribution, the distribution of $\hat{Q}$ being a marginal distribution. Our main result is the following theorem.

Theorem 2.2.

The sequence $\{\hat{Q}/n\,,n\in\mathbb{N}\}$ obeys the LDP in $\mathbb{R}_{+}^{K}$ for rate $n$ with the deviation function $\mathbf{V}(x)$ .

*Remark 2.1**.*

Under (2.8), there is no ”large deviation cost” for staying at the origin. On taking in (2.5) $J=\mathcal{K}$ , $y=0$ , $\alpha=\lambda$ , $\varrho=P$ , and $\delta=(I-P^{T})^{-1}\lambda$ and noting that $\delta<\mu$ by (2.8), one can see by (2.4) that $\psi_{\mathcal{K}}(\alpha,\delta,\varrho)=0$ , so $\Psi_{\mathcal{K}}(0)=0$ and $L(0,0)=0$ . More generally, $L(q(t),\dot{q}(t))=0$ when $q$ is ”a fluid limit queue length” or the trajectory of the law of large numbers, i.e., $\dot{q}(t)=\lambda+(P^{T}-I)\mu+(I-P^{T})\phi(t)$ , where $\phi(t)\in\mathbb{R}_{+}^{K}$ and $\phi_{k}(t)q_{k}(t)=0$ , for $k\in\mathcal{K}$ , cf. Puhalskii [9]. The converse is also true: if $\Psi_{J}(y)=0$ , then the infimum in (2.5) is attained at $\alpha=\lambda$ , $\delta_{k}=\mu_{k}$ when $k\not\in J$ and $\varrho=P$ . (For a proof, one notes that $\psi^{A}_{k}(\alpha_{k})=0$ , $\psi^{S}_{k}(\delta_{k})=0$ , and $\psi^{R}_{k}(\varrho_{k})=0$ , if and only if $\alpha_{k}=\lambda_{k}$ , $\delta_{k}=\mu_{k}$ , and $\varrho_{k}=(p_{k1},\ldots,p_{kK})$ , respectively.) As a byproduct, in (2.7) $\mathbf{I}_{0}(q)$ can be replaced with $\int_{0}^{t}L(q(s),\dot{q}(s))\,ds$ .

3 Idempotent probability and the proof of Theorem 2.2

Let us recap some notions of idempotent probability, see, e.g., Puhalskii [7]. Let $\Upsilon$ be a set. Function $\mathbf{\Pi}$ from the power set of $\Upsilon$ to $[0,1]$ is called an idempotent probability if $\mathbf{\Pi}(\Gamma)=\sup_{\upsilon\in\Gamma}\mathbf{\Pi}(\{\upsilon\}),\,\Gamma\subset\Upsilon$ and $\mathbf{\Pi}(\Upsilon)=1$ . The pair $(\Upsilon,\mathbf{\Pi})$ is called an idempotent probability space. For economy of notation, we denote $\mathbf{\Pi}(\upsilon)=\mathbf{\Pi}(\{\upsilon\})$ . Property $\mathcal{P}(\upsilon),\,\upsilon\in\Upsilon,$ pertaining to the elements of $\Upsilon$ is said to hold $\mathbf{\Pi}$ -a.e. if $\mathbf{\Pi}(\mathcal{P}(\upsilon)\text{ does not hold})=0$ , where, in accordance with a tradition of probability theory, we define $\mathbf{\Pi}(\mathcal{P}(\upsilon)\text{ does not hold})=\mathbf{\Pi}(\{\upsilon\in\Upsilon:\,\mathcal{P}(\upsilon)\text{ does not hold}\})$ . Function $f$ from set $\Upsilon$ equipped with idempotent probability $\mathbf{\Pi}$ to set $\Upsilon^{\prime}$ is called an idempotent variable. The idempotent distribution of the idempotent variable $f$ is defined as the set function $\mathbf{\Pi}\circ f^{-1}(\Gamma)=\mathbf{\Pi}(f\in\Gamma),\,\Gamma\subset\Upsilon^{\prime}$ . If $f$ is the canonical idempotent variable defined by $f(\upsilon)=\upsilon$ , then it has $\mathbf{\Pi}$ as the idempotent distribution. If $f=(f_{1},f_{2})$ , with $f_{i}$ assuming values in $\Upsilon^{\prime}_{i}$ , then the (marginal) distribution of $f_{1}$ is defined by $\mathbf{\Pi}^{f_{1}}(\upsilon^{\prime}_{1})=\mathbf{\Pi}(f_{1}=\upsilon^{\prime}_{1})=\sup_{\upsilon:\,f_{1}(\upsilon)=\upsilon^{\prime}_{1}}\mathbf{\Pi}(\upsilon)$ . The idempotent variables $f_{1}$ and $f_{2}$ are said to be independent if $\mathbf{\Pi}(f_{1}=\upsilon^{\prime}_{1},\,f_{2}=\upsilon^{\prime}_{2})=\mathbf{\Pi}(f_{1}=\upsilon^{\prime}_{1})\mathbf{\Pi}(f_{2}=\upsilon^{\prime}_{2})$ for all $(\upsilon^{\prime}_{1},\upsilon^{\prime}_{2})\in\Upsilon^{\prime}_{1}\times\Upsilon^{\prime}_{2}$ , so, the joint distribution is the product of the marginal ones. Independence of finite collections of idempotent variables is defined similarly. Collection $(X_{t},\,t\in\mathbb{R}_{+})$ of idempotent variables on $\Upsilon$ is called an idempotent process. The functions $(X_{t}(\upsilon),\,t\in\mathbb{R}_{+})$ for various $\upsilon\in\Upsilon$ are called trajectories (or paths) of $X$ . Idempotent processes are said to be independent if they are independent as idempotent variables with values in the associated function spaces. The concepts of idempotent processes with independent and (or) stationary increments mimic those for stochatic processes.

If $\Upsilon$ is, in addition, a metric space and the sets $\{\upsilon\in\Upsilon:\,\mathbf{\Pi}(\upsilon)\geq\kappa\}$ are compact for all $\kappa\in(0,1]$ , then $\mathbf{\Pi}$ is called a deviability. Obviously, $\mathbf{\Pi}$ is a deviability if and only if $\mathbf{I}(\upsilon)=-\log\mathbf{\Pi}(\upsilon)$ is a deviation function. If $f$ is a continuous mapping from $\Upsilon$ to another metric space $\Upsilon^{\prime}$ , then $\mathbf{\Pi}\circ f^{-1}$ is a deviability on $\Upsilon^{\prime}$ . As a matter of fact, for the latter property to hold, one can only require that $f$ be continuous on the sets $\{\upsilon\in\Upsilon:\,\mathbf{\Pi}(\upsilon)\geq\kappa\}$ for $\kappa\in(0,1]$ . In general, an idempotent variable is said to be Luzin if its idempotent distribution is a deviability.

Let $\{\mathbf{P}_{n},\,n\in\mathbb{N}\}$ be a sequence of probability measures on metric space $\Upsilon$ endowed with the Borel $\sigma$ -algebra and let $\mathbf{\Pi}$ be a deviability on $\Upsilon$ . The sequence $\{\mathbf{P}_{n},\,n\in\mathbb{N}\}$ is said to large deviation converge (LD converge) at rate $n$ to $\mathbf{\Pi}$ as $n\to\infty$ if $\lim_{n\to\infty}\Bigl{(}\int_{\Upsilon}f(\upsilon)^{n}\,\mathbf{P}_{n}(d\upsilon)\Bigr{)}^{1/n}=\sup_{\upsilon\in\Upsilon}f(\upsilon)\mathbf{\Pi}(\upsilon)$ for every bounded continuous $\mathbb{R}_{+}$ -valued function $f$ on $\Upsilon$ . Equivalently, one may require that $\lim_{n\to\infty}\mathbf{P}_{n}(H)^{1/n}=\mathbf{\Pi}(H)$ for every $\mathbf{\Pi}$ –continuity set $H$ , which is defined by the requirement that the values of $\mathbf{\Pi}$ on the interior and closure of $H$ are equal to each other. Obviously, the sequence $\{\mathbf{P}_{n},\,n\in\mathbb{N}\}$ LD converges at rate $n$ to $\mathbf{\Pi}$ if and only if this sequence obeys the LDP for rate $n$ with deviation function $\mathbf{I}(\upsilon)=-\log\mathbf{\Pi}(\upsilon)$ . Similarly, sequence $\mathbf{\Pi}_{n}$ of deviabilities on $\Upsilon$ is said to converge weakly to deviability $\mathbf{\Pi}$ , as $n\to\infty$ , if $\lim_{n\to\infty}\sup_{\upsilon\in\Upsilon}f(\upsilon)\mathbf{\Pi}_{n}(\upsilon)=\sup_{\upsilon\in\Upsilon}f(\upsilon)\mathbf{\Pi}(\upsilon)$ for every bounded continuous $\mathbb{R}_{+}$ -valued function $f$ on $\Upsilon$ . The analogue of Prohorov’s theorem holds: if the sequence $\mathbf{\Pi}_{n}$ is tight meaning that $\inf_{\Gamma\in\Xi}\limsup_{n\to\infty}\mathbf{\Pi}_{n}(\Upsilon\setminus\Gamma)=0$ , where $\Xi$ represents the collection of compact subsets of $\Upsilon$ , then the $\mathbf{\Pi}_{n}$ converge to a deviability along a subsequence.

LD convergence of probability measures can be also expressed as LD convergence in distribution of the associated random variables to idempotent variables. We say that sequence $\{X_{n},\,n\in\mathbb{N}\}$ of random variables defined on respective probability spaces $(\Omega_{n},\mathcal{F}_{n},\mathbf{P}_{n})$ and assuming values in $\Upsilon^{\prime}$ LD converges in distribution at rate $n$ as $n\to\infty$ to idempotent variable $X$ defined on idempotent probability space $(\Upsilon,\mathbf{\Pi})$ and assuming values in $\Upsilon^{\prime}$ if the sequence of the probability laws of the $X_{n}$ LD converges to the idempotent distribution of $X$ at rate $n$ . If sequence $\{\mathbf{P}_{n},\,n\in\mathbb{N}\}$ of probability measures on $\Upsilon$ LD converges to deviability $\mathbf{\Pi}$ on $\Upsilon$ , then one has LD convergence in distribution for the canonical setting.

We now return to the setting of generalised Jackson networks and let $\mathbf{\Pi}_{x}^{Q}(q)=e^{-\mathbf{I}_{x}(q)}$ . It is proved in Puhalskii [9] that under the hypotheses of Theorem 2.1 there exists unique deviability $\mathbf{\Pi}_{x}$ on $\Upsilon=\mathbb{D}(\mathbb{R}_{+},\mathbb{R}_{+}^{K}\times\mathbb{R}_{+}^{K}\times\mathbb{R}_{+}^{K}\times\mathbb{R}_{+}^{K}\times\mathbb{R}_{+}^{K}\times\mathbb{R}_{+}^{K\times K})$ such that the processes $\bigl{(}(Q^{n}(nt)/n\,,t\in\mathbb{R}_{+}),(B^{n}(nt)/n\,,t\in\mathbb{R}_{+}),(D^{n}(nt)/n\,,t\in\mathbb{R}_{+}),(A^{n}(nt)/n\,,t\in\mathbb{R}_{+}),(S^{n}(nt)/n,\,t\in\mathbb{R}_{+}),(R^{n}(nt)/n\,,t\in\mathbb{R}_{+})\bigr{)}$ LD converge at rate $n$ to the canonical idempotent process $(q,b,d,a,s,r)$ on $\Upsilon$ . The component idempotent processes of $b$ , $d$ , $a$ , $s$ , and $r$ have $\mathbf{\Pi}_{x}$ -a.e. absolutely continuous nondecreasing trajectories starting at [math], the component idempotent processes of $b$ grow not faster than at rate $1$ , and the component idempotent processes of $q$ have $\mathbf{\Pi}_{x}$ -a.e. absolutely continuous trajectories, the idempotent process $q$ has idempotent distribution $\mathbf{\Pi}^{Q}_{x}$ , the idempotent processes $a$ , $s$ and $r$ are independent with respective idempotent distributions $\mathbf{\Pi}^{A}$ , $\mathbf{\Pi}^{S}$ and $\mathbf{\Pi}^{R}$ defined as follows, where, by virtue of our working in a canonical setting, identical pieces of notation are used for denoting idempotent processes and their sample trajectories:

[TABLE]

where $a=(a(t)\,,t\in\mathbb{R}_{+})=(a_{1},\ldots,a_{K})^{T}$ , $a_{k}=(a_{k}(t)\,,t\in\mathbb{R}_{+})$ , $a(t)=(a_{1}(t),\ldots,a_{K}(t))^{T}$ , $s=(s(t)\,,t\in\mathbb{R}_{+})=(s_{1},\ldots,s_{K})^{T}$ , $s_{k}=(s_{k}(t)\,,t\in\mathbb{R}_{+})$ , $s(t)=(s_{1}(t),\ldots,s_{K}(t))^{T}$ , $r=(r(t)\,,t\in\mathbb{R}_{+})=(r_{1},\ldots,r_{K})^{T}$ , $r_{k}=(r_{k}(t)\,,t\in\mathbb{R}_{+})=(r_{k1},\ldots,r_{kK})$ , $r_{kl}=(r_{kl}(t)\,,t\in\mathbb{R}_{+})$ , $r_{k}(t)=(r_{k1}(t)\,,\ldots,r_{kK}(t))$ , $r(t)=(r_{1}(t),\ldots,r_{K}(t))^{T}$ , the functions $a_{k}$ , $s_{k}$ , and $r_{kl}$ being absolutely continuous with $a_{k}(0)=0$ , $s_{k}(0)=0$ , $r_{kl}(0)=0$ , $\dot{a}_{k}(t)\in\mathbb{R}_{+}$ a.e., $\dot{s}_{k}(t)\in\mathbb{R}_{+}$ a.e., and $\dot{r}(t)\in\mathbb{S}^{K\times K}_{+}$ a.e.

Also $\mathbf{\Pi}_{x}$ -a.e. the following equations hold for $t\in\mathbb{R}_{+}$ and $\,k\in\mathcal{K}$ :

[TABLE]

where $b=(b(t),\,t\in\mathbb{R}_{+})$ , $b(t)=(b_{1}(t),\ldots,b_{K}(t))^{T}$ , $d=(d(t),\,t\in\mathbb{R}_{+})$ , and $d(t)=(d_{1}(t),\ldots,d_{K}(t))^{T}$ . Equations (3.4) and (3.5) are obtained by taking large deviation limits in (2.1) and (2.2), respectively. It is noteworthy that since in (3.1), (3.2) and (3.3) the sample trajectories enter the deviabilities only through their derivatives, the idempotent processes $a$ , $s$ and $r$ have independent and stationary increments.

By (3.4), $q(0)=x$ $\mathbf{\Pi}_{x}$ -a.e. In order to allow the initial value $q(0)$ to have a nondegenerate idempotent distribution, we introduce

[TABLE]

where $\tilde{\mathbf{\Pi}}^{Q_{0}}$ is a deviabiltiy on $\mathbb{R}_{+}^{K}$ . One can see that $\mathbf{\Pi}$ is a deviability on $\Upsilon$ . Obviously, $\mathbf{\Pi}(q(0)=x)=\tilde{\mathbf{\Pi}}^{Q_{0}}(x)$ and $q(0)$ , $a$ , $s$ and $r$ are independent under $\mathbf{\Pi}$ . Also, the marginal idempotent distribution of $q$ is given by

[TABLE]

By (3.4), $\mathbf{\Pi}$ –a.e.,

[TABLE]

Let

[TABLE]

The definition implies the semigroup property that

[TABLE]

For $\Delta\subset\mathcal{K}$ , we will denote by $\mathbf{1}_{\Delta}$ the vector with unity entries whose dimension equals the number of elements in $\Delta$ . For compactness of notation, we let $\mathbf{1}=\mathbf{1}_{\mathcal{K}}$ .

Lemma 3.1.

Given $\kappa>0$ and $\epsilon>0$ , there exists $T>0$ such that

[TABLE]

Proof.

By the maxitivity property that $\mathbf{\Pi}(\cup_{i}\Gamma_{i})=\sup_{i}\mathbf{\Pi}(\Gamma_{i})$ , for arbitrary collection of sets $\Gamma_{i}$ , it suffices to work with $\mathbf{\Pi}(a(u)\notin[(\lambda-\epsilon\mathbf{1})u,(\lambda+\epsilon\mathbf{1})u])$ only. By the LD convergence in distribution of $(A(nt)/n\,,t\in\mathbb{R}_{+})$ to $a$ and Lemma A.1 relegated to the appendix, whose assertion can be found in Appendix A of Bell and Williams [1], for some $\sigma\in(0,1)$ ,

[TABLE]

∎

Lemma 3.2.

Given bounded set $B\subset\mathbb{R}_{+}^{K}$ and $\kappa\in(0,1)$ , there exists $\hat{T}>0$ such that if $\mathbf{\Pi}^{Q}(q)>\kappa$ and $q(0)\in B$ , then $\min_{u\in[0,\hat{T}]}\lvert q(u)\rvert=0\,.$

Proof.

The proof proceeds by establishing, initially, that in the long run the idempotent processes $a(t)$ , $s(t)$ and $r(t)$ ”with great deviability” stay close to the corresponding fluid trajectories $\lambda t$ , $\mu t$ and $Pt$ , respectively. Then, drawing on the proof of the stability of fluid models of queueing networks in Bramson [2, 3], it is shown that owing to condition (2.8) the function $\mathbf{1}\cdot(I-P^{T}-\epsilon I)^{-1}q(u)$ decreases linearly with $u$ , provided $\epsilon$ is small enough, which implies that the function must attain [math] .

By (2.8), there exists $\epsilon>0$ such that $(I-P^{T}-\epsilon I)^{-1}(\lambda+\epsilon\mathbf{1})\leq\mu-\epsilon\mathbf{1}$ and $(I-P^{T}+\epsilon I)^{-1}(\lambda-\epsilon\mathbf{1})\leq\mu-\epsilon\mathbf{1}$ . (In the course of the proof, potentially smaller $\epsilon$ will be needed. Yet, there exists $\epsilon$ that satisfies all the requirements. Importantly, it depends neither on $\kappa$ nor on $B$ .) By Lemma 3.1, there exists $T>0$ such that $\mathbf{\Pi}(a(u)\geq(\lambda+\epsilon\mathbf{1})u\text{ for some }u\geq T)<\kappa$ , $\mathbf{\Pi}(a(u)\leq(\lambda-\epsilon\mathbf{1})u\text{ for some }u\geq T)<\kappa$ , $\mathbf{\Pi}(s(u)\leq(\mu-\epsilon\mathbf{1})u\text{ for some }u\geq T)<\kappa$ , $\mathbf{\Pi}(s(u)\geq(\mu+\epsilon\mathbf{1})u\text{ for some }u\geq T)<\kappa$ , $\mathbf{\Pi}(r(u)\geq(P+\epsilon I)u\text{ for some }u\geq T)<\kappa$ , and $\mathbf{\Pi}(r(u)\leq(P-\epsilon I)u\text{ for some }u\geq T)<\kappa$ .

Let

[TABLE]

We have that $\mathbf{\Pi}(\Gamma_{\kappa}^{c})<\kappa$ and that on $\Gamma_{\kappa}$ , provided $u\geq T$ , by (3.4),

[TABLE]

Let us show that there exists $S\geq 2T$ such that $b_{k}(S)\geq T$ for all $k$ on $\Gamma_{\kappa}$ . Intuitively, this is the case because otherwise some $s_{k}(b_{k}(u))$ would be ”bounded” whereas $a_{k}(u)$ can be arbitrarily great for great $u$ pushing $b_{k}(u)$ past $T$ . Formally, assuming that $\epsilon<\lambda_{k}$ , for all $k$ , let $S\geq 2T$ be such that $(\lambda_{k}-\epsilon)(S-T)-(\mu_{k}+\epsilon)T>0$ , for all $k$ . If $b_{k}(S)<T$ , for some $k$ , then, by (3.5), $d_{k}(u)\leq s_{k}(T)$ for $u\in[0,S]$ . By (3.8), on $\Gamma_{\kappa}$ , for $u\in[S-T,S]$ , $q_{k}(u)\geq a_{k}(u)-s_{k}(T)\geq(\lambda_{k}-\epsilon)u-(\mu_{k}+\epsilon)T>0$ . Therefore, by (3.6), $\dot{b}_{k}(u)=1$ a.e. when $u\in[S-T,S]$ , so $b_{k}(S)\geq T$ , which contradicts the assumption that $b_{k}(S)<T$ . It is worth noting that whereas both $T$ and $S$ may depend on either $\epsilon$ or $\kappa$ , neither of them depends on $q(0)$ .

We now assume that $q$ is piecewise linear, which assumption is to be disposed of later. Let us suppose that $q_{k}(u)>0$ for $u$ in a right neighborhood of $S$ for some $k$ on $\Gamma_{\kappa}$ . Then, $b_{k}(u)=b_{k}(S)+u-S\geq T+u-S$ , for $u\geq S$ , until $q_{k}(u)$ hits zero. Accordingly, $d_{k}(u)=s_{k}(b_{k}(u))\geq(\mu_{k}-\epsilon)(u-S)$ . Hence, if $q(u)>0$ entrywise in a right neighborhood of $S$ , then

[TABLE]

where we denote

[TABLE]

As a consequence, for some $\rho>0$ , which is dependent on $\epsilon$ only, while $q(u)>0$ entrywise,

[TABLE]

By the righthand inequality in (3.10) and (3.11),

[TABLE]

By (3.12), there exist $\hat{\rho}>0$ and $\gamma>0$ such that, provided $\epsilon$ is small enough, if $u\geq S$ , then, while $q(u)$ stays entrywise positive,

[TABLE]

Let us show that similar inequalities hold on $\Gamma_{\kappa}$ for all $u\geq S$ . Given $v\geq S$ , let $O$ denote a possibly empty set of indices $k$ such that $q_{k}(u)=0$ on some interval $[v,v+\eta]$ and $q_{k}(u)>0$ if $k\notin O$ and $u\in(v,v+\eta)$ . Such $\eta$ exists because $q(u)$ is piecewise linear. We assume that $q(u)\not=0$ on $[v,v+\eta]$ , so $O$ is a proper subset of $\mathcal{K}$ . By the lefthand inequality in (3.10), on $\Gamma_{\kappa}$ ,

[TABLE]

Therefore, using subscript $O$ and $O^{c}$ to denote restrictions of vectors to indices in $O$ and $O^{c}$ respectively, and using subscripts $OO$ and $OO^{c}$ to denote restrictions of matrices to entries with both indices in $OO$ and $OO^{c}$ , respectively, we have that

[TABLE]

so, assuming $\epsilon$ is small enough,

[TABLE]

On the other hand, by (3.11), $\lambda_{O}=(I-P^{T})_{OO}\nu_{O}-P^{T}_{OO^{c}}\nu_{O^{c}}\,.$ Substitution in (3.16) and rearranging yield

[TABLE]

In analogy with the derivation of (3.12), one obtains that, for some $\rho_{O}>0$ ,

[TABLE]

Therefore, for $u\in[v,v+\eta]$ ,

[TABLE]

Since $O^{c}\not=\emptyset$ , by (3.17), (3.18) and the bound $d(u)\leq(\mu+\epsilon)u$ when $u\geq S$ , there exist $\tilde{\rho}>0$ and $\gamma>0$ which do not depend on $O$ such that, assuming $\epsilon$ is small enough, for $u\in[v,v+\eta]$ ,

[TABLE]

By (3.13), we obtain that (3.14) still holds, for suitable $\hat{\rho}>0$ which does not depend on $O$ and $u\in[v,v+\eta]$ , provided $\epsilon$ is small enough. We can repeat the same argument over and over again, so, (3.14) holds until $q(u)=0$ . Hence, one can take

[TABLE]

as the time by which $q$ is bound to hit the origin.

Suppose now that $q$ is not necessarily piecewise linear and $\mathbf{\Pi}^{Q}(q)>\kappa$ . By Lemmas 4.1–4.4 in Puhalskii [9], there exist piecewise linear $q^{n}$ which converge to $q$ as $n\to\infty$ such that $\mathbf{\Pi}^{Q}(q^{n})\to\mathbf{\Pi}^{Q}(q)$ . By what’s been proved, there exist $t^{n}$ from $[0,\hat{T}]$ such that $\lvert q^{n}(t^{n})\rvert=0$ . Since $\lvert q^{n}(t^{n})-q(t^{n})\rvert\to 0$ , it follows that $\lvert q(t^{\prime})\rvert=0$ where $t^{\prime}$ represents a subsequential limit of the $t^{n}$ . ∎

Theorem 3.1.

There exists deviability $\hat{\mathbf{\Pi}}$ on $\mathbb{R}_{+}^{K}$ such that, for every bounded set $B\subset\mathbb{R}_{+}^{K}$ ,

[TABLE]

Furthermore, given $y$ , $\mathbf{\Pi}_{x,t}(y)=\hat{\mathbf{\Pi}}(y)$ for all $t$ great enough and all $x\in B$ . The deviability $\hat{\mathbf{\Pi}}$ is a unique stationary deviability for the semigroup $\mathbf{\Pi}_{x,t}$ meaning that, for all $y\in\mathbb{R}_{+}^{K}$ and $t\in\mathbb{R}_{+}$ ,

[TABLE]

Proof.

One can see that $\mathbf{\Pi}_{0,t}(y)$ is a nondecreasing function of $t$ . Indeed, let $u\leq t$ . Given function $q$ such that $q(0)=0$ and $q(u)=y$ , we can associate with it function $\tilde{q}$ such that $\tilde{q}(v)=0$ for $v\in[0,t-u]$ and $\tilde{q}(v)=q(v-(t-u))$ for $v\in[u,t]$ . It follows that $\int_{0}^{t-s}L(\tilde{q}(r),\dot{\tilde{q}}(r))\,dr=0$ , so $\mathbf{I}_{0}(\tilde{q})=\mathbf{I}_{0}(q)$ yielding the desired monotonicity. We let

[TABLE]

Let us show that $\mathbf{\Pi}_{0,t}(y)$ levels off eventually as a function of $t$ . Let $\kappa>0$ . We define $t^{\prime}$ as $\hat{T}$ in the statement of Lemma 3.2 with $\{x\in\mathbb{R}_{+}^{K}:\,\lvert x\rvert\leq 1\}\,$ as set $B$ . Suppose that $\mathbf{\Pi}_{0,t}(y)>\kappa$ , where $t\geq t^{\prime}+1$ . Let $q$ be such that $q(0)=0$ , $q(t)=y$ and $\mathbf{\Pi}_{0}^{Q}(q)=\mathbf{\Pi}_{0,t}(y)$ . Let $\tilde{t}=\inf_{t:\,q(t)\geq 1}\wedge 1$ . Then $q(\tilde{t})\leq 1$ and $0<\tilde{t}\leq 1$ . By Lemma 3.2, there exists $\breve{t}\in[\tilde{t},t^{\prime}+1]$ such that $q(\breve{t})=0$ . On defining $\tilde{q}(s)=q(s+\breve{t})$ , we have that $\mathbf{\Pi}_{0}^{Q}(\tilde{q})\geq\mathbf{\Pi}^{Q}_{0}(q)$ . On the other hand, since $\breve{t}\leq t$ , we have that $\tilde{q}(t-\breve{t})=y$ which implies that $\mathbf{\Pi}_{0}^{Q}(\tilde{q})\leq\mathbf{\Pi}_{0,t-\breve{t}}(y)\leq\mathbf{\Pi}_{0,t}(y)=\mathbf{\Pi}_{0}^{Q}(q)$ , so, $q(u)=0$ on $[0,\breve{t}]$ , for Remark 2.1 implies that if $q(0)=0$ and $q(u)=0$ for some $u>0$ , then $\int_{0}^{u}L(q(v),\dot{q}(v))\,dv=0$ if and only if $q(v)=0$ on $[0,u]$ . Hence, $\breve{t}=\tilde{t}=1$ , so, $\mathbf{\Pi}_{0,t-1}(y)=\mathbf{\Pi}_{0,t}(y)$ . This proves that if $\mathbf{\Pi}_{0,t}(y)>\kappa$ and $t\geq t^{\prime}+1$ , then $\mathbf{\Pi}_{0,t^{\prime}+1}(y)=\mathbf{\Pi}_{0,t}(y)$ . We also have that $\mathbf{\Pi}_{0,t}(y)\leq\kappa\vee\mathbf{\Pi}_{0,t^{\prime}+1}(y)$ , for all $t$ and $y$ . Hence, the net of deviabilities $\mathbf{\Pi}_{0,t}$ is tight, so, $\hat{\mathbf{\Pi}}$ is a deviability too.

Let us prove that

[TABLE]

A coupling argument is employed. We prove, at first, that, for arbitrary $\kappa>0$ ,

[TABLE]

By Lemma 3.2, there exists $\hat{T}$ such that if $q(0)\in B$ and $\mathbf{\Pi}^{Q}(q)>\kappa$ , then $q(u)=0$ for some $u\in[0,\hat{T}]$ . Let us fix $x\in B$ and $y\in\mathbb{R}_{+}^{K}$ . One can assume that $t\geq\hat{T}$ and that $\mathbf{\Pi}_{x,t}(y)>\kappa$ . Let trajectory $\hat{q}$ be such that $\hat{q}(0)=x$ , $\hat{q}(t)=y$ and $\mathbf{\Pi}_{x,t}(y)=\mathbf{\Pi}^{Q}(\hat{q})\,$ . By Lemma 3.2, there exists $\hat{T}_{1}\in[0,\hat{T}]$ such that $\hat{q}(\hat{T}_{1})=0$ . We define $\tilde{q}$ by letting $\tilde{q}(u)=0$ when $u\leq\hat{T}_{1}$ and $\tilde{q}(u)=\hat{q}(u)$ when $u\geq\hat{T}_{1}$ . By Remark 2.1, $\mathbf{\Pi}^{Q}(\hat{q})\leq\mathbf{\Pi}^{Q}(\tilde{q})\leq\mathbf{\Pi}_{0,t}(y)\,,$ proving (3.22).

On the other hand, given $t\geq\hat{T}+1$ , $x\in B$ , and $q$ such that $q(0)=0$ , $q(t)=y$ , $\mathbf{\Pi}^{Q}(q)=\mathbf{\Pi}_{0,t}(y)>\kappa$ , and $q(u)=0$ for all $u\in[0,\hat{T}]$ (the latter can be always assumed as we have seen), we define $\hat{q}$ with $\hat{q}(0)=x$ by letting it follow the law of large numbers until it hits zero at some $\hat{T}_{1}\in[0,\hat{T}]$ and by letting $\hat{q}(u)=q(u-\hat{T}_{1})$ , for $u\geq\hat{T}_{1}$ . Since $\mathbf{\Pi}^{Q}(\hat{q})=\mathbf{\Pi}^{Q}(q)$ by Remark 2.1 , we obtain that $\mathbf{\Pi}_{0,t}(y)=\mathbf{\Pi}^{Q}(\hat{q})\leq\mathbf{\Pi}_{x,t}(y)$ , which concludes the proof of (3.21).

We have shown that $\mathbf{\Pi}_{x,t}(y)\to\hat{\mathbf{\Pi}}(y)$ , as $t\to\infty$ , uniformly over $y\in\mathbb{R}_{+}^{K}$ and over $x$ from bounded sets. It follows that, for arbitrary initial deviability $\tilde{\mathbf{\Pi}}^{Q_{0}}$ ,

[TABLE]

Letting $u\to\infty$ in (3.9) implies that $\hat{\mathbf{\Pi}}$ is a unique stationary initial deviability. (For, if $\mathbf{\Pi}^{\prime}$ is another stationary deviability, then $\lvert\hat{\mathbf{\Pi}}(y)-\mathbf{\Pi}^{\prime}(y)\rvert=\lvert\hat{\mathbf{\Pi}}(y)\sup_{x\in\mathbb{R}_{+}^{K}}\mathbf{\Pi}^{\prime}(x)-\sup_{x\in\mathbb{R}_{+}^{K}}\mathbf{\Pi}^{\prime}(x)\mathbf{\Pi}_{x,t}(y)\rvert\leq\sup_{x\in\mathbb{R}_{+}^{K}}\lvert\mathbf{\Pi}^{\prime}(x)\hat{\mathbf{\Pi}}(y)-\mathbf{\Pi}^{\prime}(x)\mathbf{\Pi}_{x,t}(y)\rvert\leq\sup_{\begin{subarray}{c}x\in\mathbb{R}_{+}^{K}\,:\\ \mathbf{\Pi}^{\prime}(x)\geq\kappa\end{subarray}}\lvert\hat{\mathbf{\Pi}}(y)-\mathbf{\Pi}_{x,t}(y)\rvert\vee\kappa$ , where $\kappa\in(0,1]$ , and one can let $t\to\infty$ .)

∎

*Remark 3.1**.*

The proof shows that the value of $t$ where the $\mathbf{\Pi}_{0,t}(y)$ level off can be chosen uniformly over $y$ such that $\hat{\mathbf{\Pi}}(y)\geq\kappa$ .

Proof of Theorem 2.2.

Let $\mathbf{Q}^{n}$ denote the distribution of $\hat{Q}/n$ and let $\mathbf{Q}^{n}_{0,t}$ denote the distribution of $Q(nt)/n$ for $Q(0)=0$ . Let $H\subset\mathbb{R}^{K}$ be a $\hat{\mathbf{\Pi}}$ –continuity set. We have that

[TABLE]

By Theorem 4.1 in Down and Meyn [6], there exist $A>1$ and $\rho\in(0,1)$ such that $\lvert\mathbf{Q}^{n}(H)-\mathbf{Q}^{n}_{0,t}(H)\rvert\leq A\rho^{nt}\,.$ Given $\epsilon>0$ , let $t$ be such that $A\rho^{t}<\epsilon$ and $\lvert\mathbf{\Pi}_{0,t}(H)-\hat{\mathbf{\Pi}}(H)\rvert<\epsilon$ . Since, by Theorem 2.1, for all $n$ great enough, $\lvert\mathbf{Q}_{0,t}^{n}(H)^{1/n}-\mathbf{\Pi}_{0,t}(H)\rvert<\epsilon$ , it follows that $\lvert\mathbf{Q}^{n}(H)^{1/n}-\hat{\mathbf{\Pi}}(H)\rvert<3\epsilon$ , for all $n$ great enough. (Alternatively, one may let $n\to\infty$ and then let $t\to\infty$ in (3.23).) Finally, $\hat{\mathbf{\Pi}}(x)=e^{-\mathbf{V}(x)}$ by (2.7) and (3.20). ∎

*Remark 3.2**.*

Since $\mathbf{\Pi}_{0,t}(H)\uparrow\hat{\mathbf{\Pi}}(H)$ , as $t\to\infty$ , one can see by (3.23), that, more generally, geometric ergodicity of $\mathbf{Q}_{0,t}$ , as $t\to\infty$ , for the metric of total variation and a sample path LDP for $(Q_{nt}/n\,,t\geq 0)$ with $Q_{0}=0$ , imply an LDP for $\mathbf{Q}^{n}$ .

Appendix A Appendix

Lemma A.1.

Let $(N(t)\,,t\in\mathbb{R}_{+})$ be a renewal process with rate $\lambda$ . Suppose that certain exponential moments of the inter-renewal times are finite. Then, given arbitrary $\epsilon>0$ , there exists $\sigma\in(0,1)$ such that, for all $t\in\mathbb{R}_{+}$ ,

[TABLE]

Proof.

Let $\vartheta_{1},\vartheta_{2},\ldots$ denote the successive inter-renewal times. For suitable $\alpha>0$ ,

[TABLE]

Hence,

[TABLE]

Since $\mathbf{E}(\vartheta_{1}-1/\lambda)=0$ , the latter righthand side is less than unity for $\alpha$ small enough. ∎

Bibliography10

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] S.L. Bell and R.J. Williams. Dynamic scheduling of a system with two parallel servers in heavy traffic with resource pooling: asymptotic optimality of a threshold policy. Ann. Appl. Probab. , 11(3):608–649, 2001.
2[2] M. Bramson. Stability of queueing networks , volume 1950 of Lecture Notes in Mathematics . Springer, Berlin, 2008. Lectures from the 36th Probability Summer School held in Saint-Flour, July 2–15, 2006.
3[3] M. Bramson. Stability of queueing networks. Probab. Surv. , 5:169–345, 2008.
4[4] H. Chen and A. Mandelbaum. Discrete flow networks: bottleneck analysis and fluid approximations. Math. Oper. Res. , 16(2):408–446, 1991.
5[5] M.I. Freidlin and A.D. Wentzell. Random Perturbations of Dynamical Systems . Nauka, 1979. In Russian, English translation: Springer, 1984.
6[6] S. P. Meyn and D. Down. Stability of generalized Jackson networks. Ann. Appl. Probab. , 4(1):124–148, 1994.
7[7] A. Puhalskii. Large Deviations and Idempotent Probability . Chapman & Hall/CRC, 2001.
8[8] A. Puhalskii. On large deviation convergence of invariant measures. J. Theoret. Probab. , 16(3):689–724, 2003.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Large deviations of the long term distribution of

Abstract

1 Introduction and summary

2 The setup and main result

Theorem 2.1**.**

Theorem 2.2**.**

Remark 2.1*.*

3 Idempotent probability and the proof of Theorem 2.2

Lemma 3.1**.**

Proof.

Lemma 3.2**.**

Proof.

Theorem 3.1**.**

Proof.

Remark 3.1*.*

Proof of Theorem 2.2.

Remark 3.2*.*

Appendix A Appendix

Lemma A.1**.**

Proof.

Theorem 2.1.

Theorem 2.2.

*Remark 2.1**.*

Lemma 3.1.

Lemma 3.2.

Theorem 3.1.

*Remark 3.1**.*

*Remark 3.2**.*

Lemma A.1.