Moderate deviations and extinction of an epidemic

Etienne Pardoux

arXiv:1905.08986·math.PR·March 6, 2020

Moderate deviations and extinction of an epidemic

Etienne Pardoux

PDF

TL;DR

This paper investigates how stochastic fluctuations influence the extinction time of an epidemic in large populations, using probabilistic theories like the Central Limit Theorem and Moderate Deviations to estimate extinction times.

Contribution

It introduces a novel approach applying Moderate and Large Deviations principles to estimate epidemic extinction times in stochastic models near deterministic equilibria.

Findings

01

Estimates of epidemic extinction times depend on population size.

02

Moderate deviations provide precise asymptotic estimates.

03

Large deviations help understand rare extinction events.

Abstract

Consider an epidemic model with a constant flux of susceptibles, in a situation where the corresponding deterministic epidemic model has a unique stable endemic equilibrium. For the associated stochastic model, whose law of large numbers limit is the deterministic model, the disease free equilibrium is an absorbing state, which is reached soon or later by the process. However, for a large population size, i.e. when the stochastic model is close to its deterministic limit, the time needed for the stochastic perturbations to stop the epidemic may be enormous. In this paper, we discuss how the Central Limit Theorem, Moderate and Large Deviations allow us to give estimates of the extinction time of the epidemic, depending upon the size of the population.

Equations393

{s^{'} (t) i^{'} (t) = - λ s (t) i (t) + γ i (t), = λ s (t) i (t) - γ i (t) .

{s^{'} (t) i^{'} (t) = - λ s (t) i (t) + γ i (t), = λ s (t) i (t) - γ i (t) .

z^{'} (t) = λ z (t) (1 - z (t)) - γ z (t) .

z^{'} (t) = λ z (t) (1 - z (t)) - γ z (t) .

⎩ ⎨ ⎧ S_{t}^{N} I_{t}^{N} = S_{0}^{N} - \frac{1}{N} P_{in f} (λ N \int_{0}^{t} S_{r}^{N} I_{r}^{N} d r) + \frac{1}{N} P_{r ec} (γ N \int_{0}^{t} I_{r}^{N} d r), = I_{0}^{N} + \frac{1}{N} P_{in f} (λ N \int_{0}^{t} S_{r}^{N} I_{r}^{N} d r) - \frac{1}{N} P_{r ec} (γ N \int_{0}^{t} I_{r}^{N} d r) .

⎩ ⎨ ⎧ S_{t}^{N} I_{t}^{N} = S_{0}^{N} - \frac{1}{N} P_{in f} (λ N \int_{0}^{t} S_{r}^{N} I_{r}^{N} d r) + \frac{1}{N} P_{r ec} (γ N \int_{0}^{t} I_{r}^{N} d r), = I_{0}^{N} + \frac{1}{N} P_{in f} (λ N \int_{0}^{t} S_{r}^{N} I_{r}^{N} d r) - \frac{1}{N} P_{r ec} (γ N \int_{0}^{t} I_{r}^{N} d r) .

λ \int_{0}^{t} \frac{S _{r}^{N}}{N} I_{r}^{N} d r .

λ \int_{0}^{t} \frac{S _{r}^{N}}{N} I_{r}^{N} d r .

⎩ ⎨ ⎧ s^{'} (t) i^{'} (t) r^{'} (t) = - λ s (t) i (t) + ρ r (t), = λ s (t) i (t) - γ i (t), = γ i (t) - ρ r (t),

⎩ ⎨ ⎧ s^{'} (t) i^{'} (t) r^{'} (t) = - λ s (t) i (t) + ρ r (t), = λ s (t) i (t) - γ i (t), = γ i (t) - ρ r (t),

⎩ ⎨ ⎧ S_{t}^{N} I_{t}^{N} R_{t}^{N} = S_{0}^{N} - \frac{1}{N} P_{in f} (λ N \int_{0}^{t} S_{r}^{N} I_{r}^{N} d r) + \frac{1}{N} P_{l o im} (ρN \int_{0}^{t} R_{r}^{N} d r), = I_{0}^{N} + \frac{1}{N} P_{in f} (λ N \int_{0}^{t} S_{r}^{N} I_{r}^{N} d r) - \frac{1}{N} P_{r ec} (γ N \int_{0}^{t} I_{r}^{N} d r) = R_{0}^{N} + \frac{1}{N} P_{r ec} (γ N \int_{0}^{t} I_{r}^{N} d r) - \frac{1}{N} P_{l o im} (ρN \int_{0}^{t} R_{r}^{N} d r) .

⎩ ⎨ ⎧ S_{t}^{N} I_{t}^{N} R_{t}^{N} = S_{0}^{N} - \frac{1}{N} P_{in f} (λ N \int_{0}^{t} S_{r}^{N} I_{r}^{N} d r) + \frac{1}{N} P_{l o im} (ρN \int_{0}^{t} R_{r}^{N} d r), = I_{0}^{N} + \frac{1}{N} P_{in f} (λ N \int_{0}^{t} S_{r}^{N} I_{r}^{N} d r) - \frac{1}{N} P_{r ec} (γ N \int_{0}^{t} I_{r}^{N} d r) = R_{0}^{N} + \frac{1}{N} P_{r ec} (γ N \int_{0}^{t} I_{r}^{N} d r) - \frac{1}{N} P_{l o im} (ρN \int_{0}^{t} R_{r}^{N} d r) .

⎩ ⎨ ⎧ s^{'} (t) i^{'} (t) r^{'} (t) = μ - λ s (t) i (t) - μ s (t) = λ s (t) i (t) - γ i (t) - μ i (t) = γ i (t) - μ r (t),

⎩ ⎨ ⎧ s^{'} (t) i^{'} (t) r^{'} (t) = μ - λ s (t) i (t) - μ s (t) = λ s (t) i (t) - γ i (t) - μ i (t) = γ i (t) - μ r (t),

⎩ ⎨ ⎧ S_{t}^{N} I_{t}^{N} R_{t}^{N} = S_{0}^{N} - \frac{1}{N} P_{in f} (λ N \int_{0}^{t} S_{r}^{N} I_{r}^{N} d r) + \frac{1}{N} P_{bi r t h} (μ N t) - \frac{1}{N} P_{d s} (μ N \int_{0}^{t} S_{r}^{N} d r), = I_{0}^{N} + \frac{1}{N} P_{in f} (λ N \int_{0}^{t} S_{r}^{N} I_{r}^{N} d r) - \frac{1}{N} P_{r ec} (γ N \int_{0}^{t} I_{r}^{N} d r) - \frac{1}{N} P_{d i} (μ N \int_{0}^{t} I_{r}^{N} d r), = R_{0}^{N} + \frac{1}{N} P_{r ec} (γ N \int_{0}^{t} I_{r}^{N} d r) - \frac{1}{N} P_{d r} (μ N \int_{0}^{t} R_{r}^{N} d r) .

⎩ ⎨ ⎧ S_{t}^{N} I_{t}^{N} R_{t}^{N} = S_{0}^{N} - \frac{1}{N} P_{in f} (λ N \int_{0}^{t} S_{r}^{N} I_{r}^{N} d r) + \frac{1}{N} P_{bi r t h} (μ N t) - \frac{1}{N} P_{d s} (μ N \int_{0}^{t} S_{r}^{N} d r), = I_{0}^{N} + \frac{1}{N} P_{in f} (λ N \int_{0}^{t} S_{r}^{N} I_{r}^{N} d r) - \frac{1}{N} P_{r ec} (γ N \int_{0}^{t} I_{r}^{N} d r) - \frac{1}{N} P_{d i} (μ N \int_{0}^{t} I_{r}^{N} d r), = R_{0}^{N} + \frac{1}{N} P_{r ec} (γ N \int_{0}^{t} I_{r}^{N} d r) - \frac{1}{N} P_{d r} (μ N \int_{0}^{t} R_{r}^{N} d r) .

Z_{t}^{N} = z_{N} + \frac{1}{N} j = 1 \sum k h_{j} P_{j} (N \int_{0}^{t} β_{j} (Z_{s}^{N}) d s) = z_{N} + \int_{0}^{t} b (Z_{s}^{N}) d s + \frac{1}{N} j = 1 \sum k h_{j} M_{j} (N \int_{0}^{t} β_{j} (Z_{s}^{N}) d s),

Z_{t}^{N} = z_{N} + \frac{1}{N} j = 1 \sum k h_{j} P_{j} (N \int_{0}^{t} β_{j} (Z_{s}^{N}) d s) = z_{N} + \int_{0}^{t} b (Z_{s}^{N}) d s + \frac{1}{N} j = 1 \sum k h_{j} M_{j} (N \int_{0}^{t} β_{j} (Z_{s}^{N}) d s),

Z_{t}^{N} = z_{N} + \frac{1}{N} j = 1 \sum k h_{j} \int_{0}^{t} \int_{0}^{N β_{j} (Z_{s}^{N})} M_{j} (d s, d u) = z_{N} + \int_{0}^{t} b (Z_{s}^{N}) d s + \frac{1}{N} j = 1 \sum k h_{j} \int_{0}^{t} \int_{0}^{N β_{j} (Z_{s}^{N})} \overline{M}_{j} (d s, d u),

Z_{t}^{N} = z_{N} + \frac{1}{N} j = 1 \sum k h_{j} \int_{0}^{t} \int_{0}^{N β_{j} (Z_{s}^{N})} M_{j} (d s, d u) = z_{N} + \int_{0}^{t} b (Z_{s}^{N}) d s + \frac{1}{N} j = 1 \sum k h_{j} \int_{0}^{t} \int_{0}^{N β_{j} (Z_{s}^{N})} \overline{M}_{j} (d s, d u),

(H .1)

(H .1)

(H .2)

\frac{d z _{t}}{d t} = b (t, z_{t}), z_{0} = x .

\frac{d z _{t}}{d t} = b (t, z_{t}), z_{0} = x .

\frac{P ( N t )}{N} \to t a.s. as N \to \infty.

\frac{P ( N t )}{N} \to t a.s. as N \to \infty.

U_{t} = \int_{0}^{t} \nabla_{x} b (s, z_{s}) U_{s} d s + j = 1 \sum k h_{j} \int_{0}^{t} β_{j} (s, z_{s}) d B_{j} (s), t \geq 0,

U_{t} = \int_{0}^{t} \nabla_{x} b (s, z_{s}) U_{s} d s + j = 1 \sum k h_{j} \int_{0}^{t} β_{j} (s, z_{s}) d B_{j} (s), t \geq 0,

\frac{d ϕ _{t}}{d t} = j = 1 \sum k c_{j} (t) h_{j}, t a.e .

\frac{d ϕ _{t}}{d t} = j = 1 \sum k c_{j} (t) h_{j}, t a.e .

I_{T} (ϕ) := {in f_{c \in A_{k} (ϕ)} I_{T} (ϕ ∣ c), \infty, if ϕ \in A C_{T, A}; otherwise.

I_{T} (ϕ) := {in f_{c \in A_{k} (ϕ)} I_{T} (ϕ ∣ c), \infty, if ϕ \in A C_{T, A}; otherwise.

I_{T} (ϕ ∣ c) = \int_{0}^{T} j = 1 \sum k g (c_{j} (t), β_{j} (ϕ_{t})) d t

I_{T} (ϕ ∣ c) = \int_{0}^{T} j = 1 \sum k g (c_{j} (t), β_{j} (ϕ_{t})) d t

N \to \infty lim inf \frac{1}{N} lo g I P (Z^{N, z_{N}} \in O) \geq - I_{T, z} (O) .

N \to \infty lim inf \frac{1}{N} lo g I P (Z^{N, z_{N}} \in O) \geq - I_{T, z} (O) .

N \to \infty lim sup \frac{1}{N} lo g I P (Z^{N, z_{N}} \in F) \leq - I_{T, z} (F) .

N \to \infty lim sup \frac{1}{N} lo g I P (Z^{N, z_{N}} \in F) \leq - I_{T, z} (F) .

\overline{V} := T > 0 in f ϕ \in A C_{T, d}, ϕ (0) = z^{*}, ϕ_{1} (T) = 0 in f I_{T} (ϕ) .

\overline{V} := T > 0 in f ϕ \in A C_{T, d}, ϕ (0) = z^{*}, ϕ_{1} (T) = 0 in f I_{T} (ϕ) .

T_{Ext}^{N, z} = in f {t > 0, Z_{1}^{N} (t) = 0, if Z^{N} (0) = z_{N}} .

T_{Ext}^{N, z} = in f {t > 0, Z_{1}^{N} (t) = 0, if Z^{N} (0) = z_{N}} .

\lim_{N\to\infty}\mathbb{P}\big{(}\exp\{N(\overline{V}-\eta)\}<T^{N,z}_{\text{Ext}}<\exp\{N(\overline{V}+\eta)\}\big{)}=1.

\lim_{N\to\infty}\mathbb{P}\big{(}\exp\{N(\overline{V}-\eta)\}<T^{N,z}_{\text{Ext}}<\exp\{N(\overline{V}+\eta)\}\big{)}=1.

exp {N (\overline{V} - η)} \leq E (T_{Ext}^{N, z}) \leq exp {N (\overline{V} + η)} .

exp {N (\overline{V} - η)} \leq E (T_{Ext}^{N, z}) \leq exp {N (\overline{V} + η)} .

i^{'} (t)

i^{'} (t)

s^{'} (t)

R_{0} = \frac{λ}{γ + μ} ε = \frac{1/ ( γ + μ )}{1/ μ} = \frac{μ}{γ + μ} .

R_{0} = \frac{λ}{γ + μ} ε = \frac{1/ ( γ + μ )}{1/ μ} = \frac{μ}{γ + μ} .

N_{c} \sim \frac{1}{( i ^{*} ) ^{2} R _{0}} = \frac{1}{ϵ ^{2} ( 1 - R _{0}^{- 1} ) ^{2} R _{0}},

N_{c} \sim \frac{1}{( i ^{*} ) ^{2} R _{0}} = \frac{1}{ϵ ^{2} ( 1 - R _{0}^{- 1} ) ^{2} R _{0}},

z (t) = z + \int_{0}^{t} b (z (s)) d s

z (t) = z + \int_{0}^{t} b (z (s)) d s

z_{N} (t) = N^{- α} z + \int_{0}^{t} b (z_{N} (s)) d s,

z_{N} (t) = N^{- α} z + \int_{0}^{t} b (z_{N} (s)) d s,

\overline{z}_{N} (t)

\overline{z}_{N} (t)

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Moderate Deviations and Extinction of an Epidemic

É. Pardoux

Abstract

Consider an epidemic model with a constant flux of susceptibles, in a situation where the corresponding deterministic epidemic model has a unique stable endemic equilibrium. For the associated stochastic model, whose law of large numbers limit is the deterministic model, the disease free equilibrium is an absorbing state, which is reached soon or later by the process. However, for a large population size, i.e. when the stochastic model is close to its deterministic limit, the time needed for the stochastic perturbations to stop the epidemic may be enormous. In this paper, we discuss how the Central Limit Theorem, Moderate and Large Deviations allow us to give estimates of the extinction time of the epidemic, depending upon the size of the population.

1 Introduction

We consider epidemic models where there is a constant flux of susceptible individuals, either because the infected individuals become susceptible immediately after healing, or after some time during which the individual is immune to the illness, or because there is a constant flux of newborn or immigrant susceptibles.

In the above three cases, for certain values of the parameters, there is an endemic equilibrium, which is a stable equilibrium of the associated deterministic epidemic model. The deterministic model can be considered as the Law of Large Numbers limit (as the size of the population tends to $\infty$ ) of a stochastic model, where infections, healings, births and deaths happen according to Poisson processes whose rates depend upon the numbers of individuals in each compartment.

Since the disease free states are absorbing, it follows from an irreducibility property which is clearly valid in our models, that the epidemic will stop soon or later in the more realistic stochastic model. However, the time which the stochastic perturbances will need to stop the epidemic may be enormous when the size $N$ of the population is large. The aim of this paper is to describe, based upon the Central Limit Theorem, Large and Moderate Deviations, the time it takes for the epidemic to stop in the stochastic model.

The law of large numbers and central limit theorems are rather old. They can be found e.g. in chapter 11 of Ethier and Kurtz [3]. There are also presented, in the framework of epidemic models, in Britton and Pardoux [1]. The Large Deviations results are close to those presented in Shwartz and Weiss [9], [10], although their assumptions are not quite satisfied in our models. Derivations adapted to our setup can be found in Kratz and Pardoux [5], Pardoux and Samegni–Kepgnou [6], and Britton and Pardoux [1]. The results concerning moderate deviations are new and constitute the core of this paper. Our derivation is essentially based upon an infinite generalization of the Gärtner–Ellis Theorem, Corollary 4.6.14 from Dembo and Zeitouni [2]. Our main results are Theorem 4.10 and Theorem 4.13. We also give expressions for the rate function in our three models of interest, and in case of the simplest model we give an explicit formula for the quasi–potential. We also compare in that case the upper bound of fluctuations given respectively by the central limit theorem, moderate deviations, and large deviations.

The paper is organized as follows. In section 2, we describe the three deterministic and stochastic models which we have in mind, namely the SIS, SIRS and SIR model with demography. In section 3, we give the general formulation of the stochastic models, and recall the Law of Large Numbers, the Central Limit Theorem and the Large Deviations, and their application to the time of extinction of an epidemic. In section 4, we establish the moderate deviations result and explain how it can be used to predict the time taken for an epidemic to cease, depending upon the size of the population. Finally an Appendix establishes an estimate of exponential moments of the integral with respect to a compensated Poisson random measure. This estimate is used several times in our proofs.

In this paper, the same letter $C$ denotes an arbitrary constant, whose value may change from line to line.

2 The three models

2.1 The SIS model

The deterministic SIS model is the following. Let $s(t)$ (resp. $i(t)$ ) denote the proportion of susceptible (resp. infectious) individuals in the population. Given an infection parameter $\lambda$ , and a recovery parameter $\gamma$ , the deterministic SIS model reads

[TABLE]

Since clearly $s(t)+i(t)\equiv 1$ , the system can be reduced to a one dimensional ODE. If we let $z(t)=i(t)$ , we have $s(t)=1-z(t)$ ,and we obtain the ODE

[TABLE]

It is easy to verify that this ODE has a so–called “disease free equilibrium”, which is $z=0$ . If $\lambda>\gamma$ , this equilibrium is unstable, and there is an endemic stable equilibrium $z^{\ast}=1-\gamma/\lambda$ .

The corresponding stochastic model is as follows. Let $S^{N}_{t}$ (resp. $I^{N}_{t}$ ) denote the proportion of susceptible (resp of infectious) individuals in a population of total size $N$ .

[TABLE]

Here $P_{inf}(t)$ and $P_{rec}(t)$ are two mutually independent standard (i.e. rate $1$ ) Poisson processes. Let us give some explanations, first concerning the modeling, then concerning the mathematical formulation.

Let $\mathcal{S}^{N}_{t}$ (resp. $\mathcal{I}^{N}_{t}$ ) denote the number of susceptible (resp. infectious) individuals in the population. The equations for those quantities are the above equations, multiplied by $N$ . The argument of $P_{inf}(t)$ reads

[TABLE]

The formulation of such a rate of infections can be explained as follows. Each infectious individual meets other individuals in the population at some rate $\beta$ . The encounter results in a new infection with probability $p$ if the partner of the encounter is susceptible, which happens with probability $S^{N}_{t}/N$ , since we assume that each individual in the population has the same probability of being that partner, and with probability [math] if the partner is an infectious individual. Letting $\lambda=\beta p$ and summing over the infectious individuals at time $t$ gives the above rate. Concerning recovery, it is assumed that each infectious individual recovers at rate $\gamma$ , independently of the others.

2.2 The SIRS model

In the SIRS model, contrary to the SIS model, an infectious who heals is first immune to the illness, he is “recovered”, and only after some time does he loose his immunity and turn to susceptible. The deterministic SIRS model reads

[TABLE]

while the stochastic SIRS model reads

[TABLE]

These two models could be reduced to two–dimensional models for $z(t)=(i(t),s(t))$ (resp. for $Z^{N}_{t}=(I^{N}_{t},S^{N}_{t})$ ).

2.3 The SIR model with demography

In this model, recovered individuals remain immune for ever, but there is a flux of susceptibles by births at a given rate multiplied by $N$ , while individuals from each of the three compartments die at rate $\mu$ . Thus the deterministic model

[TABLE]

whose stochastic variant reads

[TABLE]

Remark 2.1.

One may think that it would be more natural to decide that births happen at rate $\mu$ times the total population. The total population process would be a critical branching process, which would go extinct in finite time a.s., which we do not want. Next it might seem more natural to replace in the infection rate the ratio $S^{N}_{t}/N$ by $S^{N}_{t}/(S^{N}_{t}+I^{N}_{t}+R^{N}_{t})$ , which is the actual ratio of susceptibles in the population at time $t$ . It is easy to show that $S^{N}_{t}+I^{N}_{t}+R^{N}_{t}$ is close to $N$ , so we choose the simplest formulation.

Again, we can reduce these models to two–dimensional models for $z(t)=(i(t),s(t))$ (resp. for $Z^{N}_{t}=(I^{N}_{t},S^{N}_{t})$ ), by deleting the $r$ (resp. $R^{N}$ ) component.

3 The stochastic model, LLN, CLT and LD

3.1 The stochastic model

The three above stochastic models are of the following form.

[TABLE]

where $\{P_{j}(t),\,t\geq 0\}_{0\leq j\leq k}$ are mutually independent standard Poisson processes, $M_{j}(t)=P_{j}(t)-t$ , and $b(z)=\sum_{j=1}^{k}\beta_{j}(z)h_{j}$ . $Z^{N}_{t}$ takes its values in ${\rm I\hskip-2.0ptR}^{d}$ .

In the case of the SIS model, $d=1$ , $k=2$ , $h_{1}=1$ , $\beta_{1}(z)=\lambda z(1-z)$ , $h_{2}=-1$ and $\beta_{2}(z)=\gamma z$ .

In the case of the SIRS model, $d=2$ , $k=3$ , $h_{1}=\begin{pmatrix}1\\ -1\end{pmatrix}$ , $\beta_{1}(z)=\lambda z_{1}z_{2}$ , $h_{2}=\begin{pmatrix}-1\\ 0\end{pmatrix}$ , $\beta_{2}(z)=\gamma z_{1}$ and $h_{3}=\begin{pmatrix}0\\ 1\end{pmatrix}$ , $\beta_{3}(z)=\rho(1-z_{1}-z_{2})$ .

In the case of the SIR model with demography, we can restrict ourselves to $d=2$ , while $k=4$ , $h_{1}=\begin{pmatrix}1\\ -1\end{pmatrix}$ , $\beta_{1}(z)=\lambda z_{1}z_{2}$ , $h_{2}=\begin{pmatrix}-1\\ 0\end{pmatrix}$ , $\beta_{2}(z)=(\gamma+\mu)z_{1}$ , $h_{3}=\begin{pmatrix}0\\ 1\end{pmatrix}$ , $\beta_{3}(z)=\mu$ , $h_{4}=\begin{pmatrix}0\\ -1\end{pmatrix}$ , $\beta_{4}(z)=\mu z_{2}$ .

While the above expressions has the advantage of being concise, we shall rather use the following equivalent formulation of (3.1). Let $\{{\mathcal{M}}_{j},\,1\leq j\leq k\}$ be mutually independent Poisson random measures on ${\rm I\hskip-2.0ptR}_{+}^{2}$ with mean measure the Lebesgue measure, and let $\overline{\mathcal{M}}_{j}(ds,du)={\mathcal{M}}_{j}(ds,du)-ds\,du$ , $1\leq j\leq k$ . We can rewrite (3.1) in the form

[TABLE]

The joint law of $\{Z^{N},\,N\geq 1\}$ is the same law of a sequence of random elements of the Skorohod space $D([0,T];{\rm I\hskip-2.0ptR}^{d})$ , whether we use (3.1) or (3.2) for its definition.

Let us state the assumptions which we will need in section 4 below. Those are more than necessary for the results of the present section to hold, see [1] for the proofs.

[TABLE]

Remark 3.1.

In practice, in our models, either the process $Z^{N}_{t}$ takes its values in a compact subset of ${\rm I\hskip-2.0ptR}^{d}$ (this is the case for all models with a constant population size), or else we restrict ourselves to such a situation, by stopping the process when the total population exceeds a given large value, see section 4.2.7 in [1].

Concerning the initial condition, we assume that for some $z\in[0,1]^{d}$ , $z_{N}=[Nz]/N$ , where $[Nz]\in{\mathbb{Z}}^{d}_{+}$ is the vector whose $i$ –th component is the integer part of the real number $Nz^{i}$ .

3.2 Law of Large Numbers

We have a Law of Large Numbers

Th eor em 3.2.

Let $Z^{N}_{t}$ denote the solution of the SDE (3.1). Then $Z^{N}_{t}\to z_{t}$ a.s. locally uniformly in $t$ , where $\{z_{t},\,t\geq 0\}$ is the unique solution of the ODE

[TABLE]

The main argument in the proof of the above theorem is the fact that, locally uniformly in $t$ ,

[TABLE]

3.3 Central Limit Theorem

We also have a Central Limit Theorem. Let $U^{N}_{t}:=\sqrt{N}(Z^{N}_{t}-z(t))$ .

Th eor em 3.3.

As $N\to\infty$ , $\{U^{N}_{t},\,t\geq 0\}\Rightarrow\{U_{t},\,t\geq 0\}$ for the topology of locally uniform convergence, where $\{U_{t},\,t\geq 0\}$ is a Gaussian process of the form

[TABLE]

where $\{(B_{1}(t),B_{2}(t),\ldots,B_{k}(t)),\,t\geq 0\}$ are mutually independent standard Brownian motions.

3.4 Large Deviations, and extinction of an epidemic

We denote by $\mathcal{AC}_{T,d}$ the set of absolutely continuous functions from $[0,T]$ into ${\rm I\hskip-2.0ptR}^{d}$ . For any $\phi\in\mathcal{AC}_{T,d}$ , let $\mathcal{A}_{k}(\phi)$ denote the (possibly empty) set of functions $c\in L^{1}(0,T;{\rm I\hskip-2.0ptR}^{k}_{+})$ such that $c_{j}(t)=0$ a.e. on the set $\{t,\,\beta_{j}(\phi_{t})=0\}$ and

[TABLE]

We define the rate function

[TABLE]

where as usual the infimum over an empty set is $+\infty$ , and

[TABLE]

with $g(\nu,\omega)=\nu\log(\nu/\omega)-\nu+\omega$ . We assume in the definition of $g(\nu,\omega)$ that for all $\nu>0$ , $\log(\nu/0)=\infty$ and $0\log(0/0)=0\log(0)=0$ . The collection $Z^{N}$ obeys a Large Deviations Principle, in the sense that

Th eor em 3.4.

For any open subset $O\subset D([0,T];{\rm I\hskip-2.0ptR}^{d})$ ,

[TABLE]

For any closed subset $F\subset D([0,T];{\rm I\hskip-2.0ptR}^{d})$ ,

[TABLE]

A slight reinforcement of this theorem allows us to conclude a Wentzell–Freidlin type of result. In what follows, we assume that the first component of $Z^{N}_{t}$ (resp. of $z(t)$ ) is $I^{N}_{t}$ (resp. $i(t)$ ). Assume that the deterministic ODE which appears in Theorem 3.2 has a unique stable equilibrium $z^{\ast}$ whose first component satisfies $z^{\ast}_{1}>0$ . We define

[TABLE]

Let now

[TABLE]

We have the

Th eor em 3.5.

Given any $\eta>0$ , for any $z$ with $z_{1}>0$ ,

[TABLE]

Moreover, for all $\eta>0$ and $N$ large enough,

[TABLE]

We refer for the proof of this Theorem to [5] and [1].

It is important to evaluate the quantity $\overline{V}$ . Note that it is the value function of an optimal control problem. In case of the SIS model, which is one dimensional, one can solve this control problem explicitly with the help of Pontryagin’s maximum principle, see [8], and deduce in that case that $\overline{V}=\log\frac{\lambda}{\gamma}-1+\frac{\gamma}{\lambda}$ . For other models, one can compute numerically a good approximation of the value of $\overline{V}$ for each given value of the parameters.

3.5 CLT and extinction of an epidemic

The discussion of this subsection, which motivates the moderate deviations approach of this paper, is taken from section 4.1 in [1]. Consider the SIR with demography.

[TABLE]

We assume that $\lambda>\gamma+\mu$ , in which case there is a unique stable endemic equilibrium, namely $z^{\ast}=(i^{\ast},s^{\ast})=(\frac{\mu}{\gamma+\mu}-\frac{\mu}{\lambda},\frac{\gamma+\mu}{\lambda})$ . We can study the extinction of an epidemic in the above model using the CLT. We note that the basic reproduction number $R_{0}$ and the expected relative time of a life an individual is infected, $\varepsilon$ , are given by

[TABLE]

The rate of recovery $\gamma$ is much larger than the death rate $\mu$ (52 compared to 1/75 for a one week infectious period and 75 year life length) so we use the approximations $R_{0}\approx\lambda/\gamma$ and $\varepsilon\approx\mu/\gamma$ . Denote again by $I^{N}_{t}$ the fraction of the population which is infectious in a population of size $N$ . The law of large numbers tells us that for $N$ and $t$ large, $I^{N}_{t}$ is close to $i^{\ast}$ . The central limit theorem tell us that $\sqrt{N}(I^{N}_{t}-i^{\ast})$ converges to a Gaussian process, whose asymptotic variance can be shown to well approximated by $R_{0}^{-1}$ . This suggests that for large $t$ , the number of infectious individuals in the population is approximately Gaussian, with mean $Ni^{\ast}$ and standard deviation $\sqrt{N/R_{0}}$ . If $Ni^{\ast}$ and $\sqrt{N/R_{0}}$ are of the same order, i.e. $N$ is of the same order as $\frac{1}{(i^{\ast})^{2}R_{0}}$ , it is likely that the fluctuations described by the central limit theorem explain that the epidemic might cease in time of order one. This gives a critical population size roughly of the order of

[TABLE]

in fact probably a bit larger than that.

Consider measles prior to vaccination. In that case it is known that $R_{0}\approx 15$ , and $\varepsilon\approx\frac{1/75}{1/(1/52)+1/75}\approx 1/3750$ we arrive at $N_{c}\sim(3750)^{2}/15$ , which is almost $10^{6}$ . So, if the population is at most a million (or perhaps a couple of millions), we expect that the disease will go extinct quickly, whereas the disease will become endemic (for a rather long time) in a significantly larger population. This confirms the empirical observation that measles was continuously endemic in UK whereas it died out quickly in Iceland (and was later reintroduced by infectious people visiting the country).

4 Moderate deviations

If the CLT allows to predict extinction of an endemic disease for population sizes under a given threshold $N_{c}$ , and Large Deviations gives predictions for arbitrarily large population sizes, it is fair to look at Moderate Deviations, which describes ranges of fluctuations between those of the CLT and those of the LD.

The assumptions $(H.1)$ and $(H.2)$ are assumed to hold throughout this section.

4.1 The set–up and preliminary estimates

We shall use the general model written in the form (3.2). We assume that the limiting law of large numbers ODE

[TABLE]

has a unique stable equilibrium point $z^{\ast}$ such that $z^{\ast}_{1}>0$ , called the endemic equilibrium, which is such that, provided $z_{1}(0)>0$ , $z(t)\to z^{\ast}$ as $t\to\infty$ .

For the sake of simplifying many formulas below, we chance our coordinates, and let $z^{\ast}=0$ . The reader should be aware of the fact that there is a price to pay for that translation of the origin. Indeed, since in the original coordinate system, the process $Z^{N}_{t}$ was living on the set of vectors whose coordinates are integer multiples of $N^{-1}$ (this is essential for the process to remain in the set where it makes sense, i.e. for proportions to remain between [math] and $1$ ), the new origin generically does not belong to the set of point in ${\rm I\hskip-2.0ptR}^{d}$ which our process $Z^{N}_{t}$ may visit. The grid on which $Z^{N}_{t}$ lives is translated by the vector $z^{\ast}-\{z^{\ast}\}_{N}$ , where here and below $\{z\}_{N}:=[Nz]/N$ , $[Nz]$ denoting the vector whose $i$ –th component is the integer part of the $i$ –th component of $Nz$ . However, this minor complexity will appear only in the formula for the initial condition of the SDE. Once the SDE starts on the correct grid, the solution remains there.

From now on [math] will be the endemic equilibrium (of course in the translated coordinate system), while $z^{\ast}\not=0$ will denote that endemic equilibrium in the original coordinates (we shall need it for the formula of the initial condition of the SDE).

We want to study the moderate deviations at scale $\alpha$ of $Z^{N}_{t}$ , where $0<\alpha<1/2$ . Note that $\alpha=0$ would correspond to the large deviations, and $\alpha=1/2$ to the central limit theorem. We shall need below to consider the ODE starting from a point close to $z^{\ast}=0$ , namely we shall consider the function $\{z_{N}(t),\ 0\leq t\leq T\}$ , solution of the ODE

[TABLE]

where $z\in{\rm I\hskip-2.0ptR}^{d}$ is arbitrary. In fact, we shall be more interested in $\overline{z}_{N}(t):=N^{\alpha}z_{N}(t)$ , which solves (below we exploit the fact that $b(0)=0$ )

[TABLE]

It is not hard to prove that, under our standing assumption $(H.2)$ that $b$ is of class $C^{1}$ and $\nabla b$ is bounded, as $N\to\infty$ , $\overline{z}_{N}(t)\to\overline{z}(t)$ uniformly for $0\leq t\leq T$ , where $\overline{z}(t)$ solves the linearized ODE near the endemic equilibrium [math] :

[TABLE]

We want to study the moderate deviations of the process $Z^{N}_{t}$ solution of the SDE (3.1) with the initial condition $z_{N}:=\{z^{\ast}+N^{-\alpha}z\}_{N}-z^{\ast}$ . This amounts to study the large deviations of $Z^{N,\alpha}_{t}:=N^{\alpha}Z^{N}_{t}$ at speed $a_{N}=N^{2\alpha-1}$ . We define

[TABLE]

With these notations, the SDE for $Z^{N,\alpha}_{t}$ reads

[TABLE]

If we let $K:=\sup_{z}\|\nabla b(z)\|$ , we have

[TABLE]

This combined with Gronwall’s Lemma yields

[TABLE]

From the boundedness and Lipschitz property of $\nabla b$ , and the formula for $V^{N,\alpha}$ , we deduce that

[TABLE]

We deduce from the last three inequalities

[TABLE]

We now define

[TABLE]

so that

[TABLE]

We will see below that the large deviations of $Z^{N,\alpha}$ will follow from those of $\widetilde{Y}^{N,\alpha}$ by a variant of the contraction principle. We first consider the simpler processes

[TABLE]

which are similar to $Y^{N}$ and $Y^{N,\alpha}$ , but with $Z^{N}_{s}$ replaced by [math].

4.2 The limiting logarithmic moment generating function of $\overline{Y}^{N,\alpha}$

We note that writing the integral over $[0,N\beta_{j}(0)]$ as the sum from $\ell=1$ to $\ell=N$ of integrals over $((\ell-1)\beta_{j}(0),\ell\beta_{j}(0)]$ , we can rewrite $\overline{Y}^{N,\alpha}$ as follows.

[TABLE]

The processes $Q_{1},Q_{2},\ldots,Q_{N}$ are i.i.d., and their law is that of

[TABLE]

Now let $\nu=(\nu_{1},\ldots,\nu_{d})$ be a vector of signed measures on $[0,T]$ .

Lemma 4.1.

As $N\to\infty$ , (recall that $a_{N}=N^{2\alpha-1}$ )

[TABLE]

Proof We use in an essential way the above decomposition of $\overline{Y}^{N,\alpha}$ .

[TABLE]

provided

[TABLE]

which we will check below. From this it follows that the argument of the logarithm on the before last line is greater than or equal to $1$ , at least for $N$ large enough, and the final conclusion follows easily from the fact that for any $x\geq 0$ , $x-x^{2}/2\leq\log(1+x)\leq x$ . Let us now check (4.4). It follows from an exact Taylor formula that

[TABLE]

But $\nu(Q)$ is an affine combination of mutually independent Poisson random variables, so that (4.4) follows easily by an explicit computation. $\square$

4.3 The limiting logarithmic moment generating function of $\widetilde{Y}^{N,\alpha}$

We want to study the large deviations of $\widetilde{Y}^{N,\alpha}$ . The main step will be to prove that Lemma 4.1 remains valid if we replace $\overline{Y}^{N,\alpha}$ by $\widetilde{Y}^{N,\alpha}$ , which will follow from the next Proposition.

Proposition 4.2.

For any $C>0$ , $\nu=(\nu_{1},\ldots,\nu_{d})$ a vector of signed measures, as $N\to\infty$ ,

[TABLE]

Before we establish that Proposition, let us first prove that it yields the wished result.

Proposition 4.3.

Given Lemma 4.1, if Proposition 4.2 holds true, then for any signed measure $\nu$ on $[0,T]$ , as $N\to\infty$ ,

[TABLE]

Proof For any $\delta>0$ , we deduce from Hölder’s inequality

[TABLE]

so that, if we combine Lemma 4.1 and Proposition 4.2, we deduce that

[TABLE]

and letting $\delta\to 0$ , we conclude that

[TABLE]

For the inequality in the other direction, we note that, by similar arguments,

[TABLE]

with $\mu=-\nu$ , which implies that

[TABLE]

hence, letting $\delta\to 0$ we conclude that

[TABLE]

$\square$

The remaining of this subsection will be devoted to the proof of Proposition 4.2.

We note that Proposition 4.2 is a consequence of the following two Propositions.

Proposition 4.4.

For any $C>0$ , as $N\to\infty$ ,

[TABLE]

Proposition 4.5.

For any $C>0$ , as $N\to\infty$ ,

[TABLE]

We start with the

Proof of Proposition 4.4 The exponents in the expressions entering (4.5) are sums over the indices $1\leq i\leq d$ and $1\leq j\leq k$ . Using repeatedly Schwartz’s inequality, it is sufficient to prove the results with the sum replaced by each of the summands. Therefore in this proof we do as if $d=1$ , we fix $1\leq j\leq k$ and for the sake of simplifying the notations, we drop the index $j$ . We note that

[TABLE]

It is not hard to see that one can treat each of the two terms on the right separately, and we treat only the first term, the treatment of the second one being quite similar. We note that there exists a compensated standard Poisson process $M(t)$ on ${\rm I\hskip-2.0ptR}_{+}$ such that the factor of $N^{-\alpha}$ in this first term can be rewritten as

[TABLE]

We need to estimate ${{\rm I\hskip-2.0ptE}}\exp[CN^{-\alpha}\nu(W^{N})]$ . If we decompose the signed measure $\nu$ as the difference of two measures as follows $\nu=\nu_{+}-\nu_{-}$ , we again have two terms, and it suffices to treat one of them, say $\nu_{+}$ . Of course it suffices to treat the case where $\nu_{+}\not=0$ . Since the positive constant $C$ is arbitrary, we can w.l.o.g. assume that $\nu_{+}$ is a probability measure on $[0,T]$ . It is then clear that

[TABLE]

We choose a new parameter $0<\gamma<\alpha$ , and we write the expression whose expectation needs to be estimated as a sum of two terms as follows.

[TABLE]

We now estimate the first term on the right hand side of (4.6). For that sake, we define the stopping time

[TABLE]

and note that

[TABLE]

Consequently the expectation of the first term on the right of (4.6) is bounded from above by

[TABLE]

where the first inequality follows from Proposition 5.1 in the Appendix below, and the second one exploits the Lipschitz property of $\beta$ . Consider now the second term on the right hand side of (4.6).

[TABLE]

for some $c,C>0$ , where the second inequality follows from Proposition 5.1 and the boundedness of $\beta$ . Estimating the second factor in the last expression amounts to estimating the two probabilities (with another $c>0$ )

[TABLE]

We estimate the first probability. For any $a>0$ ,

[TABLE]

where the second inequality follows from Proposition 5.1 and the last inequality by optimizing over $a>0$ . One can easily convince oneself that a similar result holds for the second line of (4.7), making use of Proposition 5.1 with a negative $a$ . Note also for further use that the same result also holds in case $\gamma=0$ . In that case, the probability on the second line of (4.7) is zero for large enough $c$ , in which case the anounced estimate is of course true.

The expectation of the second term of the right hand side of (4.6) is thus dominated by (with $c_{1}$ and $c_{2}$ two positive constants)

[TABLE]

Finally

[TABLE]

It follows readily from the inequality $\log(a+b)\leq\log(2)+\log(a\vee b)$ that for $N$ large enough

[TABLE]

which establishes (4.5). $\square$

We now turn to the second proof.

Proof of Proposition 4.5 Recalling assumption $(H.1)$ , we now define, with $\overline{\beta}_{j}:=\sup_{z\in{\rm I\hskip-2.0ptR}^{d}}\beta_{j}(z)$ ,

[TABLE]

the event

[TABLE]

and the stopping time

[TABLE]

where the constant $b>0$ will be chosen below, and the constant $b^{\prime}>0$ is arbitrary. From the estimate (4.1),

[TABLE]

We take the limit successively in the two terms of the above right hand side. Step 1 : Estimate of (4.9) We have

[TABLE]

We first note that the arguments used in the proof of (4.8), in the particular case $\gamma=0$ , yield

[TABLE]

for some constant $C>0$ . We next estimate the product

[TABLE]

For the same reason as in the previous proof, we need only consider the case $d=k=1$ . It follows from Proposition 5.1 that the first factor satisfies

[TABLE]

Finally there exist two positive constants $C_{1}$ and $C_{2}$ such that

[TABLE]

for $N$ large enough. So $a_{N}\log$ of the above tends to [math], as $N\to\infty$ .

Step 2 : Estimate of (4.10) We first note that

[TABLE]

The first term on the right tends to [math] as $N\to\infty$ . It remains to take care of the second term. Since $Y^{N}_{t}$ is a martingale, it is clear that the process

[TABLE]

is a submartingale. Consequently, from Doob’s $L^{2}$ submartingale inequality,

[TABLE]

Consider first the first factor on the right hand side of (4.3). We deduce from the definition of $\bar{\tau}_{b}$ that

[TABLE]

with $c_{T}=\sum_{j=1}^{k}\|h_{j}\|(1+b^{\prime})\overline{\beta}_{j}T$ . Consequently the square of the first factor on the right of (4.3) is bounded from above by

[TABLE]

where we have used Doob’s optional sampling theorem for submartingales. From the same argument as above,we do as if $d=1$ , note that

[TABLE]

and exploit Proposition 4.4 in order to conclude concerning $a_{N}\log$ of the first factor on the right of (4.3).

We next note that

[TABLE]

Hence the square of the second term on the right of (4.3) satisfies

[TABLE]

Consider first the second factor on the right of (4.13). We have

[TABLE]

Using the Cauchy–Schwartz inequality several times, it is clear that it is sufficient to do as if we had (dropping the index $j$ )

[TABLE]

with $a=\beta(0)\,T$ and $\xi_{N}=\frac{\theta_{N}-aN}{\sqrt{aN}}$ , where $\theta_{N}\sim\text{Poi}(aN)$ . We now choose $b=a/3$ . We have

[TABLE]

We have proved that the second factor on the right of (4.13) remains bounded, as $N\to\infty$ . We next consider the first factor on the right of (4.13). We first note that

[TABLE]

But from (4.3), ${{\rm I\hskip-2.0ptP}}\left(\bar{\tau}_{b}<T\right)\lesssim e^{-CN}$ .

It follows that the left hand side of (4.13) is bounded from above by a constant times

[TABLE]

where $C_{1}$ and $C_{2}$ are two positive constants. This last expression is bounded by $2$ , as soon as $N$ is large enough. Finally $a_{N}\log$ of the left-hand side of (4.13) tends to [math], as $N\to\infty$ . $\square$

Remark 4.6.

We note that the full strength of (4.1) is necessary for the proof of Proposition 4.5. Indeed, while $a_{N}\log{{\rm I\hskip-2.0ptE}}\exp\{CN^{1-\alpha}\sup_{0\leq t\leq T}\|Y^{N}_{t}\|\}$ certainly does not converge to [math] as $N\to\infty$ , clearly with high probability $\|Y^{N}_{t}\|^{2}$ is smaller than $\|Y^{N}_{t}\|$ , but ${{\rm I\hskip-2.0ptE}}\exp\{CN^{1-\alpha}\|Y^{N}_{t}\|^{2}\}=\infty$ .

4.4 Large deviations of $\widetilde{Y}^{N,\alpha}$

We first define the Fenchel–Legendre transform of

[TABLE]

where $Q$ has been defined by (4.3), $\nu=(\nu_{1},\ldots,\nu_{d})$ is a vector of signed measures and $<h_{j},\nu>(dt)=\sum_{i=1}^{d}h_{j}^{i}\nu_{i}(dt)$ , $h^{i}_{j}$ being the $i$ –th coordinate of the vector $h_{j}$ . We have exploited the fact that $\nu(Q)$ is the sum over $j$ of zero mean mutually independent random variables. For each $\phi\in D([0,T];{\rm I\hskip-2.0ptR}^{d})$ , we define

[TABLE]

The next step will consist in proving that the sequence of processes $\{\widetilde{Y}^{N,\alpha}\}_{N\geq 1}$ satisfies a Large Deviation Principle.

Th eor em 4.7.

The sequence $\{\widetilde{Y}^{N,\alpha},\,N\geq 1\}$ satisfies the Large Deviation Principle in $D([0,T];{\rm I\hskip-2.0ptR}^{d})$ equipped with the supnorm topology, with the convex, good rate function $\Lambda^{\ast}$ and with speed $a_{N}$ , in the sense that for any Borel subset $\Gamma\subset D([0,T];{\rm I\hskip-2.0ptR}^{d})$ ,

[TABLE]

Since there is a difficulty with having a topology on $D([0,T];{\rm I\hskip-2.0ptR}^{d})$ which makes it a topological vector space, and allows for a simple characterization of the class of compact sets, we shall use a small detour for the proof of the above Theorem. Recall that

[TABLE]

where $Y^{N,\alpha}_{t}$ is piecewise constant, with jumps of size $h_{j}N^{\alpha-1}$ . Let $Y^{N,\alpha,c}_{t}$ denote the continuous piecewise linear approximation of $Y^{N,\alpha}_{t}$ , which is defined as follows. Let $0=\tau^{N}_{0}<\tau^{N}_{1}<\tau^{N}_{2}<\cdots$ denote the successive jump times of the process $Y^{N,\alpha}_{t}$ . For $i\geq 0$ , on the interval $[\tau^{N}_{i},\tau^{N}_{i+1}]$ ,

[TABLE]

Next we define

[TABLE]

We note that

[TABLE]

hence for any $\delta>0$ , for $N$ large enough,

[TABLE]

This implies clearly

Lemma 4.8.

The two sequences $\{\widetilde{Y}^{N,\alpha}\}_{N\geq 1}$ and $\Big{\{}\widetilde{\widetilde{Y}}^{N,\alpha}\Big{\}}_{N\geq 1}$ are exponentially equivalent in $D([0,T];{\rm I\hskip-2.0ptR}^{d})$ , equipped with the supnorm topology, in the sense that for each $\delta>0$ ,

[TABLE]

We shall prove below the following.

Proposition 4.9.

The sequence $\Big{\{}\widetilde{\widetilde{Y}}^{N,\alpha}\Big{\}}_{N\geq 1}$ is exponentially tight in $C_{0}([0,T];{\rm I\hskip-2.0ptR}^{d})$ , the space of continuous functions from $[0,T]$ into ${\rm I\hskip-2.0ptR}^{d}$ , which start from [math] at $t=0$ , in the sense that for any $R>0$ , there exists a compact subset $K_{R}\subset\subset C_{0}([0,T]:{\rm I\hskip-2.0ptR}^{d})$ such that

[TABLE]

Let us now turn to the proof of the above Theorem.

Proof of Theorem 4.7 From (4.14), we deduce that

[TABLE]

as $N\to\infty$ . Consequently, again by the argument of Proposition 4.3, we deduce from that same Proposition that for any signed measure $\nu$ on $[0,T]$ , as $N\to\infty$ ,

[TABLE]

This, together with Proposition 4.9, allows us to apply Corollary 4.6.14 from [2], to conclude that the sequence $\Big{\{}\widetilde{\widetilde{Y}}^{N,\alpha}\Big{\}}_{N\geq 1}$ satisfies a LDP in $C_{0}([0,T];{\rm I\hskip-2.0ptR}^{d})$ with the good rate function $\Lambda^{\ast}$ , and speed $a_{N}$ . Since $C_{0}([0,T];{\rm I\hskip-2.0ptR}^{d})$ is closed in $D([0,T];{\rm I\hskip-2.0ptR}^{d})$ equipped with the supnorm topology, it follows from Lemma 4.1.5 in [2] that the same LDP holds in the latter space, with the same rate function $\Lambda^{\ast}$ , extended to that space by $\Lambda^{\ast}(\phi)=+\infty$ for $\phi\in D([0,T];{\rm I\hskip-2.0ptR}^{d})\backslash C_{0}([0,T];{\rm I\hskip-2.0ptR}^{d})$ . The result now follows from Lemma 4.8, in view of Theorem 4.2.13 from [2]. $\square$

We now turn to the

Proof of Proposition 4.9 Clearly it suffices to prove both that

[TABLE]

and that the sequence $\{Y^{N,\alpha,c}\}_{N\geq 1}$ is exponentially tight in $C_{0}([0,T];{\rm I\hskip-2.0ptR}^{d})$ . Let us first establish (4.15). It follows from (4.1) that

[TABLE]

Consequently, if $R>2C(\|z\|+1)$ , with $R^{\prime}=(2C)^{-1}R$ ,

[TABLE]

It follows from Doob’s submartingale inequality and a combination of Lemma 4.1 and Proposition 4.4 that the $\limsup$ as $N\to\infty$ of the second term of the last right hand side is finite. (4.15) clearly follows.

It remains to consider $Y^{N,\alpha,c}$ . Define the modulus of continuity of an element $x\in C_{0}([0,T];{\rm I\hskip-2.0ptR}^{d})$ as $w_{x}(\delta)=\sup_{0\leq s,t\leq T,|s-t|\leq\delta}\|x(t)-x(s)\|$ . It follows from Ascoli’s theorem that for any sequence $\{\delta_{\ell},\ell\geq 1\}$ of positive numbers, the following is a compact subset of $C_{0}([0,T];{\rm I\hskip-2.0ptR}^{d})$ :

[TABLE]

Suppose that for each $\ell\geq 1$ , $R>0$ , we can find $\delta_{R,\ell}>0$ such that for all $N\geq 1$ ,

[TABLE]

From this we deduce that

[TABLE]

so that

[TABLE]

from which the result follows. A sufficient condition for (4.16) to be true is that for any $b>0$ ,

[TABLE]

In turn a sufficient condition for this is that

[TABLE]

which we now prove. It is not hard to see that

[TABLE]

where we have used Doob’s submartingale inequality at the last step. Clearly

[TABLE]

Using repeatedly Cauchy–Schwartz’s inequality, we see that it suffices to estimate for each $j$

[TABLE]

where $\bar{\beta}_{j}=\sup_{z}\beta_{j}(z)$ , we have used Proposition 5.1 and the inequality $e^{x}-1-x\leq x^{2}$ , valid for $x\leq\log(2)$ , which we have applied with $x=2CN^{-\alpha}\delta^{-1/2}$ and $x=-2CN^{-\alpha}\delta^{-1/2}$ (recall that we will first let $N\to\infty$ ). Putting together the last estimates yields

[TABLE]

(4.17) follows, and the Proposition is proved. $\square$

4.5 Computation of the rate function $\Lambda^{\ast}$

Let us compute $\Lambda^{\ast}$ in the three examples which we discussed above in section 2. Here we do not translate $z^{\ast}$ to the origin.

4.5.1 Computation of $\Lambda^{\ast}$ for the SIS model

Recall that in this case $d=1$ , $k=2$ , $h_{1}=1$ , $\beta_{1}(z)=\lambda z(1-z)$ , $h_{2}=-1$ , $\beta_{2}(z)=\gamma z$ . If $\lambda>\gamma$ , there is a unique stable endemic equilibrium $z^{\ast}=1-\gamma/\lambda$ . We first compute

[TABLE]

where

[TABLE]

It is easy to check that ${{\rm I\hskip-2.0ptE}}[Q(t)Q(s)]=\sigma^{2}(z^{\ast})\,{s\wedge t}$ , where

[TABLE]

Consequently

[TABLE]

We now need to compute $\Lambda^{\ast}(\phi)$ in case $\phi\in C^{2}([0,T])$ . We should take the supremum over the signed measures $\nu$ on $[0,T]$ of the quantity

[TABLE]

The supremum is achieved at the signed measure $\nu$ which makes the gradient with respect to $\nu$ of the above zero, if any. We first note that for such a $\nu$ to exist, we need that $\phi(0)=0$ , unless $\Lambda^{\ast}(\phi)=+\infty$ . Now the optimal $\nu$ must satisfy

[TABLE]

So necessarily

[TABLE]

Substituting this signed measure $\nu$ in the above formula, we obtain that

[TABLE]

Consequently

[TABLE]

4.5.2 Computation of $\Lambda^{\ast}$ for the SIRS model

In this model, $d=2$ and $k=3$ . We have $h_{1}=\binom{1}{-1}$ , $\beta_{1}(z)=\lambda z_{1}z_{2}$ , $h_{2}=\binom{-1}{0}$ , $\beta_{2}(z)=\gamma z_{1}$ , and $h_{3}=\binom{0}{1}$ , $\beta_{3}(z)=\rho(1-z_{1}-z_{2})$ . In the case $\lambda>\gamma$ , there is a unique stable endemic equilibrium, namely $z^{\ast}=\binom{\frac{\rho}{\gamma+\rho}\left(1-\frac{\gamma}{\lambda}\right)}{\frac{\gamma}{\lambda}}$ . In order to simplify the notations, we shall write $a=\beta_{1}(z^{\ast})$ , $b=\beta_{2}(z^{\ast})$ , $c=\beta_{3}(z^{\ast})$ and $A=ab+ac+bc$ . We have

[TABLE]

The functional to be maximized with respect to $\nu$ if

[TABLE]

Writing that the gradient w.r.t. $\nu_{1}$ and $\nu_{2}$ of this functional is zero leads to the identities

[TABLE]

This implies the identities

[TABLE]

Finally we deduce that $\Lambda^{\ast}(\phi)$ is $+\infty$ unless $\phi$ is absolutely continuous and $\phi(0)=0$ , in which case

[TABLE]

4.5.3 Computation of $\Lambda^{\ast}$ for the SIR model with demography

In this case, $d=2$ , $k=4$ , $h_{1}=\binom{1}{-1}$ , $\beta_{1}(z)=\lambda z_{1}z_{2}$ , $h_{2}=\binom{-1}{0}$ , $\beta_{2}(z)=(\gamma+\mu)z_{1}$ , $h_{3}=\binom{0}{1}$ , $\beta_{3}(z)=\mu$ and $h_{4}=\binom{0}{-1}$ , $\beta_{4}(z)=\mu z_{2}$ . In the case $\lambda>\gamma+\mu$ , there is a unique stable endemic equilibrium, namely $z^{\ast}=\binom{\mu\left(\frac{1}{\gamma+\mu}-\frac{1}{\lambda}\right)}{\frac{\gamma+\mu}{\lambda}}$ . We shall use the notations $a=\beta_{1}(z^{\ast})$ , $b=\beta_{2}(z^{\ast})$ , $c=\beta_{3}(z^{\ast})+\beta_{4}(z^{\ast})$ and $A=ab+ac+bc$ We have

[TABLE]

Formally the functional $\Lambda(\nu)$ has exactly the same form as in the case of the SIRS model, only the constants have different values. The same computations as in the previous subsection lead to the same result, namely that $\Lambda^{\ast}(\phi)$ is $+\infty$ unless $\phi$ is absolutely continuous and $\phi(0)=0$ , in which case

[TABLE]

4.6 Moderate deviations of $Z^{N}$

We again equip $D([0,T];{\rm I\hskip-2.0ptR}^{d})$ with the supnorm topology. Let for $z\in{\rm I\hskip-2.0ptR}^{d}$ $F_{z}:D([0,T];{\rm I\hskip-2.0ptR}^{d})\mapsto D([0,T];{\rm I\hskip-2.0ptR}^{d})$ be the continuous map which to $x$ associates $y$ solution of the ODE

[TABLE]

and for each $N\geq 1$ $F_{z,N}:D([0,T];{\rm I\hskip-2.0ptR}^{d})\mapsto D([0,T];{\rm I\hskip-2.0ptR}^{d})$ be the continuous map which to $x$ associates $y_{N}$ solution of the ODE

[TABLE]

We have

[TABLE]

which converges to [math] as $N\to\infty$ , uniformly in $t\in[0,T]$ and $x\in D([0,T];{\rm I\hskip-2.0ptR}^{d})$ . We want to study the moderate deviations of $Z^{N}$ , or in other words the large deviations of $Z^{N,\alpha}=N^{\alpha}Z^{N}$ . In what follows, we shall denote by $Z^{N,\alpha}_{z}$ the process $Z^{N,\alpha}$ starting from $Z^{N,\alpha}(0)=N^{\alpha}(\{z^{\ast}+N^{-\alpha}z\}_{N}-z^{\ast})$ . From (4.2), $Z^{N,\alpha}_{z}=F_{z,N}(\widetilde{Y}^{N,\alpha})$ , hence the following statement is a consequence of Theorem 4.7, (4.18) and Corollary 4.2.21 from [2].

Th eor em 4.10.

Assume that $(H.1)$ and $(H.2)$ hold. The collection of processes $\{Z^{N,\alpha}_{z}(t),\ 0\leq t\leq T\}_{N\geq 1}$ satisfies a large deviations principle with speed $a_{N}=N^{2\alpha-1}$ and the good rate function

[TABLE]

More precisely, for any Borel subset $\Gamma\subset D([0,T];{\rm I\hskip-2.0ptR}^{d})$ ,

[TABLE]

Since the mapping $F_{z}$ has the nice property that $F_{z}(x)(t)-F_{z^{\prime}}(x)(t)=\exp[\nabla b(0)t](x-x^{\prime})$ , it follows readily again from Corollary 4.2.21 in [2] that the above result can be extended to the following statement.

Th eor em 4.11.

Assume that $(H.1)$ and $(H.2)$ hold. For any closed set $F\subset D([0,T];{\rm I\hskip-2.0ptR}^{d})$ , for any sequence $z_{N}\to z$ ,

[TABLE]

For any open set $G\subset D([0,T];{\rm I\hskip-2.0ptR}^{d})$ , for any sequence $z_{N}\to z$ ,

[TABLE]

From this last Theorem, we can deduce, with the same proof as that of Corollary 5.6.15 in [2], the following Corollary.

Corollary 4.12.

Assume that $(H.1)$ and $(H.2)$ hold. Let $K$ denote an arbitrary compact subset of ${\rm I\hskip-2.0ptR}^{d}$ .

For any closed set $F\subset D([0,T];{\rm I\hskip-2.0ptR}^{d})$ ,

[TABLE]

For any open set $G\subset D([0,T];{\rm I\hskip-2.0ptR}^{d})$ ,

[TABLE]

4.7 Wentzell–Freidlin theory and extinction of an epidemic

We now define

[TABLE]

where $a>0$ , and we recall that we have translated the endemic equilibrium $z^{\ast}$ at the origin.

We can now state our main result.

Th eor em 4.13.

Assume that $(H.1)$ and $(H.2)$ hold. For some $a>0$ , let $T^{N}_{z,a}:=\inf\{t>0,\ Z^{N,\alpha}_{z,1}(t)\leq-a\}$ , where $Z^{N,\alpha}_{z,1}(t)$ denotes the first coordinate of the process $Z^{N,\alpha}_{z}(t)$ . The following hold.

For any $z\in{\rm I\hskip-2.0ptR}^{d}$ such that $z_{1}>-a$ , and any $\eta>0$ ,

[TABLE]

and

[TABLE]

Given Corollary 4.12, the proof of the above result follows the exact same steps as that of Theorem 5.7.11 in [2], with some minor modifications, to adapt to the fact that our processes have discontinuous trajectories, see the proof of Theorem 7.14 in [5], or of Theorem 4.2.17 in [1].

Recall that $a_{N}^{-1}=N^{1-2\alpha}$ . In the CLT regime, $\alpha=1/2$ , $a_{N}^{-1}=1$ , while in the LD regime, $\alpha=0$ , $a_{N}^{-1}=N$ .

4.7.1 Interpretation. The critical population size

Going back to the original coordinates, i.e. $z^{\ast}\not=0$ , we should interpret $Z^{N,\alpha}(t)$ as $Z^{N,\alpha}(t)=N^{\alpha}(Z^{N}(t)-z^{\ast})$ . So (dropping the index for the starting point in order to simplify our notations), $T^{N}_{a}$ is the first time when $Z^{N}_{1}(t)\leq z^{\ast}_{1}-aN^{-\alpha}$ . For $T^{N}_{a}$ to be finite, we need to have $z^{\ast}_{1}-aN^{-\alpha}\geq 0$ , since $Z^{N}_{1}(t)$ cannot become negative. This is of course no problem for the limit theorem, since $aN^{-\alpha}\to 0$ as $N\to\infty$ , while $z^{\ast}_{1}$ is fixed. However, a deviation of the order of $-aN^{-\alpha}$ is enough for $Z^{N}_{1}(t)$ to hit zero, if $z^{\ast}_{1}$ is of the order of $N^{-\alpha}$ , which means that $N$ is of the order of $(z^{\ast}_{1})^{-1/\alpha}$ . $e^{N^{1-2\alpha}\overline{V}_{a}}$ is the order of magnitude of the time needed for $Z^{N}_{t}-z_{t}$ to make a deviation of size $aN^{-\alpha}$ . This is sufficient to extinguish an epidemic, provided $z_{1}^{\ast}$ is of the same order, so that the corresponding critical size is $N_{c,\alpha}\sim(1/z_{1}^{\ast})^{1/\alpha}$ , which is roughly the CLT critical population size raised to the power $1/2\alpha$ .

4.7.2 The value of $\overline{V}_{a}$ in the SIS model

In the particular case of the SIS model, we can compute explicitly the value of the quasi–potential $\overline{V}_{a}$ . In this case, $d=1$ , the linearized ODE around the endemic equilibrium translated at [math] reads

[TABLE]

and the cost functional to minimize is

[TABLE]

We are looking for the minimal cost for driving $x$ from [math] to $-a$ . We now exploit the Pontryagin maximum principle, see [8]. The Hamiltonian reads

[TABLE]

The optimal control $\hat{u}$ must maximize the Hamiltonian, so it satisfies $\hat{u}=\frac{4\gamma(\lambda-\gamma)}{\lambda}p$ . Since the final time is free and the system is autonomous, the Hamiltonian vanishes along the optimal trajectory, so that along such a trajectory, either $p=0$ , in which case $\hat{u}=0$ , or else $x=\frac{\gamma}{\lambda}p$ , hence $\hat{u}=2(\lambda-\gamma)x$ . Finally the pieces of optimal trajectory which move towards the origin correspond to $u\equiv 0$ , those which move away from the origin (this is the case we are interested in) satisfy the time reversed ODE $\dot{x}=(\lambda-\gamma)x$ . There is no optimal trajectory from $x=0$ to $x=-a$ . However, if we start from $x=-{\varepsilon}$ , the optimal trajectory is $x(t)=-e^{(\lambda-\gamma)t}{\varepsilon}$ , so $\hat{u}(t)=-2(\lambda-\gamma)e^{(\lambda-\gamma)t}{\varepsilon}$ , the final state $-a$ is reached at time $(\lambda-\gamma)^{-1}\log(a/{\varepsilon})$ , and the optimal cost is $\frac{\lambda}{2\gamma}(a^{2}-{\varepsilon}^{2})$ . A possible sub–optimal control starting from [math] is as follows. Choose $u=-1$ for a time of order ${\varepsilon}$ , until $x(t)$ reaches $-{\varepsilon}$ , whose cost is of the order of ${\varepsilon}$ , and then choose the optimal feedback, until $-a$ is reached. Letting ${\varepsilon}\to 0$ , the total cost converges to

[TABLE]

4.8 Comparison between the CLT, MD and LD

We do that comparison in case of the SIS model, for which we have explicit expressions for the rate functions and the quasi–potentials. We still translate $z^{\ast}$ at the origin, and start our process at the origin : $Z^{N}_{0}=0$ . To make a change with the above, we fix $a>0$ and want to compare (for $t$ large) the upper bounds for ${{\rm I\hskip-2.0ptP}}(N^{\alpha}Z^{N}_{t}\geq a)$ in the three cases $\alpha=1/2$ (the central limit theorem), $0<\alpha<1/2$ (moderate deviations) and $\alpha=0$ (large deviations).

We start with the central limit theorem. It is easy to see that $U_{t}=\lim_{N\to\infty}\sqrt{N}Z^{N}_{t}$ solves the SDE

[TABLE]

so that the asymptotic variance of $U_{t}$ is $\gamma/\lambda$ . Consequently for $a>0$ fixed and any $\eta>0$ , there exist $t$ and $N$ large enough such that we have the following upper bound for the probability of a positive deviation of $\sqrt{N}Z^{N}_{t}$

[TABLE]

This bound follows from the following estimate, valid for a $\mathcal{N}(0,\sigma^{2})$ random variable $\xi$ : ${{\rm I\hskip-2.0ptP}}(\xi>a)\leq e^{-\lambda a}e^{\frac{\lambda^{2}\sigma^{2}}{2}}$ after optimizing over $\lambda>0$ .

Consider next the moderate deviations. Theorem 4.10 combined with the computation from the last subsection indicates that for $0<\alpha<1/2$ , any $\eta>0$ , there exists $t$ and $N$ large enough such that

[TABLE]

We finally consider the large deviations. Here we need to assume that $a<\gamma/\lambda$ . We exploit the computations from sections 4.2.6 and A.6 in [1]. The optimal trajectory to go from ${\varepsilon}$ to $a$ is the original ODE, but time reversed, i.e. it follows the ODE $\dot{x}_{t}=\beta_{2}(x_{t})-\beta_{1}(x_{t})$ . The running cost is $(\beta_{2}(x_{t})-\beta_{1}(x_{t}))\log\left(\frac{\beta_{2}(x_{t})}{\beta_{1}(x_{t})}\right)$ , so the total cost is

[TABLE]

Consequently, from Theorem 3.4, for any $\eta>0$ , there exists $t$ and $N$ large enough such that

[TABLE]

We note that Moderate Deviations resembles much more the Central Limit Theorem than Large Deviations. The fact that the discontinuity in the form of the rate function is exactly at $\alpha=0$ is typical of random variables with light tails. The situation would be quite different with heavy tails, see e. g. section VIII.4 in Petrov [7].

Note however that for small $a$ ,

[TABLE]

which is not too surprising, and in a sense reconciles Large Deviations and Moderate Deviations. Were our driving noises Brownian, then the LD rate function would be quadratic as that of MD, but the LD quasi–potential is the minimal cost when controlling the LLN ODE, while the MD quasi–potential is the minimal cost when controlling the linearized ODE around the endemic equilibrium.

5 Appendix

In this Appendix, we establish the following technical result.

Proposition 5.1.

Let ${\mathcal{M}}$ be a standard Poisson random mesure on ${\rm I\hskip-2.0ptR}^{2}_{+}$ , and $\overline{\mathcal{M}}(dt,du)={\mathcal{M}}(dt,du)-dt\,du$ the associated compensated measure. If $\varphi$ is an ${\rm I\hskip-2.0ptR}_{+}$ –valued predictable process such that $\int_{0}^{T}\varphi_{t}dt$ has exponential moments of any order, and $a\in{\rm I\hskip-2.0ptR}$ , then for any $0\leq t\leq T$ ,

[TABLE]

Proof Consider with $b\geq 0$ the process

[TABLE]

It follows from Itô’s formula that

[TABLE]

From Lemma 5.2 below, $M_{t}=\int_{0}^{t}\int_{0}^{\varphi_{s}}e^{X_{s-}}\overline{\mathcal{M}}(ds,du)$ is a martingale. Hence $e^{X}$ is a martingale if $b=(e^{a}-1-a)$ , a submartingale if we replace $=$ by $<$ , and a supermartingale if we replace $=$ by $>$ . Consequently if $b\geq(e^{a}-1-a)$ , ${{\rm I\hskip-2.0ptE}}e^{X_{t}}\leq 1$ . Now, using first Doob’s $L^{2}$ inequality for submartingales, and later Schwartz’s inequality, we have

[TABLE]

If $2b=e^{2a}-1-2a$ , it follows from the previous argument that the first factor on the second right hand side is less than or equal to $1$ , hence the result follows. $\square$

In order to complete the proof of Proposition 5.1, we still need to establish

Lemma 5.2.

The process $\varphi$ satisfying the same assumptions as in Proposition 5.1, and $X_{t}$ being given by (5.1), $M_{t}=\int_{0}^{t}\int_{0}^{\varphi_{s}}e^{X_{s-}}\overline{\mathcal{M}}(ds,du)$ is a martingale.

Proof It is plain that $M_{t}$ is a local martingale, whose predictable quadratic variation is given as

[TABLE]

All we need to show is that the above quantity is integrable. It is clearly a consequence of the assumption in case $a<0$ . In case $a>0$ , the second factor of the right hand side has finite exponential moments, so is square integrable, and all we need to show is that

[TABLE]

Using Itô’s formula we have

[TABLE]

The same computation with $\varphi_{s}$ replaced by $\varphi^{n}_{s}=\varphi_{s}\wedge n$ , and then $Y_{s}$ replaced by $Y^{n}_{s}$ would show that $Y_{t}^{n}$ is a martingale satisfying ${{\rm I\hskip-2.0ptE}}Y^{n}_{t}=1$ . But $0\leq Y^{n}_{t}\to Y_{t}$ a.s., hence Fatou’s Lemma implies that ${{\rm I\hskip-2.0ptE}}Y_{t}\leq 1$ . Since

[TABLE]

it follows from Schwartz’s inequality that

[TABLE]

and the result follows from our assumption on $\varphi$ . $\square$

Acknowledgement

It is a pleasure to thank Pierre Petit for an inspiring discussion on moderate deviations.

Bibliography10

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] Tom Britton and Etienne Pardoux, Stochastic Epidemics in a Homogeneous Community, ar Xiv:1808.0535, submitted.
2[2] Amir Dembo and Ofer Zeitouni, Large Deviations, Techniques and Applications , 2d ed., Applications of Mathematics 38 , Springer, New York, 1998.
3[3] Stewart N. Ethier and Thomas G. Kurtz, Markov processes. Characterization and convergence , J. Wiley 1986.
4[4] Mark I. Freidlin and Alexander D. Wentzell. Random perturbations of dynamical systems , 3d ed. Grundlehren des Mathematischen Wissenschaften 260 , Springer, New York, 2012.
5[5] Peter Kratz and Etienne Pardoux, Large deviations for infectious diseases models, in Séminaire de Probabilités XLIX , C. Donati-Martin, A. Lejay, A. Rouault eds., Lecture Notes in Math. 2215 , pp. 221-327, 2018.
6[6] Etienne Pardoux and Brice Samegni–Kepgnou, Large deviation principle for epidemic models, Journal of Applied Probability 54 , 905–920, 2017.
7[7] Vasily V. Petrov Sums of Independent Random Variables , Ergebnisse der Mathematik und ihrer Grenzgebiete 82 , Springer Verlag, 1975.
8[8] Lev S. Pontryagin, Vladimir G. Boltyanskii, Revaz V. Gamkrelidze and Evgenii F. Mishchenko The mathematical theory of optimal processes . Transl. by K. N. Trirogoff; ed. by L. W. Neustadt, John Wiley & Sons, 1962.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Moderate Deviations and Extinction of an Epidemic

Abstract

1 Introduction

2 The three models

2.1 The SIS model

2.2 The SIRS model

2.3 The SIR model with demography

Remark 2.1**.**

3 The stochastic model, LLN, CLT and LD

3.1 The stochastic model

Remark 3.1**.**

3.2 Law of Large Numbers

Th eor em 3.2**.**

3.3 Central Limit Theorem

Th eor em 3.3**.**

3.4 Large Deviations, and extinction of an epidemic

Th eor em 3.4**.**

Th eor em 3.5**.**

3.5 CLT and extinction of an epidemic

4 Moderate deviations

4.1 The set–up and preliminary estimates

4.2 The limiting logarithmic moment generating function of Y‾N,α\overline{Y}^{N,\alpha}YN,α

Lemma 4.1**.**

4.3 The limiting logarithmic moment generating function of Y~N,α\widetilde{Y}^{N,\alpha}YN,α

Proposition 4.2**.**

Proposition 4.3**.**

Proposition 4.4**.**

Proposition 4.5**.**

Remark 4.6**.**

4.4 Large deviations of Y~N,α\widetilde{Y}^{N,\alpha}YN,α

Th eor em 4.7**.**

Lemma 4.8**.**

Proposition 4.9**.**

4.5 Computation of the rate function Λ∗\Lambda^{\ast}Λ∗

4.5.1 Computation of Λ∗\Lambda^{\ast}Λ∗ for the SIS model

4.5.2 Computation of Λ∗\Lambda^{\ast}Λ∗ for the SIRS model

4.5.3 Computation of Λ∗\Lambda^{\ast}Λ∗ for the SIR model with demography

4.6 Moderate deviations of ZNZ^{N}ZN

Th eor em 4.10**.**

Th eor em 4.11**.**

Corollary 4.12**.**

4.7 Wentzell–Freidlin theory and extinction of an epidemic

Th eor em 4.13**.**

4.7.1 Interpretation. The critical population size

4.7.2 The value of V‾a\overline{V}_{a}Va​ in the SIS model

4.8 Comparison between the CLT, MD and LD

5 Appendix

Proposition 5.1**.**

Lemma 5.2**.**

Acknowledgement

Remark 2.1.

Remark 3.1.

Th eor em 3.2.

Th eor em 3.3.

Th eor em 3.4.

Th eor em 3.5.

4.2 The limiting logarithmic moment generating function of $\overline{Y}^{N,\alpha}$

Lemma 4.1.

4.3 The limiting logarithmic moment generating function of $\widetilde{Y}^{N,\alpha}$

Proposition 4.2.

Proposition 4.3.

Proposition 4.4.

Proposition 4.5.

Remark 4.6.

4.4 Large deviations of $\widetilde{Y}^{N,\alpha}$

Th eor em 4.7.

Lemma 4.8.

Proposition 4.9.

4.5 Computation of the rate function $\Lambda^{\ast}$

4.5.1 Computation of $\Lambda^{\ast}$ for the SIS model

4.5.2 Computation of $\Lambda^{\ast}$ for the SIRS model

4.5.3 Computation of $\Lambda^{\ast}$ for the SIR model with demography

4.6 Moderate deviations of $Z^{N}$

Th eor em 4.10.

Th eor em 4.11.

Corollary 4.12.

Th eor em 4.13.

4.7.2 The value of $\overline{V}_{a}$ in the SIS model

Proposition 5.1.

Lemma 5.2.