On a family of stochastic SVIR influenza epidemic models and maximum likelihood estimation
Divine Wanduku

TL;DR
This paper introduces a family of stochastic SVIR influenza models incorporating vaccination effects, and develops maximum likelihood and EM methods for parameter estimation, including the basic reproduction number, supported by numerical simulations.
Contribution
It presents a novel stochastic SVIR model family with trinomial transitions and derives maximum likelihood and EM estimation methods for key epidemiological parameters.
Findings
Models effectively incorporate vaccination and immunity dynamics.
Maximum likelihood and EM estimators are derived for model parameters.
Numerical simulations demonstrate model applicability and estimation accuracy.
Abstract
This study presents a family of stochastic models for the dynamics of influenza in a closed human population. We consider treatment for the disease in the form of vaccination, and incorporate the periods of effectiveness of the vaccine and infectiousness for the individuals in the population. Our model is a SVIR model, with trinomial transition probabilities, where all individuals who recover from the disease acquire permanent natural immunity against the strain of the disease. Special SVIR models in the family are presented, based on the structure of the probability of getting infection and vaccination at any instant. The methods of maximum likelihood, and expectation maximization are derived for the parameters of the chain. Moreover, estimators for some epidemiological assessment parameters, such as the basic reproduction number are computed. Numerical simulation examples are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
On a family of stochastic SVIR influenza epidemic models and maximum likelihood estimation
Divine Wanduku
Department of Mathematical Sciences, Georgia Southern University, 65 Georgia Ave, Room 3042, Statesboro, Georgia, 30460, U.S.A. E-mail:[email protected];[email protected] author. Tel: +14073009605.
Abstract
This study presents a family of stochastic models for the dynamics of influenza in a closed human population. We consider treatment for the disease in the form of vaccination, and incorporate the periods of effectiveness of the vaccine and infectiousness for the individuals in the population. Our model is a SVIR model, with trinomial transition probabilities, where all individuals who recover from the disease acquire permanent natural immunity against the strain of the disease. Special SVIR models in the family are presented, based on the structure of the probability of getting infection and vaccination at any instant. The methods of maximum likelihood, and expectation maximization are derived for the parameters of the chain. Moreover, estimators for some epidemiological assessment parameters, such as the basic reproduction number are computed. Numerical simulation examples are presented for the model.
keywords:
Influenza epidemics , chain-trinomial model, MLE method, EM algorithm, basic reproduction number
††journal: Journal of ***
1 Introduction
Influenza ranks amongst the top ten most important diseases in the USA. In fact, the CDC estimates that from 2010-2011 to 2013-2014, influenza-associated deaths in the United States ranged from a low of 12,000 (during 2011-2012) to a high of 56,000 (during 2012-2013) annually [32]. Globally, WHO estimates that seasonal influenza results in approximately 290-650 thousand deaths each year from just influenza related respiratory diseases alone[35, 21]. These statistics necessitate more investigation and understanding about the disease, in order to control or ameliorate the burdens of the disease.
For hundreds of years, influenza, more commonly know as the flu, has plagued mankind. The flu is caused by a virus, and spreads from person to person through direct contact and through particulates in the air. Symptoms include- fever, cough, sore throat, runny or stuffy nose, muscle or body aches, headaches, fatigue, and even vomiting and diarrhea [28].
There are several strains of the influenza virus, all causing similar symptoms with varying severity. Influenza viruses are generally categorized into four subgroups: A, B, C and D. Human influenza A and B viruses are the most common and cause seasonal epidemics of disease almost every winter in the United States. The emergence of a new and very different influenza A virus to infect people can cause an influenza pandemic. For instance, the recent ”swine flu” (officially called H1N1), and ”bird flu” (H5N1). See [30].
An individual infected with the flu can spread it to others up to about 6 feet away through droplets from their mouth from coughing, sneezing, or merely talking. Once an individual has been infected with the flu virus, it usually takes about 2 days incubation period before symptoms start to show. From the onset of symptoms, it can take 7 to 10 days for an individual to recover. In general, adults who are infected with the influenza virus are capable of spreading it to others a day before symptoms showup as well as throughout the duration of the illness (cf.[31]).
In an attempt to slow the spread of seasonal influenza, the CDC recommends annual influenza vaccination for people 6 months of age and older. The most common vaccination is the seasonal flu injection, standard-dose trivalent shot (IIV3), and there are several other options available. One setback is that the flu virus constantly mutates and vaccination against one strain may not protect against other strains. Once vaccinated, it may take up to two weeks before the vaccine is completely effective. See [33].
Various types of mathematical compartmental models have been used to investigate the dynamics of infectious diseases of humans[1, 22, 23, 24, 25, 27] . For instance, the authors in [27] present a deterministic model for influenza. The author in [22] presents an epidemic dynamic model for malaria. The authors in [23, 24, 25] present stochastic epidemic dynamic models with white noise perturbations for influenza.
The compartmental mathematical epidemic dynamic models are largely classified as SVIS, SVIRS, SIS, SIR, SIRS, SEIRS, and SEIR etc. epidemic dynamic models depending on the compartments of the disease states directly involved in the general disease dynamics [1, 22, 23, 24, 25, 27]. For instance, SIR model, represents transitions from the susceptible state to the infectious, and then to the removed state. An SEIR model adds the incubation period via an exposed class to the previous SIR model. SVIR models incorporate the vaccination class , and represent the dynamics of diseases that grant permanent immunity after recovery from disease, for example, influenza, where naturally acquired immunity against a strain of the virus is conferred after bacterial infection [1, 10, 11, 12].
Stochastic epidemic dynamic models more realistically represent epidemic dynamic processes because they include the randomness which inevitably occurs during a disease outbreak, owing to the presence of constant random environmental fluctuations in the disease dynamics. For stochastic epidemic models, the state of the system over time, which usually represents the number of people susceptible (S), vaccinated (V), infected but not infectious (E), infected and infectious (I), or removed with natural immunity (R) is a stochastic process. Furthermore, the stochastic process for the disease dynamics can be a continuous-time-continuous state, continuous-state-discrete-time, or discrete-time and discrete state stochastic process. The choice on which type of stochastic process to use to represent the dynamics of the disease depends on the finiteness or infiniteness of the epidemiological features of the disease that are represented mathematically. Some examples of continuous-time-continuous state stochastic epidemiological studies include [24, 25, 26, 23], and discrete-time-discrete-state models include [1, 8, 10, 20].
In our study, we derive an SVIR discrete-time and discrete state stochastic chain model, wherein temporary artificial immunity through vaccination is considered, in addition to the lifelong naturally acquired immunity for influenza in a closed community of human beings.
Chain binomial models are an important class of Markov chain models, and they have been used to represent and study the infectious diseases epidemic over time [1, 2, 3, 6, 7, 8]. This family of stochastic models represent the disease dynamics as a random process, where transition between some states such as from susceptible to infectious, or from vaccinated to susceptible etc. are characterized by binomial transition probabilities.
Some pioneer chain binomial models for infectious diseases include- the Greenwood and Reed-Frost models[2, 3]. Many other authors such as [1, 8, 20] have extended and expanded on the ideas of Greenwood and Reed-Frost, creating more realistic epidemic dynamic models. Greenwood[2] in the 1931 study presented a SIS chain binomial chain model for the spread of diseases in human populations, where a breakdown of the population into successive generations indexed by is used. Furthermore, the susceptible individuals are infected in a given generation, and are only capable of infecting others only within that generation. And after which, they remain infected, but isolated from subsequent disease spread. Greenwood assumed that a susceptible person meets only one infectious person at any instance, and interaction with the infectious person leads to transmission of disease with probability .
Reed-Frost[3] built upon most of the ideas of the Greenwood model, and presented a SIR epidemic model. Moreover, Reed-Frost made a more realistic assumption that a susceptible person at any instance has a chance to interact independently with any of the infectious individuals present at that instance. Furthermore, the probability of getting infection at any instance is given by , where is the number of infectious people present at the instance .
The previous Greenwood and Reed-Frost models consider generations of infections, and the infectious individuals no longer participate in subsequent transmission of disease to susceptible persons. This assumption is suitable for generalizations of disease dynamics, where the disease suddenly outbreaks in a given time generation, dies out, and reoccur in another time generation.
Tuckwell and Williams [1] utilized some ideas from Reed-Frost[3], and proposed a more realistic SIR epidemic dynamic model. In their model, infection takes place over discrete time intervals, and the infectious population at any instance comprises of all infectious persons at different ages of their infectiousness over the constant duration of the infectious period . The age represents how long an individual has been infectious since initial infection from the susceptible state. In their model, the total infectious individuals at any instance participates in the disease transmission process, and it is assumed that at the end of the infectious period , all infectious persons are treated, fully recovered, and join the removed class .
Tuckwell and Williams [1] also assume that the individual in the population at any instant encounter a fixed number of people (family, office mates, employees) denoted , and also a random number of people , where the random variables are mutually independent and independent on the state of the population. This assumption leads to a more realistic random process description for the total number of people the individual encounters at any instance . Moreover, the probability of getting infection at any instance becomes dependent on the number of infectious people the susceptible individual meets among the people.
Using ideas from Tuckwell-Williams [1], we propose a susceptible-vaccinated-infected-removed (SVIR) epidemic discrete-time and discrete state family of chain binomial models for influenza. Treatment in the form of vaccination is available, where the vaccine confers temporary effective artificial immunity against the disease, but ultimately wanes over time. In line with [1], the infectious population involves in the disease transmission process over all generations of the population over time, and over all ages of infectiousness. However, unlike [1], the vaccination class is incorporated, and the total infectious and vaccinated populations are characterized based on the ages of infection and vaccination since initial infection and vaccination, over the constant and finite infectious and effective vaccination periods, respectively. Moreover, as far as we know, no other discrete-time, discrete state chain binomial study in the line of thinking of Tuckwell-Williams[1] with multiple time delays, and with the vaccination class exists in the literature. This study provides a suitable and more realistic extension of the chain binomial models- Greenwood, Reed-frost and Tuckwell-William models above.
The rest of this paper is organized as follows: in Section 2, we present an adequate description of the SVIR human population over discrete time. In Section 3, we derive the transition probabilities of the SVIR Markov chain family of models, and present special cases of the general SVIR markov chain model. In Section 5, we present the method of maximum likelihood estimation to find important parameters of the SVIR chain models. In Section 6, we estimate some important epidemiological parameters for the influenza epidemic model. Finally, in Section LABEL:ch7, we present numerical examples of the SVIR influenza model, and make general concluding remarks.
2 Description of the SVIR Influenza Epidemic Process
In this section, we describe and represent the influenza epidemic in the human population. We present the procedure to discretize time, and decompose the human population into the different states of the disease involved in the influenza epidemic
We consider a human population of size living in a closed natural environment over a period of time consistent with the duration of an epidemic outbreak, where it can be assumed that human movement into and out of the population is negligible or nonexistent. Furthermore, it is assumed that an infectious disease such as a flu, and other similar respiratory infectious diseases etc. breaks out in the closed population, and the disease causes major suffering in the human being, but with no mortal consequences due to disease related causes. In addition, the population is relatively safe, and no natural deaths occur during the duration of the epidemic outbreak. Also, it is assumed no births occur during the duration of the epidemic or very strong measures are taken medically to protect all newborns, such that they can be ignored from the effective diseased population.
It is also assumed that the infectious disease has an artificial vaccine which provides temporary effective immunity against the infectious agent that lasts over a constant immunity period . At the end of the immunity period , the vaccinated person becomes susceptible again to the disease. The disease also confers natural immunity after recovery from the infection. The naturally acquired immunity also provides effective natural protection against the disease agent, that lasts over a period of time longer than the duration of the epidemic in the population. Unlike the temporary artificial immunity period , it is assumed that the naturally acquired immunity period is constant infinite. It is also assumed that the infectious agent has strong infectious abilities, such that all susceptible individuals who contract the infectious agent exhibit symptoms of illness after a relatively small time interval, and become infectious to other susceptible persons. Thus, the incubation period of the disease is considered small, and about one time unit, and consequently, the exposure class can be ignored.
It is further assumed that the infectious period of all individuals who have contracted the disease is constant and finite, and it is denoted . At the end of that period , it is assumed that the immune system of the infected individual has established sufficient natural immunity against the particular strain of the disease, which lasts longer than the overall duration of the epidemic, and as a result individuals with naturally acquired immunity are removed from the epidemic process.
From the above description, the human population of size is subdivided into four major human subclasses namely: Susceptible (), vaccinated (), infectious (), and removed (). The susceptible class () does not have the disease, but are vulnerable to infection from the infectious class (). A portion of the susceptible individuals is vaccinated artificially against the disease, and become the vaccinated class (). The vaccinated class () can no longer contract the disease during the effective artificially acquired temporary immunity period of the vaccine. At the end of the period , the vaccine wanes, and the vaccinated individuals become susceptible again to the disease. When the infectious class () recovers from the disease at the end of the constant infectious period of time , the individuals acquire natural immunity against the disease that can be considered to be constant and infinite. The individuals with naturally acquired immunity form the removed class denoted . It is further assumed that the naturally acquired immunity is very effective such that the removed class never becomes susceptible again to the disease over the overall duration of the disease.
The general susceptible population at any time over the duration of the epidemic, can be further broken down into two subclasses and , where represents all susceptible individuals present at time , who have never contracted the disease nor been vaccinated, and represents the number of susceptible individuals present at time , who were previously vaccinated, and have lost the artificial immunity against the disease after the effective period . That is,
[TABLE]
Note that over the duration of the epidemic, when it is assumed that all individuals are vaccinated only once against that strain of influenza virus, regardless whether the vaccine wanes over time, it follows that, only the class will be liable to be vaccinated. When vaccination occurs multiple times, then all susceptible individuals will be liable to be vaccinate at any time . Also, at any time , the general susceptible class is vulnerable to infection. A compartmental framework exhibiting the transitions between the different states in the population is shown in Figure 1
In the following, we characterize the different disease subclasses namely:- susceptible, vaccinated, infectious and removed individuals over discrete time intervals of unit length, for example, days, weeks, months etc. The description of the discretization of time is presented in the following.
Definition 2.1
Time discretization process:
The different disease subclasses namely:- susceptible, vaccinated, infectious and removal individuals shall be counted over discrete time intervals of unit length. That is, over the time subintervals , where is any nonnegative real number. Furthermore, the number of individuals in any subclass of the population at any time , is taken to be the number of people in that state present in the subinterval of time , counted up to and including the point , where is a positive integer, that is, . Moreover, all events that occur at the initial time , refer explicitly to every event that occurred in the initial subinterval , which forms an initial state for the epidemic. In addition, when , then the initial interval reduces to the point .
For example, the number of susceptible people present at time , denoted , refers to the number of people in the susceptible state , who are present in the subinterval of time . More generally, is the number of susceptible people present in the subinterval .
Another discrete measure of time denoted by , represents how long individuals in the different disease states and , have been in the given states since their initial conversion into the states. We let and denote the number of susceptible, vaccinated, infectious and removal individuals, respectively, present at time , who have been in their different states for discrete time units, where . Note that according to the description of the time measure , then and signify the number of people present at time , who have just become susceptible, vaccinated, infectious and removed at time .
The decomposition of the susceptible population is presented in the following.
Definition 2.2
Decomposition and Aggregation of the Susceptible Population:
We decompose the susceptible population over the discrete times into subcategories based on (1) when a person first becomes susceptible to the disease, given that the person was previously vaccinated against the disease, and the artificial vaccine has worn out after the effective vaccination period , and (2) how long a person has been susceptible to the disease, since the initial time .
For , let denote the number of susceptible individuals present at time , who have been susceptible for discrete time units, where . That is, the number of susceptible individuals present at time can be written in the two equivalent ways
[TABLE]
where from (2.1),
[TABLE]
and
[TABLE]
From (2.3)-(2.4), since infection and vaccination occur over every time , it follows that , and .
Furthermore, for any time at least as large as the maximum of the artificial immunity and the infectious periods and , respectively, that is, for it is easy to see that the relationship between susceptibility and vaccination is given as follows:
[TABLE]
It follows from (2.2)-(2.5) that the total susceptible population at any time , or , can be written as follows:
[TABLE]
Also, for any time , or then
[TABLE]
*Also, under the assumption that the population is closed and there is no immigration of susceptible, vaccinated, infectious and removal individuals, then it follows from above that , , and . *
The decomposition of the vaccinated class at time is presented in the following.
Definition 2.3
Decomposition and Aggregation of the Vaccinated Population:
We also decompose the vaccinated population class into subcategories based on how long individuals have been vaccinated before returning to the susceptible class of the population, whenever the artificial immunity wanes. As with the previous definition, we decompose the vaccinated population over the discrete times into subcategories based on (1) when a person first becomes vaccinated against the disease, and (2) how long a person has been vaccinated against the disease, since the initial time , or simply how long a person has been vaccinated before the artificial immunity wears off completely after the period of effectiveness of the vaccines . For , let denote the number of vaccinated individuals present at time , who have been vaccinated for discrete time units, where . Letting be the number vaccinated people present at time t, then it can be seen that for any , then V(t) can be written in two different equivalent ways as follows: For or ,
[TABLE]
and since , then is equivalently expressed as follows:
[TABLE]
Also for any time at least as large as , or at least as large as the maximum of the artificial immunity and the infectious periods and , respectively, that is, for , or it is easy to see that some of the subclasses ’s in in the expression (2.8) are no longer in the vaccinated state, but have been converted into susceptible individuals. That is,
[TABLE]
Therefore, for , or , the number of people in the vaccinated class at time , is given by
[TABLE]
The decomposition of the infectious class at time is presented in the following.
Definition 2.4
Decomposition and Aggregation of the Infectious Population:
We also decompose the infectious population class into subcategories based on how long individuals have been infectious before transitioning into the removed class of the population. As with the previous definition, we decompose the infectious population over the discrete times into subcategories based on (1) when a person first becomes infectious and (2) how long a person has been infectious, since the initial time , and until the infectious period is lapsed.
For , let denote the number of infectious individuals present at time , who have been infectious for discrete time units, where . It follows that for any , the number of infectious persons present at time , denoted is written as follows: For For or ,
[TABLE]
where .
In addition, for any time at least as large as the infectious period , or at least as large as the maximum of the artificial immunity and the infectious periods and respectively, that is, for , or , it is easy to see that some of the subcategories in the expression (2.4) are no longer in the infectious class, but have been converted into the removed state. That is, for , or , it is easy to see that
[TABLE]
*or
[TABLE]
and the following infectious classes are removed:
[TABLE]
The decomposition of the removed class at time is presented in the following:
Definition 2.5
Decomposition and Aggregation of the Removed Population:
Finally, we decompose the removed population class into subcategories based on how long individuals have been removed from the infectious state. As with the previous definition, we decompose the removed population over the discrete times into subcategories based on (1) when a person first becomes removed, and (2) how long a person has been removed from the infectious state since the initial time of disease outbreak, or simply how long the person has been removed from the time when the person transitioned from the infectious state to the removal state. For , let denote the number of removed individuals present at time , who have been removed for discrete time units, where . It follows that for any time , the number of removed individuals at time denoted , can be written in two different equivalent ways as follows: For or ,
[TABLE]
That is, in (2.15) is the removed population at the initial time.
Furthermore, for any time at least as large as the infectious period , or at least as large as the maximum of the artificial immunity and the infectious periods and respectively, that is, for or ,
[TABLE]
where .
It is also easy to see that some of the subcategories in (2.5) are equivalent to some infectious subclasses as shown below:
[TABLE]
Therefore, for or , the number of removed individuals at time , is given by:
[TABLE]
Definition 2.6
Decomposition and Aggregation of the total human population:
*It can be seen from Definitions 2.2 - 2.5 that the total human population of size can be written as follows:
For all = 0,1,2,3,…*
[TABLE]
So for , the vector
[TABLE]
*is sufficient to define all states of the process. Also,
[TABLE]
where .
For , the vector
[TABLE]
*is sufficient to define all states of the process. Also,
[TABLE]
Also, for , the vector
[TABLE]
*is sufficient to define all states of the process. Also,
[TABLE]
where .
*In addition, for , the vector
[TABLE]
*is sufficient to define all states of the process. Also,
[TABLE]
3 Derivation of the SVIR Influenza Markov Chain Model and Transition Probabilities
In this section we derive the SVIR chain-binomial epidemic dynamic model for the influenza epidemic. The epidemic model presented is an extension of the studies by [1, 2, 9]. That is, we derive a random process that characterizes the dynamics of influenza in the human population structured over time as described in above, and derive the general formula for the transition probabilities of the Markov chain.
Define the following non-negative integer valued random vector function:
[TABLE]
where for each , and , the random vector is given by
[TABLE]
Also, for each , an observed vector value for the random vector is defined as follows:
[TABLE]
Note that for each , the observed values are non-negative integers. Furthermore,
[TABLE]
if and only if
[TABLE]
The random process , describes the evolution of the influenza epidemic in the population characterized above.
Furthermore, for
[TABLE]
is sufficient to define all the states of the process. For
[TABLE]
is sufficient to define all the states of the process. For
[TABLE]
is sufficient to define all states of the process. And for
[TABLE]
is sufficient to define all states of the process.
It should be noted from (3.5) - (3.8) that the random process takes different simplifications for different sub-intervals of the time interval, . Without loss of generality, we present the general form of the transition probability of the random process in the case where . In the following theorem, we show that the random process is a Markov chain.
Theorem 3.1
For , the random process , where the non-negative integer value random vector function , defines a Markov chain for all . That is, Furthermore, the transition probabilities are completely specified by the conditional distribution of the states and . That is,
[TABLE]
where the terms , , , and are related as follows
[TABLE]
That is,
If , then
[TABLE]
If , then
[TABLE]
If , then
[TABLE]
Proof:
As mentioned earlier, we show that is a Markov chain without loss of generality for . Moreover, we assume further without loss of generality that . Thus, . To show that is a Markov chain, it suffices to show that the Markov property holds in }. That is, we show that for all t
[TABLE]
where and are defined in (3.2) and (3.3). Indeed, from (3.8),
[TABLE]
Observe that at time , is expressed as follows:
[TABLE]
The right-hand-side (RHS) of (3.15) can be expressed explicitly as follows:
[TABLE]
By applying (3.16) to (3.17), it is easy to see that (3.17) reduces to
[TABLE]
Since most of the terms in the argument of the conditional probability in (3.18) are already given as the conditions of the probability, and and have no relationship with and , the Markov property in (3) follows immediately. Moreover, (3.18) further reduces to:
[TABLE]
or to
[TABLE]
Thus, the second part (3.9) is proved.
Remark 3.1
In reality, the relationship between the events of getting vaccinated and getting infected exhibit various mathematical forms depending on the properties of the particular disease scenario. That is, the joint conditional distributions of the random variables and , for any time can be be expressed in various forms depending on the disease scenario.
For instance, in more organized societies, the decision to be vaccinated, in some cases, is not influenced by the outbreak of influenza, since influenza seasons are predictable and periodic vaccination against influenza is encouraged. In other disease scenarios especially when the influenza outbreak is unpredictable, more people tend to get vaccinated whenever the influenza epidemic breaks-out, and more influenza cases have been reported. Also, in other influenza scenarios, the vaccine may not be sufficiently strong to prevent infection, but mainly to reduce the severity of infection. These scenarios can also create independence between vaccination and infection. One of such special influenza disease scenarios is described in the next section.
Observe Theorem 3.1[1.] for signifies that at the time step, all susceptible susceptible individuals are either vaccinated or infected by the virus. Theorem 3.1[2.] for signifies that no infection or vaccination occurs at the time step, and Theorem 3.1[3.] for signifies that either number of people are vaccinated, are infected, and remain susceptible at time , or number of people are infected, are vaccinated, and remain susceptible at time . Thus, the transition probabilities (LABEL:ch2.sec3.thm1.eq4)-(LABEL:ch2.sec3.thm1.eq6) represent a tri-variate distributions distributions for and .
4 Special SVIR Models
As remarked in Theorem 3.1, the transition probabilities of the chain are completely defined by the joint conditional distribution of the discrete random variables and . There are several possible discrete tri-variate distributions that can be used to specify the transition probabilities for the chain. In this section, we consider two different cases based on the definition of the probability of getting vaccination, infection or remaining susceptible in the next time step, for each susceptible individual in the population. The trinomial distribution will be used to characterize the transition probabilities for the chain .
4.1 The SVIR Influenza Model with vaccination dependent on infectious encounters
The following assumptions are made for the influenza epidemic in this subsection:- (a) it is assumed that the influenza epidemic is very severe and highly contagious, and the population is also well sensitized about the disease, and there are available vaccines for those seeking vaccination. Furthermore, because of the high prevalence of the disease, every susceptible person in the population at the end of any time interval , that is, is either infected, vaccinated or remains susceptible. Moreover, the probability of the susceptible individual from the generation escaping infection, and avoiding vaccination at time is denoted .
(b) Using ideas from [3, 1], we assume further that is the probability of becoming infected after one interaction with an infected individual, and all interactions between a susceptible person and infectious individuals are independent. Thus, at time , the probability that the susceptible individual from the generation becomes infectious after interaction with infectious individuals is denoted , and is given by
[TABLE]
(c) Let represent the number of people the susceptible individual interacts with at time . Note that is a random variable for each . In fact, is a random process. For the purpose of illustrating the method of expectation-maximization (EM) algorithm to find the maximum likelihood estimators (MLE’s) for the parameters of the influenza epidemic model, we shall assume that is a constant , that is . Moreover, we assume that is chosen in such a way that the binomial approximation for the number of infectious people a susceptible person meets can be used. In the next subsection, we characterize the random process as a poisson process.
(d) Furthermore, it is assumed that there is homogenous mixing in the population, and as a result, the probability that the susceptible person at the beginning of the time interval meets an infectious person also from the generation present at time is given as . Thus, the probability that the individual meets exactly infectious individuals is defined as follows:
[TABLE]
It follows from the assumptions (a)-(d) above that the probability of the susceptible individual of meeting infectious individuals and becoming infected at time is given by
[TABLE]
which simplifies to
[TABLE]
(e) It is assumed further that susceptible individuals are motivated to seek vaccination due to their encounter with infectious individuals, and such encounters within any one time unit which do not lead to infection, give rise to immediate desire in the susceptible person to get vaccinated within the one time unit. Furthermore, it is assumed that the disease has an incubation period at most one time unit, and the vaccine is relatively strong to reverse any infection that occurs within the span of one time unit, such that all susceptible people who get infected and are also vaccinated within the one time unit, obtain artificial immunity, and join the vaccinated class. Suppose a susceptible person eludes infection within a one time unit, let be the probability that the susceptible individual develops desire and gets vaccinated within the time interval.
Also, from assumption (a), since three outcomes are possible for any susceptible person at time , therefore, the probability that the susceptible person at the beginning of the time interval meets an infectious person also from the generation present at time , and does not become infected, and becomes vaccinated at time , denoted , satisfies , and is derived as follows:
[TABLE]
which simplifies to
[TABLE]
we now present the transition probabilities for the SVIR stochastic process , whenever .
Theorem 4.2
*Let the conditions of Theorem 3.1 be satisfied, and let the probability that the susceptible gets vaccinated at time , denoted be as defined in (4.6), the probability that the susceptible gets infected at time , denoted be as defined in (4.4), where . It follows that for , the transition probabilities for the stochastic process , are given as follows. For , and with only every susceptible individual vaccinated only once, it follows that *
[TABLE]
where .
For , and with multiple vaccination, it follows that
[TABLE]
where . In addition, the conditional marginal distributions of and are given as follows:
[TABLE]
and
[TABLE]
Proof:
Let be the random variable representing the number of people who remain susceptible at time . It follows form basic probability rules that
[TABLE]
From the assumptions (a)-(d) above, it is easy to see that since , then
[TABLE]
Note that (4.12) reduces to (4.7).
The result for (4.9)-(4.10) follows immediately from the assumptions (a)-(d) above.
4.2 The SVIR Influenza Model with exponential time until vaccination
In this subsection we consider a less aggressive influenza scenario, where the state of every susceptible person in the next time step is either just infected , just vaccinated , or the person remains susceptible . Due to limited space, we shall present only the assumptions of this special SVIR model, so as to enlighten about more realistic special cases of the SVIR model in Theorem 3.1. The analysis of this special case will appear in the second part of this study.
We let be the probability that in the next time step the susceptible person becomes infected, and be the probability that in the next time step the susceptible person becomes vaccinated, and be the probability that in the next time step the susceptible person remains susceptible, where , we shall use the multinomial distribution to characterize the transition probabilities for the SVIR stochastic process defined in Theorem 3.1. The following are slight modifications of the assumptions (a)-(d) made for the special SVIR model in Subsection 4.1.
Define be an indicator random variable for the event that the susceptible person becomes infected at time . Also let be a discrete random variable denoting the number of infectious people that the susceptible person meets, or interacts with at time .
(f) It is assumed for the influenza epidemic that the population is also well sensitized and educated about the disease, and there are available vaccines for those seeking vaccination, as well as some people in the population practice natural preventive techniques against the disease. Thus, because of availability of vaccination and natural control measures, every susceptible person in the population in the next time step , that is in , is either infected, vaccinated or remains susceptible to the disease.
(g) Similarly to (b) in Section 4.1, at time , the probability that the susceptible individual from the generation becomes infectious after interaction with infectious individuals is denoted , and is given by
[TABLE]
(h) Using ideas from [20], we let represent the number of people the susceptible individual interacts with at time , where . We assume that the random process is a poisson process with rate per unit time.
(i) Furthermore, it is assumed that there is homogenous mixing in the population, and as a result, the probability that the susceptible person at the beginning of the time interval meets an infectious person also from the generation present at time is given as . Thus, the probability that the susceptible individual meets exactly infectious individuals given that (g) holds is defined by
[TABLE]
Using (e)-(h), we find in the following lemma.
Lemma 4.1
Suppose the assumptions in (f)-(i) hold, then it follows that the probability that the susceptible individual meets infectious people at time , and becomes infected is given by
[TABLE]
Proof:
The probability that the susceptible individual meets infectious people at time , and becomes infected is given by applying multiplication rule in the following
[TABLE]
Applying the binomial theorem and Taylor series expansion of the exponential function and further algebraic modifications and simplifications, the right-hand-side of (4.16) reduces to (4.15), where
[TABLE]
The following assumptions are made to derive - the probability that in the next time step the susceptible person becomes vaccinated. Note that in some influenza scenarios, people get vaccinated independently, and at a constant rate per unit time. In other words, the number of people getting vaccinated over time follows a poisson process with rate . Thus, the time until the susceptible person gets vaccinated follows the exponential distribution with mean . Very easily, the probability that the susceptible person gets vaccinated in any interval , can be calculated using the poisson process as follows;
[TABLE]
We now use the trinomial distribution to characterize the transition probabilities of the SVIR stochastic process in the following:
Theorem 4.3
*Let the conditions of Theorem 3.1 be satisfied, and let the probability that the susceptible gets vaccinated at time , denoted be as defined in (4.18), the probability that the susceptible gets infected at time , denoted be as defined in (4.15), where , and is the probability that in the next time step the susceptible person remains susceptible. It follows that for , the transition probabilities for the stochastic process , are given as follows: *
[TABLE]
where and are defined above. Moreover, it follows from (4.19) that the conditional marginal distributions of and are binomials and given as follows:
[TABLE]
[TABLE]
where and .
Proof:
The result follows easily by applying the trinomial distribution to the conditional distribution .
The analysis of the special SVIR model in Theorem 4.3 appears in the part 2 of this study.
5 Parameter estimation
In this section, we find estimators for the true parameters of the SVIR Markov chain model using observed data for the state of the process over time. In Subsection 4.1, observe that both and depend on the probability of getting infection from one interaction with an infectious person, . Also, depends on - the probability that a susceptible person who has eluded infection develops desire and become vaccinated at time . Therefore, utilizing ideas from [20], we find a maximum likelihood estimator (MLE’s) for the probability of getting infection from one interaction with an infectious person, , and also for , for the chain in the case defined in Subsection 4.1, that is, when the transition probabilities are defined in (4.7).
Furthermore, we consider the random process , whenever without loss of generality. Also, note that the parameter represent fixed measurements in the population at each time , that is, and represent fixed measurements for events occurring in the population during the time interval , where the population at any time is defined by the random vector
[TABLE]
whenever .
Note that due to the size of this paper, investigations and MLE’s for the parameters for the SVIR chain-binomial model in Subsection 4.2 will appear in the Part-2 of this work.
Let be the observed value of the random vector at time , , defined in (3.3). That is,
[TABLE]
where are non-negative observed constant values for each component of , at time .
The population is observed over the time units, , where the initial state is assumed to be known. That is, is deterministic, and the observed data consists of the measurements
[TABLE]
We define the collection of random variables representing the population over times as follows:
[TABLE]
and from (5.2), the observed values of are given as
[TABLE]
We use the observed sample path of the process to find maximum likelihood estimators for the parameters . The generation of the sample path in (5.5) from the population over times is exhibited in Figure 2.
Remark 5.1
*It must be noted that inferences for the parameters such as confidence intervals and tests of significance, and consistency of estimators for the parameters are beyond the scope of this work, and will appear elsewhere. *
We assume that we have data for influenza over time units denoted , where is defined in (5.5), and is one realization of the human population over time denoted , defined in (5.4). From (5.2), (5.4), and (5.5), the likelihood function of is defined as follows:
[TABLE]
From (5), applying the multiplication rule, it is easy to see that
[TABLE]
But, since is a Markov Chain, and since it is assumed is known, it is easy to see that (5.7) reduces to
[TABLE]
It follows from (5.8) and (3.9) that for ,
[TABLE]
Observe that substituting (4.7) into (5.9) leads to the likelihood function with respect to the parameters . We should note that applying the maximization technique to find the MLE for using the likelihood function defined in (5.9) leads to an intractable equation for the derivative of the log-likelihood of . Thus, we apply the expectation maximization (EM) algorithm to find an appropriate MLE for - the probability of passing the infection from one independent infectious contact.
5.1 The EM Algorithm and Jensen’s Inequality
In this section, we consider the EM algorithm, where missing information is incorporated into the incomplete likelihood function, at random, and Jensen’s inequality is used to find a lower-bound for the complete log-likelihood function.
Recall, the Expectation Maximization (EM) algorithm is an iterative algorithm used to find the MLE of a parameter of a given distribution[13, 16]. There are two cases where the algorithm is most useful: (1) when the data available for the maximum likelihood estimation technique has missing components, and (2) when maximizing the likelihood function leads to an intractable equation, but adding missing data can simplify the process. It is for the second case in our problem that we utilize this EM algorithm.
Suppose we have observed data and likelihood function , and suppose the vector is missing data or a missing component, so that is the complete data. The complete log-likelihood function can be maximized to find the MLE of in two basic EM algorithm steps, namely - the expectation (E) step, and the maximization (M)-step.
The E-step consists of finding the expected value of the complete log-likelihood function with respect to the conditional mass of given and . That is, we find
[TABLE]
The M-step consists of maximizing to find an estimate for . This process is summarized in the following steps:
- i.
Let and be an initial guess for .
- ii.
Given the observed data , and assuming that the guess is correct, calculate the conditional probability distribution for the missing data .
- iii.
Find the conditional expected log-likelihood referred to as , that is,
[TABLE]
where .
- iv.
Find the that maximizes . The result will be the new . That is,
[TABLE]
- v.
Update and repeat step (iv) until stops noticeably changing.
The E-step can be obtained by applying Jensen’s inequality. We recall Jensen’s inequality [14] in the following:
Lemma 5.1
Suppose is a convex function, and is a random variable, then
[TABLE]
Conversely, if you have a concave function (e.g. a logarithmic function), then
[TABLE]
From (5.10), let represent the observed data defined in (5.5). The following random missing information are incorporated to make the log-likelihood function more tractable, where is given in (5.9).
(i.)
the collection , where for each and , is a discrete random variable representing the number of infectious individuals that the susceptible person of the generation meets at time (i.e. in the interval . Since from (4.3), a susceptible person only meets the fixed number of people, , at any time , therefore
[TABLE]
(ii.)
the collection , where for each , , and , is a categorical random variable indicating the event that the infectious individual passes the infection with probability , where , given that the susceptible person meets infectious people among the fixed number of people , at time . Thus,
[TABLE]
We consider a step-by-step approach to add the missing data and into the incomplete likelihood function .
Lemma 5.2
Let Theorem 4.2 hold. Given the missing information and defined in (5.15) and (5.16), then it follows from (4.7) and (5.9), that the log-likelihood function satisfies the following inequality:
[TABLE]
where , and , and also are probability terms that depends on and .
Proof:
From (5.9), denote the log-likelihood . It follows from (5.9) that adding the missing data , we obtain
[TABLE]
The equation (LABEL:sec4.sec6.lemma1.proof.eq1) can be expressed further as follows
[TABLE]
Applying Jensen’s inequity to (LABEL:sec4.sec6.lemma1.proof.eq2), leads to the following
[TABLE]
where (5.20) is given as follows
[TABLE]
We add the missing data in (5.16) into the partially complete log likelihood function ,
and
, ; , and apply the same technique in (LABEL:sec4.sec6.lemma1.proof.eq1)-(LABEL:sec4.sec6.lemma1.proof.eq4), as follows. From (5.20)
[TABLE]
Applying Jensen’s inequality on (5.22) we obtain the following
[TABLE]
where
[TABLE]
Remark 5.2
We note from (5.2) that the E-step of the EM algorithm consists of finding the conditional expectation term
[TABLE]
where , and is the estimate of in the step of the EM algorithm.
We specify an explicit expression for components of the E-step (5.25) in the following result.
Lemma 5.3
For each , , and , the following holds:
[TABLE]
[TABLE]
and
[TABLE]
Also,
[TABLE]
[TABLE]
and
[TABLE]
Furthermore,
[TABLE]
[TABLE]
and
[TABLE]
where .
Proof:
The equations (5.26)-(5.31) follow immediately from assumptions (a)-(d) in Subsection 4.1. For (5.32) and (5.34), we apply the multiplication rule and also apply assumptions (a)-(d) in Subsection 4.1. That is,
[TABLE]
Observe from assumptions (a)-(d) in Subsection 4.1 that, given infection is passed across only by the infectious person among the infectious persons encountered for that given instant, and also given that is the probability that a susceptible person changes the mind, and becomes vaccinated within that time instant after escaping infection from the infectious person, it follows that is the probability that a susceptible person gets vaccinated at any instant, and the conditional distribution of the random variable is binomial with parameters and . Thus,
[TABLE]
Also, the probability that the infectious person passes infection is given by
[TABLE]
and the probability that the susceptible person meets infectious people at time is given by
[TABLE]
Substituting (5.36)-(5.38) into (5.35) gives (5.32).
Similarly,
[TABLE]
From assumptions (a)-(d) in Subsection 4.1, it is easy to see that
[TABLE]
Furthermore, all the other components of (5.39) are obtained similarly as in (5.37)-(5.38).
Finally,
[TABLE]
From assumptions (a)-(d) in Subsection 4.1, it is easy to see that for ,
[TABLE]
Furthermore, all the other components of (5.41) are obtained similarly as in (5.37)-(5.38). Thus, from (5.39)-(5.42), the result in (5.34) follows immediately.
The following result presents an expression for the E-step of the EM algorithm.
Theorem 5.4
Assume that the results in Lemma 5.3 hold. For , the E-step of the EM algorithm in (5.25) in Remark 5.2 is expressed as follows for
[TABLE]
where , denotes a constant term, and is an estimate of at the step. Also
[TABLE]
Proof:
From Lemma 5.2 and (5.25), it is easy to see that
[TABLE]
where . Observe that
[TABLE]
Thus, (5.43) follows immediately from (5.45), where
[TABLE]
Also, (5.44) follows from Definition 2.4.
Remark 5.3
It follows from Theorem 5.4 that the M-step of the EM algorithm consists of maximizing with respect to . This is equivalent to maximizing the non-constant term of (5.43).
In the next result, we present the EM algorithm MLE estimator for .
Theorem 5.5
Let the E step of the EM algorithm be as defined in Theorem 5.4. It follows that the MLE for is given as follows:
[TABLE]
Proof:
From (5.43), it follows that the derivative
[TABLE]
It follows from (5.49) that since , then setting the right-hand-side to zero and solving for leads to
[TABLE]
Since for all the solution of from (5.49) remains the same and is given by (5.50), therefore the EM-algorithm converges for the value of in (5.48).
6 Some epidemiological parameters for evaluating the occurrence of epidemics
In this section, we calculate some epidemiological parameters to evaluate the prevalence of influenza. We consider two disease control parameters, namely: the basic reproduction number, and the probability of no spread. These parameters are used in [19, 15, 14, 17, 18].
Furthermore, these epidemiological parameters are calculated for the random process whenever the transition probabilities are defined in Theorem 4.2.
In the following, we characterize the prevalence of influenza from the initial infected population. That is, we calculate the expected number of infected individuals that occur over time, given an initial infected population. This information is useful to determine whether an epidemic will occur from the initial infected population.
6.1 Expected number of infected individuals
Recall Definition 2.4, for , and ,
[TABLE]
Furthermore, for and ,
[TABLE]
In the following result, we show that the expected infectious population over time, given the initial outbreak of influenza depends only the state of the process at one time lag.
Lemma 6.1
For any ,
[TABLE]
and
[TABLE]
Proof:
It is easy to see from Theorem 3.1 and Theorem 4.2 that
[TABLE]
Also, by the properties of conditional expectations, for any ,
[TABLE]
From (6.5), it follows that (6.6) reduces to,
[TABLE]
definitely.
Using Lemma 6.1, we present in general form the expected number of infectious people present at any time , given the population at the initial outbreak.
Theorem 6.6
Let the assumptions of Theorem 3.1 and Theorem 4.2 hold. For and , it follows that
[TABLE]
For and ,
[TABLE]
Proof:
For ,
[TABLE]
[TABLE]
For each , applying Lemma 6.1
[TABLE]
where is defined in (4.4). Substituting (6.1) into (6.11), we obtain the result in (6.8). Observe that the result for is obtained similarly as above.
For and
[TABLE]
[TABLE]
For each applying Lemma 6.1
[TABLE]
where is defined in (4.4). Substituting (6.1) into (6.14), we obtain (6.9).
Remark 6.1
It should be observed from Theorem 6.6 that an explicit form for the conditional expectation (6.8) and (6.9) can only be obtained provided the joint distribution of is known. However, since (6.8) and (6.9) represent population parameters at time (conditional population means), which are a sum of random variables that represent observations over time until the time , these parameters can be estimated point-wise using sample paths of the process , and the MLE of obtained in Theorem 5.5.
For example, for , (6.9) can be estimated using the sample path of
[TABLE]
The next result presents the estimates for the conditional population means at any time ,
Theorem 6.7
Assume that the conditions of Theorem 6.6 are satisfied. For , the conditional expected value in (6.9) denoted can be estimated using the sample path of the process namely:-
[TABLE]
In fact, for and
[TABLE]
estimates defined in (6.9).
Also, for and
[TABLE]
estimates defined in (6.8), where is the MLE of defined in (5.48).
Proof:
The results follow simply from (5.44) and the definition of defined in (6.9) and (6.8).
6.2 The basic reproduction number for the SVIR influenza epidemic
The basic reproduction number, generally denoted , is the expected number of secondary cases of infection from one infectious person or from an initial number of infectious people, , placed in a completely susceptible population, and it is the most widely used predictor of an epidemic outbreak. If , the disease is expected to die out. If , the disease is expected to spread out of control.
The basic reproduction number is highly dependent on the initial infectiousness of the disease and the duration of the disease in the initial infectious population. The basic reproduction number is important to determine disease control factors for an epidemic [15, 14, 17, 18]. In the absence of an explicit formula for the basic reproduction number , it can be estimated empirically using the methods in this section.
Observe from (6.8) that while is the initial infectious population, the second term represents the expected number of secondary infectious cases present at time , given that the initial disease outbreak had only number of infectious cases. Therefore, must be the basic reproduction number given that , where is the constant infectious period of every infectious person in the population. Using the results of Theorem 6.7, the basic reproduction number can be estimated the following.
Corollary 6.1
Let the assumptions of Theorem 6.6 and Theorem 6.7 be satisfied. The basic reproduction number is given implicitly as follows:
[TABLE]
For , (6.20) can be estimated using the following sample path of the process
[TABLE]
Furthermore, the estimate for is given as follows
[TABLE]
Proof:
The result follows very easily from the definition of the basic reproduction number, (6.19), and setting , since is the infectious period of individuals in the population.
Remark 6.2
It should be observed from Corollary 6.1 that an explicit form for the basic reproduction number in (6.20) can only be obtained provided the joint distribution of is known. However, since (6.20) represents a population parameter at time , which is a sum of random variables that represent observations over time until the time , this parameter is easily estimated-point using the sample path of the process given in (6.21), and the MLE of obtained in (5.50).
The advantage of in (6.22) as an initial point estimate for the actual value is its dependence primarily on empirical data for influenza obtained over time, and also the dependence on the feasible estimated value of the probability of passing infection from one infectious contact . Furthermore, with limited real data over several possible sample realizations for the stochastic process over a given time interval, the statistic can be studied numerically and the approximate sampling distribution generated and studied.
*More interval estimates for the parameter in (6.20), require explicit sampling distribution for the point estimate , which are beyond the scope of this work. *
7 Conclusion
In this study, we have sufficiently defined an SVIR Markov chain model for influenza which effectively shows the progression of the disease and effectiveness of the vaccine to control the disease over time for an individual in the population. Moreover, we defined the transition probabilities for the model. We presented two special cases of our model-(1) based on the assumption that the event of getting vaccinated at any instant depends on encounter with infectious people, and (2) the vaccination occurring over time with as a poisson process. We present detailed derivations of the probabilities of the susceptible individual getting vaccinated or infected at any instant, and further define the transition probabilities for each special case of the model.
We further used the maximum likelihood estimation technique and the expectation maximization (EM) algorithm to approximate the fixed parameters of the model. To evaluate the occurrence and prevalence of the epidemic, we derived estimators for the basic reproduction number , and the expected number of infected individuals at any time .
Finally, we presented numerical simulation examples for the influenza epidemic, and approximate the distribution of the total number of people in the population who ever get infected. Given these scenarios, we see how vaccination can curb the spread of influenza, and that the most limiting factor when trying to control the spread of influenza is the number of infectious people one interacts with per unit time.
8 References
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] H.C. Tuckwell, R,J. Williams: Some properties of a simple stochastic epidemic model of SIR type. Math. Biosci. 208(2007), 76-97
- 2[2] M. Greenwood, On the statistical measure of infectiousness, J. Hyg. Camb. 31 (1931) 336
- 3[3] H. Abbey, An examination of the Reed-Frost theory of epidemics, Hum. Biol. 24 (1952) 201
- 4[4] M. Halloran, Ira Longini Jr., Claudio Struchiner: Design and analysis of Vaccine studies. Springer-Verlag New York (2010)
- 5[5] J. Reynolds, M. Torremorell, M. Craft. Mathematical Modeling of Influenza A virus Dynamics within Swine Farms and the Effects of Vaccination. P Lo S ONE 9(8): e 106177. (2014)
- 6[6] J. Jacquez, A note on chain binomial models of epidemic spread: What is wrong with the Reed-Frost Model. Mathematical Biosciences 87(1): 73-82 (1987)
- 7[7] J. Gani, D. Jerwood, Markov chain methods in chain binomial epidemic models. Biometrics 27, (1971)
- 8[8] M. Ferrante, E. Ferraris, C. Rovira, On a stochastic epidemic SEIHR model and its diffusion approximation, TEST (2016) 25: 482. https://doi.org/10.1007/s 11749-015-0465-z
