The Effect of Recombination on the Speed of Evolution
Nantawat Udomchatpitak

TL;DR
This paper models how recombination affects the speed of beneficial allele spread in a population, confirming that high recombination rates accelerate evolution, supporting Fisher and Muller's hypothesis.
Contribution
It provides an asymptotic analysis of how different recombination probabilities influence the time for beneficial alleles to fix in a population.
Findings
High recombination speeds up beneficial allele fixation.
Low recombination does not significantly affect fixation time.
Results support the advantage of sexual reproduction in evolution.
Abstract
It has been a puzzling question why some organisms reproduce sexually. Fisher and Muller hypothesized that reproducing by sex can speed up the evolution. They explained that in the sexual reproduction, recombination can combine beneficial alleles that lie on different chromosomes, which speeds up the time that those beneficial alleles spread to the entire population. We consider a population model of fixed size , in which we will focus on two loci on a chromosome. Each allele at each locus can mutate into a beneficial allele at rate . The individuals with 0, 1, and 2 beneficial alleles die at rates and respectively. When an individual dies, with probability , the new individual inherits both alleles from one parent, chosen at random from the population, while with probability , recombination occurs, and the new individual receives its twoβ¦
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
The Effect of Recombination on the Speed of Evolution
Nantawat Udomchatpitak111The author is supported in part by NSF grant DMS-1707953.
Abstract
It has been a puzzling question why some organisms reproduce sexually. Fisher and Muller hypothesized that reproducing by sex can speed up the evolution. They explained that in the sexual reproduction, recombination can combine beneficial alleles that lie on different chromosomes, which speeds up the time that those beneficial alleles spread to the entire population. We consider a population model of fixed size , in which we will focus on two loci on a chromosome. Each allele at each locus can mutate into a beneficial allele at rate . The individuals with 0, 1, and 2 beneficial alleles die at rates and respectively. When an individual dies, with probability , the new individual inherits both alleles from one parent, chosen at random from the population, while with probability , recombination occurs, and the new individual receives its two alleles from different parents. Under certain assumptions on the parameters and , we obtain an asymptotic approximation for the time that both beneficial alleles spread to the entire population. When the recombination probability is small, we show that recombination does not speed up the time that the two beneficial alleles spread to the entire population, while when the recombination probability is large, we show that recombination decreases the time, which agrees with Fisher-Muller hypothesis, and confirms the advantage of reproducing by sex.
Keywords. beneficial mutations, evolution, fixation time, recombination, selection
AMS 2010 subject classifications. Primary 92D15; Secondary 60J27, 60J75, 60J85
1 Introduction
It has been a puzzle in evolutionary biology why many organisms reproduce sexually. Sexually reproducing parents transmit just half of their genes to the offspring, which means that all beneficial alleles that the parent has might not be fully transmitted to the offspring. This does not happen to parents who reproduce asexually, since they transmit all their genes to the offspring. An advantage of sexual reproduction might come from recombination, which can combine portions of different chromosomes together. Fisher [8] and Muller [10] hypothesized that sexual reproduction can speed up the evolution. They explained that in an asexual population, for two beneficial mutations to survive, the second beneficial mutation has to occur in an individual that already has the first beneficial mutation, while in a sexually reproducing population, both beneficial mutations might occur on different individuals and recombination can later combine both mutations, which leads to an evolutionary advantage over asexual reproduction.
1.1 The model
We consider a population of fixed size consisting of chromosomes, which come from organisms of the same species. We are interested in two loci on the chromosome. One of the two loci contains either an or allele, and another locus contains either a or allele. Both the and alleles are beneficial. At time 0, all individuals have and alleles. Independently, each allele mutates to at exponential rate , and each allele mutates to at exponential rate . Individuals with 0, 1 and 2 beneficial alleles will die independently at exponential rates and , respectively. A new individual is created immediately to replace the individual who dies, in order to keep the population size fixed. With probability , no recombination occurs, in which case the new individual receives both alleles from a randomly chosen individual in the population at that time. With probability , recombination occurs, in which case the new individual receives the allele from a randomly chosen individual, and receives the allele from another independently randomly chosen individual. We will give an approximation for the first time that all individuals in the population have both beneficial alleles, when the population size is large. The result shows that this time is shorter when is large, consistent with the Fisher-Muller hypothesis.
1.2 Previous works
Takahata [14] considered a model of a population of finite size, where each individual consists of one chromosome. This model focuses on two loci on the chromosome. One locus contains either an or allele, and another locus contains either a and allele. The fitnesses of individuals of types and are assumed to be and respectively. The model also assumed recurrent mutations from to and from to , which means that mutations will never be exhausted. In the beginning, the frequency of type is assumed to be 1. Via simulation, the numerical fixation time of both and is given for some values of and in the following parameter regimes: 1) , 2) , 3) , 4) , and 5) .
Some non-rigourous works discuss the benefits of recombination. Crow and Kimura [4] argued that in large populations, sexual reproduction can incorporate more mutations due to recombination than asexual reproduction can. Several works pursued finding the relation between the speed of adaptation and the recombination rate. Neher, Kessinger, and Shraiman [11] considered a linear chromosome model assuming a large mutation rate and a weak selective effect. They obtained that the rate of adaptation is proportional to the square root of the recombination rate. Weissman and Barton [15] considered the regime where the mutation rate is small, and they obtained that the rate of adaptation is proportional to the recombination rate. Weissman and Hallatschek [16] considered the intermediate mutation rate regime and obtained that the rate of adaptation is proportional to the recombination rate. Lastly, Neher, Shraiman, and Fisher [12] considered a population model, where a large number of loci was considered. The recombination mechanism in this model is different from the other works mentions before. Under the assumptions that the selective advantage is weak and the recombination rate is much larger than the selective advantage, they obtained that in large populations, the rate of adaptation increases as the square of recombination rate.
We will now discuss some rigourous results. Cuthbertson, Etheridge, and Yu [5] considered a two loci model with finite population size . Each individual can be one of the four possible types: and . Both and are considered to be beneficial, and they increase the fitness by and respectively, with the assumption that . The mutation from to randomly occurs during the the time interval that is spreading in the population, and it appears as a type . For both and to spread to the entire population, there are three requirements. First, the number of type should become significant. Second, recombination between and must occur. Lastly, the number of type should become significant, after which is almost certain to fixate. The result shows that the fixation probability of can be approximated by the solution to a specific system of ODEs.
Bossert and Pfaffelhuber [3] considered a diffusion model with 4 types: and , where the fitnesses of and are in increasing order. The frequencies of these four types evolve according to a system of SDEs. In the beginning, the frequencies of types and are assumed to be small, and there is no type yet. They obtain approximate formulas for the fixation probability and fixation time of type .
Both Cuthbertson, Etheridge, and Yu [5] and Bossert and Pfaffelhuber [3] assume that at least one beneficial mutation is present at the beginning, and they do not allow an unlimited supply of new mutations. In the model studied in this paper, we assume that all individuals in the beginning do not have any beneficial mutations, and both beneficial mutations occur according to a Poisson process. This model is similar to the model given by Takahata in the case , but with finite population size.
Lastly, we mention another work by Berestycki and Zhao [2]. In their model, which involves branching Brownian motion in two dimensions, they showed that the fitnesses on two loci are negative correlated. They explained that recombination can reduce this negative correlation, and leads to a fitter population.
1.3 Conditions of the parameters
There are four parameters in our model: and . We assume that and . For any two sequences and , we say that if
[TABLE]
We will assume that and satisfy the following conditions:
[TABLE]
[TABLE]
[TABLE]
and
[TABLE]
where is defined to be if , and 0 if . Note that (2) and (3) imply that
[TABLE]
1.4 Main theorem
Theorem 1**.**
Let be the first time that all individuals in the population are type , which we also call the fixation time of . For every positive integer , and , we define
[TABLE]
Then, for every , we have that
[TABLE]
This theorem suggests that the time that both beneficial alleles spread to the entire population is approximately , when is large. From (5), when there is no recombination,
[TABLE]
When , we observe that . This means that when is large enough, it decreases the fixation time of , compared with when there is no recombination. From (3) and (4), for sufficiently large , we have that , and
[TABLE]
This implies that under our assumptions, which assume small recombination rates, in large populations, recombination can decrease the fixation time of by no more than a factor of one-third.
Lastly, we will show that these assumptions on the parameters are attainable. We consider when and for some positive numbers and . One can check that (1), (2), (3) and (4) are equivalent to and .
2 Overview of the proof
From now on, we will refer to an individual with , , , and as type 0, 1, 2, and 3 respectively, and we will omit writing the subscript in and . For and , we define as the number of type individuals at time and define , which is the fraction of type individuals in the population at time .
Before we consider the behavior of the process , we will first look at the condition . Intuitively, we donβt want the mutations to occur too slowly, so that we see one beneficial mutation spread to the entire population, before any other mutations take hold. The process by which a beneficial allele spreads to the entire population is also known as a selective sweep. Suppose that a mutation from to is the first to occur, and assume that it doesnβt go extinct. It will take time about to complete its selctive sweep (see section 6.1 of [7]). During this time, a mutation from to occurs at total rate of . The number of descendants of one of these new mutations can be approximated by an asymmetric random walk. So, the chance that each of these mutations survives is about . Hence, the number of mutations to that survive during the selective sweep of is approximately
[TABLE]
So, if , then there is no that survives during the sweep of . Hence, we will see spread to the entire population first, before appears and spreads. In this case, recombination does not speed up the time needed for the type to take hold in the population. So, we should consider when . Here, we make a slightly stronger assumption that .
Now, we will consider our process . The behavior of our process is essentially reduced to two cases. For the first case, which we will call the recombination dominating case, we assume that
[TABLE]
For the second case, which we will call the mutation dominating case, we assume that there is a positive constant such that for sufficiently large ,
[TABLE]
The reason for these names is that in the recombination dominating case, type 3 individuals start to appear from recombination between alleles from type 1 individuals and alleles from type 2 individuals, while in the mutation dominating case, the type 3 individuals start to appear from mutations from type 1 and type 2 individuals.
In the following table, we define times when we see significant changes in the behavior of the process.
{TAB}
[5pt]βcβlβlββcβcβcβcβcβcβTime & recombination dominating mutation dominating
\displaystyle{\frac{1}{s}\ln\bigg{(}\frac{s}{\mu\sqrt{Nr}}\bigg{)}-\frac{C_{0,r}}{s}} \displaystyle{\frac{1}{s}\ln\bigg{(}\frac{s}{N\mu^{2}}\bigg{)}-\frac{C_{0,m}}{s}}
\displaystyle{\frac{1}{s}\ln\bigg{(}\frac{s}{\mu}\bigg{)}-\frac{C_{1}}{s}} \displaystyle{\frac{1}{s}\ln\bigg{(}\frac{s}{\mu}\bigg{)}-\frac{C_{1}}{s}}
\displaystyle{\frac{1}{s}\ln\bigg{(}\frac{s}{\mu}\bigg{)}+\frac{C_{2}}{s}} \displaystyle{\frac{1}{s}\ln\bigg{(}\frac{s}{\mu}\bigg{)}+\frac{C_{2}}{s}}
\displaystyle{\frac{1}{s}\ln\bigg{(}\frac{s^{2}}{\mu r\ln(Nr)}\bigg{)}+\frac{C_{3}}{s}} \displaystyle{\frac{1}{s}\ln\bigg{(}\frac{s^{2}}{N\mu^{3}}\bigg{)}+\frac{C_{3}}{s}}
\displaystyle{\frac{1}{s}\ln\bigg{(}\frac{s^{2}}{\mu r\ln(Nr)}\bigg{)}+\frac{C_{4}}{s}} \displaystyle{\frac{1}{s}\ln\bigg{(}\frac{s^{2}}{N\mu^{3}}\bigg{)}+\frac{C_{4}}{s}}
The constants , and are defined in (64), (62), (61), (145), (167), and (200). The reader does not need to know what these constants are exactly at this point, but should notice that is the lower order term in the definition of the . From now on, all statements are assumed to be true in both the recombination dominating case and the mutation dominating case, unless specified otherwise.
Overall, the behavior of the numbers of type 1, 2 and 3 are similar in the sense that they first grow exponentially, then grow logistically. Both types 1 and 2 grow simultaneously, but type 3 will start to grow later, due to the late appearance of type 3 individuals. The behavior of the process is split into five time intervals, which will be discussed below. During the time interval , which we will call phase 1, most individuals are type 0. The type 1 and type 2 individuals appear from mutations from type 0 individuals. Since type 1 and type 2 individuals die at rate , while the majority of the population, which is type 0, dies at rate 1, the numbers of descendants of these type 1 and 2 ancestors grow exponentially at rate approximately . Since the total rate of mutation from type 0 to type 1 is approximately , we have
[TABLE]
The type 3 individuals appear around time . From this time, the number of type 3 individuals will grow exponentially at rate about , due to the fact that each type 3 individual dies at rate , while most individuals in the population die at rate 1. The following proposition describes the process at time .
Proposition 2**.**
For and , there is an event , such that for sufficiently large , we have that , and the following statements hold:
On the event , when is sufficiently large, for ,
[TABLE] 2. 2.
In the recombination dominating case, on the event , there are positive constants and such that for sufficiently large ,
[TABLE] 3. 3.
In the mutation dominating case, on the event , there are positive constants and such that for sufficiently large ,
[TABLE]
This proposition says that when is sufficiently large, at time , both type 1 and type 2 have established themselves in the population by having their numbers reaching the level of order . However, is only of order in the recombination dominating case, and is only of order in the mutation dominating case, which from (3) and (4), means that number of type 3 at time is not yet comparable to those of type 1 and 2.
During the time interval , which we will call phase 2, the numbers of type 1 and 2 now grow logistically, or more precisely,
[TABLE]
for , where is some positive constant. The following proposition describes the process at time .
Proposition 3**.**
For and , there is an event , such that for sufficiently large , we have that , and the following statements hold:
On the event , for sufficiently large , for ,
[TABLE] 2. 2.
In the recombination dominating case, on the event , there are positive constants and such that for sufficiently large ,
[TABLE] 3. 3.
In the mutation dominating case, on the event , there are positive constants and such that for sufficiently large ,
[TABLE]
This proposition says that at time , almost half of the population becomes type 1, and almost the other half becomes type 2, while the number of type 3 individuals doesnβt change much from time .
During the time interval , which we will call phase 3, the majority of the population has become type 1 or type 2. The number of type 3 individuals continues to grow exponentially from time . However, since the majority of the population dies at rate , and a type 3 individual dies at rate , the type 3 population grows exponentially at approximately rate . The following proposition describes the behavior of the process at time .
Proposition 4**.**
For and , there is an event , such that for sufficiently large , we have that , and the following statements hold:
For sufficiently large , on the event , we have
[TABLE] 2. 2.
In both cases, there is a positive constant such that for sufficiently large , on the event , we have
[TABLE]
This proposition says that by the time , the number of type 3 individuals has reached order . Moreover, from (3) and (4), there are almost no type 0 individuals left by time .
During the time interval , which we will call phase 4, the number of type 3 individuals grows logistically. The following proposition describes the behavior of the process at time .
Proposition 5**.**
For and , there is an event , such that for sufficiently large , we have that , and on the event ,
[TABLE]
and
[TABLE]
This proposition implies that by time , almost all individuals have become type 3, and only small fractions of type 1 and 2 individuals remain in the population.
After time , which we will call phase 5, the number of individuals that are not type 3 can be approximated by a subcritical branching process. The non-type 3 population is heading toward extinction, and type 3 becomes fixated in the population. The fixation of type 3 will occur around time .
In section 3, we will discuss about transition rates of the process. In section 4, we construct martingales and submartigales, and give expectation and variance formulas. They will be used in the proofs of phases 1, 2, and 3 in sections 5, 6, and 7. In section 5, we will prove several lemmas on the process during phase 1, and at the end of the section, we give the proof of Proposition 2. Proposition 3, 4 and 5 will be proved in sections 6, 7, and 8 respectively. Finally, the proof of Theorem 1 will be given at the end of section 9.
3 On parameters and transition rates of the process
3.1 More inequalities on the parameters
Lemma 6**.**
The following statements hold.
In the recombination dominating case,
[TABLE] 2. 2.
In the mutation dominating case,
[TABLE] 3. 3.
In both cases,
[TABLE]
[TABLE]
and
[TABLE]
Proof.
We will first prove statement 1. In the recombination dominating case, from conditions (2) and (6),
[TABLE]
which implies that .
Now, we will prove statement 2 by contradiction. Suppose there is a and an increasing sequence of natural numbers such that for all , we have
[TABLE]
From (7), we have that for all ,
[TABLE]
This leads to a contradiction, since implies that
[TABLE]
as .
Lastly, we will prove statement 3. First, we will consider the recombination dominating case. By (4) and (11),
[TABLE]
From (6) and (12), it follows that
[TABLE]
and because of (2), for sufficiently large ,
[TABLE]
which implies (14). For the mutation dominating case, we define such that is the solution of
[TABLE]
It follows that . Therefore, by the same argument above,
[TABLE]
[TABLE]
and
[TABLE]
Also, from (7) and the fact that , for sufficiently large , we have . This fact along with (15), (16) and (17) imply (12), (13) and (14). β
3.2 Transition rates of the process
For the proof, we need to separate type 1 individuals into two groups: one that comes from mutation from type 0 individuals and another that comes from recombination between type 0 and type 3 individuals. We need to do the same for the other three types. The precise definitions are given below.
A type 1 (or 2) individual is called a type 1m (or 2m) ancestor, if it appears by mutation from a type 0 individual. 2. 2.
A type 1 (or 2) individual is called a type 1r (or 2r) ancestor, if it appears by recombination between a (or an ) allele from a type 0 individual and an (or a ) allele from a type 3 individual. 3. 3.
A type 1 individual is called an offspring of another type 1 individual if
- β’
receives the allele from , or
- β’
receives the allele from and receives the allele from a type 3. 4. 4.
A type 2 individual is called an offspring of another type 2 individual if a
- β’
receives the allele from , or
- β’
receives the allele from and receives the allele from a type 3. 5. 5.
A type 1 (or 2) individual is called type 1m (or 2m), if it descends from a type 1m (or 2m) ancestor. A type 1 (or 2) individual is called type 1r (or 2r), if it descends from a type 1r (or 2r) ancestor. 6. 6.
A type 3 individual is called a type 3m ancestor, if it appears from mutation from a type 1 individual or a type 2 individual. 7. 7.
A type 3 individual is called a type 3r ancestor, if it appears by recombination between an allele from a type 1 individual and a allele from a type 2 individual. 8. 8.
A type 3 individual is called an offspring of another type 3 individual if
- β’
receives the allele from , or
- β’
receives the allele from and receives the allele from a type 1 individual. 9. 9.
A type 3 individual is called type 3m, if it descends from a type 3m ancestor. A type 3 individual is called type 3r, if it descends from a type 3r ancestor. 10. 10.
A type 0 individual is called a type 0r ancestor, if it appears from recombination between an allele from a type 1 individual and a allele from a type 2 individual. 11. 11.
A type 0 individual is called an offspring of another type 0 individual if
- β’
receives the allele from , or
- β’
receives the allele from and receives the allele from a type 2. 12. 12.
A type 0 individual is called a type 0r if it descends from a type 0r ancestor.
For , we define as the number of type at time , and for , we define as the number of type at time . Note that for and , we have . Next, we define and to be the number of type and individuals at time , whose ancestor appears in the time interval . It follows that if , for , we have that , and for , we have that . We will call an individual type im(a,b] (or ir(a,b]), if it is of type im (or type ir) and its ancestor appears in the time interval . Lastly, we define , and to be the fractions of type im, ir, im(a,b] and ir(ab] in the population at time respectively.
Now, consider the process , First, we consider the rate that increases by 1. There are two ways to increase . First, a type 0 individual can mutate to a type 1 individual during the time interval , creating a type 1m(a,b] ancestor, which occurs at total rate
[TABLE]
Second, an individual that is not of type 1m(a,b] can die, which occurs at total rate
[TABLE]
and the new individual must be a type 1m(a,b]. The probability that recombination doesnβt occur and the new individual has type 1m(a,b] is . If recombination occurs, the new individual can come from combining an allele from a type 1m(a,b] individual with a allele from a type 0 or 1 individual, or combining an allele from a type 3 individual with a allele from a type 1m(a,b] individual. (Note that recombination between an allele from a type 3 individual and a alelle from a type 0 individual creates an ancestor of type 1r.) So, the probability that recombination occurs and the new individual has type 1m(a,b] is
[TABLE]
Hence, the total rate that the number of descendants of type 1m(a,b] increases by 1 is
[TABLE]
Let us define
[TABLE]
and note that increases by 1 at rate .
Similarly, the rate that the number of type 1m(a,b] individuals decreases by 1 is given by
[TABLE]
where is the total rate that type 1m(a,b] individuals die at time ,
[TABLE]
is the probability that we donβt create a type 1m(a,b] individual, and corresponds to the total rate that type 1m(a,b] mutates to type 3. We define
[TABLE]
and note that the number of type 1m(a,b] individuals decreases by 1 at rate .
We will now consider the process . We will first consider the rate that increases by 1. There are two ways to increase by 1. First, an individual that is not of type 1r(a,b] dies, and the recombination between an allele from a type 3 individual and a allele from a type 0 individual occurs during the time interval , which creates a type 1r(a,b] ancestor. This occurs at total rate of
[TABLE]
Second, an individuals that is not of type 1r(a,b] dies, and a new type 1r(a,b] individual is born from the type 1r(a,b] individuals at that time. Similar to the way we obtain (19) and (20), by defining
[TABLE]
one can see that the rate that increases by 1 is .
We will now consider the rate that decreases by 1. One way that decreases by 1 is when a type 1r(a,b] individual dies and the new individual is not of type 1r(a,b] (i.e, the new individual is not born from a type 1r(a,b] individual, and it is not a type 1r(a,b] ancestor). Another way is when a type individual mutates to a type 3 individual. By the same reason we used to obtain (22), the rate that decreases by 1 is
[TABLE]
and note that the term is precisely the probability that a type 1r(a,b] ancestor is created. By defining
[TABLE]
one can see that the rate that decreases by 1 is .
Now, we define
[TABLE]
By analogy, one can check that for , we have that increases by 1 at rate and decreases by 1 at rate . Also, for and 3, increases by 1 at rate and decreases by 1 at rate .
For , and , we define , which is the growth rate of the type im(a,b] population at time . For , and , we define . This is the growth rate of the type ir(a,b] population at time . Note that does not depended on the interval , because from (21), (23), and the fact that ,
[TABLE]
Similarly, we have
[TABLE]
Also, by similar calculation, we have
[TABLE]
From the fact that , and , it follows that for sufficiently large ,
[TABLE]
Lastly, for and , we define to be the number of type individuals at time that descend from one of the type individuals at time . It follows that for and ,
[TABLE]
and
[TABLE]
Following the argument we used to obtain and , for , we define
[TABLE]
and note that for , the process \big{(}X_{i}^{[a]}(t),t\geq a) increases by 1 at rate , and decreases by 1 at rate . Also, for all and , we can check that
[TABLE]
Lastly, we define for all . It follows that
[TABLE]
and note that from (37),
[TABLE]
4 Important Martingales and Submartingales
In this section, we will define several martingales and submartingales that will be used frequently in the proof. First, for and for , when , we define , and when , we define
[TABLE]
Also, for and for , when , we define , and when , we define
[TABLE]
It follows that for ,
[TABLE]
Let be the natural filtration of the process .
Proposition 7**.**
For , the process is a mean-zero martingale, and for ,
[TABLE]
Also, For the process is a mean-zero martingale, and for ,
[TABLE]
Moreover, if is a stopping time and , then for , the process is a mean-zero martingale, and for ,
[TABLE]
Also, for , the process is a mean-zero martingale, and for ,
[TABLE]
Proof.
The technique used in this proof was previously used in section 5.1 of [13]. We will prove the result for the process . The results for the other processes can be proved in the same manner.
For , let be the number of times in that the number of type individuals increases, and let be the number of times in that the number of type individuals decreases. Then, . Next, we define
[TABLE]
and , for all . Because and are exactly the rates that the process increases and decreases by 1 at time , and both and are 0, both the process and the process are mean-zero martingales. It follows that the processes and are also mean-zero martingales. Since is locally of bounded variation, its quadratic variation is
[TABLE]
Now, consider the process . The process is mean-zero martingale, by the definition of the sharp bracket. From equations (49), (50), and the fact that \big{(}W_{+}(t)+W_{-}(t),t\geq a\big{)} is a mean-zero martingale, we have that
[TABLE]
Now, for , we define
[TABLE]
Because both and are semimartingales, such that has continuous paths and the process is locally of bounded variation,
[TABLE]
Also, because
[TABLE]
for all , we have
[TABLE]
Using the Integration by Parts formula, we have
[TABLE]
Therefore, from (45) and (51),
[TABLE]
From (21) and (23), we have and for all . So, for all . Thus,
[TABLE]
for all . Hence, for each , we have . Therefore, from (52), the process \big{(}Z^{(a,b]}_{1m}(t),t\geq 0\big{)} is a square integrable martingale with
[TABLE]
This process has mean zero, because . By Corollary 8.25 of [9], \textup{Var}\big{(}Z^{(a,b]}_{1m}(t)\big{)}=E\big{[}\big{(}Z^{(a,b]}_{1m}(t)\big{)}^{2}\big{]}=E\big{[}\big{\langle}Z^{(a,b]}_{1m}\big{\rangle}(t)\big{]}, and this proves the variance formula by using (53) and (54). Lastly, because a stopped martingale is a martingale, the process is a mean-zero martingale, and by the same argument above, we can get the variance formula for the process . β
Since the process is a continuous-time Markov chain, combining Proposition 7 and Markov property yields the following result.
Corollary 8**.**
If is a stopping time and , then for and ,
[TABLE]
and for and for ,
[TABLE]
Now, for and , we define
[TABLE]
By a similar argument to the one used in proving Proposition 7 and Corollary 8, we get the following result.
Proposition 9**.**
If is a stopping time with , then for , the process is a martingale, and for all ,
[TABLE]
Lastly, for , for and , let us define
[TABLE]
and for , for and , we define
[TABLE]
Proposition 10**.**
If is a stopping time and , for , the process is a submartingale, and for ,
[TABLE]
For the process is a submartingale, and for ,
[TABLE]
Proof.
Consider the process \big{(}W_{im}^{(a,b]}(t\wedge T),t\geq a\big{)}. Because and , we have that . Thus,
[TABLE]
for all . So, E\big{[}W^{(a,b]}_{im}(t\wedge T)\big{]}<\infty for all .
[TABLE]
For , by Proposition 7, we have
[TABLE]
Thus, the process is a submartingale. From (57) and from the fact that the process is a mean-zero martingale by Proposition 7,
[TABLE]
The proof for the process can be done by a similar argument. β
5 Phase 1 and the proof of Proposition 2
5.1 Notations
First, note that to prove Propositions 2, 3, 4 and 5, it is enough to prove that they hold for all small values of and . We choose and as follow:
[TABLE]
and
[TABLE]
We will now define several constants, fixed times, and stopping times. In both the recombination dominating case and the mutation dominating case, we pick the following constants:
[TABLE]
Next, we define several fixed times as follows:
[TABLE]
and in both cases, we define
[TABLE]
It follows from these definitions, the fact that , and the fact that in the recombination dominating case that for sufficiently large , we have and . Now, in both cases, we define the following stopping times:
[TABLE]
Lastly, we define the following events:
[TABLE]
Also, we define
[TABLE]
We will show that these events occur with high probability. Here, we will prove some inequalities involving and , which will be used quite often in this section.
Lemma 11**.**
For sufficiently large , and , the following statements hold:
, for . 2. 2.
, , and . 3. 3.
, , and . 4. 4.
For , we have , , and . 5. 5.
For , we have , , and .
Proof.
By the definition of , and in (65), (69) and (73), for every , and for ,
[TABLE]
For statement 2, since for all , and , it follows that for sufficiently large , we have for all . Thus, by the definition of in (32), for sufficiently large , we have for all . Also, by part 1, if , then . Again, by using the definition of in (32), we get the lower bound of in statement 3. Both the upper and lower bounds for can be shown by similar arguments. Lastly, we can prove statements 4 and 5 by using (34), (35) and (36) along with statements 1, 2 and 3 of this lemma. β
5.2 Upper bounds for expectations
In this section, we are going to prove some results on the upper bounds for the expectations of and .
Lemma 12**.**
For sufficiently large , for and , we have
[TABLE]
and
[TABLE]
Proof.
The proof is similar to Lemma 5.1 in [13]. We will show the proof for , since the argument is similar for . We will first show that for sufficiently large , for , and for , we have
[TABLE]
If , this inequality is trivial, since by the definition of , we have . Assume that . By Proposition 7 and (45), we have E\big{[}Z_{1m}^{(a,b]}(t\wedge T_{(1)})\big{]}=0, and
[TABLE]
Note that in the event that , we interpret the integral from to as 0. Also, from the definition of , in the event , we have . Now, using the upper bound for in Lemma 11, we know that for sufficiently large , for , and ,
[TABLE]
Next, we use the lower bound for in Lemma 11. From (18), for sufficiently large , for , and ,
[TABLE]
From (86), (87) and (88), we have the inequality (85).
In the second part of the proof, for each , let , for . It follows from
[TABLE]
By letting , we have that for sufficiently large , , and ,
[TABLE]
The inequality (84) follows from the fact that . From (89), it follows that
[TABLE]
and the proof is completed. β
Lemma 13**.**
For sufficiently large , for , and , we have
[TABLE]
and
[TABLE]
Proof.
The proof is similar to the proof of Lemma 12. We will show the proof for , and the same argument can be used when . In the first part of this proof, we will show that for sufficiently large , for , and for , we have
[TABLE]
If , this inequality is trivial, since by the definition of , we have . Assume that . By Proposition 7 and (46), we have E\big{[}Z_{1r}^{(a,b]}(t\wedge T_{(1)})\big{]}=0, and
[TABLE]
Using the upper bound for in Lemma 11, we know that for sufficiently large , for , and ,
[TABLE]
Then, using the lower bound for in Lemma 11, along with the upper bound for in (38) and the definition of in (72), we have that for sufficiently large , for , and ,
[TABLE]
From (93), (94) and (95), we have the inequality (92). Lastly, by using (92) and following the argument in the second part of the proof of Lemma 12, we can prove (90) and (91). β
Lemma 14**.**
For sufficiently large and for , we have
[TABLE]
[TABLE]
and
[TABLE]
Proof.
The argument in this proof is similar to that of Lemma 12. We will first show that for sufficiently large , for , and for , we have
[TABLE]
If , this inequality is trivial, since by the definition of , we have . Let assume that . By Proposition 7 and (45), we have E\big{[}Z_{3m}^{(a,b]}(t\wedge T_{(1)})\big{]}=0, and
[TABLE]
Using the upper bound for in Lemma 11, we know that for sufficiently large , for , and ,
[TABLE]
Now, we use the formula for in (28), the lower bound for in Lemma 11, and the definition of and in (70) and (71). It follows that for sufficiently large , , and ,
[TABLE]
From (100), (101) and (102), we have the inequality (99). By following the argument in the second part of the proof of Lemma 12, it follows that for sufficiently large , when and ,
[TABLE]
and
[TABLE]
The inequality (96) follows from (103) and the fact that , and the inequalities (97) and (98) follow from (104) and the definitions of and in (67) and (68). β
Lemma 15**.**
For sufficiently large and , if , we have
[TABLE]
and if ,
[TABLE]
Proof.
The proof is similar to the proof of Lemma 12. We will first show that for sufficiently large , for and , we have
[TABLE]
If , this inequality is trivial, since by the definition of , we have . Assume that . By Proposition 7 and (46), we have E\big{[}Z_{3r}^{(a,b]}(t\wedge T_{(1)})\big{]}=0, and
[TABLE]
Using the upper bound for in Lemma 11, we know that for sufficiently large , for , and ,
[TABLE]
Then, we use the lower bound for in Lemma 11, along with the upper bound for in (38) and the definitions of and in (70) and (71), we have that for sufficiently large , for , and ,
[TABLE]
From (108), (109) and (110), we have the inequality (107). By similar argument to the second part of the proof of Lemma 12, we can show that for sufficiently large , for , and ,
[TABLE]
and
[TABLE]
The inequality (105) follows from the inequality (111) and the fact that . The inequality (106) is a special case of the inequality (112) when . β
Using these upper bounds on expectations, we can prove that when is sufficiently large, the event occurs with probability close to 1, and the proof is shown below.
Lemma 16**.**
For sufficiently large , we have .
Proof.
Recall the definition of in (74). First, note that
[TABLE]
Now, consider the term , for . Using Markovβs inequality, Lemmas 12 and 13, the definition of in (69), and (14), for sufficiently large ,
[TABLE]
Next, consider the term . By Markovβs inequality, Lemma 14, Lemma 15, and using (14), for sufficiently large , we have
[TABLE]
Thus, from (113), (114), (115) and the way we choose and in (60) and (61), for sufficiently large , we have . β
5.3 The variance bounds
By using the upper bounds for expectations, the variance formulas in Proposition 7, and the -maximal inequality, we can show that the probability that each of the events occurs is at least .
Lemma 17**.**
The following statements hold:
For sufficiently large , and for , we have . 2. 2.
In the recombination dominating case, for sufficiently large , we have .
Proof.
Recall the definitions of the events in (75) - (78). We will first prove that , when is sufficiently large. From (21), (23) and the facts that , and , for sufficiently large and for ,
[TABLE]
and
[TABLE]
From Proposition 7, Lemma 11, and Lemma 12, for sufficiently large ,
[TABLE]
From the definition of in (69) along with (14), and the facts that and , we have that
[TABLE]
By the way we choose and in (58), (60) and (65),
[TABLE]
By (116), (117) and the fact that , for sufficiently large ,
[TABLE]
By the -maximal inequality, for sufficiently large ,
[TABLE]
Hence, we have shown that . The proof for is in fact the same as that for .
Now, we will prove that . From (24), (25) and the facts that , and , for sufficiently large , for all , we have and . From Proposition 7, Lemma 11, and inequality (38), for sufficiently large ,
[TABLE]
From Lemma 13 and the definition of in (69), for sufficiently large ,
[TABLE]
Therefore, from (117), (118), and the fact that , for sufficiently large ,
[TABLE]
By the -maximal inequality, for sufficiently large ,
[TABLE]
We have proved that . The proof for is the same as the proof for .
Next, We will give a proof that . From (26), (27) and the facts that , and . for sufficiently large , for all , we have and . From Proposition 7, Lemma 11, and the definitions of and in (70) and (71), for sufficiently large ,
[TABLE]
By Lemma 14, and the definition of in (68), for sufficiently large ,
[TABLE]
From (117) and (118) along with the fact that , for sufficiently large ,
[TABLE]
By the -maximal inequality, we have that for sufficiently large ,
[TABLE]
Lastly, we will prove part 2. From (29), (30) and the fact that , and , for sufficiently large , for all , we have and . From Proposition 7, Lemma 11, inequality (39), and the definition of and in (70) and (71), for sufficiently large ,
[TABLE]
By Lemma 15 and the definitions of and in (66) and (69), for sufficiently large ,
[TABLE]
Because in the recombination dominating case, , by using the fact that along with (117) and (118), we have that for sufficiently large ,
[TABLE]
By the -maximal inequality, for sufficiently large ,
[TABLE]
which proves part 2. β
5.4 Results on type 3 individuals
In this section, we will show that the events and as defined in (79) and (80) occur with high probability. That is with probability close to 1, there are no type 3m (or 3r) individuals at time that are descended from type 3m (or 3r) ancestors that appear before time (or ). The proof consists of two main ideas.
With probability close to 1, the number of type 3m (or 3r) ancestors that appear before time (or ) is small. 2. 2.
With probability close to 1, each of these early ancestors will not have alive descendant by time .
At the end of this subsection, we will show that the events and as defined in (81) and (82) also occur with high probability.
Lemma 18**.**
Define and to be the number of type 3m ancestors and 3r ancestors respectively that appear in the time interval . For sufficiently large , the following statements hold:
P\bigg{(}m(t_{0,m}\wedge T_{(1)})\geq\frac{e^{-C_{0,m}/2}}{s}\bigg{)}\leq\epsilon.** 2. 2.
P\bigg{(}\rho(t_{0,r}\wedge T_{(1)})\geq\frac{e^{-C_{0,r}+1}}{s}\bigg{)}\leq\epsilon.**
Proof.
The process is a pure birth process with total birth rate as defined in (28). Then, there is a mean-zero martingale such that for all ,
[TABLE]
By Doobβs stopping theorem, is a mean-zero martingale. Thus,
[TABLE]
So, by Markovβs inequality and by the way we choose in (62),
[TABLE]
Now, consider the process . By (31), the process is a pure birth process, and the birth rate at time is given by as defined in (31). Then, there is a mean-zero martingale such that for all ,
[TABLE]
By Doobβs stopping theorem, is a mean-zero martingale. Thus,
[TABLE]
From the definition of in (66), if we are in the recombination dominating case or in the mutation dominating case with ,
[TABLE]
and in the mutation dominating case when , we have
[TABLE]
Hence, from (119),
[TABLE]
Lastly, by Markovβs inequality and the definition of in (64),
[TABLE]
β
Lemma 19**.**
For , define to be the time that the ith type 3m ancestor appears, where we set if the ith type 3m ancestor never appears. Let be the number of descendants of the ith type 3m ancestor alive at time . Then, for sufficiently large N, for all ,
[TABLE]
Proof.
First, define for all and . By following the same reasoning that led us to get the rates in (26) and (27), we have that on the event , the process is a birth-death process with , where each individual gives birth at rate
[TABLE]
and dies at rate
[TABLE]
Note that for ,
[TABLE]
and
[TABLE]
For , define . Define for . The process is a birth-death process with , where each individual gives birth at rate
[TABLE]
and dies at rate
[TABLE]
Let be a birth-death process where , where each individual gives birth at rate 1 and dies at rate . From the generating function of birth and death process (in the section 5 of Chapter III of [1]), for ,
[TABLE]
Since , we have that for sufficiently large ,
[TABLE]
By Lemma 11 and (118), on the event , we have for all , which implies that
[TABLE]
It is possible to couple the process with the population process, such that 1) on the event , for any time , if , then , and 2) the process is independent of . It follows that
[TABLE]
Lastly, using (122) and (121), we have
[TABLE]
β
Lemma 20**.**
For , define to be the time that the ith type 3r ancestor appears, where we set , if the ith type 3r ancestor never appears. Let be the number of descendants of the ith type 3r ancestor alive at time . Then, for sufficiently large N, for all ,
[TABLE]
Proof.
The proof is similar to that of Lemma 19. First, define for all and . By following the same reasoning that led us to get the rates in (29) and (30), we have that on the event , the process is a birth-death process with , where each individual gives birth at rate
[TABLE]
and dies at rate
[TABLE]
Note that when , we have .
For , let . Define for . The process is a birth-death process with , where each individual gives birth at rate
[TABLE]
and dies at rate
[TABLE]
Since the function is strictly increasing on the interval , we have that if , then Hence, from Lemma 11, for every and and 3, we have , and . Now, because , by (124), for sufficiently large , for ,
[TABLE]
Let be a birth-death process where , where each individual gives birth at rate 1 and dies at rate . By the same argument we used to get (120), for ,
[TABLE]
We claim that for sufficiently large ,
[TABLE]
From (125) and the definition of in (64), in the recombination dominating case and the mutation dominating case with , we have that for sufficiently large ,
[TABLE]
and in the mutation dominating case with , we also have
[TABLE]
On the event , using (118), we have for all . By following the same reasoning in (122),
[TABLE]
It is possible to couple the process with the population process, such that 1) on the event , for any time , if , then , and 2) the process is independent of . By the same reasoning we used to get (123), it follows that for sufficiently large ,
[TABLE]
β
Now, we are ready to show that the events and occur with probability close to 1.
Lemma 21**.**
For sufficiently large , we have , and .
Proof.
Recall the definitions of and in (79) and (80). We will only show that . The same reasoning can be used to prove that .
Let . By Lemma 19, we have that for sufficiently large ,
[TABLE]
Hence, by Lemma 16 and Lemma 18 along with the way we choose and in (58), (60) and (62), for sufficiently large ,
[TABLE]
So, this prove that . β
Lemma 22**.**
For sufficiently large , we have , and .
Proof.
Recall the definition of in (81). From Lemma 14 and the definition of in (69), for sufficiently large ,
[TABLE]
and from the Markovβs inequality, we get that .
Now, recall the definition of in (82). We will first consider the recombination dominating case and the mutation dominating case with . Recall that in the recombination dominating case. From Lemma 15 and the definition of in (66), for sufficiently large ,
[TABLE]
In the mutation dominating case with , from Lemma 15 and the definition of in (66), for sufficiently large ,
[TABLE]
Thus, in both cases, for sufficiently large ,
[TABLE]
and is followed from the Markovβs inequality. β
Before we prove Proposition 2, we will give both upper and lower bounds of the numbers of type 1 and 2 individuals on the event .
Lemma 23**.**
The following statements hold:
On the event , for , for sufficiently large and for ,
[TABLE] 2. 2.
In the recombination dominating case, on the event , for , for sufficiently large , and for every , we have
[TABLE] 3. 3.
In the mutation dominating case, on the event , for , for sufficiently large , and for , we have
[TABLE]
Proof.
In this proof, we assume that we are on the event . From (47), we have that for all ,
[TABLE]
From Lemma 11, definitions of and in (74) and (75), and the fact that , for sufficiently large and for ,
[TABLE]
Next, from (48), we have that for all ,
[TABLE]
From (38), Lemma 11, and definitions of and in (74) and (76), for sufficiently large and for ,
[TABLE]
By the definition of in (72), inequalities (14), (117) and the fact that , it follows that for sufficiently large and for ,
[TABLE]
Therefore, for sufficiently large , for all , we have
[TABLE]
Note that by similar argument, we can also prove the upper bound for .
To prove the lower bound for in the recombination dominating case, we first need to consider the term . By using (32), part 1 of this lemma and the definition of in (72), we have that when is sufficiently large, for ,
[TABLE]
Now, using the fact that , the definition of in (69) along with (117), we have that when is sufficiently large, for ,
[TABLE]
Also, using part 1 of this lemma, the definition of in (72) and the fact that , for sufficiently large , and for ,
[TABLE]
Thus, from (126), (127), (128), along with the definition of in (76) for sufficiently large , for all ,
[TABLE]
In the recombination dominating case, we have and . So, by using the definition of in (66), we have that
[TABLE]
Thus, from (129), (117), and the way we choose as in (61), for sufficiently large , and for all ,
[TABLE]
The proof for the mutation dominating case is almost exactly the same as that of the recombination dominating case by replacing by , and using that because , we have
[TABLE]
which completes the proof. β
5.5 The proof of Proposition 2
Proof.
By the definition of in (83) and Lemmas 16, 17, 21, and 22, for sufficiently large , we have that . From now on, we will assume that we are working on the event . The statement 1 follows from Lemma 23 by inserting .
Now consider . From the definitions of and , in (79), (80), (81) and (82), it follows that
[TABLE]
and
[TABLE]
In the recombination dominating case, and satisfy (6). It follows from (130) and (131) that if is sufficiently large, then
[TABLE]
So, we choose the positive constant
[TABLE]
Next, consider the mutation dominating case. In this case, satisfies (7), and we also have that . It follows from (130) and (131) that if is sufficiently large, then
[TABLE]
and
[TABLE]
Thus, we choose the positive constant
[TABLE]
Now, we will show the lower bound of . First, consider the recombination dominating case. To prove the lower bound, we will need to consider the term . Similar to the way we get (127) by using (36) instead of (34), for ,
[TABLE]
By (117), when is sufficiently large, for
[TABLE]
By (31) and Lemma 23, for sufficiently large , and for ,
[TABLE]
Using (48), (132), (133), Lemma 11 and the definitions of in (78), for sufficiently large ,
[TABLE]
It follows from the definitions of and in (69) and (66) that for sufficiently large ,
[TABLE]
By (117) and the fact that , we have that for sufficiently large
[TABLE]
and we choose the positive constant
[TABLE]
Lastly, consider the mutation dominating case. By the same argument we used to obtain (132), we have that for sufficiently large and for ,
[TABLE]
From (47), Lemma 11, Lemma 23, and the definition of in (77), for sufficiently large ,
[TABLE]
Using the definitions of and in (69) and (67), and the fact that , for sufficiently large ,
[TABLE]
Note that the way we define in (63) is precisely to make
[TABLE]
Hence, we choose the positive constant
[TABLE]
This completes the proof. β
6 Phase 2 and the proof of Proposition 3
6.1 Comparing the Markov chain with a differential equation
Theorem 24 below is a special case of Theorem 4.1 of [6]. Let be a continuous time Markov chain with finite state space . Let be the jump rate from the state to the state . For each state , define the function by
[TABLE]
where is the Euclidean norm, and define the function by
[TABLE]
It follows that
[TABLE]
for some martingale .
Let be a Lipschitz function with Lipschitz constant . Let be the function that satisfies
[TABLE]
The goal is to compare with .
Fix , , , and let . Define the events
[TABLE]
Theorem 24**.**
Under all the assumptions above,
[TABLE]
Now, we will apply this theorem to our process . First, for , we define
[TABLE]
and . We are thinking of and as the fractions of type 1, 2 and 3 individuals in the population. For better understanding in the following formulas, we will define , which represents the fraction of type 0 individuals in the population. Now, for each , we define
[TABLE]
Note that for each , the quantity represents the probability that a new individual born is of type . Next, for and , the transition rate is given by
[TABLE]
The reasons behind the formulas for these rates are similar to the ones we used to obtain the birth and death rates in section 3.2. Let us consider the first rate. It is the rate that the number of type 0 individuals decreases by 1 and the number of type 1 individuals increases by 1. There are two ways for this to occur: 1) a type 0 individual mutates to type 1, which occurs at total rate of , and 2) a type 0 individual dies and is replaced by a type 1 individual. The total rate that a type 0 individual dies is , and the probability that the replacement is of type 1 is .
We define the functions and as in (134) and (135). For such that , we have , since it is equal to or . Because for each and , we have , and because , it follows that for sufficiently large , for all , we have . By the definition of in (134), for sufficiently large ,
[TABLE]
For each , we define
[TABLE]
A tedious calculation gives
[TABLE]
Note that for , the row of is exactly the rate at which the number of type individuals increases by 1 minus the rate at which the number of type individuals decreases by 1.
Here, we define the functions and by
[TABLE]
and
[TABLE]
Since all first partial derivatives of are bounded, the function is Lipschitz. Hence, is also Lipschitz with Lipschitz constant , where and does not depend on .
Now, we define a random variable such that on the event that , we have
[TABLE]
The value of on the event that is not of interest, as we will work only on the event when is sufficiently large. By Proposition 2, we know that on the event . Next, for , we define
[TABLE]
and for , we let
[TABLE]
Note that for , we have , and for all , we have . From (143), for ,
[TABLE]
and it follows that
[TABLE]
From (140), (144), (143), and the fact that for all , we have that for ,
[TABLE]
Therefore, for , we have , and
[TABLE]
We pick the constant
[TABLE]
and we define
[TABLE]
We will use Theorem 24 to show that with probability almost 1, both and are close to and for . We define the event
[TABLE]
Lemma 25**.**
For sufficiently large , we have on the event .
Proof.
Let . We will first prove that for sufficiently large , on the event ,
[TABLE]
[TABLE]
Because for all , and , we have
[TABLE]
for some positive constants and . Thus,
[TABLE]
In the recombination dominating case, since , , , and , if is sufficiently large, then
[TABLE]
[TABLE]
and
[TABLE]
In the mutation dominating case, since , , an , if is sufficiently large, then (150) holds and
[TABLE]
In this proof, we assume that is large enough so that in the recombination dominating case, (149), (150) and (151) hold, and in the mutation dominating case, (149), (150) and (152) hold.
Now, let us consider the process . By Markov property of the process, if we condition on , the process after time behaves as if we start the whole process again with as the initial condition. Now, let us fix the value of , and consider the process starting at time with this initial condition. Note that by starting the process from this fixed start point, the function and defined in (143) and (144) are no longer random, which allows us to use Theorem 24.
We define , and note that , which is in the form required in order to use Theorem 24. We let and define the events
[TABLE]
First, we consider . In the recombination dominating case, if satisfies (8) and (9), then by (151), we have
[TABLE]
Similarly, in the mutation dominating case, if satisfies (8) and (10), then by (152), we have
[TABLE]
Next, because of (148) and (150), we have that . Lastly, by (138), it follows that
[TABLE]
So, .
Therefore, if satisfies (8) and (9) in the recombination dominating case, or satisfies (8) and (10) in the mutation dominating case, by Theorem 24 and (149), we have that
[TABLE]
Note that the upper bound does not depend on the value of . By Proposition 2, on the event , we know that satisfies (8) and (9) in the recombination dominating case, and satisfies (8) and (10) in the mutation dominating case for sufficiently large . Using the Markov property of the process, we have that on the event ,
[TABLE]
Thus, from the definition of the event in (147), on the event ,
[TABLE]
which completes the proof. β
6.2 Results on type 3 individuals
We will now show that for sufficiently large , with probability close to 1, has the same order as in the recombination dominating case, and has the same order as in the mutation dominating case. The proof mainly has two parts. In the first part, we will show that , which was defined to be the number of type 3 individuals at time that descend from the type 3 individuals at time , has order in the recombination dominating case, and in the mutation dominating case. In the second part, we show that and are much smaller than .
Lemma 26**.**
For sufficiently large , for all
[TABLE]
Proof.
From (55) and Proposition 9, we have that for ,
[TABLE]
Because of Lemma 11, for sufficiently large ,
[TABLE]
Thus, for sufficiently large ,
[TABLE]
which proves this lemma. β
Lemma 27**.**
The following statements hold:
In the recombination dominating case, there is a positive constant , such that for sufficiently large , on the event , we have
[TABLE] 2. 2.
In the mutation dominating case, there is a positive constant , such that for sufficiently large , on the event , we have
[TABLE]
Proof.
First, consider the recombination dominating case. From (41) and (42), for all , we have that and . Also, from (33) and the fact that , for sufficiently large , for all ,
[TABLE]
By Proposition 9, (153), and Lemma 26, for sufficiently large ,
[TABLE]
By Proposition 2 and the definitions of and in (69) and (146), for sufficiently large , on the event ,
[TABLE]
We define
[TABLE]
Since the process \big{(}Z_{3}^{[t_{1}]}(t),t\geq 0) is a martingale, we have that . Hence, by Chebyshevβs inequality, we have that for sufficiently large , on the event ,
[TABLE]
For the mutation dominating case, the proof is almost exactly the same. The only difference is the inequality (154), for which Proposition 2 gives that
[TABLE]
In this case, we pick
[TABLE]
This completes the proof. β
Lemma 28**.**
There exist positive constants and such that for sufficiently large , we have
\displaystyle P\bigg{(}X_{3m}^{(t_{1},t_{2}]}(t_{2})\geq\frac{K^{\prime}_{1}}{\epsilon}\cdot\frac{N\mu}{s}\hskip 2.84544pt\Big{|}\hskip 2.84544pt\mathcal{F}_{t_{1}}\bigg{)}\leq\epsilon.** 2. 2.
\displaystyle P\bigg{(}X_{3r}^{(t_{1},t_{2}]}(t_{2})\geq\frac{K^{\prime}_{2}}{\epsilon}\cdot\frac{Nr}{s}\hskip 2.84544pt\Big{|}\hskip 2.84544pt\mathcal{F}_{t_{1}}\bigg{)}\leq\epsilon.**
Proof.
We will first prove part 1. Let and be the numbers of times that the number of type individuals increases and decreases respectively during the time interval . Then, for , we define
[TABLE]
Then, both processes and are mean-zero martingales, and so is the process . We also have that
[TABLE]
Thus, from Lemma 11, for sufficiently large , if , then
[TABLE]
Here, we define
[TABLE]
From Gronwallβs inequality, we have
[TABLE]
and by Markovβs inequality, we have that
[TABLE]
Now, we will prove part 2. The proof is similar to the the proof for part 1. First, we have that there is a mean-zero martingale such that
[TABLE]
for all . From (31), Lemma 11 and , for sufficiently large , and for ,
[TABLE]
We define
[TABLE]
From Gronwallβs inequality, we have
[TABLE]
and the result follows from Markovβs inequality. β
Recall the constants and defined in (155), (156), (157) and (158). Now, we define the following events in both cases:
[TABLE]
In the recombination dominating case, we define
[TABLE]
while in the mutation dominating case, we define
[TABLE]
Lastly, in both cases, we define
[TABLE]
Lemma 29**.**
On the event , for sufficiently large , and for , we have
[TABLE]
and
[TABLE]
Proof.
First note that if , then the function is increasing on the interval . Then, from Proposition 2, on the event , for sufficiently large ,
[TABLE]
and
[TABLE]
By the same argument, we get the same bounds for .
Now, recall the definitions of and in (142) and (143). By Proposition 2, (164) and the definitions of and in (69), (145) and (146), for sufficiently large , on the event ,
[TABLE]
and
[TABLE]
From the way we define and in (59) and (61), we have , and
[TABLE]
Thus, we have
[TABLE]
This completes the proof of this lemma. β
6.3 The proof of Proposition 3
Proof.
Recall the definition of in (163). From Proposition 25, Lemma 27 and Lemma 28, for sufficiently large , on the event
[TABLE]
Thus, from Proposition 2, we have
[TABLE]
From now on, we will work on the event . By the definition of the event in (147), the definition of the function in (144), and Lemma 29, for sufficiently large , on the event ,
[TABLE]
and
[TABLE]
Both the upper and lower bounds for follow from the same argument.
Now, we prove statement 2. Assume that we are in the recombination dominating case. By the definition of in (55), the definition of in (161), the inequality (153) and Proposition 2, on the event ,
[TABLE]
We define
[TABLE]
Because and , for sufficiently large ,
[TABLE]
By the definitions of and in (159), (160) and (161), and by Proposition 2, we have that for sufficiently large , on the event ,
[TABLE]
We define the constant
[TABLE]
Because and , for sufficiently large ,
[TABLE]
Lastly, consider the mutation dominating case, where we will prove statement 3. The proof is similar to the proof of part 3. First, by using (162) instead of (161), for sufficiently large , on the event ,
[TABLE]
We define
[TABLE]
Since , for sufficiently large , on the event ,
[TABLE]
By the definitions of and in (159), (160) and (162), and by Proposition 2, we have that for sufficiently large , on the event ,
[TABLE]
We define the constant
[TABLE]
Because and , for sufficiently large ,
[TABLE]
This completes the proof. β
7 Phase 3 and the proof of Proposition 4
In this phase, we will use martingales and submartingales to approximate the number of type 0 and type 3 individuals. The ideas of the proof are similar to those used in phase 1. At the end of this section, we will give a proof for Proposition 4.
We define the constant
[TABLE]
where the constants and are defined in the equations (165) and (166). We define the time
[TABLE]
Next, we define the following stopping times:
[TABLE]
In both cases we define
[TABLE]
In the recombination dominating case, we define the following events:
[TABLE]
In contrast, in the mutation dominating case, we define
[TABLE]
Lastly, in both cases, we define
[TABLE]
We will first give bounds on the growth rates of type 0 and type 3 populations.
Lemma 30**.**
The following statements are true.
If , then . 2. 2.
If , then . 3. 3.
If , then s(1-\tilde{X}_{3}(t))-r\leq G_{3}(t)\leq s\big{(}1+\delta e^{-s(1-3\delta)(t-t_{2})}\big{)}. 4. 4.
If , then s(1-\tilde{X}_{3}(t))-r\leq G^{(t_{2},t_{3}]}_{3r}(t)\leq s\big{(}1+\delta e^{-s(1-3\delta)(t-t_{2})}\big{)}+r.
Proof.
By the definition of in (169), if , then , and from (43), we have that . From (44), if , then , and by using the fact that for all , we also have that .
Now, from the definition of in (171), if , then the equation (33) implies that G_{3}(t)\leq s(1+\tilde{X}_{0}(t))<s\big{(}1+\delta e^{-s(1-3\delta)(t-t_{2})}\big{)}, and . Part 4 follows directly from part 3 and (36). β
7.1 Results on type 0 individuals
Lemma 31**.**
For sufficiently large , on the event , we have .
Proof.
First, from part 2 of Proposition 3, on the event , we have that . From Proposition 9, the process is a martingale. Hence, by Lemma 30 and Doobβs maximal inequality, for sufficiently large , on the event ,
[TABLE]
which proves the lemma. β
Lemma 32**.**
For sufficiently large , we have .
Proof.
We will first prove this result in the recombination dominating case. By Proposition 10, the process \big{(}W_{0r}^{(t_{2},t_{3}]}(t\wedge T_{(3)}),t\geq t_{2}\big{)} is a submartingale. Also, note that from the definitions of and in (146) and (168), we have that
[TABLE]
From Proposition 10, Lemma 30 part 2, (40), and the definition of in (170), we have
[TABLE]
Because and , for sufficiently large ,
[TABLE]
and it follows that
[TABLE]
Also, since , we have
[TABLE]
Hence, from (184), for sufficiently large , we have
[TABLE]
Thus, from (185), Lemma 30 part 2 and Doobβs maximal inequality, for sufficiently large ,
[TABLE]
Now, for the mutation dominating case, we observe that from the definitions of and in (146) and (168), we have
[TABLE]
From the fact that and , we have
[TABLE]
Also, from and (14), we get
[TABLE]
which show that (185) and (186) also hold in this case. By following the same argument as in the recombination dominating case, we obtain that for sufficiently large ,
[TABLE]
and . β
7.2 Results on type 3 individuals
Lemma 33**.**
For sufficiently large , we have that for ,
[TABLE]
and .
Proof.
The proof is similar to that of Lemma 14. First, recall that the process \big{(}Z_{3m}^{(t_{2},t_{3}]}(t\wedge T_{(3)}),t\geq t_{2}\big{)} is a mean-zero martingale by Proposition 7. By (45), for all , we have
[TABLE]
From (28), Lemma 30 part 3, and the definition of in (170), we have that for every ,
[TABLE]
From (185), for sufficiently large and for all ,
[TABLE]
Also, by Lemma 30 part 3, we have that for all ,
[TABLE]
Therefore, using that , for sufficiently large , we have that if , then
[TABLE]
It follows from this inequality, along with (183), (187) and the definition of in (167) that in the recombination dominating case, for sufficiently large ,
[TABLE]
while in the mutation dominating, for sufficiently large ,
[TABLE]
Thus, by Markovβs inequality, in both cases, we have that . β
Lemma 34**.**
For sufficiently large , we have that for ,
[TABLE]
and .
Proof.
The proof is almost exactly the same as that of Lemma 33. Recall from Proposition 7 that the process \big{(}Z_{3r}^{(t_{2},t_{3,r}]}(t\wedge T_{(3)}),t\geq t_{2}\big{)} is a mean-zero martingale. By (46), for all , we have
[TABLE]
From (39), we have that for all . Using the same reason as in (188), for every ,
[TABLE]
Also, by Lemma 30 part 4, we have that for all ,
[TABLE]
Therefore, using that , from (185), we have that if , then
[TABLE]
It follows from this inequality, along with (183), (187) and the definition of in (167) that in the recombination dominating case, for sufficiently large ,
[TABLE]
while in the mutation dominating case, for sufficiently large ,
[TABLE]
Thus, by Markovβs inequality, in both cases, we have that . β
Next, we will bound the probabilities of the events and , but we will need an upper bound for the term E\big{[}X_{3}^{[t_{2}]}(t\wedge T_{(3)})|\mathcal{F}_{t_{2}}\big{]} first.
Lemma 35**.**
For sufficiently large , for all , on the event , we have
[TABLE]
Proof.
From Proposition 9, we know that \big{(}Z_{3}^{[t_{2}]}(t\wedge T_{(3)}),t\geq t_{2}\big{)} is a martingale. So, from (55), Lemma 30 part 3, and the fact that , for all ,
[TABLE]
Therefore, for all ,
[TABLE]
and from the upper bound of on the event in Proposition 3, the result follows. β
Lemma 36**.**
For sufficiently large , on the event , we have .
Proof.
In the recombination dominating case, from Lemmas 33, 34 and 35, we have
[TABLE]
Because and , along with the definition of in (167), for sufficiently large , on the event , we have
[TABLE]
Thus, by Markovβs inequality, we have .
For the mutation dominating case, we can follow the same argument. Note that in this case, instead of getting (189), Lemma 35 gives that
[TABLE]
Because and , for sufficiently large , on the event , we have
[TABLE]
and by following the previous argument, we prove the result. β
Lemma 37**.**
For sufficiently large , on the event , we have .
Proof.
We first consider the recombination dominating case. From Proposition 9, part 3 of Lemma 30, Lemma 35, and (185), for sufficiently large , on the event , we have that
[TABLE]
It follows from this inequality and the maximal inequality that
[TABLE]
For the mutation dominating case, the argument is exactly the same except at (190), the upper bound from Lemma 35 gives
[TABLE]
and the result follows by applying the maximal inequality. β
Lemma 38**.**
For sufficiently large , on the event , we have .
Proof.
First, by the definition of in (171), we have that
[TABLE]
It follows from this inequality, Markovβs inequality, Lemma 33, and Lemma 34 that for sufficiently large , on the event , we have
[TABLE]
At this point, the calculation splits between the two cases. In the recombination dominating case, by (191), (183), Lemma 35, and the definition of in (167), we have
[TABLE]
Because and , when is sufficiently large, on the event , we have that .
The proof for the mutation dominating case is almost the same, except at (191), where Lemma 35 gives
[TABLE]
The result follows from the facts that and . β
We have just finished showing that each of the events to conditioned on occurs with probability close to 1 on the event . In the next step, before we eventually prove Proposition 4, we are going to show that on the event , we have that .
Lemma 39**.**
For sufficiently large , on the event , we have that .
Proof.
In this proof, we are working on the event . By the definition of event in (173), we know that , and from the ways we define and as in (170) and (175), we have that . So, by the definition of in (172), it is left to show that .
In the recombination dominating case, by the definitions of the events and in (174) and (176), if , then
[TABLE]
Since , for sufficiently large , we have that X_{0}\big{(}t\wedge T_{(3)}\big{)}<\delta Ne^{-s(1-3\delta)(t\wedge T_{(3)}-t_{2})}, for all . Therefore, by the way we define as in (171), we have that .
For the mutation dominating case, by following the same argument, we have that for all ,
[TABLE]
and the result follows because . β
7.3 The proof of Proposition 4
Proof.
Recall the definition of in (182). From Lemmas 31, 32, 33, 34, 36, 37, and 38, for sufficiently large , on the event , we have
[TABLE]
Thus, by Proposition 3, for sufficiently large ,
[TABLE]
Next, assume that we are on the event . It follows from Lemma 39 that when is sufficiently large. So, by the definition of as in (171), we have , and by using the definition of in (168), we prove the first part of the proposition.
For the proof of the second part of the proposition, we define
[TABLE]
We will first prove the recombination dominating case. From (55), the definition of the event in (179), and Proposition 3, we have
[TABLE]
Since, , for sufficiently large ,
[TABLE]
Hence, from Lemma 30, the definition of in (170), inequality (185), and the definition of in (192), for sufficiently large , we have that
[TABLE]
For the upper bound for , from (55), the definition of the event in (179), Proposition 3, the fact that , and the definitions of in (167), we have
[TABLE]
Since , for sufficiently large , we have . It follows from the definitions of the events and as defined in (177) and (178), along with the facts that and , that for sufficiently large , we have , and . Therefore, for sufficiently large , we have .
We will now consider the mutation dominating case. Following the same argument as in the previous case, due to the differences in the definition of and the lower bound of from Proposition 3, instead of having inequality (193), we will have
[TABLE]
Because , for sufficiently large , we have
[TABLE]
and by using the same argument as in the previous case, we have that . For the upper bound for , due to the differences in the definition of and the lower bound of , instead of having inequality (194), we will have
[TABLE]
and because , for sufficiently large , we have . Lastly, it follows from the definitions of the events and as defined in (180) and (181), along with the facts that and , that for sufficiently large , we have , and . Thus, for sufficiently large , we have . β
8 Phase 4 and the proof of Proposition 5
The main result in this phase can be proved using Theorem 24 as we did in phase 2. First, we define and as in (136), (137), (134), (139), (140) and (141), respectively. Next, we define a random variable such that on the event that , we have
[TABLE]
The definition of when is not of interest, as we will work only on the event , on which from Proposition 4, we know that . Next, for , we define
[TABLE]
and define
[TABLE]
One can check that
[TABLE]
and, for all , we have
[TABLE]
By computation, we obtain that
[TABLE]
and
[TABLE]
which along with (197) imply that
[TABLE]
From (140), (196), (197) and (199), for ,
[TABLE]
Therefore, for , we have , and
[TABLE]
Lastly, we define
[TABLE]
where is a positive constant that was defined in (192).
Lemma 40**.**
For sufficiently large , on the event , we have .
Proof.
The proof is almost exactly the same as the proof of 25. Recall from section 6 that is a constant not depending on such that is a Lipschitz constant of the function . We define
[TABLE]
and . We also define
[TABLE]
First, we consider the event . From Proposition 4, for sufficiently large , on the event , we have , which means is well-defined. So, by (198), for sufficiently large , on the event , we have
[TABLE]
From the upper bound of in Proposition 4, along with the facts that in the recombination dominating case and in the mutation dominating case, for sufficiently large , on the event we have . So, for sufficiently large , we have .
Next, by similar arguments to those used to prove that and in Proposition 4, for sufficiently large , we have that and . Therefore, by Theorem 24, the definitions of and in (168) and (201), along with the fact that , for sufficiently large , on the event , we have
[TABLE]
which proves the result. β
Here, we will give a proof for Proposition 5.
Proof of Proposition 5.
First, from the definition of in (203), and Propositions 4 and 40, for sufficiently large , we have
[TABLE]
From this point, we will work on the event . From the definition of in (195) and Proposition (4), we have
[TABLE]
By the definitions of and in (196), (168), (201), (167) and (200), respectively, along with the inequality (204), we obtain that
[TABLE]
and
[TABLE]
Note that from Proposition 4, it is clear that . Using this fact, the definitions of in (202), the definition of in (197), along with (205) and (206) , we have
[TABLE]
and
[TABLE]
Lastly, using that , the definitions and in (197) and (202), along with (199) and (205), we obtain that
[TABLE]
This completes the proof of this lemma. β
9 Phase 5 and the proof of Theorem 1
The technique used in the proof involves coupling with a branching process, similar to the proof of Lemma 19. We begin by defining
[TABLE]
First, we will show that with probability close to 1, and .
Lemma 41**.**
The following statements hold:
For sufficiently large , on the event , we have . 2. 2.
For sufficiently large , on the event , we have .
Proof.
We are going to consider the process . For , let and be the rates the this process increases and decreases by 1 at time . This process increases by 1 when a type 3 individual dies and is replaced by an individual that is not type 3. Type 3 individuals die at total rate of , and the probability that the replacement is a type 3 individual is
[TABLE]
Hence, this process increases by 1 at rate
[TABLE]
The process decreases by 1 when an individual that is not of type 3 dies and is replaced by a type 3, or a mutation occurs on a type 1 or 2 individual. This occurs at rate
[TABLE]
Then, for all , we have
[TABLE]
and
[TABLE]
Hence, we can think of the process as a birth-death process in which each individual gives birth at rate bounded above by and dies at rate bounded below by .
Let be a birth-death process in which each individual gives birth at rate , and dies at rate , and . It is possible to couple the process with the process such that for any time , we have . This implies that if the process reaches 0 before , then the process will also reach 0 before , which means that .
Here, since we are only interested in the probability that the process reaches 0 before , we will consider the induced discrete-time jump process of . It is an asymmetric random walk process that jumps up by 1 with probability
[TABLE]
and jumps down by 1 with probability
[TABLE]
On the event , we have from Proposition 5 that . Let , and note that because , for sufficiently large , we have
[TABLE]
For sufficiently large , on the event , conditioning on the event , the probability that this asymmetric random walk reaches 0 before is
[TABLE]
and note that this upper bound is no longer depends on . Since , when , we have
[TABLE]
Also, because , it follows that when , we have
[TABLE]
Thus, on the event , for sufficiently large , the probability that the asymmetric random walk reaches 0 before is bounded below by . Therefore, through the coupling, for sufficiently large , on the event , we have .
We will now prove part 2 of this lemma. It follows from part 1 that, for sufficiently large , on the event ,
[TABLE]
So, we only need to show that for sufficiently large , on the event ,
[TABLE]
Now, for , we define
[TABLE]
and for , we define . The process is a birth-death process satisfying , where each individual gives birth at rate
[TABLE]
and each individual dies at rate
[TABLE]
For sufficiently large , on the event that , we have
[TABLE]
It follows that,
[TABLE]
By the same reason we obtain (120) which gives the probability that the birth and death process survives until time , if the process starts with one individual, we can generalize to the process that starts with any finite number of individuals. If , then
[TABLE]
and note that this upper bound does not depend on . Now, by using the facts that and along with (13), when is sufficiently large, on the event , on which we know from Proposition 5 that , we have
[TABLE]
Note that when , by using that , we have
[TABLE]
This fact along with (212) prove the inequality (210). β
Next, we are going to show that with probability close to 1.
Lemma 42**.**
The following statements hold:
For sufficiently large , on the event , we have . 2. 2.
For sufficiently large , we have .
Proof.
The proof is similar to the proof of Lemma 41. In this proof, we are going to consider the process . For , let and be the rates at which the process increases or decreases by 1. We will now give a lower bound for and an upper bound for . For the increasing rate, one way to increase is by having a type 0 or type 3 individual die, which occurs at the total rate , and the new individual is type 1 or 2 that is created without recombination, which occurs with probability . Then,
[TABLE]
To decrease , one way is by having a type 1 or type 2 die, and this occurs at total rate , and the new individual cannot be type 1 or 2, which occurs with probability bounded above by . Another way to decrease by having a type 1 or 2 mutate to type 3, which occurs at rate . So,
[TABLE]
When , we have
[TABLE]
and
[TABLE]
Hence, when ,
[TABLE]
Let be a birth-death process such that , in which each individual gives birth at rate and each individual dies at rate . We can couple this process with such that for any , we have , which means that if , the . Now, we consider the induced discrete time jump process of . It is an asymmetric walk that jumps up with probability
[TABLE]
and jumps down with probability
[TABLE]
Next, for , we define
[TABLE]
Since for all , it follows that for , we have . Now, we define . It follows that the process is a birth-death process such that , in which each individual gives birth at rate
[TABLE]
and each individual dies at rate
[TABLE]
With these birth and death rates, we can extend the process to be the birth-death process that is defined for all times , where the rates at which each individual gives birth and dies are given in (213) and (214), respectively.
We will first show that for sufficiently large , on the event ,
[TABLE]
Similar to the way we get (211), if , then
[TABLE]
and note that this lower bound does not depend on . Note that from Proposition 5, we know that on the event , we have . Using the facts that , , and using (13), when is sufficiently large, on the event ,
[TABLE]
Note that because , when ,
[TABLE]
This fact along with (216) proves (215).
Lastly, by using the couplings and from part 1, the fact that , part 1 of Lemma 41, and the definition of in (209), for sufficiently large , on the event ,
[TABLE]
Therefore, for sufficiently large , on the event , we have .
Lastly, to prove part 2, by using part 2 of Lemma 41 and part 1 of this lemma, for sufficiently large , on the event , we have that . With this fact and Proposition 5, for sufficiently large , we have . β
Proof of Theorem 1.
First, for every subsequence , there is a further subsequence that satisfies (6), or there is a further subsequence that satisfies (7). By a subsequence argument, it is enough to prove Theorem 1 in the recombination dominating case and the mutation dominating case. Now, recall that the stopping time defined in Theorem 1 is the first time that type 3 individuals have fixated in the population. We will show that if , then for sufficiently large , we have
[TABLE]
We choose to be small enough so that 1) , 2) and 3) . From part 2 of Lemma 42, for sufficiently large , we have . Note that from the definition of in (209), we have . Also, by the definition of and the fact that , for sufficiently large , we have . Thus, for sufficiently large , we have
[TABLE]
It is enough to show that and .
Recall the definition of in (5). Because of (6), in the recombination dominating case, for sufficiently large ,
[TABLE]
Next, in the mutation dominating case,
[TABLE]
and because of (7), we have
[TABLE]
From the definitions of and in (201) and (207), we have that
[TABLE]
[TABLE]
Because and , along with , we have
[TABLE]
From (220) and the way we choose , for sufficiently large ,
[TABLE]
By a similar argument, from the definitions of and in (201) and (208), we have that
[TABLE]
From (217), (218), and (221), for sufficiently large , we have
[TABLE]
which completes the proof. β
Acknowledgement
I am grateful to Professor Jason Schweinsberg for teaching me several techniques, especially the one from one of his previous papers, which was used intensively in my proof. I also want to thank him for spending time reviewing this manuscript several times, which improved this manuscript significantly.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] K. B. Athreya, P. E. Ney (1972). Branching Processes . Springer-Verlag, Berlin.
- 2[2] N. Berestycki, L. Z. Zhao (2013). The shape of multidimensional Brunet-Derrida particle systems. ar Xiv:1305.0254 .
- 3[3] S. Bossert, P. Pfaffelhuber (2016). The fixation probability and time for a doubly beneficial mutant. ar Xiv:1610.06613 .
- 4[4] J. F. Crow, M. Kimura (1965). Evolution in sexual and asexual populations. Am. Nat. 99 , 439-450.
- 5[5] C. Cuthbertson, A. Etheridge, F. Yu (2012). Fixation probability for competing selective sweeps. Electron. J. Probab. 17 , 1-36.
- 6[6] R. W. R. Darling, J. R. Norris (2008). Differential equations approximation for Markov chains. Probability Surveys 5 , 37-79.
- 7[7] R. Durrett (2008). Probability Models for DNA Sequence Evolution . Springer, New York.
- 8[8] R. A. Fisher (1930). The Genetical Theory of Natural Selection . Clarendon Press, Oxford.
