Some asymptotic results for nonlinear Hawkes processes
Fuqing Gao, Lingjiong Zhu

TL;DR
This paper investigates the asymptotic behavior of nonlinear Hawkes processes under a regime of large baseline intensity and small excitation, providing new insights into their fluctuations and deviations.
Contribution
It introduces a novel asymptotic regime for nonlinear Hawkes processes, analyzing fluctuations, large deviations, and moderate deviations in this context.
Findings
Derived asymptotic results for nonlinear Hawkes processes
Characterized fluctuations and deviations in the large intensity regime
Applicable to large networks and mean process analysis
Abstract
Hawkes process is a class of simple point processes with self-exciting and clustering properties. Hawkes process has been widely applied in finance, neuroscience, social networks, criminology, seismology, and many other fields. In this paper, we study fluctuations, large deviations and moderate deviations nonlinear Hawkes processes in a new asymptotic regime, the large intensity function and the small exciting function regime. It corresponds to the large baseline intensity asymptotics for the linear case, and can also be interpreted as the asymptotics for the mean process of Hawkes processes on a large network.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPoint processes and geometric inequalities · Diffusion and Search Dynamics
Abstract
Hawkes process is a class of simple point processes with self-exciting and clustering properties. Hawkes process has been widely applied in finance, neuroscience, social networks, criminology, seismology, and many other fields. In this paper, we study fluctuations, large deviations and moderate deviations nonlinear Hawkes processes in a new asymptotic regime, the large intensity function and the small exciting function regime. It corresponds to the large baseline intensity asymptotics for the linear case, and can also be interpreted as the asymptotics for the mean process of Hawkes processes on a large network.
Some asymptotic results for nonlinear Hawkes processes
Fuqing Gao 111School of Mathematics and Statistics, Wuhan University, Wuhan 430072, People’s Republic of China; [email protected], Lingjiong Zhu 222Department of Mathematics, Florida State University, 1017 Academic Way, Tallahassee, FL-32306, United States of America; [email protected].
1 Introduction
Let be a simple point process on and let be an increasing family of -algebras. Any nonnegative -progressively measurable process with
[TABLE]
a.s. for all intervals is called an -intensity of . We use the notation to denote the number of points in the interval .
A Hawkes process is a simple point process admitting an -intensity
[TABLE]
where is locally integrable, left continuous, and locally integrable. In (1.1), stands for , where are the occurrences of the points before time . In the literature, and are usually referred to as exciting function (or sometimes kernel function or self-interaction function) and intensity function respectively, see e.g. [8]. A Hawkes process is linear if the intensity function is linear and it is nonlinear otherwise.
The Hawkes process when is linear was first proposed by Alan Hawkes in 1971 to model earthquakes and their aftershocks [24]. The nonlinear Hawkes process was first introduced by Brémaud and Massoulié [4]. The Hawkes process naturally generalizes the Poisson process and it captures both the self-exciting property and the clustering effect, and it is a very versatile model for statistical analysis. These explain why it has wide applications in neuroscience, genome analysis, criminology, social networks, healthcare, seismology, insurance, finance and many other fields. For a list of references, we refer to [41].
Most of the asymptotic results for Hawkes processes in the literature are the large time limit theorems. For the linear Hawkes process, the functional law of large numbers and functional central limit theorems were studied in Bacry et al. [1]; the large deviations principle was studied in Bordenave and Torrisi [3]; and the moderate deviation principle was obtained in Zhu [43]. The precise large and moderate deviations are recently studied in Gao and Zhu [19]. For the nonlinear Hawkes process, Zhu [42] studied the functional central limit theorems by using Poisson embeddings and a careful analysis of the decay of the correlations over time. In [44], Zhu obtained a process-level, i.e. level-3 large deviation principle and the rate function is expressed as a variational problem optimizing over a certain entropy function of any simple point process against the underlying nonlinear Hawkes process. When the exciting function is exponential and the process is Markovian, an alternative expression for the rate function for the large deviations was obtained in Zhu [45]. Very recently, using the techniques as a combination of Poisson embeddings, Stein’s method and Malliavin calculus, the quantitative Gaussian and Poisson approximations were studied in Torrisi [38, 39]. The Malliavin calculus for Hawkes processes has also appeared in [37]. In the case of linear Hawkes process, the limit theorems for nearly unstable, also known as, nearly critical case, that is, when and are studied in Jaisson and Rosenbaum [26] when the exciting function has light tail and in Jaisson and Rosenbaum [27] when the exciting function has heavy tail.
There have been some progress made in the direction of asymptotic results other than the large time limits. For instance, when the exciting function is exponential, the intensity process and the pair are Markovian. In Gao and Zhu [20], they studied the functional central limit theorems for the linear Hawkes process when the initial intensity is large, and they further studied the large deviations and applied their results to insurance and queueing systems in [21]. For the more general linear and non-Markovian case, Gao and Zhu [22] considered the large baseline intensity asymptotic results and studied the applications to queueing systems.
In recent years, the mean-field limit for high dimensional Hawkes processes has also been studied, and it first appeared in Delattre et al. [12]. They showed that under a certain setting, the mean-field limit is an inhomogeneous Poisson process. Other mean-field limit works include Chevallier [8] who studied a generalized Hawkes process model with an inclusion of the dependence on the age of the process, and Delattre and Fournier [11] who studied the mean-field limit for Hawkes processes on a graph with two nodes whether or not influence each other modeled by i.i.d. Bernoulli random variables.
In this paper, we are interested in studying a new asymptotic regime for the nonlinear Hawkes process starting from empty past history, in which the intensity function is large and the exciting function is small. More precisely, we introduce the small parameter and consider the nonlinear Hawkes process with intensity:
[TABLE]
In this asymptotic regime, the pair of the intensity function and the exciting function has the transformation .
Now, let us explain why this asymptotic regime is natural and also point out that such a regime has been studied extensively in many similar settings in the literature.
When the Hawkes process is linear, say , , where is the baseline intensity, we have
[TABLE]
This gives the intensity of a linear Hawkes process with exciting function , and a large baseline intensity . Therefore, the asymptotic regime considered in this paper corresponds to the large baseline intensity regime that is studied in Gao and Zhu [22].
The asymptotic regime studied in this paper for the univariate Hawkes process is also equivalent for the asymptotics for the mean process for the high-dimensional multivariate Hawkes process. Our work is related to the mean-field limit for high-dimensional Hawkes processes in [12, 8, 11]. To see the connection of our work with the mean-field limit literature of Hawkes processes, let us first define a multivariate Hawkes process as follows. An -dimensional Hawkes process is an -dimensional point process admitting an -intensity such that
[TABLE]
where is locally integrable, left continuous, and we always assume that . For the multivariate Hawkes process, a jump in one component will not only increase the intensity of future jumps of its own component, known as the self-exciting property, but also increase the intensity of the future jumps of or the other components that are connected to its own component, which is known as the mutually-exciting property. By using the Poisson embeddings, see e.g [4, 12], we can express the Hawkes process as the solution of a Poisson driven SDE:
[TABLE]
where are a sequence of i.i.d. Poisson measures with common intensity measure on . As a special case, for each , we let and and we consider the Hawkes process which can be expressed as
[TABLE]
The mean process of the Hawkes processes is defined by :
[TABLE]
It follows from (1.6) that
[TABLE]
where is a Poisson measure on with intensity .
On the other hand, let us recall that the nonlinear Hawkes process with the intensity function and the exciting function can be expressed via Poisson embedding as the unique strong solution to the following equation:
[TABLE]
where is a Poisson random measure on with intensity . In this paper, we are interested in the asymptotics for , which satisfies the dynamics:
[TABLE]
where is a Poisson random measure on with intensity .
By comparing (1.8) with (1.10), it becomes clear that the mean process of an -dimensional Hawkes process defined in (1.6) has the same dynamics as a univariate Hawkes process with . All the asymptotic results we are going to derive in this paper for the process automatically hold for the mean process . We will go back to this in Section 3.
The asymptotic results for the mean process for a high-dimensional Hawkes process in Section 3 can shed some lights for the applications of high-dimensional Hawkes processes in various context. Hawkes processes have been applied to the study of neuroscience, see e.g. neuroscience, see e.g. [31, 32, 33, 35]. More recently, mean-field limits for extended Hawkes processes have been used to model the neural networks in e.g. [8, 15, 9]. The large deviations results in Section 3 can be used to estimate the probability of rare events in a neural network. The moderate deviations results in Section 3 can be used to fill in the gap between the second-order fluctuations and the large deviations regime. We can also use the multivariate Hawkes process of dimension to represent the loss process for firms in a large portfolio. The results in Section 3 can be used to provide estimates for the tail probabilities for the loss of a large portfolio. We refer to [10, 13, 23] for the works of large portfolio losses in finance. Note that the results we obtained in Section 3 are for the standard multivariate nonlinear Hawkes processes. In order to apply our results to neural networks in neuroscience, large portfolio losses in finance, and many other contexts, one needs to extend our results for the generalized Hawkes processes suitable for the applications in various contexts. Since there are many different ways to generalize the standard multivariate nonlinear Hawkes processes for the purpose of applications, we restrict the study in this paper to the most standard nonlinear Hawkes processes. Nevertheless, the methodology presented in this paper should be applicable for various extensions.
The scalings in (1.10) for stochastic equations with Poisson noise have been widely studied in the literature, see e.g. Budhiraja et al. [6], Budhiraja et al. [7], Budhiraja et al. [5]. The large deviations and moderate deviations for stochastic equations with jumps can be established usually using the variational representation in [6]. However, in our case, the coefficient of the dynamics (1.10) is a indicator function with path-dependency, which is not continuous. As a result, we cannot apply the results from [6, 7] directly. Instead of pursuing a modification of the variational representation approach in [6, 7], we will adopt a more direct approach to establish large and moderate deviations in our paper.
We organize this paper as follows. In Section 2, we introduce the main results of the paper. We will study fluctuations in Section 2.1, large deviations in Section 2.2 and moderate deviations in Section 2.3. The asymptotic results for the mean process for a high-dimensional Hawkes process are presented in Section 3. Finally, all the proofs will be given in Section 4.
2 Main Results
Before we proceed, let us summarize here a list of key assumptions that will be used throughout the paper.
Assumption 1**.**
* is -Lipschitz for some . is locally integrable and locally bounded.*
Assumption 2**.**
* is differentiable and is locally integrable.*
Assumption 3**.**
* is -Lipschitz and .*
Assumption 4**.**
* is twice differentiable and .*
Assumption 5**.**
, is differentiable and .
We collect here a set of notations that will be used throughout the paper.
- •
is the space of real-valued continuous functions on ;
- •
is the space of real-valued càdlàg functions on equipped with Skorokhod topology;
- •
is the space of functions that are absolutely continuous with ;
- •
is the space of non-decreasing functions that are absolutely continuous with .
2.1 Fluctuations
In this section, we are interested to study the fluctuations of around its limit . We will obtain a functional central limit theorem for .
As , will converge on to a deterministic function that satisfies the equation:
[TABLE]
Indeed, this result will follow from the fluctuation result for , that is, we will show that converges in distribution on to a non-trivial stochastic limit, which turns out to be a continuous Gaussian process. Let us notice that the equation (2.1) has a unique locally bounded and non-negative solution under certain assumptions, see Delattre [12]. It is interesting that the mean of the inhomogeneous Poisson process as the mean-field limit for high dimensional Hawkes processes leads to the same limiting equation as in (2.1).
Let us define:
[TABLE]
Theorem 6**.**
Suppose Assumption 1, Assumption 2 and Assumption 4 hold. converges in distribution on to a continuous Gaussian process defined by
[TABLE]
where is a standard Brownian motion.
Remark 7**.**
The Gaussian process defined by (2.3) is also a semimartingale and
[TABLE]
Thus, the Gaussian process has the following equivalent characterization:
[TABLE]
A key component of the proof of Theorem 6 is the tightness of the sequence on that we will establish in the following lemma.
Lemma 8**.**
Suppose Assumption 1 and Assumption 2 hold, is tight on and the all limits are in .
The proof of the tightness of , relies on two auxiliary lemmas. The first lemma, i.e. Lemma 9 gives a uniform bound on the first moment of , uniformly in , and the second lemma, i.e. Lemma 10, gives us a uniform bound on the second moment of the running maximum of process, uniformly in .
Lemma 9**.**
Suppose Assumption 1 holds.
[TABLE]
Lemma 10**.**
Suppose Assumption 1 and Assumption 2 hold,
[TABLE]
The proofs of Theorem 6, Lemma 8, Lemma 9 and Lemma 10 will all be given in Section 4.
Remark 11**.**
Note that [42] studied the large time fluctuations for stationary nonlinear Hawkes processes and more precisely, as a special case for the linear Hawkes process , we have in distribution on as , where and , and is a standard Brownian motion. Note that for the large time functional central limit theorem, the limiting variance depends on only, while in our Theorem 6, it depends on the entire exciting function for . Moreover, in our limit, we obtain a Gaussian process that in general is not a Brownian motion.
2.2 Large deviations
We have already seen that converges to the limit on and have studied the fluctuations around this limit. It is natural to ask about the probability of the rare events that the process deviates away from its deterministic limit. That is the question of large deviations in probability theory.
We start by giving a formal definition of the large deviation principle. We refer to Dembo and Zeitouni [14] and Varadhan [40] for general background of large deviations and the applications.
A sequence of probability measures on a topological space satisfies the large deviation principle with rate function and speed if is non-negative, lower semicontinuous and for any Borel set , we have
[TABLE]
Here, is the interior of and is its closure.
Now, we are ready to state the main results of large deviations for on .
Theorem 12**.**
Suppose Assumption 1, Assumption 3 and Assumption 5 hold. Then, satisfies a large deviation principle on equipped with Skorokhod topology with the speed and the rate function
[TABLE]
if and otherwise, where
[TABLE]
Instead of establishing a full large deviation principle in Theorem 12 directly, our strategy is to first prove a local large deviation principle in Theorem 13, with the main tool being the change of measure technique for simple point processes. We then establish the exponential tightness in order to obtain a full large deviation principle.
We have the following local large deviation principle.
Theorem 13**.**
Suppose Assumption 1 and Assumption 5 hold. For any ,
[TABLE]
where is defined in (2.8).
Next, let us establish the exponential tightness of the sequence on . The following Lemma 14 and Lemma 15, together with the local large deviation principle will provide us the full large deviation principle that is desired.
Lemma 14**.**
Suppose Assumption 1, Assumption 3 hold. Then,
[TABLE]
Lemma 15**.**
Suppose Assumption 1, Assumption 3 hold. For any ,
[TABLE]
Remark 16**.**
Theorem 13, Lemma 14 and Lemma 15 provide actually the large deviation principle for with respect to the uniform topology on , see e.g. Lemma A.1 in [16], or Theorem 4.14 [17].
Remark 17**.**
In [3], they obtained a sample path large deviation principle for the large time scaling for Poisson cluster processes. More precisely, the linear Hawkes process with , as a special case of the Poisson cluster process, has the sample path large deviation principle that satisfies a large deviation principle on equipped with the topology of point-wise convergence with the speed and the rate function if and otherwise, where
[TABLE]
for and otherwise. Note that since the assumption (37) in [3] is not satisfied for the linear Hawkes process, their large deviations results apply to the topology of point-wise convergence, but not the uniform topology. Our results in Theorem 12 differ in two ways. First, our rate function depends on the entire function , , rather than as in (2.13). Second, we allow uniform topology for the sample path large deviation principle.
2.3 Moderate Deviations
In this section, we are interested in the moderate deviations for . The moderate deviation principle fills in the gap between the central limit theorem and the large deviation principle. For a brief introduction to moderate deviations, we refer to Chap. 3.7. in Dembo and Zeitouni [14].
Our approach to the proof of the moderate deviations is similar to that of the large deviations. That is, we first establish a local moderate deviation principle by using the change of measure technique, i.e. Theorem 19, and then establish the appropriate exponential tightness estimates, i.e. Lemma 20 and Lemma 21.
Our main result is the following:
Theorem 18**.**
Suppose Assumption 1, Assumption 2, Assumption 3, Assumption 4 and Assumption 5 hold. Let be a positive sequence such that as . Then, satisfies a large deviation principle on equipped with Skorokhod topology with speed and the rate function
[TABLE]
if and otherwise.
We first establish a local moderate deviation principle:
Theorem 19**.**
Suppose Assumption 1, Assumption 4 and Assumption 5 hold. For any ,
[TABLE]
where is given in (2.14).
Next, we establish the exponential tightness of the sequence on in the following lemmas.
Lemma 20**.**
Suppose Assumption 1, Assumption 2, Assumption 3 hold.
[TABLE]
Lemma 21**.**
Suppose Assumption 1, Assumption 2, Assumption 3 hold. For any ,
[TABLE]
Remark 22**.**
(i). Theorem 19, Lemma 20 and Lemma 21 provide actually the moderate deviation principle for with respect to the uniform topology on , see e.g. Lemma A.1 in [16], or Theorem 4.14 [17].
(ii). For stochastic dynamics driven by Brownian motion, the limit of its standardization is an Ornstein-Uhlenbeck process driven by the same Brownian motion, and so, the fluctuations and the moderate deviations can be established by estimating deviation inequality of the standardization with the Ornstein-Uhlenbeck process (see,e.g. [18]). That approach cannot be applied to stochastic dynamics with jumps in our paper.
3 Asymptotics for the mean process for high-dimensional Hawkes processes
All the previous results that we derived in Theorem 6, Theorem 12 and Theorem 18 for the univariate Hawkes process can be transferred to the mean process of a multivariate Hawkes process. Consider the -dimensional multivariate Hawkes process using the Poisson embeddings representation: :
[TABLE]
and its mean process , which satisfies
[TABLE]
Theorem 6, Theorem 12 and Theorem 18 for the univariate Hawkes process can be transferred to the following Theorem 23, Theorem 24 and Theorem 25 respectively for the mean process of a multivariate Hawkes process.
Theorem 23**.**
Suppose Assumption 1, Assumption 2 and Assumption 4 hold. Set . Then converges in distribution on to a continuous Gaussian process defined in Theorem 6.
Theorem 24**.**
Suppose Assumption 1, Assumption 3 and Assumption 5 hold. satisfies a large deviation principle on equipped with Skorokhod topology with the speed and the rate function given in Theorem 12.
Theorem 25**.**
Suppose Assumption 1, Assumption 2, Assumption 3, Assumption 4 and Assumption 5 hold. Let be a positive sequence such that as . Then, satisfies a large deviation principle on equipped with Skorokhod topology with speed and the rate function given in Theorem 18.
4 Proofs
4.1 Proofs for Section 2.1
Before we prove Theorem 6, let us first give the proofs of Lemma 9, Lemma 10, and Lemma 8.
Firstly, For any , is a positive local martingale, hence a supermartingale, and thus for any , we have
[TABLE]
Since we assumed that , for sufficiently small , we have . It follows that
[TABLE]
In particular, for any ,
[TABLE]
Proof of Lemma 9.
Notice that for any ,
[TABLE]
The result follows from the Gronwall’s inequality. ∎
Proof of Lemma 10.
Notice that
[TABLE]
where
[TABLE]
is a martingale. For any ,
[TABLE]
By Gronwall’s inequality,
[TABLE]
Finally, by Doob’s inequality
[TABLE]
where we have proved in Lemma 9 that is uniformly bounded in . ∎
Proof of Lemma 8.
Let us recall that
[TABLE]
where is a martingale and we can show that for any ,
[TABLE]
To show (4.10), w.l.o.g., assume and by using Doob’s martingale inequality, Chebychev’s inequality, and Burkholder-Davis-Gundy inequality, we have
[TABLE]
Note that for every ,
[TABLE]
Recall that we have proved in Lemma 9 that is uniformly bounded in . By Gronwall’s inequality, is uniformly bounded in . Hence, we conclude that (4.10) follows from (4.11) since it goes to zero as uniformly in .
Moreover, for any ,
[TABLE]
It follows from Lemma 10 that the sequence
[TABLE]
is tight on . Hence, for any ,
[TABLE]
which implies that the sequence is tight on and the all limits are in by Theorem 15.5 [2]. ∎
We are now finally ready to give the proof of the fluctuations results in Theorem 6.
Proof of Theorem 6.
We can write
[TABLE]
and
[TABLE]
where is defined in (4.6) and
[TABLE]
Then, by Doob’s martingale inequality, and Burkholder-Davis-Gundy inequality, there exists a constant
[TABLE]
where we have proved in Lemma 10 that is uniformly bounded in . Thus, is uniformly bounded in , and by (4.7), is also uniformly bounded in .
By Taylor expansion,
[TABLE]
Therefore, we have that
[TABLE]
are uniformly integrable martingales, and
[TABLE]
These yield that as , in probability,
[TABLE]
and there exists a square integrable martingale such that
[TABLE]
In Lemma 8, we showed that the sequence is tight on . Let be a limit point of , and is continuous in . We conclude that,
[TABLE]
and
[TABLE]
are martingales. Since is continuous in time , by Lévy’s characterization of Brownian motion and martingale representation theorem, see e.g. Chapter IV Theorem 3.6. and Chapter V Proposition 3.8. [34], there exists a standard Brownian motion , such that (2.3) holds.
By Gronwall’s inequality, the stochastic differential equation (2.3) only has a unique solution which implies that as , the set of limit points of is a singleton. Thus, converges in distribution on to the solution of the equation (2.3).
Finally, let us show that the limit is a Gaussian process. Set and
[TABLE]
and for every ,
[TABLE]
Then is a sequence of Gaussian processes. Moreover, we can compute that
[TABLE]
where we used the integration by parts and . Set . Then for every ,
[TABLE]
which implies that
[TABLE]
which yields that
[TABLE]
Thus, almost surely, . Thus, by Proposition 6.1 (Chapter 0) in [34], is a continuous Gaussian process such that
[TABLE]
Therefore,
[TABLE]
By the uniqueness of the solution of the equation (2.3), we have . Therefore, is a Gaussian process. ∎
4.2 Proofs for Section 2.2
Proof of Theorem 12.
Theorem 12 follows from the local large deviation principle in Theorem 13 and the super-exponential estimates in Lemma 14 and Lemma 15. ∎
Proof of Theorem 13.
Set
[TABLE]
Then is a closed subset in and , Thus, for any ,
[TABLE]
Next, we assume that .
Let be the probability measure under which is a standard Poisson process with intensity . Since is -Lipschitz, we have
[TABLE]
That is, the intensity has at most the linear growth in . Moreover, under our assumption, we have . Thus, and are equivalent, and the Radon-Nikodym is given by, see e.g. [36],
[TABLE]
By changing of the probability measure to ,
[TABLE]
For any with , we have
[TABLE]
Set . Then is a finite variation function on . The total variation of on denotes by . Then
[TABLE]
and
[TABLE]
It follows from integration by parts that
[TABLE]
On the other hand, we can estimate that
[TABLE]
Finally, we can estimate that
[TABLE]
Since is a standard Poisson process with intensity under the probability measure , it is well known that, see, e.g. [29, 30], satisfies a large deviation principle on with the rate function
[TABLE]
where is defined in (2.9).
Hence, we conclude that
[TABLE]
∎
Proof of Lemma 14.
By (4.2), and Chebychev’s inequality, for sufficiently small ,
[TABLE]
which yields the desired result. ∎
Proof of Lemma 15.
Without loss of generality, let us assume that . For any ,
[TABLE]
For any ,
[TABLE]
Therefore, by Cauchy-Schwarz inequality,
[TABLE]
which is uniform in . By Chebychev’s inequality,
[TABLE]
It follows from (4.2) that for any sufficiently small ,
[TABLE]
where is a positive constant that depends only on , , , and .
Let be a sufficiently small fixed constant, independent of all the other parameters. We define , and thus
[TABLE]
is sufficiently small since is sufficiently small and we can apply (4.36) and the Chebychev’s inequality and get
[TABLE]
Since , we get
[TABLE]
which yields the desired result by letting . ∎
4.3 Proofs for Section 2.3
Proof of Theorem 18.
Theorem 18 follows from the local large deviation principle in Theorem 19 and the super-exponential estimates in Lemma 20 and Lemma 21. ∎
Proof of Theorem 19.
Set
[TABLE]
Then is a closed subset in and , Thus, for any ,
[TABLE]
Next, we assume that .
Let be the probability measure under which is an inhomogeneous Poisson process with intensity . Denote by
[TABLE]
By changing the probability measure to (see the discussions about the change of measure and the Radon-Nikodym derivative in the proof of Theorem 13 and [36]),
[TABLE]
Replacing in the proof of Theorem 13 by , we have firstly,
[TABLE]
and secondly,
[TABLE]
and thirdly,
[TABLE]
and finally,
[TABLE]
Thus, by (4.40), (4.41), (4.42), and (4.43), there exists a constant such that for any ,
[TABLE]
Moreover, by a deterministic time change, we have
[TABLE]
where the first equality above holds in distribution, where , and is a standard Poisson process with constant intensity under the probability measure .
It is well known that, see e.g. [30], that satisfies a large deviation principle on , see e.g. [30], with the speed and the rate function
[TABLE]
Therefore,
[TABLE]
where is defined via , for every , so that if , then , and if , then and
[TABLE]
Thus, we have
[TABLE]
Finally, we notice that
[TABLE]
Hence, we conclude that
[TABLE]
Hence, the conclusion of the Theorem 19 holds. ∎
Proof of Lemma 20.
Let us recall that satisfies the dynamics:
[TABLE]
where is a martingale. Therefore, for any ,
[TABLE]
It follows from Gronwall’s inequality that
[TABLE]
For any , by Doob’s martingale inequality,
[TABLE]
Let us define
[TABLE]
Then is a martingale and . Moreover, and
[TABLE]
By Lemma 26.19 in Kallenberg [28], if is a local martingale starting at [math] with then is a supermartingale where . Hence,
[TABLE]
Similarly,
[TABLE]
As , it is easy to see that and . Therefore, for sufficiently small ,
[TABLE]
where for small is defined in the proof of the exponential tightness for the large deviation principle under the assumption that and the fact that is sufficiently small as . It is easy to check that for some
[TABLE]
Therefore, we conclude that
[TABLE]
The desired result follows by letting . ∎
Proof of Lemma 21.
Without loss of generality, let us assume that . For any ,
[TABLE]
We can estimate that
[TABLE]
Thus,
[TABLE]
We can compute that for any , for sufficiently small ,
[TABLE]
where the last line uses (4.54). From here, we can further estimate that
[TABLE]
which is uniform in . Moreover,
[TABLE]
The choice of is arbitrary. Let us choose , then
[TABLE]
Finally, by Lemma 20,
[TABLE]
Hence, we have proved the desired result. ∎
Acknowledgements
We are very grateful to the Associate Editor and two anonymous referees for their helpful comments and suggestions. Fuqing Gao acknowledges support from NSFC Grant 11571262 and the Specialized Research Fund for the Doctoral Program of Higher Education of China (Grant No. 20130141110076). Lingjiong Zhu is grateful to the support from NSF Grant DMS-1613164.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Bacry, E, Delattre, S., Hoffmann, M. and Muzy, J. F. (2013). Scaling limits for Hawkes processes and application to financial statistics. Stochastic Processes and their Applications 123 , 2475-2499.
- 2[2] Billingsley, P. (1999). Convergence of Probability Measures , 2nd edition. Wiley-Interscience, New York.
- 3[3] Bordenave, C. and Torrisi, G. L. (2007). Large deviations of Poisson cluster processes. Stochastic Models , 23 , 593-625.
- 4[4] Brémaud, P. and Massoulié, L. (1996). Stability of nonlinear Hawkes processes. Ann. Probab. , 24 , 1563-1588.
- 5[5] Budhiraja, A., Chen, J. and Dupuis, P. (2013). Large deviations for stochastic partial differential equations driven by a Poisson random measure. Stochastic Processes and their Applications . 123 , 523-560.
- 6[6] Budhiraja, A., Dupuis, P. and Maroulas, V. (2011). Variational representations for continuous time processes. Annales de I’Institut Henri Poincaré-Probabilités et Statistiques . 47 , 725-747.
- 7[7] Budhiraja, A., Dupuis, P. and Ganguly, A.(2016). Moderate deviation principle for stochastic differential equations with jumps. Annals of Probability . 44 , 1723-1775.
- 8[8] Chevallier, J. (2017). Mean-field limit of generalized Hawkes processes. to appear in Stochastic Processes and their Applications .
