Some asymptotic results for nonlinear Hawkes processes

Fuqing Gao; Lingjiong Zhu

arXiv:1702.05852·math.PR·November 5, 2018

Some asymptotic results for nonlinear Hawkes processes

Fuqing Gao, Lingjiong Zhu

PDF

Open Access

TL;DR

This paper investigates the asymptotic behavior of nonlinear Hawkes processes under a regime of large baseline intensity and small excitation, providing new insights into their fluctuations and deviations.

Contribution

It introduces a novel asymptotic regime for nonlinear Hawkes processes, analyzing fluctuations, large deviations, and moderate deviations in this context.

Findings

01

Derived asymptotic results for nonlinear Hawkes processes

02

Characterized fluctuations and deviations in the large intensity regime

03

Applicable to large networks and mean process analysis

Abstract

Hawkes process is a class of simple point processes with self-exciting and clustering properties. Hawkes process has been widely applied in finance, neuroscience, social networks, criminology, seismology, and many other fields. In this paper, we study fluctuations, large deviations and moderate deviations nonlinear Hawkes processes in a new asymptotic regime, the large intensity function and the small exciting function regime. It corresponds to the large baseline intensity asymptotics for the linear case, and can also be interpreted as the asymptotics for the mean process of Hawkes processes on a large network.

Equations357

\mathbb{E}\left[N(a,b]|\mathcal{F}^{-\infty}_{a}\right]=\mathbb{E}\left[\int_{a}^{b}\lambda_{s}ds\big{|}\mathcal{F}^{-\infty}_{a}\right],

\mathbb{E}\left[N(a,b]|\mathcal{F}^{-\infty}_{a}\right]=\mathbb{E}\left[\int_{a}^{b}\lambda_{s}ds\big{|}\mathcal{F}^{-\infty}_{a}\right],

λ_{t} := ϕ (\int_{- \infty}^{t -} h (t - s) N (d s)),

λ_{t} := ϕ (\int_{- \infty}^{t -} h (t - s) N (d s)),

λ_{t}^{ϵ} = \frac{1}{ϵ} ϕ (\int_{0}^{t -} ϵ h (t - s) d N_{s}^{ϵ}) .

λ_{t}^{ϵ} = \frac{1}{ϵ} ϕ (\int_{0}^{t -} ϵ h (t - s) d N_{s}^{ϵ}) .

λ_{t}^{ϵ} = \frac{ν}{ϵ} + \int_{0}^{t -} h (t - s) d N_{s}^{ϵ} .

λ_{t}^{ϵ} = \frac{ν}{ϵ} + \int_{0}^{t -} h (t - s) d N_{s}^{ϵ} .

λ_{t}^{i} := ϕ_{i} (j = 1 \sum N \int_{0}^{t -} h_{ij} (t - s) d Z_{s}^{j}),

λ_{t}^{i} := ϕ_{i} (j = 1 \sum N \int_{0}^{t -} h_{ij} (t - s) d Z_{s}^{j}),

Z_{t}^{i} = \int_{0}^{t} \int_{0}^{\infty} 1_{{z \leq ϕ_{i} (\sum_{j = 1}^{N} \int_{0}^{s -} h_{ij} (s - u) d Z_{u}^{j})}} π^{i} (d s d z), 1 \leq i \leq N,

Z_{t}^{i} = \int_{0}^{t} \int_{0}^{\infty} 1_{{z \leq ϕ_{i} (\sum_{j = 1}^{N} \int_{0}^{s -} h_{ij} (s - u) d Z_{u}^{j})}} π^{i} (d s d z), 1 \leq i \leq N,

Z_{t}^{N, i} = \int_{0}^{t} \int_{0}^{\infty} 1_{{z \leq ϕ (N^{- 1} \sum_{j = 1}^{N} \int_{0}^{s -} h (s - u) d Z_{u}^{N, j})}} π^{i} (d s d z) .

Z_{t}^{N, i} = \int_{0}^{t} \int_{0}^{\infty} 1_{{z \leq ϕ (N^{- 1} \sum_{j = 1}^{N} \int_{0}^{s -} h (s - u) d Z_{u}^{N, j})}} π^{i} (d s d z) .

\overline{Z}_{t}^{N} = \frac{1}{N} i = 1 \sum N Z_{t}^{N, i}, t \geq 0.

\overline{Z}_{t}^{N} = \frac{1}{N} i = 1 \sum N Z_{t}^{N, i}, t \geq 0.

\overline{Z}_{t}^{N} = \int_{0}^{t} \int_{0}^{\infty} 1_{{z \leq ϕ (\int_{0}^{s -} h (s - u) d \overline{Z}_{u}^{N})}} \frac{1}{N} i = 1 \sum N π^{i} (d s d z),

\overline{Z}_{t}^{N} = \int_{0}^{t} \int_{0}^{\infty} 1_{{z \leq ϕ (\int_{0}^{s -} h (s - u) d \overline{Z}_{u}^{N})}} \frac{1}{N} i = 1 \sum N π^{i} (d s d z),

N_{t}^{ϵ} = \int_{0}^{t} \int_{0}^{\infty} 1_{[0, \frac{1}{ϵ} ϕ (\int_{0}^{s -} ϵ h (s - u) d N_{u}^{ϵ})]} (z) π (d z d s),

N_{t}^{ϵ} = \int_{0}^{t} \int_{0}^{\infty} 1_{[0, \frac{1}{ϵ} ϕ (\int_{0}^{s -} ϵ h (s - u) d N_{u}^{ϵ})]} (z) π (d z d s),

Z_{t}^{ϵ}

Z_{t}^{ϵ}

= \int_{0}^{t} \int_{0}^{\infty} 1_{[0, ϕ (\int_{0}^{s -} h (s - u) d Z_{u}^{ϵ})]} (z) ϵ π^{ϵ^{- 1}} (d z d s) .

Z_{t}^{0} = \int_{0}^{t} ϕ (\int_{0}^{s} h (s - u) d Z_{u}^{0}) d s .

Z_{t}^{0} = \int_{0}^{t} ϕ (\int_{0}^{s} h (s - u) d Z_{u}^{0}) d s .

X_{t}^{ϵ} = \frac{Z _{t}^{ϵ} - Z _{t}^{0}}{ϵ} .

X_{t}^{ϵ} = \frac{Z _{t}^{ϵ} - Z _{t}^{0}}{ϵ} .

X_{t} =

X_{t} =

+ \int_{0}^{t} ϕ (\int_{0}^{s} h (s - u) d Z_{u}^{0}) d W_{s},

h (0) X_{s} + \int_{0}^{s} X_{u} h^{'} (s - u) d u = \int_{0}^{s} h (s - u) d X_{u} .

h (0) X_{s} + \int_{0}^{s} X_{u} h^{'} (s - u) d u = \int_{0}^{s} h (s - u) d X_{u} .

X_{t} =

X_{t} =

ϵ > 0 sup E [Z_{T}^{ϵ}] \leq ϕ (0) T e^{α ∥ h ∥_{L^{\infty} [0, T]} T} .

ϵ > 0 sup E [Z_{T}^{ϵ}] \leq ϕ (0) T e^{α ∥ h ∥_{L^{\infty} [0, T]} T} .

ϵ > 0 sup E [0 \leq t \leq T sup (X_{t}^{ϵ})^{2}] < \infty.

ϵ > 0 sup E [0 \leq t \leq T sup (X_{t}^{ϵ})^{2}] < \infty.

- x \in A^{o} in f I (x) \leq ϵ \to 0 lim inf \frac{1}{b ( ϵ )} lo g P_{ϵ} (A) \leq ϵ \to 0 lim sup \frac{1}{b ( ϵ )} lo g P_{ϵ} (A) \leq - x \in \overline{A} in f I (x) .

- x \in A^{o} in f I (x) \leq ϵ \to 0 lim inf \frac{1}{b ( ϵ )} lo g P_{ϵ} (A) \leq ϵ \to 0 lim sup \frac{1}{b ( ϵ )} lo g P_{ϵ} (A) \leq - x \in \overline{A} in f I (x) .

I (η) := \int_{0}^{T} ℓ (η^{'} (t); ϕ (\int_{0}^{t} h (t - s) d η (s))) d t,

I (η) := \int_{0}^{T} ℓ (η^{'} (t); ϕ (\int_{0}^{t} h (t - s) d η (s))) d t,

ℓ (x; y) := x lo g (\frac{x}{y}) - x + y .

ℓ (x; y) := x lo g (\frac{x}{y}) - x + y .

δ \to 0 lim ϵ \to 0 lim ϵ lo g P (0 \leq t \leq T sup ∣ Z_{t}^{ϵ} - η (t) ∣ \leq δ) = - I (η),

δ \to 0 lim ϵ \to 0 lim ϵ lo g P (0 \leq t \leq T sup ∣ Z_{t}^{ϵ} - η (t) ∣ \leq δ) = - I (η),

K \to \infty lim sup ϵ \to 0 lim sup ϵ lo g P (Z_{T}^{ϵ} \geq K) = - \infty.

K \to \infty lim sup ϵ \to 0 lim sup ϵ lo g P (Z_{T}^{ϵ} \geq K) = - \infty.

M \to \infty lim sup ϵ \to 0 lim sup ϵ lo g P (0 \leq s \leq t \leq T, ∣ t - s ∣ \leq \frac{1}{M} sup ∣ Z_{t}^{ϵ} - Z_{s}^{ϵ} ∣ \geq δ) = - \infty.

M \to \infty lim sup ϵ \to 0 lim sup ϵ lo g P (0 \leq s \leq t \leq T, ∣ t - s ∣ \leq \frac{1}{M} sup ∣ Z_{t}^{ϵ} - Z_{s}^{ϵ} ∣ \geq δ) = - \infty.

I (x) := x lo g (\frac{x}{ν + x ∥ h ∥ _{L^{1}}}) - x + x ∥ h ∥_{L^{1}} + ν,

I (x) := x lo g (\frac{x}{ν + x ∥ h ∥ _{L^{1}}}) - x + x ∥ h ∥_{L^{1}} + ν,

J (η) := \frac{1}{2} \int_{0}^{T} \frac{( η ^{'} ( t ) - ϕ ^{'} ( \int _{0}^{t} h ( t - u ) d Z _{u}^{0} ) \int _{0}^{t} h ( t - u ) d η _{u} ) ^{2}}{ϕ ( \int _{0}^{t} h ( t - u ) d Z _{u}^{0} )} d t,

J (η) := \frac{1}{2} \int_{0}^{T} \frac{( η ^{'} ( t ) - ϕ ^{'} ( \int _{0}^{t} h ( t - u ) d Z _{u}^{0} ) \int _{0}^{t} h ( t - u ) d η _{u} ) ^{2}}{ϕ ( \int _{0}^{t} h ( t - u ) d Z _{u}^{0} )} d t,

δ \to 0 lim ϵ \to 0 lim \frac{ϵ}{a ^{2} ( ϵ )} lo g P (0 \leq t \leq T sup \frac{Z _{t}^{ϵ} - Z _{t}^{0}}{a ( ϵ )} - η_{t} \leq δ) = - J (η),

δ \to 0 lim ϵ \to 0 lim \frac{ϵ}{a ^{2} ( ϵ )} lo g P (0 \leq t \leq T sup \frac{Z _{t}^{ϵ} - Z _{t}^{0}}{a ( ϵ )} - η_{t} \leq δ) = - J (η),

K \to \infty lim sup ϵ \to 0 lim sup \frac{ϵ}{a ( ϵ ) ^{2}} lo g P (0 \leq t \leq T sup ∣ Z_{t}^{ϵ} - Z_{t}^{0} ∣ \geq K a (ϵ)) = - \infty.

K \to \infty lim sup ϵ \to 0 lim sup \frac{ϵ}{a ( ϵ ) ^{2}} lo g P (0 \leq t \leq T sup ∣ Z_{t}^{ϵ} - Z_{t}^{0} ∣ \geq K a (ϵ)) = - \infty.

M \to \infty lim sup ϵ \to 0 lim sup \frac{ϵ}{a ( ϵ ) ^{2}} lo g P (0 \leq s \leq t \leq T, ∣ t - s ∣ \leq \frac{1}{M} sup ∣ Z_{t}^{ϵ} - Z_{s}^{ϵ} - Z_{t}^{0} + Z_{s}^{0} ∣ \geq δ a (ϵ)) = - \infty.

M \to \infty lim sup ϵ \to 0 lim sup \frac{ϵ}{a ( ϵ ) ^{2}} lo g P (0 \leq s \leq t \leq T, ∣ t - s ∣ \leq \frac{1}{M} sup ∣ Z_{t}^{ϵ} - Z_{s}^{ϵ} - Z_{t}^{0} + Z_{s}^{0} ∣ \geq δ a (ϵ)) = - \infty.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPoint processes and geometric inequalities · Diffusion and Search Dynamics

Full text

Abstract

Hawkes process is a class of simple point processes with self-exciting and clustering properties. Hawkes process has been widely applied in finance, neuroscience, social networks, criminology, seismology, and many other fields. In this paper, we study fluctuations, large deviations and moderate deviations nonlinear Hawkes processes in a new asymptotic regime, the large intensity function and the small exciting function regime. It corresponds to the large baseline intensity asymptotics for the linear case, and can also be interpreted as the asymptotics for the mean process of Hawkes processes on a large network.

Some asymptotic results for nonlinear Hawkes processes

Fuqing Gao 111School of Mathematics and Statistics, Wuhan University, Wuhan 430072, People’s Republic of China; [email protected], Lingjiong Zhu 222Department of Mathematics, Florida State University, 1017 Academic Way, Tallahassee, FL-32306, United States of America; [email protected].

1 Introduction

Let $N$ be a simple point process on $\mathbb{R}$ and let $\mathcal{F}^{-\infty}_{t}:=\sigma(N(C),C\in\mathcal{B}(\mathbb{R}),C\subset(-\infty,t])$ be an increasing family of $\sigma$ -algebras. Any nonnegative $\mathcal{F}^{-\infty}_{t}$ -progressively measurable process $\lambda_{t}$ with

[TABLE]

a.s. for all intervals $(a,b]$ is called an $\mathcal{F}^{-\infty}_{t}$ -intensity of $N$ . We use the notation $N_{t}:=N(0,t]$ to denote the number of points in the interval $(0,t]$ .

A Hawkes process is a simple point process $N$ admitting an $\mathcal{F}^{-\infty}_{t}$ -intensity

[TABLE]

where $\phi(\cdot):\mathbb{R}\rightarrow\mathbb{R}^{+}$ is locally integrable, left continuous, $h(\cdot):\mathbb{R}^{+}\rightarrow\mathbb{R}$ and locally integrable. In (1.1), $\int_{-\infty}^{t-}h(t-s)N(ds)$ stands for $\sum_{\tau<t}h(t-\tau)$ , where $\tau$ are the occurrences of the points before time $t$ . In the literature, $h(\cdot)$ and $\phi(\cdot)$ are usually referred to as exciting function (or sometimes kernel function or self-interaction function) and intensity function respectively, see e.g. [8]. A Hawkes process is linear if the intensity function $\phi(\cdot)$ is linear and it is nonlinear otherwise.

The Hawkes process when $\phi(\cdot)$ is linear was first proposed by Alan Hawkes in 1971 to model earthquakes and their aftershocks [24]. The nonlinear Hawkes process was first introduced by Brémaud and Massoulié [4]. The Hawkes process naturally generalizes the Poisson process and it captures both the self-exciting property and the clustering effect, and it is a very versatile model for statistical analysis. These explain why it has wide applications in neuroscience, genome analysis, criminology, social networks, healthcare, seismology, insurance, finance and many other fields. For a list of references, we refer to [41].

Most of the asymptotic results for Hawkes processes in the literature are the large time limit theorems. For the linear Hawkes process, the functional law of large numbers and functional central limit theorems were studied in Bacry et al. [1]; the large deviations principle was studied in Bordenave and Torrisi [3]; and the moderate deviation principle was obtained in Zhu [43]. The precise large and moderate deviations are recently studied in Gao and Zhu [19]. For the nonlinear Hawkes process, Zhu [42] studied the functional central limit theorems by using Poisson embeddings and a careful analysis of the decay of the correlations over time. In [44], Zhu obtained a process-level, i.e. level-3 large deviation principle and the rate function is expressed as a variational problem optimizing over a certain entropy function of any simple point process against the underlying nonlinear Hawkes process. When the exciting function is exponential and the process is Markovian, an alternative expression for the rate function for the large deviations was obtained in Zhu [45]. Very recently, using the techniques as a combination of Poisson embeddings, Stein’s method and Malliavin calculus, the quantitative Gaussian and Poisson approximations were studied in Torrisi [38, 39]. The Malliavin calculus for Hawkes processes has also appeared in [37]. In the case of linear Hawkes process, the limit theorems for nearly unstable, also known as, nearly critical case, that is, when $\phi(z)=\nu+z$ and $\|h\|_{L^{1}}\approx 1$ are studied in Jaisson and Rosenbaum [26] when the exciting function has light tail and in Jaisson and Rosenbaum [27] when the exciting function has heavy tail.

There have been some progress made in the direction of asymptotic results other than the large time limits. For instance, when the exciting function is exponential, the intensity process and the pair $(N_{t},\lambda_{t})$ are Markovian. In Gao and Zhu [20], they studied the functional central limit theorems for the linear Hawkes process when the initial intensity is large, and they further studied the large deviations and applied their results to insurance and queueing systems in [21]. For the more general linear and non-Markovian case, Gao and Zhu [22] considered the large baseline intensity asymptotic results and studied the applications to queueing systems.

In recent years, the mean-field limit for high dimensional Hawkes processes has also been studied, and it first appeared in Delattre et al. [12]. They showed that under a certain setting, the mean-field limit is an inhomogeneous Poisson process. Other mean-field limit works include Chevallier [8] who studied a generalized Hawkes process model with an inclusion of the dependence on the age of the process, and Delattre and Fournier [11] who studied the mean-field limit for Hawkes processes on a graph with two nodes whether or not influence each other modeled by i.i.d. Bernoulli random variables.

In this paper, we are interested in studying a new asymptotic regime for the nonlinear Hawkes process starting from empty past history, in which the intensity function is large and the exciting function is small. More precisely, we introduce the small parameter $\epsilon>0$ and consider the nonlinear Hawkes process $N_{t}^{\epsilon}$ with intensity:

[TABLE]

In this asymptotic regime, the pair of the intensity function and the exciting function has the transformation $(\phi,h)\mapsto(\frac{1}{\epsilon}\phi,\epsilon h)$ .

Now, let us explain why this asymptotic regime is natural and also point out that such a regime has been studied extensively in many similar settings in the literature.

When the Hawkes process is linear, say $\phi(z)=\nu+z$ , $h(\cdot):\mathbb{R}^{+}\rightarrow\mathbb{R}^{+}$ , where $\nu$ is the baseline intensity, we have

[TABLE]

This gives the intensity of a linear Hawkes process with exciting function $h$ , and a large baseline intensity $\frac{\nu}{\epsilon}$ . Therefore, the asymptotic regime considered in this paper corresponds to the large baseline intensity regime that is studied in Gao and Zhu [22].

The asymptotic regime studied in this paper for the univariate Hawkes process is also equivalent for the asymptotics for the mean process for the high-dimensional multivariate Hawkes process. Our work is related to the mean-field limit for high-dimensional Hawkes processes in [12, 8, 11]. To see the connection of our work with the mean-field limit literature of Hawkes processes, let us first define a multivariate Hawkes process as follows. An $N$ -dimensional Hawkes process $(Z_{t}^{1},\ldots,Z_{t}^{n})$ is an $N$ -dimensional point process admitting an $\mathcal{F}_{t}$ -intensity $(\lambda_{t}^{1},\ldots,\lambda_{t}^{N})$ such that

[TABLE]

where $\phi_{i}(\cdot):\mathbb{R}\rightarrow\mathbb{R}^{+}$ is locally integrable, left continuous, $h_{ij}(\cdot):\mathbb{R}^{+}\rightarrow\mathbb{R}$ and we always assume that $\|h_{ij}\|_{L^{1}}=\int_{0}^{\infty}h_{ij}(t)dt<\infty$ . For the multivariate Hawkes process, a jump in one component will not only increase the intensity of future jumps of its own component, known as the self-exciting property, but also increase the intensity of the future jumps of or the other components that are connected to its own component, which is known as the mutually-exciting property. By using the Poisson embeddings, see e.g [4, 12], we can express the Hawkes process $(Z_{t}^{1},\ldots,Z_{t}^{n})$ as the solution of a Poisson driven SDE:

[TABLE]

where $\{\pi^{i}(ds\,dz),i\geq 1\}$ are a sequence of i.i.d. Poisson measures with common intensity measure $dsdz$ on $[0,\infty)\times[0,\infty)$ . As a special case, for each $N\geq 1$ , we let $h_{ij}=\frac{1}{N}h$ and $\phi_{i}=\phi$ and we consider the Hawkes process $(Z^{N,1}_{t},\dots,Z^{N,N}_{t})_{t\geq 0}$ which can be expressed as

[TABLE]

The mean process of the Hawkes processes is defined by $(Z^{N,1}_{t},\dots,Z^{N,N}_{t})_{t\geq 0}$ :

[TABLE]

It follows from (1.6) that

[TABLE]

where $\sum_{i=1}^{N}\pi^{i}(dsdz)$ is a Poisson measure on $[0,\infty)\times[0,\infty)$ with intensity $N$ .

On the other hand, let us recall that the nonlinear Hawkes process $N_{t}^{\epsilon}$ with the intensity function $\frac{\phi}{\epsilon}$ and the exciting function $\epsilon h$ can be expressed via Poisson embedding as the unique strong solution to the following equation:

[TABLE]

where $\pi(dzds)$ is a Poisson random measure on $[0,\infty)\times[0,\infty)$ with intensity $1$ . In this paper, we are interested in the asymptotics for $Z_{t}^{\epsilon}:=\epsilon N_{t}^{\epsilon}$ , which satisfies the dynamics:

[TABLE]

where $\pi^{\epsilon^{-1}}(dzds)$ is a Poisson random measure on $[0,\infty)\times[0,\infty)$ with intensity $\epsilon^{-1}$ .

By comparing (1.8) with (1.10), it becomes clear that the mean process of an $N$ -dimensional Hawkes process defined in (1.6) has the same dynamics as a univariate Hawkes process with $N=\frac{1}{\epsilon}$ . All the asymptotic results we are going to derive in this paper for the $Z_{t}^{\epsilon}$ process automatically hold for the mean process $\overline{Z}_{t}^{N}$ . We will go back to this in Section 3.

The asymptotic results for the mean process for a high-dimensional Hawkes process in Section 3 can shed some lights for the applications of high-dimensional Hawkes processes in various context. Hawkes processes have been applied to the study of neuroscience, see e.g. neuroscience, see e.g. [31, 32, 33, 35]. More recently, mean-field limits for extended Hawkes processes have been used to model the neural networks in e.g. [8, 15, 9]. The large deviations results in Section 3 can be used to estimate the probability of rare events in a neural network. The moderate deviations results in Section 3 can be used to fill in the gap between the second-order fluctuations and the large deviations regime. We can also use the multivariate Hawkes process of dimension $N$ to represent the loss process for $N$ firms in a large portfolio. The results in Section 3 can be used to provide estimates for the tail probabilities for the loss of a large portfolio. We refer to [10, 13, 23] for the works of large portfolio losses in finance. Note that the results we obtained in Section 3 are for the standard multivariate nonlinear Hawkes processes. In order to apply our results to neural networks in neuroscience, large portfolio losses in finance, and many other contexts, one needs to extend our results for the generalized Hawkes processes suitable for the applications in various contexts. Since there are many different ways to generalize the standard multivariate nonlinear Hawkes processes for the purpose of applications, we restrict the study in this paper to the most standard nonlinear Hawkes processes. Nevertheless, the methodology presented in this paper should be applicable for various extensions.

The scalings in (1.10) for stochastic equations with Poisson noise have been widely studied in the literature, see e.g. Budhiraja et al. [6], Budhiraja et al. [7], Budhiraja et al. [5]. The large deviations and moderate deviations for stochastic equations with jumps can be established usually using the variational representation in [6]. However, in our case, the coefficient of the dynamics (1.10) is a indicator function with path-dependency, which is not continuous. As a result, we cannot apply the results from [6, 7] directly. Instead of pursuing a modification of the variational representation approach in [6, 7], we will adopt a more direct approach to establish large and moderate deviations in our paper.

We organize this paper as follows. In Section 2, we introduce the main results of the paper. We will study fluctuations in Section 2.1, large deviations in Section 2.2 and moderate deviations in Section 2.3. The asymptotic results for the mean process for a high-dimensional Hawkes process are presented in Section 3. Finally, all the proofs will be given in Section 4.

2 Main Results

Before we proceed, let us summarize here a list of key assumptions that will be used throughout the paper.

Assumption 1.

$\phi(\cdot):\mathbb{R}\rightarrow\mathbb{R}^{+}$ * is $\alpha$ -Lipschitz for some $0<\alpha<\infty$ . $h(\cdot):\mathbb{R}_{\geq 0}\rightarrow\mathbb{R}$ is locally integrable and locally bounded.*

Assumption 2.

$h$ * is differentiable and $|h^{\prime}|$ is locally integrable.*

Assumption 3.

$\phi(\cdot)$ * is $\alpha$ -Lipschitz and $\alpha\|h\|_{L^{1}[0,T]}=\alpha\int_{0}^{T}|h(t)|dt<1$ .*

Assumption 4.

$\phi(\cdot)$ * is twice differentiable and $\|\phi^{\prime\prime}\|_{L^{\infty}}=\sup_{x\geq 0}|\phi^{\prime\prime}(x)|<\infty$ .*

Assumption 5.

$\inf_{x\geq 0}\phi(x)>0$ , $h$ is differentiable and $\|h^{\prime}\|_{L^{\infty}[0,T]}=\sup_{t\in[0,T]}|h^{\prime}(t)|<\infty$ .

We collect here a set of notations that will be used throughout the paper.

•

$C[0,T]$ is the space of real-valued continuous functions on $[0,T]$ ;

•

$D[0,T]$ is the space of real-valued càdlàg functions on $[0,T]$ equipped with Skorokhod topology;

•

$\mathcal{AC}_{0}[0,T]$ is the space of functions $f:[0,T]\rightarrow\mathbb{R}$ that are absolutely continuous with $f(0)=0$ ;

•

$\mathcal{AC}_{0}^{+}[0,T]$ is the space of non-decreasing functions $f:[0,T]\rightarrow\mathbb{R}$ that are absolutely continuous with $f(0)=0$ .

2.1 Fluctuations

In this section, we are interested to study the fluctuations of $Z^{\epsilon}$ around its limit $Z^{0}$ . We will obtain a functional central limit theorem for $Z^{\epsilon}$ .

As $\epsilon\rightarrow 0$ , $Z_{t}^{\epsilon}$ will converge on $D[0,T]$ to a deterministic function $Z_{t}^{0}$ that satisfies the equation:

[TABLE]

Indeed, this result will follow from the fluctuation result for $Z_{t}^{\epsilon}$ , that is, we will show that $\frac{Z_{t}^{\epsilon}-Z_{t}^{0}}{\sqrt{\epsilon}}$ converges in distribution on $D[0,T]$ to a non-trivial stochastic limit, which turns out to be a continuous Gaussian process. Let us notice that the equation (2.1) has a unique locally bounded and non-negative solution under certain assumptions, see Delattre [12]. It is interesting that the mean of the inhomogeneous Poisson process as the mean-field limit for high dimensional Hawkes processes leads to the same limiting equation as in (2.1).

Let us define:

[TABLE]

Theorem 6.

Suppose Assumption 1, Assumption 2 and Assumption 4 hold. $X^{\epsilon}$ converges in distribution on $D[0,T]$ to a continuous Gaussian process $X_{t}$ defined by

[TABLE]

where $W_{t}$ is a standard Brownian motion.

Remark 7.

The Gaussian process defined by (2.3) is also a semimartingale and

[TABLE]

Thus, the Gaussian process $X_{t}$ has the following equivalent characterization:

[TABLE]

A key component of the proof of Theorem 6 is the tightness of the sequence $X^{\epsilon}$ on $D[0,T]$ that we will establish in the following lemma.

Lemma 8.

Suppose Assumption 1 and Assumption 2 hold, $X_{t}^{\epsilon}$ is tight on $D[0,T]$ and the all limits are in $C[0,T]$ .

The proof of the tightness of $X^{\epsilon}$ , relies on two auxiliary lemmas. The first lemma, i.e. Lemma 9 gives a uniform bound on the first moment of $Z_{T}^{\epsilon}$ , uniformly in $\epsilon$ , and the second lemma, i.e. Lemma 10, gives us a uniform bound on the second moment of the running maximum of $X^{\epsilon}$ process, uniformly in $\epsilon$ .

Lemma 9.

Suppose Assumption 1 holds.

[TABLE]

Lemma 10.

Suppose Assumption 1 and Assumption 2 hold,

[TABLE]

The proofs of Theorem 6, Lemma 8, Lemma 9 and Lemma 10 will all be given in Section 4.

Remark 11.

Note that [42] studied the large time fluctuations for stationary nonlinear Hawkes processes and more precisely, as a special case for the linear Hawkes process $\phi(x)=\nu+x$ , we have $\frac{N_{nt}-\mu t}{\sqrt{n}}\rightarrow\sigma B(t)$ in distribution on $D[0,T]$ as $n\rightarrow\infty$ , where $\mu=\frac{\nu}{1-\|h\|_{L^{1}}}$ and $\sigma^{2}=\frac{\nu}{(1-\|h\|_{L^{1}})^{3}}$ , and $B(t)$ is a standard Brownian motion. Note that for the large time functional central limit theorem, the limiting variance depends on $\|h\|_{L^{1}}$ only, while in our Theorem 6, it depends on the entire exciting function $h(t)$ for $t\in[0,T]$ . Moreover, in our limit, we obtain a Gaussian process that in general is not a Brownian motion.

2.2 Large deviations

We have already seen that $Z_{t}^{\epsilon}$ converges to the limit $Z_{t}^{0}$ on $D[0,T]$ and have studied the fluctuations around this limit. It is natural to ask about the probability of the rare events that the process $Z_{t}^{\epsilon}$ deviates away from its deterministic limit. That is the question of large deviations in probability theory.

We start by giving a formal definition of the large deviation principle. We refer to Dembo and Zeitouni [14] and Varadhan [40] for general background of large deviations and the applications.

A sequence $(P_{\epsilon})_{\epsilon\in\mathbb{R}^{+}}$ of probability measures on a topological space $X$ satisfies the large deviation principle with rate function $I:X\rightarrow\mathbb{R}$ and speed $b(\epsilon)$ if $I$ is non-negative, lower semicontinuous and for any Borel set $A$ , we have

[TABLE]

Here, $A^{o}$ is the interior of $A$ and $\overline{A}$ is its closure.

Now, we are ready to state the main results of large deviations for $Z_{t}^{\epsilon}$ on $D[0,T]$ .

Theorem 12.

Suppose Assumption 1, Assumption 3 and Assumption 5 hold. Then, $\mathbb{P}(Z_{t}^{\epsilon}\in\cdot)$ satisfies a large deviation principle on $D[0,T]$ equipped with Skorokhod topology with the speed $\epsilon^{-1}$ and the rate function

[TABLE]

if $\eta\in\mathcal{AC}_{0}^{+}[0,T]$ and $+\infty$ otherwise, where

[TABLE]

Instead of establishing a full large deviation principle in Theorem 12 directly, our strategy is to first prove a local large deviation principle in Theorem 13, with the main tool being the change of measure technique for simple point processes. We then establish the exponential tightness in order to obtain a full large deviation principle.

We have the following local large deviation principle.

Theorem 13.

Suppose Assumption 1 and Assumption 5 hold. For any $\eta\in D[0,T]$ ,

[TABLE]

where $I(\eta)$ is defined in (2.8).

Next, let us establish the exponential tightness of the sequence $Z_{t}^{\epsilon}$ on $D[0,T]$ . The following Lemma 14 and Lemma 15, together with the local large deviation principle will provide us the full large deviation principle that is desired.

Lemma 14.

Suppose Assumption 1, Assumption 3 hold. Then,

[TABLE]

Lemma 15.

Suppose Assumption 1, Assumption 3 hold. For any $\delta>0$ ,

[TABLE]

Remark 16.

Theorem 13, Lemma 14 and Lemma 15 provide actually the large deviation principle for $Z_{t}^{\epsilon}$ with respect to the uniform topology on $D[0,T]$ , see e.g. Lemma A.1 in [16], or Theorem 4.14 [17].

Remark 17.

In [3], they obtained a sample path large deviation principle for the large time scaling for Poisson cluster processes. More precisely, the linear Hawkes process with $\phi(x)=\nu+x$ , as a special case of the Poisson cluster process, has the sample path large deviation principle that $\mathbb{P}(\frac{N_{n\cdot}}{n}\in\cdot)$ satisfies a large deviation principle on $D[0,T]$ equipped with the topology of point-wise convergence with the speed $n$ and the rate function $\int_{0}^{T}\mathcal{I}(f^{\prime}(t))dt$ if $f\in\mathcal{AC}_{0}[0,T]$ and $+\infty$ otherwise, where

[TABLE]

for $x\geq 0$ and $+\infty$ otherwise. Note that since the assumption (37) in [3] is not satisfied for the linear Hawkes process, their large deviations results apply to the topology of point-wise convergence, but not the uniform topology. Our results in Theorem 12 differ in two ways. First, our rate function depends on the entire function $h(t)$ , $0\leq t\leq T$ , rather than $\|h\|_{L^{1}}$ as in (2.13). Second, we allow uniform topology for the sample path large deviation principle.

2.3 Moderate Deviations

In this section, we are interested in the moderate deviations for $Z_{t}^{\epsilon}$ . The moderate deviation principle fills in the gap between the central limit theorem and the large deviation principle. For a brief introduction to moderate deviations, we refer to Chap. 3.7. in Dembo and Zeitouni [14].

Our approach to the proof of the moderate deviations is similar to that of the large deviations. That is, we first establish a local moderate deviation principle by using the change of measure technique, i.e. Theorem 19, and then establish the appropriate exponential tightness estimates, i.e. Lemma 20 and Lemma 21.

Our main result is the following:

Theorem 18.

Suppose Assumption 1, Assumption 2, Assumption 3, Assumption 4 and Assumption 5 hold. Let $a(\epsilon)$ be a positive sequence such that $a(\epsilon),\frac{\epsilon}{a(\epsilon)^{2}}\rightarrow 0$ as $\epsilon\rightarrow 0$ . Then, $\mathbb{P}(\frac{Z_{t}^{\epsilon}-Z_{t}^{0}}{a(\epsilon)}\in\cdot)$ satisfies a large deviation principle on $D[0,T]$ equipped with Skorokhod topology with speed $\frac{a(\epsilon)^{2}}{\epsilon}$ and the rate function

[TABLE]

if $\eta\in\mathcal{AC}_{0}[0,T]$ and $+\infty$ otherwise.

We first establish a local moderate deviation principle:

Theorem 19.

Suppose Assumption 1, Assumption 4 and Assumption 5 hold. For any $\eta\in D[0,T]$ ,

[TABLE]

where $J(\eta)$ is given in (2.14).

Next, we establish the exponential tightness of the sequence $\frac{Z_{t}^{\epsilon}-Z_{t}^{0}}{a(\epsilon)}$ on $D[0,T]$ in the following lemmas.

Lemma 20.

Suppose Assumption 1, Assumption 2, Assumption 3 hold.

[TABLE]

Lemma 21.

Suppose Assumption 1, Assumption 2, Assumption 3 hold. For any $\delta>0$ ,

[TABLE]

Remark 22.

(i). Theorem 19, Lemma 20 and Lemma 21 provide actually the moderate deviation principle for $Z_{t}^{\epsilon}$ with respect to the uniform topology on $D[0,T]$ , see e.g. Lemma A.1 in [16], or Theorem 4.14 [17].

(ii). For stochastic dynamics driven by Brownian motion, the limit of its standardization is an Ornstein-Uhlenbeck process driven by the same Brownian motion, and so, the fluctuations and the moderate deviations can be established by estimating deviation inequality of the standardization with the Ornstein-Uhlenbeck process (see,e.g. [18]). That approach cannot be applied to stochastic dynamics with jumps in our paper.

3 Asymptotics for the mean process for high-dimensional Hawkes processes

All the previous results that we derived in Theorem 6, Theorem 12 and Theorem 18 for the univariate Hawkes process can be transferred to the mean process of a multivariate Hawkes process. Consider the $N$ -dimensional multivariate Hawkes process using the Poisson embeddings representation: $(Z^{N,1}_{t},\dots,Z^{N,N}_{t})_{t\geq 0}$ :

[TABLE]

and its mean process $\overline{Z}^{N}_{t}=\frac{1}{N}\sum_{i=1}^{N}Z^{N,i}_{t}$ , which satisfies

[TABLE]

Theorem 6, Theorem 12 and Theorem 18 for the univariate Hawkes process can be transferred to the following Theorem 23, Theorem 24 and Theorem 25 respectively for the mean process of a multivariate Hawkes process.

Theorem 23.

Suppose Assumption 1, Assumption 2 and Assumption 4 hold. Set $X^{N}_{t}:=\sqrt{N}(\overline{Z}_{t}^{N}-m_{t})$ . Then $X^{N}_{t}$ converges in distribution on $D[0,T]$ to a continuous Gaussian process $X_{t}$ defined in Theorem 6.

Theorem 24.

Suppose Assumption 1, Assumption 3 and Assumption 5 hold. $\mathbb{P}(\overline{Z}_{t}^{N}\in\cdot)$ satisfies a large deviation principle on $D[0,T]$ equipped with Skorokhod topology with the speed $N$ and the rate function given in Theorem 12.

Theorem 25.

Suppose Assumption 1, Assumption 2, Assumption 3, Assumption 4 and Assumption 5 hold. Let $a(N)$ be a positive sequence such that $a(N),\frac{N}{a(N)^{2}}\rightarrow\infty$ as $N\rightarrow\infty$ . Then, $\mathbb{P}\left(\frac{\sqrt{N}(Z_{t}^{N}-m_{t})}{a(N)}\in\cdot\right)$ satisfies a large deviation principle on $D[0,T]$ equipped with Skorokhod topology with speed ${a(N)^{2}}$ and the rate function given in Theorem 18.

4 Proofs

4.1 Proofs for Section 2.1

Before we prove Theorem 6, let us first give the proofs of Lemma 9, Lemma 10, and Lemma 8.

Firstly, For any $\theta\in\mathbb{R}$ , $e^{\theta N_{T}^{\epsilon}-\int_{0}^{T}(e^{\theta}-1)\frac{1}{\epsilon}\phi(\int_{0}^{t}\epsilon h(t-s)dN_{s}^{\epsilon})dt}$ is a positive local martingale, hence a supermartingale, and thus for any $\theta>0$ , we have

[TABLE]

Since we assumed that $\alpha\|h\|_{L^{1}[0,T]}<1$ , for sufficiently small $\theta>0$ , we have $\theta-(e^{\theta}-1)\alpha\|h\|_{L^{1}[0,T]}>0$ . It follows that

[TABLE]

In particular, for any $k\geq 1$ ,

[TABLE]

Proof of Lemma 9.

Notice that for any $0\leq t\leq T$ ,

[TABLE]

The result follows from the Gronwall’s inequality. ∎

Proof of Lemma 10.

Notice that

[TABLE]

where

[TABLE]

is a martingale. For any $0\leq t\leq T$ ,

[TABLE]

By Gronwall’s inequality,

[TABLE]

Finally, by Doob’s inequality

[TABLE]

where we have proved in Lemma 9 that $\mathbb{E}[Z_{T}^{\epsilon}]$ is uniformly bounded in $\epsilon$ . ∎

Proof of Lemma 8.

Let us recall that

[TABLE]

where $M_{t}^{\epsilon}$ is a martingale and we can show that for any $\eta>0$ ,

[TABLE]

To show (4.10), w.l.o.g., assume $T/\delta\in\mathbb{N}$ and by using Doob’s martingale inequality, Chebychev’s inequality, and Burkholder-Davis-Gundy inequality, we have

[TABLE]

Note that for every $0\leq t\leq T$ ,

[TABLE]

Recall that we have proved in Lemma 9 that $\mathbb{E}[Z_{T}^{\epsilon}]$ is uniformly bounded in $\epsilon$ . By Gronwall’s inequality, $\mathbb{E}[(Z_{T}^{\epsilon})^{2}]$ is uniformly bounded in $\epsilon$ . Hence, we conclude that (4.10) follows from (4.11) since it goes to zero as $\delta\rightarrow 0$ uniformly in $\epsilon$ .

Moreover, for any $0\leq t\leq t+\delta\leq T$ ,

[TABLE]

It follows from Lemma 10 that the sequence

[TABLE]

is tight on $C[0,T]$ . Hence, for any $\eta>0$ ,

[TABLE]

which implies that the sequence $X_{t}^{\epsilon}$ is tight on $D[0,T]$ and the all limits are in $C[0,T]$ by Theorem 15.5 [2]. ∎

We are now finally ready to give the proof of the fluctuations results in Theorem 6.

Proof of Theorem 6.

We can write

[TABLE]

and

[TABLE]

where $M_{t}^{\epsilon}$ is defined in (4.6) and

[TABLE]

Then, by Doob’s martingale inequality, and Burkholder-Davis-Gundy inequality, there exists a constant $0<C_{T}<\infty$

[TABLE]

where we have proved in Lemma 10 that $\mathbb{E}[(Z_{T}^{\epsilon})^{2}]$ is uniformly bounded in $\epsilon$ . Thus, $\frac{1}{\epsilon^{2}}\mathbb{E}\left[\left(\sup_{0\leq t\leq T}|M_{t}^{\epsilon}|\right)^{4}\right]$ is uniformly bounded in $\epsilon$ , and by (4.7), $\frac{1}{\epsilon^{2}}\mathbb{E}\left[\left(\sup_{0\leq t\leq T}|X_{t}^{\epsilon}|\right)^{4}\right]$ is also uniformly bounded in $\epsilon$ .

By Taylor expansion,

[TABLE]

Therefore, we have that

[TABLE]

are uniformly integrable martingales, and

[TABLE]

These yield that as $\epsilon\to 0$ , in probability,

[TABLE]

and there exists a square integrable martingale $\tilde{M}_{t},t\in[0,T]$ such that

[TABLE]

In Lemma 8, we showed that the sequence $X_{t}^{\epsilon}$ is tight on $D[0,T]$ . Let $X$ be a limit point of $X^{\epsilon}$ , and $X$ is continuous in $t$ . We conclude that,

[TABLE]

and

[TABLE]

are martingales. Since $X_{t}$ is continuous in time $t$ , by Lévy’s characterization of Brownian motion and martingale representation theorem, see e.g. Chapter IV Theorem 3.6. and Chapter V Proposition 3.8. [34], there exists a standard Brownian motion $W_{t}$ , such that (2.3) holds.

By Gronwall’s inequality, the stochastic differential equation (2.3) only has a unique solution which implies that as $\epsilon\to 0$ , the set of limit points of $\{X^{\epsilon}\}$ is a singleton. Thus, $X^{\epsilon}$ converges in distribution on $D[0,T]$ to the solution of the equation (2.3).

Finally, let us show that the limit $X_{t}$ is a Gaussian process. Set $X_{t}^{(0)}:=0$ and

[TABLE]

and for every $n\geq 1$ ,

[TABLE]

Then $\{X_{t}^{(n)},t\in[0,T]\}_{n\geq 1}$ is a sequence of Gaussian processes. Moreover, we can compute that

[TABLE]

where we used the integration by parts and $X_{0}^{(n)}=X_{0}^{(n-1)}=0$ . Set $\Phi^{(n)}(t):=\sup_{0\leq s\leq t}|X_{s}^{(n)}-X_{s}^{(n-1)}|$ . Then for every $t\in[0,T]$ ,

[TABLE]

which implies that

[TABLE]

which yields that

[TABLE]

Thus, almost surely, $\sum_{n=1}^{\infty}\Phi^{(n)}(T)<\infty$ . Thus, by Proposition 6.1 (Chapter 0) in [34], $\tilde{X}_{t}=\sum_{n=0}^{\infty}(X_{t}^{(n+1)}-X_{t}^{(n)})$ is a continuous Gaussian process such that

[TABLE]

Therefore,

[TABLE]

By the uniqueness of the solution of the equation (2.3), we have $\tilde{X}=X$ . Therefore, $\{X_{t},t\in[0,T]\}$ is a Gaussian process. ∎

4.2 Proofs for Section 2.2

Proof of Theorem 12.

Theorem 12 follows from the local large deviation principle in Theorem 13 and the super-exponential estimates in Lemma 14 and Lemma 15. ∎

Proof of Theorem 13.

Set

[TABLE]

Then $\mathcal{M}_{0}[0,T]$ is a closed subset in $D[0,T]$ and $\mathbb{P}(Z^{\epsilon}\in\mathcal{M}_{0}[0,T]\mbox{ for all }\epsilon\in(0,1])=1$ , Thus, for any $\eta\not\in\mathcal{M}_{0}[0,T]$ ,

[TABLE]

Next, we assume that $\eta\in\mathcal{M}_{0}[0,T]$ .

Let $\tilde{\mathbb{P}}$ be the probability measure under which $N^{\epsilon}$ is a standard Poisson process with intensity $\frac{1}{\epsilon}$ . Since $\phi$ is $\alpha$ -Lipschitz, we have

[TABLE]

That is, the intensity has at most the linear growth in $N_{t-}$ . Moreover, under our assumption, we have $\inf_{x\geq 0}\phi(x)>0$ . Thus, $\mathbb{P}$ and $\tilde{\mathbb{P}}$ are equivalent, and the Radon-Nikodym is given by, see e.g. [36],

[TABLE]

By changing of the probability measure $\mathbb{P}$ to $\tilde{\mathbb{P}}$ ,

[TABLE]

For any $\{Z^{\epsilon}_{t},0\leq t\leq T\}$ with $\sup_{0\leq t\leq T}|Z_{t}^{\epsilon}-\eta(t)|\leq\delta$ , we have

[TABLE]

Set $\nu_{\eta}(t)=\phi\left(\int_{0}^{t-}h(t-s)d\eta(s)\right)$ . Then $\nu_{\eta}$ is a finite variation function on $[0,T]$ . The total variation of $\nu_{\eta}(t)$ on $[0,T]$ denotes by $\int_{0}^{T}|d\nu_{\eta}(t)|$ . Then

[TABLE]

and

[TABLE]

It follows from integration by parts that

[TABLE]

On the other hand, we can estimate that

[TABLE]

Finally, we can estimate that

[TABLE]

Since $N^{\epsilon}$ is a standard Poisson process with intensity $\frac{1}{\epsilon}$ under the probability measure $\tilde{\mathbb{P}}$ , it is well known that, see, e.g. [29, 30], $\tilde{\mathbb{P}}(Z_{t}^{\epsilon}\in\cdot)$ satisfies a large deviation principle on $D[0,T]$ with the rate function

[TABLE]

where $\ell(\cdot)$ is defined in (2.9).

Hence, we conclude that

[TABLE]

∎

Proof of Lemma 14.

By (4.2), and Chebychev’s inequality, for sufficiently small $\theta>0$ ,

[TABLE]

which yields the desired result. ∎

Proof of Lemma 15.

Without loss of generality, let us assume that $MT\in\mathbb{N}$ . For any $\delta>0$ ,

[TABLE]

For any $\theta>0$ ,

[TABLE]

Therefore, by Cauchy-Schwarz inequality,

[TABLE]

which is uniform in $1\leq j\leq TM$ . By Chebychev’s inequality,

[TABLE]

It follows from (4.2) that for any sufficiently small $\iota>0$ ,

[TABLE]

where $C(\iota)$ is a positive constant that depends only on $\iota$ , $\alpha$ , $\|h\|_{L^{1}[0,T]}$ , $\phi(0)$ and $T$ .

Let $\gamma$ be a sufficiently small fixed constant, independent of all the other parameters. We define $\theta:=\log(1+\gamma M)$ , and thus

[TABLE]

is sufficiently small since $\gamma$ is sufficiently small and we can apply (4.36) and the Chebychev’s inequality and get

[TABLE]

Since $\theta=\log(1+\gamma M)$ , we get

[TABLE]

which yields the desired result by letting $M\rightarrow\infty$ . ∎

4.3 Proofs for Section 2.3

Proof of Theorem 18.

Theorem 18 follows from the local large deviation principle in Theorem 19 and the super-exponential estimates in Lemma 20 and Lemma 21. ∎

Proof of Theorem 19.

Set

[TABLE]

Then $\mathcal{V}_{0}[0,T]$ is a closed subset in $D[0,T]$ and $\mathbb{P}(Z^{\epsilon}-Z^{0}\in\mathcal{V}_{0}[0,T]\mbox{ for all }\epsilon\in(0,1])=1$ , Thus, for any $\eta\not\in\mathcal{V}_{0}[0,T]$ ,

[TABLE]

Next, we assume that $\eta\in\mathcal{V}_{0}[0,T]$ .

Let $\tilde{\mathbb{P}}$ be the probability measure under which $N^{\epsilon}$ is an inhomogeneous Poisson process with intensity $\frac{1}{\epsilon}\phi\left(\int_{0}^{t}h(t-s)dZ_{s}^{0}\right)$ . Denote by

[TABLE]

By changing the probability measure $\mathbb{P}$ to $\tilde{\mathbb{P}}$ (see the discussions about the change of measure and the Radon-Nikodym derivative in the proof of Theorem 13 and [36]),

[TABLE]

Replacing $\eta$ in the proof of Theorem 13 by $\eta^{\epsilon}$ , we have firstly,

[TABLE]

and secondly,

[TABLE]

and thirdly,

[TABLE]

and finally,

[TABLE]

Thus, by (4.40), (4.41), (4.42), and (4.43), there exists a constant $C$ such that for any $\sup_{0\leq t\leq T}|Z_{t}^{\epsilon}-\eta^{\epsilon}(t)|\leq\delta(\epsilon)$ ,

[TABLE]

Moreover, by a deterministic time change, we have

[TABLE]

where the first equality above holds in distribution, where $Y^{\epsilon}(t):=\epsilon\bar{N}_{t}^{\epsilon}-t$ , and $\bar{N}_{t}^{\epsilon}$ is a standard Poisson process with constant intensity $\frac{1}{\epsilon}$ under the probability measure $\tilde{\mathbb{P}}$ .

It is well known that, see e.g. [30], that $\tilde{\mathbb{P}}(\frac{Y^{\epsilon}}{a(\epsilon)}\in\cdot)$ satisfies a large deviation principle on $D[0,Z_{T}^{0}]$ , see e.g. [30], with the speed $\frac{a^{2}(\epsilon)}{\epsilon}$ and the rate function

[TABLE]

Therefore,

[TABLE]

where $\xi(\cdot)$ is defined via $\xi(Z_{t}^{0})=\eta(t)$ , for every $0\leq t\leq T$ , so that if $\eta\not\in\mathcal{AC}_{0}[0,T]$ , then $J_{Pos}(\xi)=+\infty$ , and if $\eta\in\mathcal{AC}_{0}[0,T]$ , then $\eta^{\prime}(t)=(Z_{t}^{0})^{\prime}\xi^{\prime}(Z_{t}^{0})$ and

[TABLE]

Thus, we have

[TABLE]

Finally, we notice that

[TABLE]

Hence, we conclude that

[TABLE]

Hence, the conclusion of the Theorem 19 holds. ∎

Proof of Lemma 20.

Let us recall that $Z_{t}^{\epsilon}$ satisfies the dynamics:

[TABLE]

where $M_{t}^{\epsilon}$ is a martingale. Therefore, for any $0\leq t\leq T$ ,

[TABLE]

It follows from Gronwall’s inequality that

[TABLE]

For any $\theta>0$ , by Doob’s martingale inequality,

[TABLE]

Let us define

[TABLE]

Then $R_{t}^{\epsilon}$ is a martingale and $R_{0}^{\epsilon}=0$ . Moreover, $|\Delta M^{\epsilon}|\leq\epsilon$ and

[TABLE]

By Lemma 26.19 in Kallenberg [28], if $M$ is a local martingale starting at [math] with $|\Delta M|\leq c$ then $e^{M-b\langle M\rangle}$ is a supermartingale where $b=g(c):=(e^{c}-1-c)c^{-2}$ . Hence,

[TABLE]

Similarly,

[TABLE]

As $\epsilon\rightarrow 0$ , it is easy to see that $c\rightarrow 0$ and $g(2c)\rightarrow\frac{1}{2}$ . Therefore, for sufficiently small $\epsilon$ ,

[TABLE]

where $C(\theta)$ for small $\theta>0$ is defined in the proof of the exponential tightness for the large deviation principle under the assumption that $\alpha\|h\|_{L^{1}[0,T]}<1$ and the fact that $a(\epsilon)^{2}$ is sufficiently small as $\epsilon\rightarrow 0$ . It is easy to check that for some $\bar{C}>0$

[TABLE]

Therefore, we conclude that

[TABLE]

The desired result follows by letting $K\rightarrow\infty$ . ∎

Proof of Lemma 21.

Without loss of generality, let us assume that $MT\in\mathbb{N}$ . For any $\delta>0$ ,

[TABLE]

We can estimate that

[TABLE]

Thus,

[TABLE]

We can compute that for any $\theta>0$ , for sufficiently small $\epsilon>0$ ,

[TABLE]

where the last line uses (4.54). From here, we can further estimate that

[TABLE]

which is uniform in $j$ . Moreover,

[TABLE]

The choice of $\theta>0$ is arbitrary. Let us choose $\theta=\sqrt{M}$ , then

[TABLE]

Finally, by Lemma 20,

[TABLE]

Hence, we have proved the desired result. ∎

Acknowledgements

We are very grateful to the Associate Editor and two anonymous referees for their helpful comments and suggestions. Fuqing Gao acknowledges support from NSFC Grant 11571262 and the Specialized Research Fund for the Doctoral Program of Higher Education of China (Grant No. 20130141110076). Lingjiong Zhu is grateful to the support from NSF Grant DMS-1613164.

Bibliography45

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] Bacry, E, Delattre, S., Hoffmann, M. and Muzy, J. F. (2013). Scaling limits for Hawkes processes and application to financial statistics. Stochastic Processes and their Applications 123 , 2475-2499.
2[2] Billingsley, P. (1999). Convergence of Probability Measures , 2nd edition. Wiley-Interscience, New York.
3[3] Bordenave, C. and Torrisi, G. L. (2007). Large deviations of Poisson cluster processes. Stochastic Models , 23 , 593-625.
4[4] Brémaud, P. and Massoulié, L. (1996). Stability of nonlinear Hawkes processes. Ann. Probab. , 24 , 1563-1588.
5[5] Budhiraja, A., Chen, J. and Dupuis, P. (2013). Large deviations for stochastic partial differential equations driven by a Poisson random measure. Stochastic Processes and their Applications . 123 , 523-560.
6[6] Budhiraja, A., Dupuis, P. and Maroulas, V. (2011). Variational representations for continuous time processes. Annales de I’Institut Henri Poincaré-Probabilités et Statistiques . 47 , 725-747.
7[7] Budhiraja, A., Dupuis, P. and Ganguly, A.(2016). Moderate deviation principle for stochastic differential equations with jumps. Annals of Probability . 44 , 1723-1775.
8[8] Chevallier, J. (2017). Mean-field limit of generalized Hawkes processes. to appear in Stochastic Processes and their Applications .

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

Abstract

1 Introduction

2 Main Results

Assumption 1**.**

Assumption 2**.**

Assumption 3**.**

Assumption 4**.**

Assumption 5**.**

2.1 Fluctuations

Theorem 6**.**

Remark 7**.**

Lemma 8**.**

Lemma 9**.**

Lemma 10**.**

Remark 11**.**

2.2 Large deviations

Theorem 12**.**

Theorem 13**.**

Lemma 14**.**

Lemma 15**.**

Remark 16**.**

Remark 17**.**

2.3 Moderate Deviations

Theorem 18**.**

Theorem 19**.**

Lemma 20**.**

Lemma 21**.**

Remark 22**.**

3 Asymptotics for the mean process for high-dimensional Hawkes processes

Theorem 23**.**

Theorem 24**.**

Theorem 25**.**

4 Proofs

4.1 Proofs for Section 2.1

Proof of Lemma 9.

Proof of Lemma 10.

Proof of Lemma 8.

Proof of Theorem 6.

4.2 Proofs for Section 2.2

Proof of Theorem 12.

Proof of Theorem 13.

Proof of Lemma 14.

Proof of Lemma 15.

4.3 Proofs for Section 2.3

Proof of Theorem 18.

Proof of Theorem 19.

Proof of Lemma 20.

Proof of Lemma 21.

Acknowledgements

Assumption 1.

Assumption 2.

Assumption 3.

Assumption 4.

Assumption 5.

Theorem 6.

Remark 7.

Lemma 8.

Lemma 9.

Lemma 10.

Remark 11.

Theorem 12.

Theorem 13.

Lemma 14.

Lemma 15.

Remark 16.

Remark 17.

Theorem 18.

Theorem 19.

Lemma 20.

Lemma 21.

Remark 22.

Theorem 23.

Theorem 24.

Theorem 25.