Cram\'er Type Moderate Deviations for Random Fields

Aleksandr Beknazaryan; Hailin Sang; Yimin Xiao

arXiv:1902.02723·math.ST·July 22, 2019

Cram\'er Type Moderate Deviations for Random Fields

Aleksandr Beknazaryan, Hailin Sang, Yimin Xiao

PDF

TL;DR

This paper investigates Cramér type moderate deviations for partial sums of random fields, utilizing the conjugate method, with applications to linear random fields and nonparametric regression errors.

Contribution

It introduces new results on moderate deviations for random fields, extending classical theory to complex dependence structures and practical regression models.

Findings

01

Established Cramér type moderate deviation results for linear random fields.

02

Extended applicability to nonparametric regression with random field errors.

03

Demonstrated the effectiveness of the conjugate method in this context.

Abstract

We study the Cram\'er type moderate deviation for partial sums of random fields by applying the conjugate method. The results are applicable to the partial sums of linear random fields with short or long memory and to nonparametric regression with random field errors.

Equations272

n \to \infty lim x \in R sup P (S_{n} > x σ n) - (1 - Φ (x)) = 0,

n \to \infty lim x \in R sup P (S_{n} > x σ n) - (1 - Φ (x)) = 0,

\lim_{n\rightarrow\infty}\sup_{0\leq x\leq c_{n}}\bigg{|}\frac{\mathbb{P}(S_{n}>x\sigma\sqrt{n})}{1-\Phi(x)}-1\bigg{|}=0,

\lim_{n\rightarrow\infty}\sup_{0\leq x\leq c_{n}}\bigg{|}\frac{\mathbb{P}(S_{n}>x\sigma\sqrt{n})}{1-\Phi(x)}-1\bigg{|}=0,

\frac{\mathbb{P}(S_{n}>x\sigma\sqrt{n})}{1-\Phi(x)}=\exp\left\{\frac{x^{3}}{\sqrt{n}}\lambda\Big{(}\frac{x}{\sqrt{n}}\Big{)}\right\}\left[1+O\left(\frac{x+1}{\sqrt{n}}\right)\right].

\frac{\mathbb{P}(S_{n}>x\sigma\sqrt{n})}{1-\Phi(x)}=\exp\left\{\frac{x^{3}}{\sqrt{n}}\lambda\Big{(}\frac{x}{\sqrt{n}}\Big{)}\right\}\left[1+O\left(\frac{x+1}{\sqrt{n}}\right)\right].

L_{nj} (z) = lo g E e^{z X_{nj}} of X_{nj} is analytic in D_{n},

L_{nj} (z) = lo g E e^{z X_{nj}} of X_{nj} is analytic in D_{n},

L_{nj} (z) = k = 1 \sum \infty \frac{γ _{k nj}}{k !} z^{k},

L_{nj} (z) = k = 1 \sum \infty \frac{γ _{k nj}}{k !} z^{k},

∣ E X_{nj}^{m} ∣ \leq \frac{m !}{2} σ_{nj}^{2} H_{n}^{2 - m} for all m \geq 2.

∣ E X_{nj}^{m} ∣ \leq \frac{m !}{2} σ_{nj}^{2} H_{n}^{2 - m} for all m \geq 2.

S_{n} = j \in Z^{d} \sum X_{nj}, S_{m, n} = j \in Γ_{m}^{d} \sum X_{nj},

S_{n} = j \in Z^{d} \sum X_{nj}, S_{m, n} = j \in Γ_{m}^{d} \sum X_{nj},

B_{n} = j \in Z^{d} \sum σ_{nj}^{2}, F_{n} (x) = P (S_{n} < x B_{n})

B_{n} = j \in Z^{d} \sum σ_{nj}^{2}, F_{n} (x) = P (S_{n} < x B_{n})

∣ L_{nj} (z) ∣ \leq c_{nj}, \forall z \in C with ∣ z ∣ < H_{n},

∣ L_{nj} (z) ∣ \leq c_{nj}, \forall z \in C with ∣ z ∣ < H_{n},

C_{n} := j \in Z^{d} \sum c_{nj} = O (B_{n} H_{n}^{2}) .

C_{n} := j \in Z^{d} \sum c_{nj} = O (B_{n} H_{n}^{2}) .

\frac{1-F_{n}(x)}{1-\Phi(x)}=\exp\Bigg{\{}\frac{x^{3}}{H_{n}\sqrt{B_{n}}}\lambda_{n}\Big{(}\frac{x}{H_{n}\sqrt{B_{n}}}\Big{)}\Bigg{\}}\Bigg{(}1+O\Bigg{(}\frac{x+1}{H_{n}\sqrt{B_{n}}}\Bigg{)}\Bigg{)},

\frac{1-F_{n}(x)}{1-\Phi(x)}=\exp\Bigg{\{}\frac{x^{3}}{H_{n}\sqrt{B_{n}}}\lambda_{n}\Big{(}\frac{x}{H_{n}\sqrt{B_{n}}}\Big{)}\Bigg{\}}\Bigg{(}1+O\Bigg{(}\frac{x+1}{H_{n}\sqrt{B_{n}}}\Bigg{)}\Bigg{)},

\frac{F_{n}(-x)}{\Phi(-x)}=\exp\Bigg{\{}-\frac{x^{3}}{H_{n}\sqrt{B_{n}}}\lambda_{n}\Big{(}-\frac{x}{H_{n}\sqrt{B_{n}}}\Big{)}\Bigg{\}}\Bigg{(}1+O\Bigg{(}\frac{x+1}{H_{n}\sqrt{B_{n}}}\Bigg{)}\Bigg{)},

\frac{F_{n}(-x)}{\Phi(-x)}=\exp\Bigg{\{}-\frac{x^{3}}{H_{n}\sqrt{B_{n}}}\lambda_{n}\Big{(}-\frac{x}{H_{n}\sqrt{B_{n}}}\Big{)}\Bigg{\}}\Bigg{(}1+O\Bigg{(}\frac{x+1}{H_{n}\sqrt{B_{n}}}\Bigg{)}\Bigg{)},

λ_{n} (t) = k = 0 \sum \infty β_{k n} t^{k}

λ_{n} (t) = k = 0 \sum \infty β_{k n} t^{k}

\displaystyle\frac{1-F_{n}(x)}{1-\Phi(x)}=\exp\bigg{\{}\frac{x^{3}}{6B_{n}^{3/2}}\sum_{j\in\mathbb{Z}^{d}}\gamma_{3nj}\bigg{\}}\bigg{(}1+O\Big{(}\frac{x+1}{H_{n}\sqrt{B_{n}}}\Big{)}\bigg{)}.

\displaystyle\frac{1-F_{n}(x)}{1-\Phi(x)}=\exp\bigg{\{}\frac{x^{3}}{6B_{n}^{3/2}}\sum_{j\in\mathbb{Z}^{d}}\gamma_{3nj}\bigg{\}}\bigg{(}1+O\Big{(}\frac{x+1}{H_{n}\sqrt{B_{n}}}\Big{)}\bigg{)}.

1 - Φ (x) < \frac{e ^{- x^{2} /2}}{x 2 π},

1 - Φ (x) < \frac{e ^{- x^{2} /2}}{x 2 π},

\displaystyle 1-F_{n}(x)=\Big{(}1-\Phi(x)\Big{)}\exp\bigg{\{}\frac{x^{3}}{6B_{n}^{3/2}}\sum_{j\in\mathbb{Z}^{d}}\gamma_{3nj}\bigg{\}}+O\bigg{(}\frac{e^{-x^{2}/2}}{H_{n}\sqrt{B_{n}}}\bigg{)}.

\displaystyle 1-F_{n}(x)=\Big{(}1-\Phi(x)\Big{)}\exp\bigg{\{}\frac{x^{3}}{6B_{n}^{3/2}}\sum_{j\in\mathbb{Z}^{d}}\gamma_{3nj}\bigg{\}}+O\bigg{(}\frac{e^{-x^{2}/2}}{H_{n}\sqrt{B_{n}}}\bigg{)}.

\displaystyle F_{n}(x)-\Phi(x)=O\bigg{(}\frac{e^{-x^{2}/2}}{H_{n}\sqrt{B_{n}}}\bigg{)}.

\displaystyle F_{n}(x)-\Phi(x)=O\bigg{(}\frac{e^{-x^{2}/2}}{H_{n}\sqrt{B_{n}}}\bigg{)}.

\frac{F _{n} ( x + \frac{c}{x} ) - F _{n} ( x )}{1 - F _{n} ( x )} \to 1 - e^{- c}

\frac{F _{n} ( x + \frac{c}{x} ) - F _{n} ( x )}{1 - F _{n} ( x )} \to 1 - e^{- c}

X_{j} = i \in Z^{d} \sum a_{i} ε_{j - i}, j \in Z^{d},

X_{j} = i \in Z^{d} \sum a_{i} ε_{j - i}, j \in Z^{d},

L (z) = k = 1 \sum \infty \frac{γ _{k}}{k !} z^{k},

L (z) = k = 1 \sum \infty \frac{γ _{k}}{k !} z^{k},

S_{n} = j \in Γ_{n}^{d} \sum X_{j} = j \in Z^{d} \sum b_{nj} ε_{j},

S_{n} = j \in Γ_{n}^{d} \sum X_{j} = j \in Z^{d} \sum b_{nj} ε_{j},

B_{n} = σ^{2} j \in Z^{d} \sum b_{nj}^{2}, F_{n} (x) = P (S_{n} < x B_{n}) .

B_{n} = σ^{2} j \in Z^{d} \sum b_{nj}^{2}, F_{n} (x) = P (S_{n} < x B_{n}) .

A := i \in Z^{d} \sum ∣ a_{i} ∣ < \infty, a := i \in Z^{d} \sum a_{i} \neq = 0,

A := i \in Z^{d} \sum ∣ a_{i} ∣ < \infty, a := i \in Z^{d} \sum a_{i} \neq = 0,

a_{i} = l (∣ i ∣) b (i /∣ i ∣) ∣ i ∣^{- α}, i \in Z^{d}, ∣ i ∣ \neq = 0,

a_{i} = l (∣ i ∣) b (i /∣ i ∣) ∣ i ∣^{- α}, i \in Z^{d}, ∣ i ∣ \neq = 0,

∣ L (z) ∣ < C

∣ L (z) ∣ < C

\displaystyle\frac{1-F_{n}(x)}{1-\Phi(x)}=\exp\bigg{\{}\frac{x^{3}}{n^{d/2}}\lambda_{n}\Big{(}\frac{x}{n^{d/2}}\Big{)}\bigg{\}}\bigg{(}1+O\Big{(}\frac{x+1}{n^{d/2}}\Big{)}\bigg{)},

\displaystyle\frac{1-F_{n}(x)}{1-\Phi(x)}=\exp\bigg{\{}\frac{x^{3}}{n^{d/2}}\lambda_{n}\Big{(}\frac{x}{n^{d/2}}\Big{)}\bigg{\}}\bigg{(}1+O\Big{(}\frac{x+1}{n^{d/2}}\Big{)}\bigg{)},

λ_{n} (t) = k = 0 \sum \infty β_{k n} t^{k}

λ_{n} (t) = k = 0 \sum \infty β_{k n} t^{k}

B_{n}^{m} = σ^{2} j \in Z^{d} \sum (b_{nj}^{m})^{2}, F_{n}^{m} (x) = P (S_{n}^{m} < x B_{n}^{m}) .

B_{n}^{m} = σ^{2} j \in Z^{d} \sum (b_{nj}^{m})^{2}, F_{n}^{m} (x) = P (S_{n}^{m} < x B_{n}^{m}) .

\displaystyle\frac{1-F_{n}^{m}(x)}{1-\Phi(x)}=\exp\bigg{\{}\frac{x^{3}}{n^{d/2}}\lambda_{n}^{m}\Big{(}\frac{x}{n^{d/2}}\Big{)}\bigg{\}}\bigg{(}1+O\Big{(}\frac{x+1}{n^{d/2}}\Big{)}\bigg{)},

\displaystyle\frac{1-F_{n}^{m}(x)}{1-\Phi(x)}=\exp\bigg{\{}\frac{x^{3}}{n^{d/2}}\lambda_{n}^{m}\Big{(}\frac{x}{n^{d/2}}\Big{)}\bigg{\}}\bigg{(}1+O\Big{(}\frac{x+1}{n^{d/2}}\Big{)}\bigg{)},

λ_{n}^{m} (t) = k = 0 \sum \infty β_{k n}^{m} t^{k},

λ_{n}^{m} (t) = k = 0 \sum \infty β_{k n}^{m} t^{k},

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Cramér Type Moderate Deviations for Random Fields

Aleksandr Beknazaryana, Hailin Sanga and Yimin Xiaob

a Department of Mathematics, The University of Mississippi, University, MS 38677, USA. E-mail: [email protected], [email protected]

b Department of Statistics and Probability, Michigan State University, East Lansing, MI 48824, USA. E-mail: [email protected]

**Abbreviated Title: **Moderate deviations for random fields

Abstract

We study the Cramér type moderate deviation for partial sums of random fields by applying the conjugate method. The results are applicable to the partial sums of linear random fields with short or long memory and to nonparametric regression with random field errors.

Keywords: Cramér type moderate deviation, long range dependence, nonparametric regression, spacial linear process, random field.

MSC 2010 subject classification: 60F10, 60G60, 62E20

1 Introduction

In this paper we study the Cramér type moderate deviations for random fields, in particular linear random fields (often called spatial linear processes in statistics literature) with short or long memory (short or long range dependence). The study of moderate deviation probabilities in non-logarithmic form for independent random variables goes back to 1920s. The first theorem in this field was published by Khinchin (1929) who studied a particular case of the Bernoulli random variables. In his fundamental work, Cramér (1938) studied the estimation of the tail probability by the standard normal distribution under the condition that the random variable has moment generating function in a neighborhood of the origin (cf. (3) below). This condition has been referred to as the Cramér condition. Cramér’s work was improved by Petrov (1954) (see also Petrov (1975, 1995)). Their works have stimulated a large amount of research on moderate and large deviations; see below for a brief (and incomplete) review on literature related to this paper. Nowadays, the area of moderate and large deviation deviations is not only important in probability but also plays an important role in many applied fields, for instance, the premium calculation problem, risk management in insurance (cf. Asmussen and Albrecher (2010)), nonparametric estimation in statistics (see, e.g., Bahadur and Rao (1960), van der Vaart (1998), Joutard (2006, 2013)), and in network information theory (cf. Lee et al. (2016, 2017)).

Let $X,X_{1},X_{2},\cdots$ be a sequence of independent and identically distributed (i.i.d.) random variables with mean [math] and variance $\sigma^{2}$ . Let $S_{n}=\sum_{k=1}^{n}X_{k}$ ( $n\geq 1$ ) be the partial sums. By the central limit theorem,

[TABLE]

where $\Phi(x)$ is the probability distribution of the standard normal random variable. If for a suitable sequence $c_{n}$ , we have

[TABLE]

or $\mathbb{P}(S_{n}>x\sigma\sqrt{n})=(1-\Phi(x))(1+o(1))$ uniformly over $x\in[0,\,c_{n}]$ , then Eq. (1) is called moderate deviation probability or normal deviation probability for $S_{n}$ since it can be estimated by the standard normal distribution. We refer to $[0,\,c_{n}]$ as a range for the moderate deviation. The most famous result of this kind is the Cramér type moderate deviation. Under Cramér’s condition, one has the following Cramér’s theorem (Cramér (1938), Petrov (1954; 1975, p.218; or 1995, p.178)): If $x\geq 0$ and $x=o(\sqrt{n})$ then

[TABLE]

Here $\lambda(z)=\sum_{k=0}^{\infty}c_{k}z^{k}$ is a power series with coefficients depending on the cumulants of the random variable $X$ . Eq. (2) provides more precise approximation than (1) which holds uniformly on the range $[0,\,c_{n}]$ for any $c_{n}=o(\sqrt{n})$ . The moderate deviations under Cramér’s condition for independent non-identically distributed random variables were obtained by Feller (1943), Petrov (1954) and Statulevičius (1966). The Cramér type moderate deviation has also been established for the sum of independent random variables with $p$ -th moment, $p>2$ . To name a few, for example, see Rubin and Sethuraman (1965), Nagaev (1965, 1979), Michel (1976), Slastnikov (1978), Amosova (1979), and Frolov (2005). It should be pointed out that the ranges the moderate deviations in these references are smaller (e.g., $c_{n}=O(\sqrt{\log n})$ ).

The Cramér type moderate deviations for dependent random variables have also been studied in the literature. Ghosh (1974), Heinrich (1990) studied the moderate deviation for $m$ -dependent random variables. Ghosh and Babu (1977), Babu and Singh (1978a) studied moderate deviation for mixing processes. Grama (1997), Grama and Haeusler (2000, 2006) and Fan, Grama and Liu (2013) investigated the large and moderate deviations for martingales. Babu and Singh (1978b) established moderate deviation results for linear processes with coefficients satisfying $\sum_{i=1}^{\infty}i|a_{i}|<\infty$ . Wu and Zhao (2008) studied moderate deviations for stationary processes under certain conditions in terms of the physical dependence measure. But it can be verified that the results from Wu and Zhao (2008) can only be applied to linear processes with short memory and their transformations. Recently Peligrad et al. (2013) studied the exact moderate and large deviations for short or long memory linear processes. Sang and Xiao (2018) studied exact moderate and large deviations for linear random fields and applied the moderate result to prove a Davis-Gut law of the iterated logarithm. Nevertheless, in the aforementioned works, the moderate deviations are studied for dependent random variables with $p$ -th moment, $p>2$ . The exact moderate deviation for random fields under Cramér’s condition has not been well studied. For example, the optimal range $[0,c_{n}]$ and the exact rate of convergence in (1) had been unknown in the random field setting.

The main objective of this paper is to establish exact moderate deviation analogous to (2) for random fields under Cramér’s condition. Our main result is Theorem 2.1 below, whose proof is based on the conjugate method to change the probability measure as in the classical case (see, e.g., Petrov (1965, 1975)). The extension of this method to the random field setting reveals the deep relationship between the tail probabilities and the properties of the cumulant generating functions of the random variables such as the analytic radius and the bounds, for $x$ within some ranges related to the sum of the variances and the analytic radius of the cumulant generating functions of these random variables. Compared with the results in Sang and Xiao (2018) for linear random fields, Theorems 2.1 and 3.1 in this paper provide more precise convergence rate in the moderate deviations and explicit information on the range $[0,\,c_{n}]$ , which is much bigger than the range in Theorem 2.1 in Sang and Xiao (2018). In Section 3 we show that Theorem 2.1 is applicable to linear random fields with short or long memory and to nonparametric regression analysis. The results there can be applied to approximate the quantiles and tail conditional expectations for the partial sums of linear random fields.

In this paper we use the following notations. For two sequences $\{a_{n}\}$ and $\{b_{n}\}$ of real numbers, $a_{n}\mathbb{\sim}b_{n}$ means $a_{n}/b_{n}\rightarrow 1$ as $n\rightarrow\infty$ ; $a_{n}\propto b_{n}$ means that $a_{n}/b_{n}\rightarrow C$ as $n\rightarrow\infty$ for some constant $C>0$ ; for positive sequences, the notation $a_{n}\ll b_{n}$ or $b_{n}\gg a_{n}$ means that $a_{n}/b_{n}$ is bounded. For $d,m\in\mathbb{N}$ denote $\Gamma^{d}_{m}=[-m,m]^{d}\cap\mathbb{Z}^{d}$ . Section 2 gives the main results. In Section 3 we study the application of the main results in linear random fields and nonparametric regression. All the proofs go to Section 4.

Acknowledgement The authors are grateful to the referee and the Associate Editor for carefully reading the paper and for insightful suggestions that significantly improved the presentation of the paper. The research of Hailin Sang is supported by the Simons Foundation Grant 586789 and the College of Liberal Arts Faculty Grants for Research and Creative Achievement at the University of Mississippi. The research of Yimin Xiao is partially supported by NSF grants DMS-1612885 and DMS-1607089.

2 Main results

Let $\{X_{nj},\,n\in\mathbb{N},j\in\mathbb{Z}^{d}\}$ be a random field with zero means defined on a probability space $(\Omega,{\mathcal{F}},P)$ . Suppose that for each $n$ , the random variables $X_{nj},\,j\in{\mathbb{Z}}^{d}$ are independent and satisfy the following Cramér condition: There is a positive constant $H_{n}$ such that the cumulant generating function

[TABLE]

where $D_{n}=\{z\in\mathbb{C}:|z|<H_{n}\}$ is the disc of radius $H_{n}$ on the complex plane $\mathbb{C}$ , and $\log$ denotes the principal value of the logarithm so that $L_{nj}(0)=0$ . This setting is convenient for applications to linear random fields in Section 3.

Without loss of generality we assume in this section that $\limsup\limits_{n\to\infty}H_{n}<\infty$ . Within the disc $\{z\in\mathbb{C}:|z|<H_{n}\}$ , $L_{nj}$ can be expanded in a convergent power series

[TABLE]

where $\gamma_{knj}$ is the cumulant of order $k$ of the random variable $X_{nj}$ . We have that $\gamma_{1nj}=\operatorname{\mathbb{E}}X_{nj}=0$ and $\gamma_{2nj}=\operatorname{\mathbb{E}}X_{nj}^{2}=\sigma_{nj}^{2}$ . By Taylor’s expansion, one can verify that a sufficient condition for (3) is the following moment condition

[TABLE]

This condition has been used frequently in probability and statistics, see Petrov (1975, p.55), Johnstone (1999, p.64), Picard and Tribouley (2000, p.301), Zhang and Wong (2003, p.164), among others.

Denote

[TABLE]

and assume that $S_{n}$ is well-defined and $B_{n}<\infty$ for each $n\in\mathbb{N}$ . The following is the main result of this paper.

Theorem 2.1

Suppose that, for all $n\in\mathbb{N}$ and $j\in\mathbb{Z}^{d}$ , there exist non-negative constants $c_{nj}$ such that

[TABLE]

and suppose that $B_{n}H_{n}^{2}\to\infty$ as $n\to\infty$ , and

[TABLE]

If $x\geq 0$ and $x=o(H_{n}\sqrt{B_{n}})$ , then

[TABLE]

where

[TABLE]

is a power series that stays bounded uniformly in $n$ for sufficiently small values of $|t|$ and the coefficients $\beta_{kn}$ only depend on the cumulants of $X_{nj}$ $(n\in\mathbb{Z},j\in\mathbb{Z}^{d})$ .

For the rest of the paper, we only state the results for $x\geq 0$ . Since $\lambda_{n}(t)=\sum_{k=0}^{\infty}\beta_{kn}t^{k}$ stays bounded uniformly in $n$ for sufficiently small values of $|t|$ and $\beta_{0n}=\frac{H_{n}}{6B_{n}}\sum_{j\in\mathbb{Z}^{d}}\gamma_{3nj}$ from the proof of Theorem 2.1, we have the following corollary:

Corollary 2.1

Assume the conditions of Theorem 2.1 hold. Then for $x\geq 0$ with $x=O\Big{(}(H_{n}\sqrt{B_{n}})^{1/3}\Big{)}$ we have

[TABLE]

Notice that $\frac{x^{3}}{6B_{n}^{3/2}}\sum_{j\in\mathbb{Z}^{d}}\gamma_{3nj}=O(1)$ under the condition $x=O\Big{(}(H_{n}\sqrt{B_{n}})^{1/3}\Big{)}$ . Also taking into the account the fact that for $x>0$

[TABLE]

we obtain the following corollaries:

Corollary 2.2

Under the conditions of Theorem 2.1, we have that for $x\geq 0$ with $x=O\Big{(}(H_{n}\sqrt{B_{n}})^{1/3}\Big{)}$ ,

[TABLE]

Corollary 2.3

Assume the conditions of Theorem 2.1 and $\sum_{j\in\mathbb{Z}^{d}}\gamma_{3nj}=0$ for all $n\in\mathbb{N}$ . Then for $x\geq 0$ with $x=O\Big{(}(H_{n}\sqrt{B_{n}})^{1/3}\Big{)}$ , we have

[TABLE]

Also as $1-\Phi(x)\sim\frac{1}{x\sqrt{2\pi}}e^{-x^{2}/2}$ , as $x\to\infty$ , we have

Corollary 2.4

Under the conditions of Theorem 2.1, if $x\to\infty$ , $x=o(H_{n}\sqrt{B_{n}})$ , then

[TABLE]

for every positive constant $c$ .

3 Applications

In this section, we provide some applications of the main result in Section 2. First, we derive a moderate deviation result for linear random fields with short or long memory; then we apply this result to risk measures and apply a same argument to study nonparametric regression.

3.1 Cramér type moderate deviation for linear random fields

Let $X=\{X_{j},j\in\mathbb{Z}^{d}\}$ be a linear random field defined on a probability space $(\Omega,{\mathcal{F}},P)$ by

[TABLE]

where the innovations $\varepsilon_{i},i\in\mathbb{Z}^{d}$ , are i.i.d. random variables with mean zero and finite variances $\sigma^{2}$ , and where $\{a_{i},i\in\mathbb{Z}^{d}\}$ is a sequence of real numbers that satisfy $\sum_{i\in\mathbb{Z}^{d}}a_{i}^{2}<\infty$ .

Linear random fields have been studied extensively in probability and statistics. We refer to Sang and Xiao (2018) for a brief review on studies in limit theorems, large and moderate deviations for linear random fields and to Koul et al. (2016), Lahiri and Robinson (2016) and the reference therein for recent developments in statistics.

By applying Theorem 2.1 in Section 2, we establish the following moderate deviation result for linear random fields with short or long memory, under Cramér’s condition on the innovations $\varepsilon_{i},i\in\mathbb{Z}^{d}$ . Compared with the moderate deviation results in Sang and Xiao (2018), our Theorem 3.1 below gives more precise convergence rate which holds on much wider range for $x$ .

Suppose that there is a disc centered at $z=0$ within which the cumulant generating function $L(z)=L_{\varepsilon_{i}}(z)=\log\operatorname{\mathbb{E}}e^{z\varepsilon_{i}}$ of $\varepsilon_{i}$ is analytic and can be expanded in a convergent power series

[TABLE]

where $\gamma_{k}$ is the cumulant of order $k$ of the random variables $\varepsilon_{i},\,i\in\mathbb{Z}^{d}$ . We have that $\gamma_{1}=\operatorname{\mathbb{E}}\varepsilon_{i}=0$ and $\gamma_{2}=\operatorname{\mathbb{E}}\varepsilon_{i}^{2}=\sigma^{2}$ , $i\in\mathbb{Z}^{d}$ .

We write

[TABLE]

where $b_{nj}=\sum_{i\in\Gamma^{d}_{n}}a_{i-j}$ . In the setting of Section 2, we have $X_{nj}=b_{nj}\varepsilon_{j}$ , $j\in{\mathbb{Z}}^{d}$ . Then it can be verified that for all $n\geq 1$ and $j\in{\mathbb{Z}}^{d}$ , $X_{nj}$ satisfy condition (3) for suitably chosen $H_{n}$ . In the notation of Section 2, we have

[TABLE]

Hence, we can apply Theorem 2.1 to prove the following theorem.

Theorem 3.1

Assume that the linear random field $X=\{X_{j},j\in\mathbb{Z}^{d}\}$ has short memory, i.e.,

[TABLE]

or long memory with coefficients

[TABLE]

where $\alpha\in(d/2,d)$ is a constant, $l(\cdot):[1,\infty)\to\mathbb{R}$ is a slowly varying function at infinity and $b(\cdot)$ is a continuous function defined on the unit sphere ${\mathbb{S}}_{d-1}$ . Suppose that there exist positive constants $H$ and $C$ such that

[TABLE]

in the disc $|z|<H$ . Then for all $x\geq 0$ with $x=o(n^{d/2})$ , we have

[TABLE]

where

[TABLE]

is a power series that stays bounded uniformly in $n$ for sufficiently small values of $|t|$ and the coefficients $\beta_{kn}$ only depend on the cumulants of $\varepsilon_{i}$ and on the coefficients $a_{i}$ of the linear random field.

To the best of our knowledge, Theorem 3.1 is the first result that gives the exact tail probability for partial sums of random fields with dependence structure under the Cramér condition.

Due to its preciseness, Theorem 3.1 can be applied to evaluate the performance of approximation of the distribution of linear random fields by truncation. We often use the random variable $X_{j}^{m}=\sum_{i\in\Gamma_{m}^{d}}a_{i}\varepsilon_{j-i}$ with finite terms to approximate the linear random field $X_{j}=\sum_{i\in\mathbb{Z}^{d}}a_{i}\varepsilon_{j-i}$ in practice. For example, the moving average with finite terms $MA(m)$ is applied to approximate the linear process (moving average with infinite terms). In this case, Theorem 3.1 also applies to the partial sum $S_{n}^{m}=\sum_{j\in\Gamma^{d}_{n}}X_{j}^{m}=\sum_{j\in\mathbb{Z}^{d}}b_{nj}^{m}\varepsilon_{j}$ . Here only finite terms $b_{nj}^{m}$ are non-zero. Denote

[TABLE]

Then for all $x\geq 0$ with $x=o(n^{d/2})$ , we have

[TABLE]

where

[TABLE]

and where the coefficients $\beta_{kn}^{m}$ have similar definition as $\beta_{kn}$ . To see the difference between the two tail probabilities of the partial sums, we have

[TABLE]

here as in the proof of Theorem 3.1, we take $M_{n}=\max_{j\in\mathbb{Z}^{d}}|b_{nj}|$ , $H_{n}=\frac{H}{2M_{n}}$ , $M_{n}^{m}=\max_{j\in\mathbb{Z}^{d}}|b_{nj}^{m}|$ , $H_{n}=\frac{H}{2M_{n}^{m}}$ ,

[TABLE]

If $\gamma_{3}\neq 0$ , $\frac{1-F_{n}(x)}{1-F_{n}^{m}(x)}$ is dominated by $\exp\big{\{}\frac{x^{3}}{n^{d/2}}(\beta_{0n}-\beta_{0n}^{m})\big{\}}$ . If $\gamma_{3}=0$ , then $\beta_{0n}=\beta_{0n}^{m}=0$ and $\frac{1-F_{n}(x)}{1-F_{n}^{m}(x)}$ can be dominated by $\exp\big{\{}\frac{x^{4}}{n^{d}}(\beta_{1n}-\beta_{1n}^{m})\big{\}}$ which depends on whether $\gamma_{4}=0$ . In general, Theorem 3.1 can be applied to evaluate whether the truncated version $X_{j}^{m}$ is a good approximation to $X_{j}$ in terms of the ratio $\frac{1-F_{n}(x)}{1-F_{n}^{m}(x)}$ for $x$ in different ranges which depends on the property of the innovation $\varepsilon$ and the sequence $\{a_{i},i\in\mathbb{Z}^{d}\}$ .

Theorem 3.1 can be applied to calculate the tail probability of the partial sum of some well-known dependent models. For example, the autoregressive fractionally integrated moving average FARIMA $(p,\beta,q)$ processes in one dimensional case introduced by Granger and Joyeux (1980) and Hosking (1981), which is defined as

[TABLE]

Here $p,q$ are nonnegative integers, $\phi(z)=1-\phi_{1}z-\cdots-\phi_{p}z^{p}$ is the AR polynomial and $\theta(z)=1+\theta_{1}z+\cdots\theta_{q}z^{q}$ is the MA polynomial. Under the conditions that $\phi(z)$ and $\theta(z)$ have no common zeros, the zeros of $\phi(\cdot)$ lie outside the closed unit disk and $-1/2<\beta<1/2$ , the FARIMA( $p,\beta,q$ ) process has linear process form $X_{n}=\sum_{i=0}^{\infty}{a_{i}\varepsilon_{n-i}},\;\;n\in\mathbb{N},$ with $a_{i}=\frac{\theta(1)}{\phi(1)}\frac{i^{\beta-1}}{\Gamma(\beta)}+O(i^{-1})$ . Here $\Gamma(\cdot)$ is the gamma function.

3.2 Approximation of risk measures

Theorem 3.1 can be applied to approximate the risk measures such as quantiles and tail conditional expectations for the partial sums $S_{n}$ in (8) of linear random field $X=\{X_{j},j\in\mathbb{Z}^{d}\}$ . Given the tail probability $\alpha\in(0,1)$ , let $Q_{\alpha,n}$ be the upper $\alpha$ -th quantile of $S_{n}$ . Namely $P(S_{n}\geq Q_{\alpha,n})=\alpha$ . By Theorem 3.1, for all $x\geq 0$ with $x=o(n^{d/2})$ ,

[TABLE]

We approximate $Q_{\alpha,n}$ by $x_{\alpha}\sqrt{B_{n}}$ , where $x=x_{\alpha}=o(n^{d/2})$ can be solved numerically from the equation

[TABLE]

The tail conditional expectation is computed as

[TABLE]

which can be solved numerically. The quantile and tail conditional expectation, which are also called value at risk (VaR) or expected shortfall (ES) in finance and risk theory, are important measures to model the extremal behavior of random variables in practice. The precise moderate deviation results in this article provide a vehicle in the computation of these two measures of time series or spacial random fields. See Peligrad et al. (2014a) for a brief review of VaR and ES in the literature and a study of them when a linear process has $p$ -th moment ( $p>2$ ) or has a regularly varying tail with exponent $t>2$ .

3.3 Nonparametric regression

Consider the following regression model

[TABLE]

where $g$ is a bounded continuous function on $\mathbb{R}^{m}$ , $z_{n,j}$ ’s are the fixed design points over $\Gamma_{n}^{d}\subseteq\mathbb{Z}^{d}$ with values in a compact subset of $\mathbb{R}^{m}$ , and $X_{n,j}=\sum_{i\in\mathbb{Z}^{d}}a_{i}\varepsilon_{n,j-i}$ is a linear random field over $\mathbb{Z}^{d}$ , where the i.i.d. innovations $\varepsilon_{n,i}$ satisfy the same conditions as in Subsection 3.1. The kernel regression estimation for the function $g$ on the basis of sample pairs $(z_{n,j},Y_{n,j})$ , $j\in\Gamma_{n}^{2}\subset\mathbb{Z}^{2}$ has been studied by Sang and Xiao (2018) under the condition that the i.i.d. innovations $\varepsilon_{n,i}$ satisfy $\|\varepsilon_{n,i}\|_{p}<\infty$ for some $p>2$ and (or) the innovations have regularly varying right tail with index $t>2$ . See Sang and Xiao (2018) for more references in the literature for regression models with independent or weakly dependent random field errors.

We study the kernel regression estimation for the function $g$ on the basis of sample pairs $(z_{n,j},Y_{n,j})$ , $j\in\Gamma_{n}^{d}$ , when the i.i.d. innovations $\varepsilon_{n,i}$ satisfy the conditions as in Subsection 3.1. Same as in Sang and Xiao (2018) and the other references in the literature, the estimator that we consider is given by

[TABLE]

where the weight functions $w_{n,j}(\cdot)$ ’s on $\mathbb{R}^{m}$ have form

[TABLE]

Here $K:\mathbb{R}^{m}\rightarrow\mathbb{R}^{+}$ is a kernel function and $h_{n}$ is a sequence of bandwidths which goes to zero as $n\rightarrow\infty$ . Notice that the weight functions satisfy the condition $\sum_{j\in\Gamma_{n}^{d}}w_{n,j}(z)=1$ .

For a fixed $z\in{\mathbb{R}}^{m}$ , let

[TABLE]

where $b_{n,j}(z)=\sum_{i\in\Gamma_{n}^{d}}w_{n,i}(z)a_{i-j}$ . Let $B_{n}(z)=\sigma^{2}\sum_{j\in\mathbb{Z}^{d}}b_{n,j}^{2}(z)$ , $M_{n}(z)=\max\limits_{j\in\mathbb{Z}^{d}}|b_{nj}(z)|$ . By the same analysis as in the proof of Theorem 3.1, we take $H_{n}\propto M_{n}(z)^{-1}$ and derive a moderate deviation result for $S_{n}(z)=g_{n}(z)-\mathbb{E}g_{n}(z)$ . That is, if $B_{n}(z)H_{n}^{2}\to\infty$ as $n\to\infty$ , $x\geq 0,\,x=o(H_{n}\sqrt{B_{n}(z)})$ , then

[TABLE]

A similar bound can be derived for $P\big{(}|S_{n}(z)|>x\sqrt{B_{n}(z)}\big{)}$ . Notice that these tail probability estimates are more precise than those obtained in Sang and Xiao (2018), where an upper bound for the law of the iterated logarithm of $g_{n}(z)-\mathbb{E}g_{n}(z)$ was derived. With the more precise bound on the tail probability in (14) and certain assumptions on $g$ and the fixed design points $\{z_{n,j}\}$ [cf. Gu and Tran (2009)], one can construct a confidence interval for $g(z)$ .

More interestingly, our method in this paper provides a way for constructing confidence bands for the function $g(z)$ when $z\in T$ , where $T\subset{\mathbb{R}}^{m}$ is a compact interval. Observe that for any $z,z^{\prime}\in T$ , we can write

[TABLE]

Under certain regularity assumption on $g$ and the fixed design points $\{z_{n,j}\}$ [cf. Gu and Tran (2009)], we can apply the argument in Subsection 3.1 to derive exponential upper bound for the tail probability $P\big{(}|S_{n}(z)-S_{n}(z^{\prime})|>x\sqrt{B_{n}(z,z^{\prime})}\big{)},$ where $B_{n}(z,z^{\prime})=\sigma^{2}\sum_{j\in\mathbb{Z}^{d}}\big{(}b_{n,j}(z)-b_{n,j}(z^{\prime})\big{)}^{2}$ . Such a sharp upper bound, combined with the chaining argument [cf. Talagrand (2014)] would allow us to derive an exponential upper bound for

[TABLE]

which can be applied to derive uniform convergence rate of $g_{n}(z)\rightarrow g(z)$ for all $z\in T$ and to construct confidence band for the function $g(z),\,z\in T$ . It is non-trivial to carry out this project rigorously and the verification of the details is a little lengthy. Hence we will have to consider it elsewhere.

4 Proofs

**Proof of Theorem 2.1

**

Since $\gamma_{1nj}=0$ , the cumulant generating function $L_{nj}(z)$ of $X_{nj}$ can be written as

[TABLE]

Cauchy’s inequality for the derivatives of analytic functions together with the condition (4) yields that

[TABLE]

By following the conjugate method (cf. Petrov (1965, 1975)), we now introduce an auxiliary sequence of independent random variables $\{\overline{X}_{nj}\}$ , $j\in\mathbb{Z}^{d}$ , with the distribution functions

[TABLE]

where $V_{nj}(y)=P(X_{nj}<y)$ and $z\in(-H_{n},H_{n})$ is a real number whose value will be specified later.

Denote

[TABLE]

and

[TABLE]

Note that, in the above and below, we have suppressed $z$ for simplicity of notations.

We shall see in the later analysis that the quantities $\overline{S}_{n},\overline{M}_{n}$ and $\overline{B}_{n}$ are well-defined for every $n$ and $z\in{\mathbb{R}}$ with $|z|<aH_{n},$ where $a<1$ is a positive constant which is independent of $n$ . Throughout the proof we will obtain some estimates holding for the values of $z$ satisfying $|z|<bH_{n},$ where the positive constant $b<1$ may vary but is always independent of $n$ . We will then take $a$ to be the smallest one among those constants $b$ . The selection of the constants does not affect the proof since the $z=z_{n}$ we need in the later analysis has property $z=o(H_{n})$ .

Also, the change of the order of summation of double series presented in the proof is justified by the absolute convergence of those series in the specified regions.

**Step 1: Representation of $P(S_{n}<x)$ in terms of the conjugate measure

**

First notice that by equation (2.11) on page 221 of Petrov (1975), for any $m\in\mathbb{N}$ , we have

[TABLE]

Note that the condition (5) implies that $C_{n}<\infty,n\in\mathbb{N}$ . From (15) it follows that for any $w$ with $|w|<\frac{2}{3}H_{n}$ and for any $m\in\mathbb{N}$ we have

[TABLE]

Therefore, for any $v$ with $|v|<\frac{1}{2}H_{n}$ and $z$ with $|z|<\frac{1}{6}H_{n}$ ,

[TABLE]

Hence, $\overline{S}_{n}$ is well-defined and $\overline{S}_{m,n}$ converges to $\overline{S}_{n}$ in distribution or equivalently in probability or almost surely as $m\rightarrow\infty$ .

For the $x$ in $P(S_{n}<x)$ , let $f(y)=\exp\{-zy\}\textbf{1}\{y<x\}$ and $M>0$ . By Markov’s inequality, we have

[TABLE]

Hence, by (17) we have that for $|z|<\frac{1}{6}H_{n}$ ,

[TABLE]

Applying Theorem 2.20 from van der Vaart (1998), we have

[TABLE]

as $m\rightarrow\infty$ . And taking into account that

[TABLE]

and

[TABLE]

as $m\rightarrow\infty$ we obtain from (16) that

[TABLE]

**Step 2: Properties of the conjugate measure

**

From the calculation of (18) it follows that the cumulant generating function $\overline{L}_{nj}(v)$ of the random variable $\overline{X}_{nj}$ exists when $|v|$ is sufficiently small and we have

[TABLE]

$j\in\mathbb{Z}^{d}$ . Denoting by $\overline{\gamma}_{knj}$ the cumulant of order $k$ of the random variable $\overline{X}_{nj}$ , we obtain

[TABLE]

Setting $k=1$ and $k=2$ we find that

[TABLE]

and

[TABLE]

Hence, for $|z|<\frac{1}{2}H_{n}$ , (21) imples

[TABLE]

which means that $\overline{M}_{n}$ is well-defined and, as a function of $z\in\mathbb{C}$ , is analytic in $|z|<\frac{1}{2}H_{n}$ .

Also, without loss of generality, we assume that

[TABLE]

By the definition of $\overline{M}_{n}$ and (21), we have

[TABLE]

It follows from (15) that

[TABLE]

for $|z|<b_{1}H_{n}$ and a suitable positive constant $b_{1}<1$ which is independent of $j$ and $n$ . This together with (25) implies that for $|z|<b_{1}H_{n}$

[TABLE]

Taking into account the condition (24), we get that

[TABLE]

Moreover, (25) implies that for $|z|<\frac{1}{2}H_{n}$ ,

[TABLE]

Also, by the definition of $\overline{B}_{n}$ and (22), we have

[TABLE]

It follows from (15) that

[TABLE]

for $|z|<b_{2}H_{n}$ and a suitable positive constant $b_{2}<1$ which is independent of $j$ and $n$ . This together with (28) implies that for $|z|<b_{2}H_{n}$ , $\overline{B}_{n}$ is well-defined and

[TABLE]

Condition (24) then implies that

[TABLE]

Furthermore, (28) and (15) imply that for $|z|<\frac{1}{2}H_{n}$ ,

[TABLE]

**Step 3: Selection of $z$

**

Let $z=z_{n}$ be the real solution of the equation

[TABLE]

and let

[TABLE]

Then

[TABLE]

By (23) we know that $\frac{\overline{M}_{n}}{H_{n}B_{n}}$ is analytic in a disc $|z|<\frac{1}{2}H_{n}$ and

[TABLE]

in that disc. It follows from Bloch’s theorem (see, e.g., Privalov (1984), page 256) that (33) has a real solution which can be written as

[TABLE]

for

[TABLE]

Moreover, the absolute value of that sum in (34) is less than $\frac{1}{2}H_{n}$ . Condition (5) implies that there exists a disc with center at $t=0$ and radius $R$ that does not depend on $n$ within which the series on the right side of (34) converges.

It can be checked from (33) and (34) that

[TABLE]

Cauchy’s inequality implies that for every $m\in\mathbb{N}$ ,

[TABLE]

Therefore, as $t\to 0$ , $a_{1n}t$ becomes the dominant term of the series in (34). Hence, for sufficiently large $n$ we have

[TABLE]

and taking into account (32) we get

[TABLE]

It follows from (17) and (23) that for $z<\frac{1}{2}H_{n}$ ,

[TABLE]

For the solution $z$ of the equation (31) we also have

[TABLE]

where $\lambda_{n}(t)=\sum_{k=0}^{\infty}\beta_{kn}t^{k}$ with $\beta_{kn}=b_{(k+3)n}(H_{n}^{2}B_{n})^{-1}$ .

Recall that the series $\sum_{m=1}^{\infty}a_{mn}t^{m}$ converges in the disc centered at $t=0$ with radius $R>0$ that does not depend on $n$ , and the absolute value of this sum is less than $\frac{1}{2}H_{n}$ . We see from (37) that the function $\lambda_{n}(t)$ is obtained by the substitution of $\sum_{m=1}^{\infty}a_{mn}t^{m}$ in a series that converges on the interval $(-\frac{1}{2}H_{n},\frac{1}{2}H_{n})$ . It follows from Cauchy’s inequality that

[TABLE]

which means that for $|t|<\frac{1}{2}R$ , $\lambda_{n}(t)$ stays bounded uniformly in $n$ . In particular, by (35) and (37), we have $\beta_{0n}=\frac{H_{n}}{6B_{n}}\sum_{j\in\mathbb{Z}^{d}}\gamma_{3nj}$ .

From now on we will assume that $z$ is the unique real solution of the equation (31).

**Step 4: The case $0\leq x\leq 1$

**

Now we prove the theorem for the case $0\leq x\leq 1$ using the method presented in Petrov and Robinson (2006). Throughout the proof, $C$ denotes a positive constant which may vary from line to line, but is independent of $j,n$ and $z$ . If $f_{n}(s)$ is the characteristic function of $S_{n}/\sqrt{B_{n}}$ we then have that for $|s|<H_{n}\sqrt{B_{n}}/2$

[TABLE]

Then

[TABLE]

Thus, using (15) we get that for $|s|<\delta H_{n}\sqrt{B_{n}}/2$ , with $0<\delta<1$ ,

[TABLE]

Then, for appropriate choice of $\delta$ we have that

[TABLE]

for $|s|<\delta H_{n}\sqrt{B_{n}}/2$ . Now applying Theorem 5.1 from Petrov (1995) with $b=1/\pi$ and $T=\delta H_{n}\sqrt{B_{n}}/2$ we get that

[TABLE]

Since $0\leq x\leq 1$ , $B_{n}H_{n}^{2}\rightarrow\infty$ as $n\rightarrow\infty$ , and $\lambda_{n}\Big{(}\frac{x}{H_{n}\sqrt{B_{n}}}\Big{)}$ is bounded uniformly in $n$ , we have

[TABLE]

Together with condition (5), to have (6) in the case $0\leq x\leq 1$ , it is sufficient to show

[TABLE]

which is given by (38), since $1/2\leq\Phi(x)\leq\Phi(1)$ for $0\leq x\leq 1$ .

So we will limit the proof of the theorem to the case $x>1,x=o(H_{n}\sqrt{B_{n}})$ .

**Step 5: The case $x>1,\;x=o(H_{n}\sqrt{B_{n}})$

**

Making a change of variables $y\rightsquigarrow\overline{M}_{n}+y\sqrt{\overline{B}_{n}}$ and applying (31), we can rewrite (19) as

[TABLE]

Denote $r_{n}(x)=\overline{F}_{n}(x)-\Phi(x)$ and we show that for sufficiently large $n$

[TABLE]

Let $\overline{f}_{n}(s)$ be the characteristic function of $(\overline{S}_{n}-\overline{M}_{n})/\sqrt{\overline{B}_{n}}$ . We then have that

[TABLE]

Then by (20) for $|z|<\frac{1}{2}H_{n}$ and $|s|<H_{n}\sqrt{\overline{B}_{n}}/6$ we have that

[TABLE]

where $0\leq|\theta|\leq 1$ . For $|z|<\frac{1}{2}H_{n}$ and $|s|<\delta H_{n}\sqrt{\overline{B}_{n}}/6$ , with $0<\delta<1$ , we have that

[TABLE]

Thus,

[TABLE]

Then, for appropriate choice of $\delta$ we have that

[TABLE]

for $|s|<\delta H_{n}\sqrt{\overline{B}_{n}}/6$ . Now applying (29) and Theorem 5.1 from Petrov (1995) with $b=1/\pi$ and $T=\delta H_{n}\sqrt{\overline{B}_{n}}/6$ , we have (40).

By (40) we have

[TABLE]

where $|\alpha_{n}|\leq\frac{C}{H_{n}\sqrt{B_{n}}}.$

Denote

[TABLE]

and

[TABLE]

where

[TABLE]

is the Mills ratio which is known to satisfy

[TABLE]

for all $x>0$ . Hence, by (36) and (29) we obtain

[TABLE]

Hence,

[TABLE]

For every $y_{1}<y_{2}$ we have that $\psi(y_{2})-\psi(y_{1})=(y_{2}-y_{1})\psi^{\prime}(u)$ , where $y_{1}<u<y_{2}$ . As for $u>0$ , $|\psi^{\prime}(u)|<u^{-2}$ , then using (5), (36), (26), (27), (29) and (30) we get that

[TABLE]

Hence,

[TABLE]

which means that

[TABLE]

Finally, combining (4), (31), (32), (37), (41) and (42) we get

[TABLE]

By (43) and the fact that $I_{2}=\psi(x)$ , we see that

[TABLE]

This proves (6). The proof of (7) follows a same pattern and is omitted.

Proof of Theorem 3.1

Since $\gamma_{1}=0$ , we see that the cumulant generating function $L_{nj}(z)$ of the random variable $b_{nj}\varepsilon_{j},j\in\mathbb{Z}^{d},$ is given by

[TABLE]

Cauchy’s inequality for the derivatives of analytic functions together with the condition (11) yields that

[TABLE]

Denote $M_{n}=\max\limits_{j\in\mathbb{Z}^{d}}|b_{nj}|$ . Then by (44), for any $H_{n}$ with $0<H_{n}\leq\frac{H}{2M_{n}}$ and for any $z$ with $|z|<H_{n}$ we have

[TABLE]

Hence,

[TABLE]

Then by Theorem 2.1, if $B_{n}H_{n}^{2}\to\infty$ as $n\to\infty$ , we have

[TABLE]

for $x\geq 0,x=o(H_{n}\sqrt{B_{n}})$ .

If the linear random field has long memory then we have that (see Surgailis (1982), Theorem 2) $B_{n}\propto n^{3d-2\alpha}l^{2}(n)$ . As the function $b(\cdot)$ is bounded, then for $j\in\Gamma^{d}_{n}$ we have

[TABLE]

where we have used the fact (see Bingham et al. (1987) or Seneta (1976)) that for a slowly varying function $l(x)$ defined on $[1,\infty)$ and for any $\theta>-1$ ,

[TABLE]

It follows from the definition of $a_{i}$ in (10) that (for sufficiently large $n$ ) $M_{n}=\max\limits_{j\in\mathbb{Z}^{d}}|b_{nj}|$ is attained at some $j\in\Gamma^{d}_{n}$ . Hence, $M_{n}=O(n^{d-\alpha}l(n))$ . We take $H_{n}\propto n^{-d+\alpha}l^{-1}(n)$ which yields

[TABLE]

Then the result follows from (45).

If the linear random field has short memory, i.e., $A:=\sum_{i\in\mathbb{Z}^{d}}|a_{i}|<\infty,\;\;a:=\sum_{i\in\mathbb{Z}^{d}}a_{i}\neq 0,$ we can take $M_{n}=A$ and $H_{n}=\frac{H}{2A}$ . Moreover, we also have

[TABLE]

and

[TABLE]

which means that $\sum_{j\in\mathbb{Z}^{d}}|b_{nj}|\propto n^{d}$ .

As for all $n\in\mathbb{N}$ we have that $|b_{nj}|\leq A$ by the definition of $A$ , then

[TABLE]

On the other hand, for $j\in\Gamma^{d}_{\left\lfloor{n/2}\right\rfloor}$ we have that $|b_{nj}|>|a|/2$ for sufficiently large $n$ . Hence,

[TABLE]

Thus, $\sum_{j\in\mathbb{Z}^{d}}b_{nj}^{2}\propto n^{d}$ and the result follows from (45).

Bibliography49

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] Amosova, N. N., 1979. On probabilities of moderate deviations for sums of independent random variables. Teor. Veroyatn. Primen. 24 , 858–865.
2[2] Asmussen, S. and Albrecher, H., 2010. Ruin Probabilities. World Scientific, Hackensack, NJ.
3[3] Babu, G. J. and Singh, K., 1978 a. Probabilities of moderate deviations for some stationary strong-mixing processes. Sankhy a ¯ ¯ 𝑎 \bar{a} Ser. A 40 , 38–43.
4[4] Babu, G. J. and Singh, K., 1978 b. On probabilities of moderate deviations for dependent processes. Sankhy a ¯ ¯ 𝑎 \bar{a} Ser. A 40 , 28–37.
5[5] Bahadur, R. and Rao, R. R., 1960. On deviations of the sample mean. Ann. Math. Statist. 31 , 1015–1027.
6[6] Bingham, N. H., Goldie, C. M. and Teugels, J. L., 1987. Regular Variation . Cambridge University Press, Cambridge, UK.
7[7] Cramér, H., 1938. Sur un nouveau théorème-limite de la théorie des probabilités , Actual. Sci. et Ind., Paris, 736.
8[8] Fan, X., Grama, I. G. and Liu, Q., 2013. Cramér large deviation expansions for martingales under BernsteinÕs condition. Stoch. Process. Appl. 123 , 3919–3942.