Optimal Gamma Approximation on Wiener Space

Ehsan Azmoodeh; Peter Eichelsbacher; Lukas Knichel

arXiv:1902.02658·math.PR·February 8, 2019

Optimal Gamma Approximation on Wiener Space

Ehsan Azmoodeh, Peter Eichelsbacher, Lukas Knichel

PDF

TL;DR

This paper establishes an optimal rate of convergence for Gamma approximation on Wiener space using a novel operator approach to Stein's method, extending previous cumulant-based characterizations.

Contribution

It introduces a new operator theory approach to Stein's method for Gamma approximation, achieving optimal convergence rates in the $d_2$-distance.

Findings

01

Derived an optimal convergence rate in $d_2$-distance for Gamma approximation.

02

Extended cumulant-based characterization to include rate of convergence.

03

Applied the method to quadratic forms as illustrative examples.

Abstract

In \cite{n-p-noncentral}, Nourdin and Peccati established a neat characterization of Gamma approximation on a fixed Wiener chaos in terms of convergence of only the third and fourth cumulants. In this paper, we provide an optimal rate of convergence in the $d_{2}$ -distance in terms of the maximum of the third and fourth cumulants analogous to the result for normal approximation in \cite{n-p-optimal}. In order to achieve our goal, we introduce a novel operator theory approach to Stein's method. The recent development in Stein's method for the Gamma distribution of D\"obler and Peccati (\cite{d-p}) plays a pivotal role in our analysis. Several examples in the context of quadratic forms are considered to illustrate our optimal bound.

Equations304

d_{T V} (F_{n}, N) \leq 2 \frac{q - 1}{3 q} ∣ κ_{4} (F_{n}) ∣ .

d_{T V} (F_{n}, N) \leq 2 \frac{q - 1}{3 q} ∣ κ_{4} (F_{n}) ∣ .

C_{1} max {∣ κ_{3} (F_{n})∣, ∣ κ_{4} (F_{n})∣} ⩽ d_{T V} (F_{n}, N) ⩽ C_{2} max {∣ κ_{3} (F_{n})∣, ∣ κ_{4} (F_{n})∣} .

C_{1} max {∣ κ_{3} (F_{n})∣, ∣ κ_{4} (F_{n})∣} ⩽ d_{T V} (F_{n}, N) ⩽ C_{2} max {∣ κ_{3} (F_{n})∣, ∣ κ_{4} (F_{n})∣} .

\begin{split}d_{1}(F,G(\nu))&\leq C_{\nu,q}\,\sqrt{\Big{|}\left(\kappa_{4}(F)-\kappa_{4}(G(\nu))\right)-12\left(\kappa_{3}(F)-\kappa_{3}(G(\nu)\right)\Big{|}}\leq C^{\prime}_{\nu,q}\sqrt{\mathbf{M}(F)}\end{split}

\begin{split}d_{1}(F,G(\nu))&\leq C_{\nu,q}\,\sqrt{\Big{|}\left(\kappa_{4}(F)-\kappa_{4}(G(\nu))\right)-12\left(\kappa_{3}(F)-\kappa_{3}(G(\nu)\right)\Big{|}}\leq C^{\prime}_{\nu,q}\sqrt{\mathbf{M}(F)}\end{split}

\mathbf{M}(F):=\max\Big{\{}\Big{|}\kappa_{4}(F)-\kappa_{4}(G(\nu))\Big{|},\Big{|}\kappa_{3}(F)-\kappa_{3}(G(\nu))\Big{|}\Big{\}}.

\mathbf{M}(F):=\max\Big{\{}\Big{|}\kappa_{4}(F)-\kappa_{4}(G(\nu))\Big{|},\Big{|}\kappa_{3}(F)-\kappa_{3}(G(\nu))\Big{|}\Big{\}}.

d_{k}(X,Y):=\sup_{h\in\mathcal{H}_{k}}\Big{\lvert}\mathbb{E}[h(X)]-\mathbb{E}[h(Y)]\Big{\rvert}

d_{k}(X,Y):=\sup_{h\in\mathcal{H}_{k}}\Big{\lvert}\mathbb{E}[h(X)]-\mathbb{E}[h(Y)]\Big{\rvert}

C_{1} \leq \frac{d ( F _{n} , G ( ν ))}{ρ ( n )} \leq C_{2}, \forall n \in N .

C_{1} \leq \frac{d ( F _{n} , G ( ν ))}{ρ ( n )} \leq C_{2}, \forall n \in N .

C_{1} M (F) \leq d_{2} (F, G (ν)) \leq C_{2} M (F),

C_{1} M (F) \leq d_{2} (F, G (ν)) \leq C_{2} M (F),

F = q = 0 \sum \infty I_{q} (f_{q}),

F = q = 0 \sum \infty I_{q} (f_{q}),

E [I_{p} (f) I_{q} (g)] = {p! ⟨ f, g ⟩_{H^{\otimes p}} 0 if p = q otherwise .

E [I_{p} (f) I_{q} (g)] = {p! ⟨ f, g ⟩_{H^{\otimes p}} 0 if p = q otherwise .

DF=\sum_{i=1}^{\infty}\frac{\partial g}{\partial x_{i}}\big{(}X(\varphi_{1}),\ldots,X(\varphi_{n})\big{)}\,\varphi_{i}.

DF=\sum_{i=1}^{\infty}\frac{\partial g}{\partial x_{i}}\big{(}X(\varphi_{1}),\ldots,X(\varphi_{n})\big{)}\,\varphi_{i}.

D ϕ (F) = i = 1 \sum m \frac{\partial ϕ}{\partial x _{i}} (F) D F_{i} .

D ϕ (F) = i = 1 \sum m \frac{\partial ϕ}{\partial x _{i}} (F) D F_{i} .

L^{- 1} F = - p = 1 \sum \infty \frac{1}{p} I_{p} (f_{p}) .

L^{- 1} F = - p = 1 \sum \infty \frac{1}{p} I_{p} (f_{p}) .

E [F G] = E [F] E [G] + E [⟨ D G, - D L^{- 1} F ⟩_{H}] .

E [F G] = E [F] E [G] + E [⟨ D G, - D L^{- 1} F ⟩_{H}] .

\kappa_{n}(F)=\frac{1}{i^{n}}\frac{\partial^{n}}{\partial t^{n}}\log\phi_{F}(t)\Big{|}_{t=0}.

\kappa_{n}(F)=\frac{1}{i^{n}}\frac{\partial^{n}}{\partial t^{n}}\log\phi_{F}(t)\Big{|}_{t=0}.

Γ_{i + 1} (F) := ⟨ D Γ_{i} (F), - D L^{- 1} F ⟩_{H}, for i ⩾ 0.

Γ_{i + 1} (F) := ⟨ D Γ_{i} (F), - D L^{- 1} F ⟩_{H}, for i ⩾ 0.

Γ_{a l t, 0} (F) := F and Γ_{a l t, i + 1} (F) := ⟨ D F, - D L^{- 1} Γ_{a l t, i} (F) ⟩_{H}, for i ⩾ 0.

Γ_{a l t, 0} (F) := F and Γ_{a l t, i + 1} (F) := ⟨ D F, - D L^{- 1} Γ_{a l t, i} (F) ⟩_{H}, for i ⩾ 0.

E [Γ_{a l t, j} (F)] = \frac{1}{j !} κ_{j + 1} (F) .

E [Γ_{a l t, j} (F)] = \frac{1}{j !} κ_{j + 1} (F) .

Γ_{j} (F) = Γ_{a l t, j} (F) for all j ⩾ 1.

Γ_{j} (F) = Γ_{a l t, j} (F) for all j ⩾ 1.

F = i = 1 \sum \infty c_{f, i} (N_{i}^{2} - 1),

F = i = 1 \sum \infty c_{f, i} (N_{i}^{2} - 1),

κ_{p} (F) = 2^{p - 1} (p - 1)! i = 1 \sum \infty c_{f, i}^{p} = 2^{p - 1} (p - 1)! ⟨ f, f \otimes_{1}^{(p - 1)} f ⟩_{H} = 2^{p - 1} (p - 1)! Tr (A_{f}^{p})

κ_{p} (F) = 2^{p - 1} (p - 1)! i = 1 \sum \infty c_{f, i}^{p} = 2^{p - 1} (p - 1)! ⟨ f, f \otimes_{1}^{(p - 1)} f ⟩_{H} = 2^{p - 1} (p - 1)! Tr (A_{f}^{p})

κ_{p} (G (ν)) = {0 2^{p - 1} (p - 1)! ν, p = 1;, p ⩾ 2.

κ_{p} (G (ν)) = {0 2^{p - 1} (p - 1)! ν, p = 1;, p ⩾ 2.

\frac{d ^{p} K}{d t ^{p}} (t) = ⎩ ⎨ ⎧ - ν + \frac{ν}{1 - 2 t} \frac{ν}{2} \frac{2 ^{p} ( p - 1 )!}{( 1 - 2 t ) ^{p + 1}}, p = 1;, p ⩾ 2.

\frac{d ^{p} K}{d t ^{p}} (t) = ⎩ ⎨ ⎧ - ν + \frac{ν}{1 - 2 t} \frac{ν}{2} \frac{2 ^{p} ( p - 1 )!}{( 1 - 2 t ) ^{p + 1}}, p = 1;, p ⩾ 2.

\displaystyle\operatorname{Var}\Big{(}\Gamma_{r}(F)-2\Gamma_{r-1}(F)\Big{)}

\displaystyle\operatorname{Var}\Big{(}\Gamma_{r}(F)-2\Gamma_{r-1}(F)\Big{)}

= \frac{1}{( 2 r + 1 )!} κ_{2 r + 2} (F) - \frac{4}{( 2 r )!} κ_{2 r + 1} (F) + \frac{4}{( 2 r - 1 )!} κ_{2 r} (F) .

\operatorname{\overline{\Gamma}}_{r}(F)=2^{r}I_{2}\Big{(}f\mathbin{\otimes_{1}^{(r+1)}}f\Big{)}.

\operatorname{\overline{\Gamma}}_{r}(F)=2^{r}I_{2}\Big{(}f\mathbin{\otimes_{1}^{(r+1)}}f\Big{)}.

\displaystyle\operatorname{Var}\Big{(}\Gamma_{r}(F)-

\displaystyle\operatorname{Var}\Big{(}\Gamma_{r}(F)-

\displaystyle=2^{2r+1}\Big{(}\langle f,f\mathbin{\otimes_{1}^{(2r+1)}}f\rangle_{\mathfrak{H}^{\otimes 2}}-2\,\langle f,f\mathbin{\otimes_{1}^{(2r)}}f\rangle_{\mathfrak{H}^{\otimes 2}}+\langle f,f\mathbin{\otimes_{1}^{(2r-1)}}f\rangle_{\mathfrak{H}^{\otimes 2}}\Big{)}

\displaystyle=2^{2r+1}\operatorname{Tr}\Big{(}A_{f}^{2r+2}-2\,A_{f}^{2r+1}+A_{f}^{2r}\Big{)}.

p_{r} (x) = {\frac{1}{Γ ( r )} x^{r - 1} e^{- x}, 0, if x > 0, otherwise .

p_{r} (x) = {\frac{1}{Γ ( r )} x^{r - 1} e^{- x}, 0, if x > 0, otherwise .

2 (x + ν) f^{'} (x) - x f (x) = h (x) - E [h (G (ν))],

2 (x + ν) f^{'} (x) - x f (x) = h (x) - E [h (G (ν))],

∥ f^{'} ∥_{\infty} = x, y \in R x \neq = y sup \frac{∣ f ( x ) - f ( y )∣}{∣ x - y ∣} \in R \cup {+ \infty} .

∥ f^{'} ∥_{\infty} = x, y \in R x \neq = y sup \frac{∣ f ( x ) - f ( y )∣}{∣ x - y ∣} \in R \cup {+ \infty} .

\big{\|}S(h)\big{\|}_{\infty}\leq\|h^{\prime}\|_{\infty},\quad\text{and}\quad\big{\|}S(h)^{\prime}\big{\|}_{\infty}\leq c_{\nu}\|h^{\prime}\|_{\infty},

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Optimal Gamma Approximation on Wiener Space

E. Azmoodeh Ruhr University Bochum, Faculty of Mathematics, IB 2/101, 44780 Bochum, Germany. E-mail: [email protected]

P. Eichelsbacher and L. Knichel Ruhr University Bochum, Faculty of Mathematics, IB 2/115, 44780 Bochum, Germany. E-mail: [email protected] University Bochum, Faculty of Mathematics, IB 2/95, 44780 Bochum, Germany. E-mail: [email protected]. Lukas Knichel has been supported by the German Research Foundation (DFG) via Research Training Group RTG 2131 High dimensional phenomena in probability – fluctuations and discontinuity

Abstract

In [NP09a], Nourdin and Peccati established a neat characterization of Gamma approximation on a fixed Wiener chaos in terms of convergence of only the third and fourth cumulants. In this paper, we provide an optimal rate of convergence in the $d_{2}$ -distance in terms of the maximum of the third and fourth cumulants analogous to the result for normal approximation in [NP15]. In order to achieve our goal, we introduce a novel operator theory approach to Stein’s method. The recent development in Stein’s method for the Gamma distribution of Döbler and Peccati ([DP18]) plays a pivotal role in our analysis. Several examples in the context of quadratic forms are considered to illustrate our optimal bound.

Keywords: Gamma approximation, Wiener chaos, Cumulants/Moments, Weak convergence, Malliavin Calculus, Berry–Esseen bounds, Stein’s method, Wasserstein distances, Quadratic form

MSC 2010: 60F05, 60G50, 60H07

1 Introduction and Main Result
2 Preliminaries: Gaussian Analysis and Malliavin Calculus
2.1 Isonormal Gaussian Processes and Wiener Chaos
2.2 The Malliavin Operators
2.3 Gamma Operators and Cumulants
2.4 Useful facts on Second Wiener Chaos
3 Stein’s Method for the centered Gamma distribution
3.1 Explicit Formula for the Solution of the Stein Equation
3.2 An Operator Theory Approach
4 Optimal Gamma Approximation
4.1 A General Stein-Malliavin Upper Bound
4.2 The Upper Bound: Second Wiener Chaos
4.3 The Lower Bound: Second Wiener Chaos
4.4 Main Result: Non Asymptotic Optimal Gamma Approximation
4.5 Examples
5 Appendix

1 Introduction and Main Result

Let $X=\{X(h):h\in\mathfrak{H}\}$ be an isonormal Gaussian process over a separable Hilbert space $\mathfrak{H}$ on a suitable probability space $(\Omega,\mathscr{F},P)$ . In the landmark article [NP05] Nualart and Peccati discovered an astonishing central limit theorem (CLT) known nowadays as the fourth moment theorem for a sequence of normalized random variables inside a fixed Wiener chaos associated to $X$ . It states that the convergence in distribution towards a standard Gaussian distribution is equivalent to the sole requirement that the fourth moments converge to $3$ . A few years later, their findings have created a fertile line of research, culminating in the popular article [NP09b], introducing the so called Malliavin-Stein approach, an elegant combination of two probabilistic techniques namely Stein method [Ste72, CGS11] and Malliavin calculus [Nua06, NN18] in order to quantify the probability distance between a square integrable Wiener functional and a normal distribution. The reader may consult the excellent monograph [NP12a], as well as the constantly updated web resource https://sites.google.com/site/malliavinstein/home for a huge amount of applications and generalizations of the aforementioned results. Our study is mainly inspired by the following discovery (item (b) of the forthcoming theorem), which presents an optimal version of the fourth moment theorem. For every real-valued random variable $F$ the quantity $\kappa_{r}(F)$ stands for the $r$ th cumulant of $F$ , see section 2.3.

Theorem 1.1 ((Optimal) fourth moment theorem [NP05, NP09b, NP15]).

Fix $q\geq 2$ . Let $\{F_{n}:n\geq 1\}$ be a sequence of random variables in the $q$ th Wiener chaos associated to $X$ such that $\mathbb{E}[F^{2}_{n}]=1$ for every $n\in\mathbb{N}$ . Then

(a)

$F_{n}\to N\sim\mathscr{N}(0,1)$ * in distribution if and only if $\mathbb{E}[F^{4}_{n}]\to 3$ . Also, the following quantitative estimate is in order: for $n\geq 1$ ,*

[TABLE]

(b)

Under the assumptions of item (a) there exist two constants $C_{1}$ and $C_{2}$ (independent of $n$ ) such that the following optimal rate of convergence in total variation distance holds:

[TABLE]

Fix a parameter $\nu>0$ . In this paper, the target distribution of interest is the so called centered Gamma distribution denoted by $G(\nu)\sim CenteredGamma(\nu)$ . This means that $G(\nu)=2\,\widehat{G}(\nu/2)-\nu$ , where $\widehat{G}(\nu/2)$ is a standard Gamma random variable with density $\widehat{g}(x)=x^{\frac{\nu}{2}-1}\,e^{-x}\,\Gamma(\frac{\nu}{2})^{-1}\,\mathds{1}_{(0,\infty)}(x)$ . Here $\Gamma(\nu):=\int_{0}^{+\infty}x^{\nu-1}e^{-x}dx$ denotes the Euler Gamma function. The centered Gamma distribution frequently appears as a natural limiting distribution in the context of the fourth moment theorems in several studies, see for example [ACP14, AMMP16, APP15, KT18, KT12, AS17, ET15, Led12, NR14, NP12b, AMPS17, EV15]. Our principal goal is to provide an optimal rate (analogous to that of item (b) Theorem 1.1) for the Gamma approximation on a fixed Wiener chaos. The statement of the next result is an up-to-date significant improvement over the years of the findings in [NP09a, NP09b, NPR10, DP18].

Theorem 1.2.

Let $\nu>0$ . Fix $q\geq 2$ an even number (see [NP09a, Remark 1.3, item 3] when $q$ is odd). Assume $F=I_{q}(f)$ is a random element in the $q$ th Wiener chaos such that $\mathbb{E}[F^{2}]=2\nu$ . Then there exists a constant $C_{\nu,q}$ (may depend on $\nu$ and $q$ ) such that

[TABLE]

where

[TABLE]

Here $d_{1}$ stands for the so called $1$ -Wasserstein metric (see below for definition). As a consequence, for a sequence $\{F_{n}:n\geq 1\}$ of random variables in the $q$ th Wiener chaos such that $\mathbb{E}[F^{2}_{n}]=2\nu$ for every $n\in\mathbb{N}$ , the following remarkable equivalence of asymptotic statements are in order:

(a)

$F_{n}\rightarrow G(\nu)$ * in distribution.*

(b)

$\kappa_{3}(F_{n})\to 8\nu$ , and $\kappa_{4}(F_{n})\to 48\nu$ .

The exact shape of the constant $C_{\nu,q}$ can be found in the aforementioned references. Note that $\kappa_{3}(G(\nu)),\kappa_{4}(G(\nu))\neq 0$ unlike the case of normal approximation. We also recall the following natural generalization of the $1$ -Wasserstein metric $d_{1}$ that we will make use of throughout the paper. Let $X$ and $Y$ be two real-valued random variables. For $k\geq 2$ , define

[TABLE]

where the class of the test functions is $\mathcal{H}_{k}:=\{h\in C^{k-1}(\mathbb{R}):h^{(k-1)}\in\operatorname{Lip}(\mathbb{R})\text{ and }\lVert h^{(1)}\rVert_{\infty}\leqslant 1,\ldots,\lVert h^{(k)}\rVert_{\infty}\leqslant 1\}$ . Here, $\lVert h^{(k)}\rVert_{\infty}$ denotes the smallest Lipschitz constant of $h^{(k-1)}$ , see (17). A significant and also very challenging question, which we will deal with in this paper, is whether one can either provide an optimal rate or improve the rate (2) available in Theorem 1.2. For a general sequence $\{F_{n}:n\geq 1\}$ and a suitable probability metric $d$ (often we assume that the topology induced by metric $d$ is stronger than convergence in distribution), following [NP12a, Definition 9.2.1], we say that a numerical sequence $\{\rho(n):n\in\mathbb{N}\}$ of strictly positive real numbers, decreasing to [math], yields an optimal rate with respect to the metric $d$ , if there exist two constants $C_{1}$ and $C_{2}$ (independent of $n$ ) such that

[TABLE]

Our main result is the following non asymptotic optimal Gamma approximation within the second Wiener chaos that improves upon the rate (2) by a square power.

Theorem 1.3 (Non asymptotic optimal Gamma approximation).

Let $\nu>0$ , and $G(\nu)\sim CenteredGamma(\nu)$ . Assume that $F$ is a random variable in the second Wiener chaos associated with $X$ , such that $\mathbb{E}[F^{2}]=2\nu$ . Then there exist two constants $0<C_{1}<C_{2}$ (possibly depending on the parameter $\nu$ ) such that

[TABLE]

where the quantity $\mathbf{M}(F)$ is given by (3).

Remark 1.4.

(a)

A significant feature of the optimal rate (4), unlike the one in item (b) of Theorem 1.1 in the normal approximation case, is that it is non asymptotic and a priori does not assume the law of the chaotic random variable $F$ to be close to that of $G(\nu)$ .

(b)

For the upper bound, the starting point is an adaption of the technique developed in [NP15]. However, in order to achieve the optimal upper bound we introduce a novel technique within Stein’s method to split test functions relying on tools from operator theory. This is the topic of section 3.

(c)

Our methodology to obtain the optimal lower bound is based on complex analysis and differs from that in [NP15]. Up to our knowledge this method is new.

(d)

Theorem 1.3 has to be seen as a full generalization of the main findings of [AEK18], where we assumed some additional technical conditions.

The outline of our paper is as follows: In section $2$ , we give a brief introduction to Malliavin calculus on the Wiener space and specify the notation used in the paper. Section $3$ gathers the essential ingredients of Stein’s method for the centered Gamma distribution, developed recently in [DP18]. Section $4$ contains the main theoretical findings of this paper – an upper bound for the $d_{2}$ distance between a general element $F$ living in a finite sum of Wiener chaoses and the target distribution $G(\nu)$ in terms of iterated Gamma operators, as well as the optimal Gamma approximation rate. The end of this section is devoted to applications of our main findings. Lastly, we close the paper with an appendix section with focus on the newly introduced Gamma operators.

2 Preliminaries: Gaussian Analysis and Malliavin Calculus

In this section, we provide a brief introduction to Malliavin calculus and define some of the operators used in this framework. For more details, see for example the textbooks [NP12a, Nua06, NN18].

2.1 Isonormal Gaussian Processes and Wiener Chaos

Let $\mathfrak{H}$ be a real separable Hilbert space with inner product $\langle\cdot,\cdot\rangle_{\mathfrak{H}}$ , and $X=\{X(h):h\in\mathfrak{H}\}$ be an isonormal Gaussian process, defined on some probability space $(\Omega,\mathscr{F},P)$ . This means that $X$ is a family of centered, jointly Gaussian random variables with covariance structure $\mathbb{E}[X(g)X(h)]=\langle g,h\rangle_{\mathfrak{H}}$ . We assume that $\mathscr{F}$ is the $\sigma$ -algebra generated by $X$ . For an integer $q\geqslant 1$ , we will write $\mathfrak{H}^{\otimes q}$ or $\mathfrak{H}^{\odot q}$ to denote the $q$ -th tensor product of $\mathfrak{H}$ , or its symmetric $q$ -th tensor product, respectively. If $H_{q}(x)=(-1)^{q}e^{x^{2}/2}{\frac{d^{q}}{dx^{n}}}e^{-x^{2}/2}$ is the $q$ -th Hermite polynomial, then the closed linear subspace of $L^{2}(\Omega)$ generated by the family $\{H_{q}(X(h)):h\in\mathfrak{H},\lVert h\rVert_{\mathfrak{H}}=1\}$ is called the $q$ -th Wiener chaos of $X$ and will be denoted by $\mathscr{H}_{q}$ . For $f\in\mathfrak{H}^{\odot q}$ , let $I_{q}(f)$ be the $q$ -th multiple Wiener-Itô integral of $f$ (see [NP12a, Definition 2.7.1]). An important observation is that for any $f\in\mathfrak{H}$ with $\lVert f\rVert_{\mathfrak{H}}=1$ we have that $H_{q}(X(f))=I_{q}(f^{\otimes q})$ . As a consequence $I_{q}$ provides an isometry from $\mathfrak{H}^{\odot q}$ onto the $q$ -th Wiener chaos $\mathscr{H}_{q}$ of $X$ . It is a well-known fact, called the Wiener-Itô chaotic decomposition, that any element $F\in L^{2}(\Omega)$ admits the expansion

[TABLE]

where $f_{0}=\mathbb{E}[F]$ and the $f_{q}\in\mathfrak{H}^{\odot q}$ , $q\geqslant 1$ are uniquely determined. An important result is the following isometry property of multiple integrals. Let $f\in\mathfrak{H}^{\odot p}$ and $g\in\mathfrak{H}^{\odot q}$ , where $1\leqslant q\leqslant p$ . Then

[TABLE]

2.2 The Malliavin Operators

We denote by $\mathscr{S}$ the set of smooth random variables, i.e. all random variables of the form $F=g(X(\varphi_{1}),\ldots,X(\varphi_{n}))$ , where $n\geqslant 1$ , $\varphi_{1},\ldots,\varphi_{n}\in\mathfrak{H}$ and $g:\mathbb{R}^{n}\to\mathbb{R}$ is a $C^{\infty}$ -function, whose partial derivatives have at most polynomial growth. For these random variables, we define the Malliavin derivative of $F$ with respect to $X$ as the $\mathfrak{H}$ -valued random element $DF\in L^{2}(\Omega,\mathfrak{H})$ defined as

[TABLE]

The set $\mathscr{S}$ is dense in $L^{2}(\Omega)$ and using a closure argument, we can extend the domain of $D$ to $\mathbb{D}^{1,2}$ , which is the closure of $\mathscr{S}$ in $L^{2}(\Omega)$ with respect to the norm $\lVert F\rVert_{\mathbb{D}^{1,2}}:=\mathbb{E}[F^{2}]+\mathbb{E}[\lVert DF\rVert_{\mathfrak{H}}^{2}]$ . See [NP12a] for a more general definition of higher order Malliavin derivatives and spaces $\mathbb{D}^{p,q}$ . The Malliavin derivative satisfies the following chain-rule. If $\phi:\mathbb{R}^{m}\to\mathbb{R}$ is a continuously differentiable function with bounded partial derivatives and $F=(F_{1},\ldots,F_{m})$ is a vector of elements of $\mathbb{D}^{1,q}$ for some $q$ , then $\phi(F)\in\mathbb{D}^{1,q}$ and

[TABLE]

Note that the conditions on $\phi$ are not optimal and can be weakened. For $F\in L^{2}(\Omega)$ , with chaotic expansion as in (5), we define the pseudo-inverse of the infinitesimal generator of the Ornstein-Uhlenbeck semigroup as

[TABLE]

The following integration by parts formula is one of the main ingredients to proving the main theorem of section 4.1. Let $F,G\in\mathbb{D}^{1,2}$ . Then

[TABLE]

2.3 Gamma Operators and Cumulants

Let $F$ be a random variable with characteristic function $\phi_{F}(t)=\mathbb{E}[e^{itF}]$ . We define its $n$ -th cumulant, denoted by $\kappa_{n}(F)$ , as

[TABLE]

Let $F$ be a random variable with a finite chaos expansion. We define the operators $\Gamma_{i}$ , $i\in\mathbb{N}_{0}$ via $\Gamma_{0}(F):=F$ and

[TABLE]

This is the Gamma operator used in the proof of the main theorem in [NP15], although it is defined differently there. Note that there is also an alternative definition, which can be found in most other papers in this framework, see for example Definition 8.4.1 in [NP12a] or Definition 3.6 in [BBNP12]. For the sake of completeness, we also mention the classical Gamma operators, which we also call alternative Gamma operators, which we shall denote by $\Gamma_{alt}$ . These are defined via

[TABLE]

The classical Gamma operators are related to the cumulants of $F$ by the following identity from [NP10]: For all $j\geqslant 0$ , we have

[TABLE]

If $j\geqslant 3$ , this does not hold anymore for our new Gamma operators. Instead, in our next result, we will list some useful relations between the classical and the new Gamma operators.

Proposition 2.1.

Let $F$ be a centered random variable admitting a finite chaos expansion. Then

(a)

$\Gamma_{1}(F)=\Gamma_{alt,1}(F)$ ,

(b)

$\mathbb{E}\big{[}\Gamma_{j}(F)\big{]}=\mathbb{E}\big{[}\Gamma_{alt,j}(F)\big{]}=\frac{1}{j!}\kappa_{j+1}(F)$ * for $j=1,2$ .*

(c)

$\mathbb{E}\big{[}\Gamma_{3}(F)\big{]}=2\,\mathbb{E}\big{[}\Gamma_{alt,3}(F)\big{]}-\operatorname{Var}\big{(}\Gamma_{1}(F)\big{)}=\frac{1}{3}\kappa_{4}(F)-\operatorname{Var}\big{(}\Gamma_{1}(F)\big{)}$ ,

(d)

When $F=I_{2}(f)$ , for some $f\in\mathfrak{H}^{\odot 2}$ , is an element of the second Wiener chaos, then

[TABLE]

The proofs of these statements can be found in the appendix along with an explicit representation of the Gamma operators in terms of contractions.

2.4 Useful facts on Second Wiener Chaos

Let $F=I_{2}(f)$ , for some $f\in\mathfrak{H}^{\odot 2}$ be a generic element in the second Wiener chaos. It is a classical result (see [NP12a, section 2.7.4]) that these kind of random variables can be analyzed through the associated Hilbert-Schmidt operator $A_{f}:\mathfrak{H}\to\mathfrak{H}$ that maps $g\mapsto f\mathbin{\otimes_{1}}g$ . Denote by $\{c_{f,i}:i\in\mathbb{N}\}$ the set of eigenvalues of $A_{f}$ . We also introduce the following sequence of auxiliary kernels $\Big{\{}f\mathbin{\otimes_{1}^{(p)}}f:p\geqslant 1\Big{\}}\subset\mathfrak{H}^{\odot 2}$ , defined recursively as $f\mathbin{\otimes_{1}^{(1)}}f=f$ , and, for $p\geqslant 2$ , $f\mathbin{\otimes_{1}^{(p)}}f=\Big{(}f\mathbin{\otimes_{1}^{(p-1)}}f\Big{)}\mathbin{\otimes_{1}}f$ .

Proposition 2.2.

*(see e.g. [NP12a, p. 43])

The random element $F$ admits the representation

[TABLE]

where the $(N_{i})$ are i.i.d. $\mathscr{N}(0,1)$ and the series converges in $L^{2}(\Omega)$ and almost surely. 2. 2.

For every $p\geqslant 2$

[TABLE]

*where $\operatorname{Tr}(A^{p}_{f})$ stands for the trace of the * $p$ th power of operator $A_{f}$ .

It is known that when $\nu$ is an integer, $G(\nu)\sim\overline{\chi}^{2}$ is a centered chi-squared random variable with $\nu$ degrees of freedom, and (11) shows that $G(\nu)$ is itself an element of the second Wiener chaos, where $\nu$ -many of the eigenvalues are $1$ and the remaining ones are [math]. Hence, in this case, we deduce from (12) that $\kappa_{p}(G(\nu))=2^{p-1}(p-1)!\,\nu$ . Perhaps not surprisingly, this is also the case when $\nu$ is any positive real number.

Lemma 2.3.

Let $\nu>0$ and $G(\nu)\sim CenteredGamma(\nu)$ . Then

[TABLE]

Proof.

Since the cumulant generating function of a Gamma random variable is well-known, we can easily compute that of $G(\nu)$ to be $K(t)=\frac{\nu}{2}\log\left(\frac{1}{1-2t}\right)-\nu t$ . By simple induction over $p$ , we obtain

[TABLE]

The result now follows by letting $t=0$ . ∎

Lemma 2.4.

Let $F=I_{2}(f)$ for some $f\in\mathfrak{H}^{\odot 2}$ , and denote by $A_{f}$ the corresponding Hilbert-Schmidt operator with eigenvalues $\{c_{f,i}:i\geqslant 1\}$ . Then for every $r\geqslant 1$ ,

[TABLE]

Proof.

From [APP15] equation (24), which follows by induction on $r$ , we have the representation

[TABLE]

Using the isometry property (6), we obtain

[TABLE]

The result now follows with (12). ∎

3 Stein’s Method for the centered Gamma distribution

Let $X_{r}\sim\Gamma(r,1)$ be distributed according to a Gamma distribution with shape parameter $r>0$ . It means that random variable $X_{r}$ admits the density

[TABLE]

Consider the centered Gamma random variable $G(\nu)=2\,X_{\nu/2}-\nu\sim CenteredGamma(\nu)$ . Stein’s method for $X_{\nu/2}$ has first been studied in [Luk94] and then later been refined in [Pic04]. It is well known (see e.g. [DP18, equation (24)]) that the Stein equation for the centered Gamma random variable $G(\nu)$ associated to the test function $h$ is given by the following first order ODE with polynomial coefficients

[TABLE]

where $h:\mathbb{R}\to\mathbb{R}$ is measurable and $\mathbb{E}|h(G(\nu))|<\infty$ . The following result is taken from [DP18, Theorem 2.3] and plays a crucial role in our analysis. For the reader’s convenience we restate it here. We also need the following convention that for every function $f:\mathbb{R}\to\mathbb{R}$ the quantity $\lVert f^{\prime}\rVert_{\infty}$ stands for the smallest Lipschitz constant, i.e.

[TABLE]

It is worth pointing out that $\lVert f^{\prime}\rVert_{\infty}$ coincides with the uniform norm of the derivative of $f$ whenever $f$ is differentiable.

Theorem 3.1.

([DP18, Theorem 2.3]) (a) Let $h$ be a Lipschitz-continuous function on the whole real line $\mathbb{R}$ . Then there exists a unique bounded Lipschitz-continuous solution $S(h)$ to the equation (16) on the whole real line $\mathbb{R}$ satisfying the bounds

[TABLE]

*where the constant $c_{\nu}=\max\{1,\frac{2}{\nu}\}$ .

(b) Suppose that the function $h$ is continuously differentiable on $\mathbb{R}$ such that both $h$ and $h^{\prime}$ are Lipschitz-continuous. Then there is a continuously differentiable solution $S(h)$ of equation (16) on $\mathbb{R}$ whose derivative $S(h)^{\prime}$ is Lipschitz-continuous, and moreover*

[TABLE]

3.1 Explicit Formula for the Solution of the Stein Equation

This section is entirely based on [DP18]. It is known that a Stein equation for the $\Gamma(r,1)$ distribution is given by

[TABLE]

where $h:\mathbb{R}\to\mathbb{R}$ is a measurable test function with $\mathbb{E}|h(X_{r})|<+\infty$ . Döbler and Peccati [DP18, p. 3406] showed that if $h\in\operatorname{Lip}(\mathbb{R})$ , then there exists a unique Lipschitz-continuous function $f_{h}$ on $\mathbb{R}$ solving (18), given by

[TABLE]

where for $x<0$ , $f_{h}^{-}(x)=\frac{1}{xq_{l}(x)}\int_{0}^{x}\Big{(}h(t)-\mathbb{E}\big{[}h(X_{r})\big{]}\Big{)}q_{l}(t)dt$ and $q_{l}(x)=-(-x)^{r-1}e^{-x}$ . Also $f_{h}^{+}(x)=\frac{1}{xp_{r}(x)}\int_{0}^{x}\Big{(}h(t)-\mathbb{E}\big{[}h(X_{r})\big{]}\Big{)}p_{r}(t)dt$ for $x>0$ . Furthermore, one can extend $f_{h}^{-}$ and $f_{h}^{+}$ continuously by setting $f_{h}^{-}(0)=f_{h}^{+}(0):=\frac{h(0)-\mathbb{E}[h(X_{r})]}{r}$ . Now, for a given test function $h:\mathbb{R}\to\mathbb{R}$ , set $h_{1}(x):=h(2x-\nu)$ . Following [DP18, p. 3399], if $f_{h}$ is the solution of (18) (with $r=\nu/2$ ), where $h$ is replaced by $h_{1}$ , then $S(h)(x):=\frac{1}{2}\,f_{h}\left(\frac{x+\nu}{2}\right)$ solves (16). Therefore, the unique bounded solution $S(h)$ of the Stein equation (16) admits the following explicit representation

[TABLE]

where $\hat{p}_{\nu}$ is the density of the centered Gamma distribution $G(\nu)$ given by

[TABLE]

and $\hat{q}(x):=\frac{1}{2}\,q_{l}\left(\frac{x+\nu}{2}\right)=-\,2^{-\frac{\nu}{2}}\big{(}-(x+\nu)\big{)}^{\frac{\nu}{2}-1}\,e^{-\frac{x+\nu}{2}}$ . Also note that

[TABLE]

The following lemma will be used in the proof of Proposition 3.7. Using a simple adaptation, a similar statement also holds for the solution $S(h)$ corresponding to the Stein equation (16) of the centered Gamma distribution $G(\nu)$ .

Lemma 3.2.

Let $X_{r}\sim\Gamma(r,1)$ with cumulative distribution function $F_{r}$ , and $h$ be a Lipschitz-continuous function. Then there exist two non-negative bounded functions $U^{+}$ on $(0,+\infty)$ , and $U^{-}$ on $(-\infty,0]$ such that $U^{\pm}\downarrow 0$ as $x\to\pm\infty$ , and the following estimates are in order:

(a)

for $x>0$ it holds that $\Big{|}f^{\prime}_{h}(x)\Big{|}\leq 2\|h^{\prime}\|_{\infty}U^{+}(x)$ , 2. (b)

for $x<0$ it holds that $\Big{|}f^{\prime}_{h}(x)\Big{|}\leq 2\|h^{\prime}\|_{\infty}U^{-}(x)$ .

Proof.

Let $Q_{l}(x):=\int_{x}^{0}(-q_{l}(y))dy$ . Consider

[TABLE]

It is known that both estimates in parts (a) and (b) take place with $V^{\pm}$ instead of $U^{\pm}$ (see [Döb15, Corollary 3.15. Part (b)], and [DP18, relation (35), page 4304]). Moreover, for $x>r$ , the function $V^{+}$ satisfies

[TABLE]

Also, it is straightforward to check that as $x\to+\infty$ , the function $U^{+}$ is decreasing to [math]. (It is also true that $0\leq U^{+}(x)\leq 1$ for $0<x\leq r$ [DP18, see the top of page 3403]). Part (b) is similar. ∎

3.2 An Operator Theory Approach

Let $a,b\in\mathbb{R}^{+}\cup\{\infty\}$ . Define

[TABLE]

Lemma 3.3.

Let $\mathcal{B}:=\mathcal{B}_{\infty,\infty}$ . For every given $h\in\mathcal{B}$ , define $\lVert f\rVert_{\mathcal{B}}:=\lVert f\rVert_{\infty}+\lVert f^{\prime}\rVert_{\infty}$ . Then $\|\cdot\|_{\mathcal{B}}$ is a norm on the real vector space $\mathcal{B}$ , and furthermore the pair $\left(\mathcal{B},\|\cdot\|_{\mathcal{B}}\right)$ is a Banach space, the so-called Lipschitz-space.

Proof.

It is straightforward to see that the pair $\left(\mathcal{B},\|\cdot\|_{\mathcal{B}}\right)$ is a normed space. Furthermore, it is a classical fact that it is a Banach space, see for example [Wea99, Proposition 6.1.2]. ∎

Lemma 3.4.

Consider the mapping $S:\mathcal{B}\to\mathcal{B}$ such that for every $h\in\mathcal{B}$ , the action $S(h)$ is defined as the unique bounded solution to the centered Gamma Stein equation (16), which is guaranteed to exist by Theorem 3.1 item (a). Then $S(h)\in\mathcal{B}$ , and $S$ is a bounded linear operator from the Banach space $\mathcal{B}$ to itself.

Proof.

Let $h\in\mathcal{B}$ . Then a direct application of Theorem 3.1 item (a) yields that $S(h)\in\mathcal{B}$ . To show linearity of $S$ , take $h_{1},h_{2}\in\mathcal{B}$ , and $\alpha\in\mathbb{R}$ . Then using the Gamma Stein equation (16), together with the fact that $S(h)$ is the unique bounded solution to the latter, we infer that $S(h_{1}+\alpha h_{2})=S(h_{1})+\alpha S(h_{2})$ . For the boundedness of $S:\mathcal{B}\to\mathcal{B}$ we apply Theorem 3.1 part (a) to obtain

[TABLE]

Hence $\|S\|\leq 1+c_{\nu}.$ ∎

Proposition 3.5.

Consider the bounded linear operator $S:\mathcal{B}\to\mathcal{B}$ defined as in Lemma 3.4. Then the following statements are in order.

(a)

The operator $S$ does not admit any non-zero eigenvalue, i.e. if $S(h)=\lambda h$ for some non-zero constant $\lambda\in\mathbb{R}$ , then necessary $h=0$ . 2. (b)

For every non-zero scalar $\lambda\in\mathbb{R}$ , the operator $I+\lambda S:\mathcal{B}\to\mathcal{B}$ is a one to one map, where $I:\mathcal{B}\to\mathcal{B}$ stands for the identity operator.

Proof.

(a) By contrary assume that there exists a non-zero scalar $\lambda\in\mathbb{R}$ such that

[TABLE]

We claim that $h(-\nu)=0$ . Otherwise introduce the auxiliary test function $g=\frac{h}{h(-\nu)}-1$ . Then, obviously, $g\in\mathcal{B}$ , and moreover by virtue of relation (21), we have $S(g)=\lambda(g+1)$ . Furthermore, we have $\mathbb{E}\left[g(G(\nu))\right]=-\lambda\nu$ , because $S(g)(-\nu)=\lambda$ . Therefore, the function $g$ satisfies the first order non-homogeneous ode

[TABLE]

Then general solutions of the ode (22) on the interval $(-\nu,\infty)$ are given by

[TABLE]

where $\beta:=\frac{1-\lambda\nu}{2\lambda}$ . Now, if $\beta<1$ , then as $x\to+\infty$ , we have

[TABLE]

This implies that $g(x)\to+\infty$ as $x\to+\infty$ , which is a contradiction to the fact that $g$ must be a bound function. When $\beta\geq 1$ , i.e. $\tilde{\beta}:=1-\beta\leq 0$ as $x\to+\infty$ , we obtain that for some finite constant $d_{\beta}$ that

[TABLE]

which is either an infinite number or a finite number depending on whether $\tilde{\beta}\in-\mathbb{N}\cup\{0\}$ is a negative integer or not. Therefore, in any case, we have obtained that $g(x)\to+\infty$ as $x\to+\infty$ , which is a contradiction. Hence always $h(-\nu)=0$ . This implies that $\mathbb{E}\left[h(G(\nu))\right]=0$ by using (20). On the other hand, $S(h)=\lambda h$ satisfies the first order ode (16), and therefore

[TABLE]

The general solutions of the ordinary differential equation (24) on the interval $(-\nu,\infty)$ are given by

[TABLE]

for some constant $C_{1}$ . If $C_{1}\neq 0$ , then this is a contradiction to the fact that $S(h)$ is a bounded function over the whole real line. Hence it must hold that $C_{1}=0$ . Similarly, the general solutions of the ordinary differential equation (24) on the interval $(-\infty,-\nu)$ are given by

[TABLE]

where $C_{2}$ is a general constant. Now if $C_{2}\neq 0$ , we infer that $S(h)$ is unbounded on the domain $(-\infty,-\nu)$ , which leads to a contradiction. Therefore $C_{2}=0$ , and as a direct consequence we get $h=0$ .

(b) Assume that $\lambda\neq 0$ is a non-zero scalar. Then the mapping $I+\lambda S:\mathcal{B}\to\mathcal{B}$ is a linear operator. Hence, $I+\lambda S$ is a one to one map if and only if $\operatorname{Ker}\left(I+\lambda S\right)=\{0\}$ , and the latter follows at once from part (a). ∎

Lemma 3.6.

Let $f_{n}:[a,b]\to\mathbb{R}$ be a sequence of $L$ -Lipschitz continuous functions for every $n\in\mathbb{N}$ : i.e. for all $x,y\in[a,b]$ , and every $n$ ,

[TABLE]

Assume further that $f_{n}\to f$ pointwise as $n$ tends to infinity. Then $f$ is also an $L$ -Lipschitz function and $f_{n}\to f$ uniformly.

Proof.

It is elementary. ∎

Proposition 3.7.

The bounded linear operator $S:\mathcal{B}\to\mathcal{B}$ defined as in Lemma 3.4 is a compact operator.

Proof.

Let $U_{\mathcal{B}}:=\{h\in\mathcal{B}:\lVert h\rVert_{\mathcal{B}}=\|h\|_{\infty}+\|h^{\prime}\|_{\infty}\leq 1\}$ denote the unit ball of the Banach space $\mathcal{B}$ . We need to show that the image $S\left(U_{\mathcal{B}}\right)$ of the unit ball is a precompact set in $\mathcal{B}$ , or equivalently, that every sequence $\{S(h_{n})\}_{n\geq 1}\subseteq S(U_{\mathcal{B}})$ has a convergent subsequence in the topology of the Banach space $\mathcal{B}$ . We divide the rest of the proof in three steps.

Step (1): First we show that there exists a subsequence $\{h_{n_{k}}\}_{k\geq 1}$ such that $h_{n_{k}}\to h$ pointwise for some $h\in U_{\mathcal{B}}$ . Moreover $S(h_{n_{k}})\to S(h)$ , and $S(h_{n_{k}})^{\prime}\to S(h)^{\prime}$ pointwise. Note that $\{h_{n}\}_{n\geq 1}\subseteq U_{\mathcal{B}}$ is a bounded subset of $\mathcal{B}$ . It is well known (see for example [Wea99, Chapter 2] or [Wea18, Theorem 2.4, and Proposition 2.1] as well as the survey [God15]) that the Banach space $\mathcal{B}$ is a predual space, i.e. there exists a (unique) Banach space $\text{\AE}(\mathbb{R})$ , the so called Arens-Eells space, such that $\text{\AE}(\mathbb{R})^{*}=\mathcal{B}$ . On the other hand, the Banach-Alaoglu theorem implies that the unit ball $U_{\mathcal{B}}$ is weak-∗ compact. Moreover, $\mathbb{R}$ is a separable Banach space, so the Arens-Eells Banach space $\text{\AE}(\mathbb{R})$ is, too [God15]. Hence the weak-∗topology on $U_{\mathcal{B}}$ is metrizable. Therefore, weak-∗ compact is the same as weak-∗ sequentially compact on the unit ball $U_{\mathcal{B}}$ . It follows that the sequence $\{h_{n}\}_{n\geq 1}$ contains a subsequence that converges in the weak-∗ topology to an element $h\in U_{\mathcal{B}}$ . Without loss of generality, we assume that the subsequence is given by the sequence itself. Hence there exists an element $h\in U_{\mathcal{B}}$ such that $h_{n}\to h$ in the $\text{weak}^{*}$ -topology. Furthermore, the weak-∗ topology on the bounded subsets of $\mathcal{B}$ coincides with the topology of pointwise convergence, see [Wea18, Proposition 2.1]. As a consequence, $h_{n}\to h$ pointwise (here one should not expect that $h_{n}\to h$ weakly; otherwise this implies that the unit ball is weakly sequentially compact, and therefore the Banach space $\mathcal{B}$ is reflexive which is a contradiction). An application of the Lebesgue dominated convergence theorem implies that $S(h_{n})\to S(h)$ pointwise. Taking into account these observations together with the fact that for every $n\in\mathbb{N}$ we have

[TABLE]

there exists a function $f$ such that $S(h_{n})^{\prime}\to f$ pointwise. On the other hand, for every $x\in\mathbb{R}$ we have that

[TABLE]

Recall that $h\in U_{\mathcal{B}}$ . Hence, the function $S(h)$ satisfies the Gamma Stein equation

[TABLE]

Hence $f=S(h)^{\prime}$ , and also $S(h_{n})^{\prime}\to S(h)^{\prime}$ pointwise.

Step (2): In this step, we show that $S(U_{\mathcal{B}})\subseteq C_{0}(\mathbb{R})$ is a family of functions having the equivanishing at infinity property, i.e. for every given $\varepsilon>0$ , there exists a compact interval $K\subset\mathbb{R}$ such that $\big{|}f(x)\big{|}<\varepsilon$ for all $f\in S(U_{\mathcal{B}})$ and for all $x\notin K$ . To do this, we use the explicit integral representation (19). Note that since $\lVert h\rVert_{\infty}\leq 1$ , we have $\lvert h(t)-\mathbb{E}[h(G_{\nu})]\rvert\leqslant 2$ for all $t\in\mathbb{R}$ . When $x>-\nu$ , then (recall that $\hat{p}_{\nu}$ is the density of $G(\nu)$ ):

[TABLE]

Now if $\nu\leqslant 2$ , then $\left(\frac{t+\nu}{x+\nu}\right)^{\nu/2-1}\leqslant 1$ and thus

[TABLE]

When $\nu>2$ , set $r:=\lceil\nu/2-1\rceil$ . We have

[TABLE]

where $P$ is a polynomial of degree $r$ . Since we always have $r<\nu/2$ , it follows that $\lim_{x\to\infty}\lvert S(h)(x)\rvert=0$ . When $x<-\nu$ , again using (19) of the explicit representation of the solution function $S(h)$ , we get

[TABLE]

Hence, the case $x\to-\infty$ can now be discussed similarly. Note that the upper bounds for $\lvert S(h)(x)\rvert$ that we found do not depend on the choice of the test function $h$ . Therefore, we have shown that, in addition to $S(U_{\mathcal{B}})\subseteq C_{0}(\mathbb{R})$ , the collection $S(U_{\mathcal{B}})$ is a family of functions that are equivanishing at infinity.

Step (3): Next we show that as $n\to\infty$ ,

[TABLE]

By Step $(2)$ , for a given $\varepsilon>0$ , there exists a compact interval $K\subset\mathbb{R}$ such that

[TABLE]

On the other hand, the family $(S(h_{n}):n\geq 1)$ consists of $1$ -Lipschitz-continuous functions (see part (a), Theorem 3.1), and by step (1) converges pointwise to $S(h)$ on the compact interval $K$ . Hence, Lemma 3.6 yields that

[TABLE]

Finally relations (28) and (29) readily imply that $S(h_{n})\to S(h)$ uniformly on the real line. Now, we are left to show that $\|S(h_{n})^{\prime}-S(h)^{\prime}\|_{\infty}\to 0$ . To this end, first note that for every $h\in U_{\mathcal{B}}$ , and every $x\neq y\in\mathbb{R}$ it holds that $|S(h)^{\prime}(x)-S(h)^{\prime}(y)|\leq c_{\nu}\|h^{\prime}\|_{\infty}|x-y|\leq c_{\nu}|x-y|$ . Hence, the family $\{S(h_{n})^{\prime},S(h)^{\prime}\,:n\geq 1\}$ consists of $c_{\nu}$ -Lipschitz continuous functions. On the other hand, Lemma 3.2 yields that the family $\{S(h_{n})^{\prime},S(h)^{\prime}\,:n\geq 1\}$ is equivanishing at infinity. The result now follows.

∎

Theorem 3.8.

Let $\lambda\in\mathbb{R}$ be a non-zero scalar. Then for every $h\in\mathcal{B}$ there exists a unique solution $g\in\mathcal{B}$ to the functional equation

[TABLE]

Proof.

This is a direct application of Propositions 3.5, 3.7, and the classical Fredholm alternative Theorem [Meg98, 3.4.24, page 329]. ∎

For $r>0$ , let $U_{\mathcal{B}}(r):=\{h\in\mathcal{B}:\|h\|_{\mathcal{B}}\leq r\}$ denote the ball of radius $r$ .

Proposition 3.9.

Let $r_{1}>0$ , and $\lambda\in\mathbb{R}$ be a non-zero scalar. Then there exists a universal constant $r_{2}$ (may depend on $r_{1}$ , $\lambda$ , and $\nu$ ) such that for every $h\in U_{\mathcal{B}}(r_{1})$ the unique solution $g$ of the functional equation (30) satisfies $\|g\|_{\mathcal{B}}\leq r_{2}$ .

Proof.

From Proposition 3.5 and Theorem 3.8, the linear bounded operator $I+\lambda S:\mathcal{B}\to\mathcal{B}$ is a bijective map. Hence the result follows at once using the inverse mapping Theorem [Meg98, 1.6.6 Corollary]. ∎

4 Optimal Gamma Approximation

4.1 A General Stein-Malliavin Upper Bound

In the following, we present a general Malliavin-Stein upper bound that constitutes the cornerstone to achieve our final optimal goal. We start with the following useful result. Sometimes, we will use centered versions of the Gamma-operators, i.e.

[TABLE]

Proposition 4.1.

Let $F$ be a centered random variable admitting a finite chaos expansion with $\operatorname{Var}(F)=2\nu$ . Let $G(\nu)\sim CenteredGamma(\nu)$ . Then there exists a constant $C>0$ (only depending on $\nu$ ), such that

[TABLE]

where recall that $\mathcal{B}_{1,1}:=\big{\{}h:\mathbb{R}\to\mathbb{R},\,\text{Lipschitz-continuous}\,:\,\|h\|\leq 1,\,\|h^{\prime}\|_{\infty}\leq 1\big{\}}$ .

Proof.

Consider the centered Gamma Stein equation (16). Let $h\in\mathcal{H}_{2}$ be an arbitrary test function (note that $\mathbb{E}|h(G(\nu))|<\infty$ ). Then by using the Malliavin integration by parts formula (8), we get

[TABLE]

Now the claim follows at once by a direct application of Theorem 3.1. ∎

To simplify computations, we continue with the following useful Lemmas.

Lemma 4.2.

Let $g:\mathbb{R}\to\mathbb{R}$ be a Lipschitz-continuous function, where $g$ and $g^{\prime}$ are bounded by a constant only depending on $\nu>0$ . Consider the solution $S(g)$ of the Gamma Stein equation (16) associated to the test functions $g$ . Assume that $F\in\mathbb{D}^{\infty}$ is a centered random variable with variance $\mathbb{E}[F^{2}]=2\nu$ . Then for any $r\in\mathbb{N}$ :

[TABLE]

Proof.

First note that $2\nu=\mathbb{E}[\Gamma_{1}(F)]$ . Thus

[TABLE]

Now, we use the integration-by-parts formula (8) in combination with the chain rule (7) to obtain

[TABLE]

and similarly

[TABLE]

Hence, putting everything together, the result follows. ∎

Lemma 4.3.

Let $g:\mathbb{R}\to\mathbb{R}$ be a Lipschitz-continuous function, where $g$ and $g^{\prime}$ are bounded by a constant only depending on $\nu>0$ . Assume that $S(g)$ and $S\left(S(g)\right)$ stand for the solutions of the Gamma Stein equation (16) associated to the test functions $g$ and $S(g)$ respectively. Let $F\in\mathbb{D}^{\infty}$ be a centered random variable with variance $\mathbb{E}[F^{2}]=2\nu$ . Then the following identities take place.

(a)

[TABLE]

(b)

[TABLE]

Proof.

We apply Lemma 4.2 twice to obtain

[TABLE]

Note that we cannot translate $\mathbb{E}[\Gamma_{3}(F)]$ directly into the fourth cumulant, but instead by Proposition 2.1 part (c), we have $\mathbb{E}[\Gamma_{3}(F)]=\frac{1}{3}\kappa_{4}(F)-\operatorname{Var}(\Gamma_{1}(F))$ . The variance term can be written as

[TABLE]

Putting everything together, the claim follows. ∎

Remark 4.4.

We point out that for both linear cumulant combinations appearing in the right hand sides of parts (a) and (b) in Lemma 4.3 it holds that

[TABLE]

Now, we are ready to state the main result of this section.

Theorem 4.5.

Let $F$ be a centered random variable admitting a finite chaos expansion with $\operatorname{Var}(F)=2\nu$ . Let $G(\nu)\sim CenteredGamma(\nu)$ . Then there exists a constant $C>0$ (only depending on $\nu$ ), such that

[TABLE]

Proof.

Using Proposition 4.1, Theorem 3.8 with $\lambda=2$ , and Proposition 3.9 we obtain that

[TABLE]

where $C$ stands for a general constant depending only on the parameter $\nu$ . Now, we apply Lemma 4.3 item (b) on $\mathbb{E}\left[h(F)\big{(}\operatorname{\overline{\Gamma}}_{1}(F)-2F\big{)}\right]$ , and item (a) on $\mathbb{E}\left[S(h)(F)\big{(}\operatorname{\overline{\Gamma}}_{1}(F)-2F\big{)}\right]$ . Then putting everything together the result follows by applying Cauchy-Schwarz inequality, Theorem 3.1, as well as using the fact that $\kappa_{2}(G(\nu))=\kappa_{2}(F)=2\nu$ , $\kappa_{3}(G(\nu))=8\nu$ and $\kappa_{4}(G(\nu))=48\nu$ , see (13). ∎

Remark 4.6.

The splitting technique implemented in the proof of Theorem 4.5 by using operator theory is vital to obtain an optimal upper bound. In fact, not doing it, instead of estimate (4.5), the best estimate one can achieve (under the assumption in Theorem 4.5) is a similar bound as (4.5) with the quantity $\sqrt{\operatorname{Var}\left(\Gamma_{3}(F)-2\Gamma_{2}(F)\right)}$ instead of

[TABLE]

On the other hand, it is not difficult to see that for a sequence $\{F_{n}=\sum_{1\leq i\leq\nu}c_{i,n}(N^{2}_{i}-1):n\geq 1\}$ in the second Wiener chaos with a finite number of non-zero spectral coefficients such that for every $i=1,\ldots,\nu$ , $c_{i,n}\to 1$ as $n\to\infty$ it holds that

[TABLE]

resulting in a suboptimal rate. See also illustrating Example 4.13 for further clarifications.

4.2 The Upper Bound: Second Wiener Chaos

In the present section, in order to handle the variance quantities of the Gamma operators appearing in the right hand side of estimate (4.5) in terms of cumulants, we consider the case of second Wiener chaos. In this setting, the connection is apparent thanks to Lemma 2.4.

Proposition 4.7.

Let $\nu>0$ , and $F=I_{2}(f)$ be in the second Wiener chaos such that $\mathbb{E}[F^{2}]=2\nu$ . Then, for every $r\geq 1$ , with constant $C=4\nu$ , we have

[TABLE]

In particular, by choosing $r=1$ , we obtain

[TABLE]

Proof.

Let’s prove the first estimate in (33). Then the second estimate could be proven by iteration using similar arguments. Let $r\geq 1$ . Denote by $A_{f}$ the associated Hilbert-Schmidt operator. As in the proof of Lemma 2.4, we can write

[TABLE]

where in the third step, we have used the trace inequality $\operatorname{Tr}(AB)\leq\operatorname{Tr}(A)\,\operatorname{Tr}(B)$ for non-negative operators $A,B\geq 0$ , see [Liu07]. ∎

Remark 4.8.

The estimates in (33) can also deduce from representation (14) together with the classical estimate $(4.4)$ in [BBNP12, Lemma 4.2].

Proposition 4.9.

Let $\nu>0$ , and $F=I_{2}(f)$ in the second Wiener chaos such that $\mathbb{E}[F^{2}]=2\nu$ . Assume $r\geq 1$ . Then there exists a general constant $C$ (possibly depending on the parameters $\nu$ and $r$ ) such that

[TABLE]

In particular, by choosing $r=1$ , we obtain the crucial estimate

[TABLE]

Proof.

For the first estimate, using representation (14) we can write

[TABLE]

where we have used the classical estimate $(4.4)$ in [BBNP12, Lemma 4.2]. The second estimate is a direct application of [Dra16, Corollary 1] with $P=(A^{r+1}_{f}-A^{r}_{f})^{2},C=A^{2}_{f}$ combined with $\operatorname{Var}\left(\Gamma_{r+1}(F)-2\Gamma_{r}(F)\right)=2^{2r+3}\operatorname{Tr}\left((A^{r+2}_{f}-A^{r+1}_{f})^{2}\right)$ for every $r\geq 0$ , see the proof of Lemma 2.4. ∎

4.3 The Lower Bound: Second Wiener Chaos

Proposition 4.10.

Let $\nu>0$ , and $F=I_{2}(f)$ be in the second Wiener chaos such that $\mathbb{E}[F^{2}]=2\nu$ . Then there exists a general constant $C$ (possibly depending on the parameter $\nu$ ) such that

[TABLE]

*where the quantity $\mathbf{M}(F)$ is given by (3). *

Proof.

Fix a real number $\rho>0$ whose range of values will be determined later on. Taking into account the second moment assumption, it is a classical result (see [Luk70, Chapter $7$ ]) that the characteristic functions $\phi_{F}$ and $\phi_{G(\nu)}$ are analytic inside the strip $\Delta_{\nu}:=\{z\in\mathbb{C}:\lvert\operatorname{Im}z\rvert<\frac{1}{2\sqrt{\nu}}\}$ . Moreover, in the strip of regularity $\Delta_{\nu}$ , they follow the integral representations

[TABLE]

where $\mu$ and $\mu_{\nu}$ stand for the probability measures of $F$ and $G(\nu)$ respectively. Recall that all elements in the second Wiener chaos have exponential moments, see [NP12a, Proposition $2.7.13$ , item (iii)]. Denote by $\Omega_{\rho,\nu}$ the domain

[TABLE]

Then for any $z\in\Omega_{\rho,\nu}$ , together with a Fubini’s argument, we have that

[TABLE]

Hence $\lvert\phi_{F}(z)-\phi_{G(\nu)}(z)\rvert\leq_{C_{\rho}}d_{2}(F,G(\nu))$ for every $z\in\Omega_{\rho,\nu}$ . Let $R>0$ such that the disk $D_{R}\subset\mathbb{C}$ with the origin as center and radius $R$ is contained in the domain $\Omega_{\rho,\nu}$ (note that $R$ depends only on $\nu$ , since $\rho$ is a free parameter. For example, one can choose $\min\{(2\sqrt{\nu})^{-1},e^{-1}\}<\rho<2\min\{(2\sqrt{\nu})^{-1},e^{-1}\}$ ). Now for any $z\in D_{R}$ , and using the fact that

[TABLE]

one can readily conclude that the function $\phi_{G(\nu)}(z)$ is bounded away from [math] on the disk $D_{R}$ . Also, for any $r\geq 2$ ,

[TABLE]

Therefore, for any $z\in D_{R}$ ,

[TABLE]

Hence the function $\phi_{F}(z)$ is also bounded away from [math] on the disk $D_{R}$ . Also, relation (36) implies that the following power series (complex variable) converge to some analytic function as soon as $\lvert z\rvert<R$ ;

[TABLE]

Thus we come to the conclusion that the functions $\phi_{G(\nu)}(z)$ and $\phi_{F}(z)$ are analytic on the disk $D_{R}$ . Moreover, there exists a constant $c>0$ such that $\lvert\phi_{G(\nu)}(z)\rvert,\lvert\phi_{F}(z)\rvert\geq c>0$ for every $z\in D_{R}$ . This implies that on the disk $D_{R}$ there exist two analytic functions $g$ and $g_{\nu}$ such that

[TABLE]

i.e. $g(z)=\log(\phi_{F}(z))$ and $g_{\nu}(z)=\log(\phi_{G(\nu)}(z))$ , for $z\in D_{R}$ . In fact, the functions $g$ and $g_{\nu}$ are given by the power series (37). Since the derivative of the analytic branch of the complex logarithm is $(\log z)^{\prime}=\frac{1}{z}$ (see [Con95, Corollary $2.21$ ]), one can infer that for some constant $C$ whose value may differ from line to line and for every $z\in D_{R}$ , we have

[TABLE]

Now, using Cauchy’s estimate for the coefficients of analytic functions, for any $r\geq 3$ , we obtain that

[TABLE]

Therefore, $\max\Big{\{}\Big{\lvert}\kappa_{3}(F)-\kappa_{3}(G(\nu))\Big{\rvert},\Big{\lvert}\kappa_{4}(F)-\kappa_{4}(G(\nu))\Big{\rvert}\Big{\}}\leq_{C}d_{2}(F,G(\nu))$ .

∎

4.4 Main Result: Non Asymptotic Optimal Gamma Approximation

Now we are ready to present a non asymptotic optimal Gamma approximation in full generality on the second Wiener chaos in terms of the maximum of the third and fourth cumulants. The following result provides an analogous counterpart to the same phenomenon in the case of normal approximation, see [NP15, Theorem 1.2] or Theorem 1.1 item (b).

Theorem 4.11.

Let $\nu>0$ , and $G(\nu)\sim CenteredGamma(\nu)$ . Assume that $F=I_{2}(f)$ belongs to the second Wiener chaos such that $\mathbb{E}[F^{2}]=2\nu$ . Then there exist two general constants $0<C_{1}<C_{2}$ (possibly depending on the parameter $\nu$ ) such that

[TABLE]

Recall that

[TABLE]

Proof.

For the upper bound combine Theorem 4.5 with Proposition 4.7 estimate (34), Proposition 4.9 estimate (35) as well as Lemma 2.4 with $r=1$ . The lower bound directly follows from Proposition 4.10. ∎

Remark 4.12.

In this remark we shortly comment on a natural thought relating to the generalization of the optimal rate (38) to higher order Wiener chaoses. In addition a complete lack of any non-artificial example of a sequence of random variables in a fixed Wiener chaos of order $q\geq 3$ converging towards the $G(\nu)$ distribution, our investigations imply that such an extension would come at the cost of very complicated computations involving norms of contraction operators to verify estimate (35) (possibly with a different constant). Furthermore, our method to achieve the optimal lower bound, relying on complex analysis, cannot be used anymore in higher order chaoses, and hence one requires the introduction of new ideas.

4.5 Examples

We start with the following naive example that illustrates the essential role of our operator theory technique to achieve the optimal rate. It is worth mentioning that all the rates achieved in the forthcoming examples are better (by a square power) than those that can be obtained by the Malliavin-Stein bound [NP09b, Theorem 1.5]. In the following, when $(a_{n})_{n\geqslant 1}$ and $(b_{n})_{n\geqslant 1}$ are two non-negative real number sequences, we write $a_{n}\approx_{C}b_{n}$ if $\lim_{n\to\infty}\frac{a_{n}}{b_{n}}=C$ , for some constant $C>0$ .

Example 4.13.

Let $N_{1},N_{2}\sim\mathscr{N}(0,1)$ be independent. Consider the sequence

[TABLE]

First note that $\mathbb{E}[F^{2}_{n}]=4$ for every $n\in\mathbb{N}$ . Also, using Proposition 2.2 item $2$ , and relation (13), simple computations yield that $\kappa_{4}(F_{n})-\kappa_{4}(G(2))=48\,\frac{2}{n^{2}}\approx_{C}\frac{1}{n^{2}}$ . Similarly $\kappa_{3}(F_{n})-\kappa_{3}(G(2))=8\sum_{j=1}^{2}\left(c_{j,n}^{3}-1\right)\approx_{C}\frac{1}{n^{2}}$ . Therefore, our main Theorem 4.11 implies

[TABLE]

The following important remarks are in order. (a) This example represents a typical scenario, in which, in order to obtain the optimal upper bound, one needs to join together two Gamma quantities $\Gamma_{3}(F_{n})-2\Gamma_{2}(F_{n})$ and $\Gamma_{2}(F_{n})-2\Gamma_{1}(F_{n})$ . In fact, it is not difficult, using Lemma 2.4, to see that

[TABLE]

And now consider Remark 4.6. (b) It is classical that the density function $f_{n}$ of the random variable $F_{n}$ admits the following explicit representation in terms of confluent hypergeometric functions,

[TABLE]

Also recall that the density of the target $G(2)$ is given by $f_{\nu}(x)=\frac{1}{2}e^{-\frac{x}{2}-1}\mathds{1}_{\{x>-2\}}(x)$ . Using rather long and tedious computations, one can show that the optimal estimate (39) continues to hold in the stronger distance of total variation, namely that

[TABLE]

Example 4.14.

(U-statistics) In this example, we consider a second order U-statistic with degeneracy order $1$ inspired by [AAPS17, section 3.1]. The reader may consult the excellent textbook [Ser80] for a general asymptotic theory of $U$ -statistics. Let $\{h_{i}\}_{i\geqslant 1}$ be an orthonormal basis of $\mathfrak{H}$ and for $i\geqslant 1$ set $Z_{i}:=I_{1}(h_{i})$ . Consider

[TABLE]

Then $nU_{n}\stackrel{{\scriptstyle\mathcal{D}}}{{\to}}G(1)$ as $n\to\infty$ with parameter $\nu=1$ . Furthermore to fix the variance to $2\nu=2$ , define

[TABLE]

We consider the associated Hilbert-Schmidt operator $A_{f_{n}}g=f_{n}\mathbin{\otimes_{1}}g$ . Using the fact that $(h_{i}\mathbin{\otimes}h_{j})\mathbin{\otimes_{1}}h_{k}=\langle h_{i},h_{k}\rangle_{\mathfrak{H}}\,h_{j}$ we can explicitly compute the non-zero eigenvalues $c_{1,n},\ldots,c_{n,n}$ of $A_{f_{n}}$ . They are

[TABLE]

Therefore, as $n\to\infty$ , gathering Proposition 2.2 item $2$ , relation (40) and Theorem 4.11 we get that

[TABLE]

In the next example we consider the important problem of the asymptotic behavior of the least squares estimators in the autoregressive models in the nearly non-stationary regime, where the target distribution $G(\nu)$ shows up. For more details on this fascinating subject, we refer the reader to [CW87, CW88, Whi58, Rao78, BC13, LLQM11] and references therein when the noise is a martingale difference, and [BC07] when the innovation process exhibits long-range dependence. We also refer to [GT05, Proposition 2] for a study of optimal rates in a general context of quadratic forms.

Example 4.15.

(Least square estimator in nearly non stationary $AR(1)$ model) Let $n\in\mathbb{N}$ . Let $\beta_{n}:=1-\frac{\beta}{n}$ . We consider the first order autoregressive process $X_{t}(n)=\beta_{n}X_{t-1}(n)+Z_{t}$ , where $t=1,\ldots,n$ , $X_{0}(n)=0$ for all $n$ and $(Z_{i})$ is a white noise, i.e. a sequence of i.i.d. $\mathscr{N}(0,1)$ random variables. It is classical that the least squares estimator of the unknown parameter $\beta_{n}$ , based on discrete observations $X_{1}(n),\ldots,X_{n}(n)$ , is given by

[TABLE]

Define

[TABLE]

Then [CW87, Theorem 1] implies that as $n\to\infty$ :

[TABLE]

where $B=(B_{t})_{t\in[0,1]}$ is a standard Brownian motion. In particular when $\beta=0$ , we observe that $W_{\infty}:=W^{\beta=0}_{\infty}=G(1)$ (equality in law), and hence we obtain that $W_{n}:=W^{\beta=0}_{n}\stackrel{{\scriptstyle\mathcal{D}}}{{\longrightarrow}}G(1)$ . Now, apply Example 4.14 to deduce that $d_{2}\big{(}W_{n},G(1)\big{)}\approx_{C}\frac{1}{n}$ .

Example 4.16.

(Least square estimator in $AR(2)$ model) In this example, we consider the second order autoregressive $AR(2)$ model:

[TABLE]

where $(Z_{k})$ is a white noise, and $X_{0}=X_{-1}=0$ . Further, assume that the roots of the associated characteristic polynomial $1-\beta_{1}z-\beta_{2}z^{2}$ are $e^{i\theta}$ and $e^{-i\theta}$ , and lie on the unit disk. Under this condition it is easy to see that $\beta_{1}=2\cos\theta$ and $\beta_{2}=-1$ . The least square estimator $\widehat{\boldsymbol{\beta}}_{n}=(\widehat{\beta}_{1,n},\widehat{\beta}_{2,n})^{\prime}$ of the parameter $\boldsymbol{\beta}=(\beta_{1},\beta_{2})^{\prime}=(2\cos\theta,-1)^{\prime}$ for $n\geq 2$ is given by

[TABLE]

In [CW88], the asymptotic behavior of $n(\widehat{\boldsymbol{\beta}}_{n}-\boldsymbol{\beta})=\mathbf{A}^{-1}_{n}\boldsymbol{b}_{n}$ has been derived where

[TABLE]

Following [CW88, Corollary 3.3.8], as $n\to\infty$ , one can deduce that

[TABLE]

Note that the sequence $(W^{\theta}_{n}:n\geq 1)$ belongs to the second Wiener chaos. An interesting feature of the previous limit theorem is that although the sequence does depend on the parameter $\theta$ in the model, the target distribution is independent of $\theta$ . On the other hand, relation (41) together with the assumption $(\beta_{1},\beta_{2})=(2\cos\theta,-1)$ yields that

[TABLE]

Therefore,

[TABLE]

By elementary combinatorics, we have for any function $f:\mathbb{N}\to\mathbb{R}$ that $\sum_{i=2}^{n}\sum_{j=1}^{i-1}f(i-j)=\sum_{k=1}^{n-1}(n-k)f(k)$ . Using this, and evaluating the sums of sine functions (which are just geometric sums after writing them in terms of complex exponentials), we get

[TABLE]

Note that $\big{|}\kappa_{2}(W^{\theta}_{n})-4\big{|}\approx_{C}1/n$ as $n\to\infty$ . Now we scale $W_{n}^{\theta}$ so that it has variance equal to $4$ for every $n\in\mathbb{N}$ . Set $\sigma_{n}:=\sqrt{\operatorname{Var}(W_{n}^{\theta})}$ , and let $\widetilde{W}_{n}^{\theta}:=\frac{2}{\sigma_{n}}W_{n}^{\theta}$ . Using (12), and after some tedious computations, we get that

[TABLE]

Using that $\sigma_{n}^{3}\to 8$ as $n\to\infty$ , we see that $\lim_{n\to\infty}\kappa_{3}(\widetilde{W}_{n}^{\theta})=16=8\nu$ (note that $\nu=2$ ), and furthermore,

[TABLE]

Similar computations yield that $\lvert\kappa_{4}(\widetilde{W}_{n}^{\theta})-\kappa_{4}(G(2))\rvert\approx_{C}1/n$ . Therefore, Theorem 4.11 can be applied to deduce that $d_{2}(\widetilde{W}_{n}^{\theta},G(2))\approx_{C}1/n$ .

Example 4.17.

(Quadratic forms [dWV73] and [AAPS17, section 3.2]) In this example, we consider a general quadratic form in independent standard normal random variables

[TABLE]

where $C_{n}=(c_{n}(i,j))_{1\leq i,j\leq n}$ is an $n\times n$ symmetric matrix, and $(Z_{i})$ is a sequence of i.i.d standard normal random variables. Let $\nu>0$ be an integer number. Now, we make the following assumptions:

(a)

The second moment assumption: $\sum_{1\leq i,j\leq n}c_{n}(i,j)^{2}=\nu,\quad\forall n\in\mathbb{N}$ .

(b)

There exists a sequence $\{b_{n}^{m}(i):n,i=1,2,\ldots,m=1,2,\ldots,\nu\}$ of real numbers such that as $n\to\infty$ :

[TABLE]

(c)

For every $1\leq m\leq\nu$ , as $n\to\infty$ it holds that: $\sum_{1\leq i,j\leq n}c_{n}(i,j)b_{n}^{m}(i)b_{n}^{m}(j)\rightarrow 1$ .

Now a direct application of [dWV73, Theorem 2] implies that $W_{n}:=F_{n}-\mathbb{E}[F_{n}]\stackrel{{\scriptstyle\mathcal{D}}}{{\rightarrow}}G(\nu)$ . Note that $\mathbb{E}[W^{2}_{n}]=2\nu$ for every $n\in\mathbb{N}$ relying on condition (a). Moreover, one can write $W_{n}=I_{2}(\sum_{1\leq i,j\leq n}c_{n}(i,j)h_{i}\mathbin{\widetilde{\otimes}}h_{j})$ , where $\{h_{i}\}_{i\geqslant 1}$ stands for an orthonormal basis of $\mathfrak{H}$ , and for $i\geqslant 1$ ,as before, we set $Z_{i}:=I_{1}(h_{i})$ . Therefore our main Theorem 4.11 entails that

[TABLE]

Depending on the particular choice of the matrix $C_{n}$ in the original quadratic form $F_{n}$ , we can provide explicit rates (in terms of suitable powers of $n$ ) in the asymptotic relation (43). For example, following [dWV73, remark after Theorem 2] and [AAPS17, Corollary 3.2], assume that $\{e_{m}:m=1,\ldots,\nu\}$ is a sequence of distinct orthonormal functions in $L^{2}[0,1]$ such that $e_{m}\in C^{\alpha}([0,1])$ for some $\alpha\in(0,1]$ . Here $C^{\alpha}([0,1])$ denotes the space of all Hölder continuous functions with Hölder exponent $\alpha$ . Consider the square integrable kernel $K_{\nu}$ defined as

[TABLE]

Finally, for $n\in\mathbb{N}$ and $1\leq i,j\leq n$ we set

[TABLE]

Now consider the sequence $W_{n}=F_{n}-\mathbb{E}[F_{n}]$ associated to the symmetric matrix $C_{n}=(c_{n}(i,j))$ belonging to the second Wiener chaos. Then, it is straightforward to check that the conditions (a)-(b)-(c) are in order with $b^{m}_{n}(i)=\frac{e_{m}(i/n)}{\sqrt{n}}$ . On the other hand, it has been shown [AAPS17, Corollary 3.2] that:

[TABLE]

Putting together the asymptotic estimates (43) and (44), we obtain the optimal rate $d_{2}(W_{n},G(\nu))\approx_{C}n^{-\alpha}$ . Also, the example presented on page $107$ in [NP09b] can be treated in this framework, and resulting in an improved optimal rate of $1/n$ .

5 Appendix

The following lemma provides an explicit representation of the new Gamma operators used in this paper in terms of contractions. Recall that these are not the same as e.g. in [NP10], but rather the new ones introduced in (9).

Lemma 5.1.

For $q\geqslant 1$ , lets $F=I_{q}(f)$ , for some $f\in\mathfrak{H}^{\odot q}$ be an element of the $q$ -th Wiener chaos. Then

[TABLE]

where the constants $c_{q}(r_{1},\cdots,r_{s})$ are recursively defined via $c_{q}(r)=q\,(r-1)!\,\binom{q-1}{r-1}^{2}$ , and for $s\geq 2$ ,

[TABLE]

Proof.

It follows by induction on $s$ and similar lines of arguments as in [NP10, Proof of Theorem 5.1].

∎

Proof of Proposition 2.1.

Part (a) is clear from the definition. Part (b) for $j=1$ is also trivial. For $j=2$ , we use the fact that $\Gamma_{1}=\Gamma_{alt,1}$ , as well as the integration by parts formula (8), to get

[TABLE]

For part (c), consider

[TABLE]

For part (d), we consider the representation of $\Gamma_{alt,s}$ given in equation (5.25) of [NP10]. The representation is exactly the same as for $\Gamma_{s}$ (Lemma 5.1), except for the recursive formula of the constants $c_{q}$ . For $\Gamma_{alt,j}$ they are given by $c_{alt,q}(r)=c_{q}(r)=q(r-1)!\binom{q-1}{r-1}^{2}$ , and for $s\geq 2$ ,

[TABLE]

Comparing this with our formula (46), we see that only the first factor is different, namely $q$ instead of $(sq-2r_{1}-\ldots-2r_{s-1})$ . But now for $q=2$ , the indicator $\mathds{1}_{\{r_{1}+\cdots+r_{s-1}<\frac{sq}{2}\}}$ dictates that $r_{1}=\ldots=r_{s-1}=1$ . Hence $q=2=2s-2r_{1}-\ldots-2r_{s-1}$ . Therefore, the two notions of Gamma operators coincide when $q=2$ . ∎

Acknowledgments

The authors would like to thank Simon Campese for pointing out a mistake in the proof of Theorem 4.5.

Bibliography48

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[AAPS 17] B. Arras, E. Azmoodeh, G. Poly, and Y. Swan. A bound on the 2-Wasserstein distance between linear combinations of independent random variables. 2017, ar Xiv:1704.01376 v 2. To appear in Stochastic processes and their Applications .
2[ACP 14] E. Azmoodeh, S. Campese, and G. Poly. Fourth Moment Theorems for Markov diffusion generators. J. Funct. Anal. , 266(4):2341–2359, 2014.
3[AEK 18] E. Azmoodeh, P. Eichelsbacher, and L. Knichel. On the Rate of Convergence to a Gamma Distribution on Wiener Space, 2018, ar Xiv:1806.03878 v 2.
4[AMMP 16] E. Azmoodeh, D. Malicet, G. Mijoule, and G. Poly. Generalization of the Nualart-Peccati criterion. Ann. Probab. , 44(2):924–954, 2016.
5[AMPS 17] B. Arras, G. Mijoule, G. Poly, and Y. Swan. A new approach to the Stein-Tikhomirov method: with applications to the second Wiener chaos and Dickman convergence, 2017, ar Xiv:1605.06819 v 2.
6[APP 15] E. Azmoodeh, G. Peccati, and G. Poly. Convergence towards linear combinations of chi-squared random variables: a Malliavin-based approach. In In memoriam Marc Yor—Séminaire de Probabilités XLVII , volume 2137 of Lecture Notes in Math. , pages 339–367. Springer, Cham, 2015.
7[AS 17] B. Arras and Y. Swan. A stroll along the gamma. Stochastic Process. Appl. , 127(11):3661–3688, 2017.
8[BBNP 12] H. Biermé, A. Bonami, I. Nourdin, and G. Peccati. Optimal Berry-Esseen rates on the Wiener space: the barrier of third and fourth cumulants. ALEA Lat. Am. J. Probab. Math. Stat. , 9(2):473–500, 2012.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Optimal Gamma Approximation on Wiener Space

Abstract

Contents

1 Introduction and Main Result

Theorem 1.1** ((Optimal) fourth moment theorem** [NP05, NP09b, NP15]).

Theorem 1.2**.**

Theorem 1.3** (Non asymptotic optimal Gamma approximation).**

Remark 1.4**.**

2 Preliminaries: Gaussian Analysis and Malliavin Calculus

2.1 Isonormal Gaussian Processes and Wiener Chaos

2.2 The Malliavin Operators

2.3 Gamma Operators and Cumulants

Proposition 2.1**.**

2.4 Useful facts on Second Wiener Chaos

Proposition 2.2**.**

Lemma 2.3**.**

Proof.

Lemma 2.4**.**

Proof.

3 Stein’s Method for the centered Gamma distribution

Theorem 3.1**.**

3.1 Explicit Formula for the Solution of the Stein Equation

Lemma 3.2**.**

Proof.

3.2 An Operator Theory Approach

Lemma 3.3**.**

Proof.

Lemma 3.4**.**

Proof.

Proposition 3.5**.**

Proof.

Lemma 3.6**.**

Proof.

Proposition 3.7**.**

Proof.

Theorem 3.8**.**

Proof.

Proposition 3.9**.**

Proof.

4 Optimal Gamma Approximation

4.1 A General Stein-Malliavin Upper Bound

Proposition 4.1**.**

Proof.

Lemma 4.2**.**

Proof.

Lemma 4.3**.**

Proof.

Remark 4.4**.**

Theorem 4.5**.**

Proof.

Remark 4.6**.**

4.2 The Upper Bound: Second Wiener Chaos

Proposition 4.7**.**

Proof.

Remark 4.8**.**

Proposition 4.9**.**

Proof.

4.3 The Lower Bound: Second Wiener Chaos

Proposition 4.10**.**

Proof.

4.4 Main Result: Non Asymptotic Optimal Gamma Approximation

Theorem 4.11**.**

Proof.

Remark 4.12**.**

4.5 Examples

Example 4.13**.**

Example 4.14**.**

Example 4.15**.**

Example 4.16**.**

Example 4.17**.**

5 Appendix

Lemma 5.1**.**

Proof.

Proof of Proposition 2.1.

Theorem 1.1 ((Optimal) fourth moment theorem [NP05, NP09b, NP15]).

Theorem 1.2.

Theorem 1.3 (Non asymptotic optimal Gamma approximation).

Remark 1.4.

Proposition 2.1.

Proposition 2.2.

Lemma 2.3.

Lemma 2.4.

Theorem 3.1.

Lemma 3.2.

Lemma 3.3.

Lemma 3.4.

Proposition 3.5.

Lemma 3.6.

Proposition 3.7.

Theorem 3.8.

Proposition 3.9.

Proposition 4.1.

Lemma 4.2.

Lemma 4.3.

Remark 4.4.

Theorem 4.5.

Remark 4.6.

Proposition 4.7.

Remark 4.8.

Proposition 4.9.

Proposition 4.10.

Theorem 4.11.

Remark 4.12.

Example 4.13.

Example 4.14.

Example 4.15.

Example 4.16.

Example 4.17.

Lemma 5.1.