On fractional regularity of distributions of functions in Gaussian   random variables

Egor Kosov

arXiv:1812.02416·math.PR·January 1, 2020

On fractional regularity of distributions of functions in Gaussian random variables

Egor Kosov

PDF

TL;DR

This paper investigates the fractional smoothness of measures derived from Gaussian distributions through Sobolev class mappings, establishing fractional regularity results under weak nondegeneracy conditions.

Contribution

It introduces new results on the fractional regularity of Gaussian measure images, extending understanding of their smoothness properties under weak assumptions.

Findings

01

Established Nikolskii--Besov fractional regularity for Gaussian measure images.

02

Derived fractional smoothness results under weak nondegeneracy conditions.

03

Extended the theory of measure regularity in Gaussian spaces.

Abstract

We study fractional smoothness of measures on $R^{k}$ , that are images of a Gaussian measure under mappings from Gaussian Sobolev classes. As a consequence we obtain Nikolskii--Besov fractional regularity of these distributions under some weak nondegeneracy assumption.

Equations214

δ (ε) := n sup γ (Δ_{f_{n}} \leq ε)

δ (ε) := n sup γ (Δ_{f_{n}} \leq ε)

n sup ∥ f_{n} ∥_{W^{4 k, 2} (γ)} = a < \infty and ε \to 0 lim δ (ε) = 0.

n sup ∥ f_{n} ∥_{W^{4 k, 2} (γ)} = a < \infty and ε \to 0 lim δ (ε) = 0.

\|\gamma\circ f_{n}^{-1}-\nu\|_{\rm TV}\leq C(k,a)\Bigl{(}\bigl{[}\delta\bigl{(}\|\gamma\circ f_{n}^{-1}-\nu\|_{\rm KR}^{1/8}\bigr{)}\bigr{]}^{1/(4k)}+\|\gamma\circ f_{n}^{-1}-\nu\|_{\rm KR}^{1/(32k)}\Bigr{)},

\|\gamma\circ f_{n}^{-1}-\nu\|_{\rm TV}\leq C(k,a)\Bigl{(}\bigl{[}\delta\bigl{(}\|\gamma\circ f_{n}^{-1}-\nu\|_{\rm KR}^{1/8}\bigr{)}\bigr{]}^{1/(4k)}+\|\gamma\circ f_{n}^{-1}-\nu\|_{\rm KR}^{1/(32k)}\Bigr{)},

\int φ^{(n)} (f) d γ \leq C_{n} t sup ∣ φ (t) ∣, \forall φ \in C_{0}^{\infty} (R)

\int φ^{(n)} (f) d γ \leq C_{n} t sup ∣ φ (t) ∣, \forall φ \in C_{0}^{\infty} (R)

\sigma(\mu,t):=\sup\Bigl{\{}\int\partial_{e}\varphi\,d\mu:\,\|\varphi\|_{\infty}\leq t,\,\|\partial_{e}\varphi\|_{\infty}\leq 1\Bigr{\}},

\sigma(\mu,t):=\sup\Bigl{\{}\int\partial_{e}\varphi\,d\mu:\,\|\varphi\|_{\infty}\leq t,\,\|\partial_{e}\varphi\|_{\infty}\leq 1\Bigr{\}},

\mu\circ f^{-1}(A)=\mu\bigl{(}f^{-1}(A)\bigr{)}.

\mu\circ f^{-1}(A)=\mu\bigl{(}f^{-1}(A)\bigr{)}.

μ_{h} (A) = μ (A - h) for every Borel set A .

μ_{h} (A) = μ (A - h) for every Borel set A .

\|\mu\|_{\rm TV}:=\sup\biggl{\{}\int\varphi\,d\mu,\ \varphi\in C_{0}^{\infty}(\mathbb{R}^{k}),\ \|\varphi\|_{\infty}\leq 1\biggr{\}},

\|\mu\|_{\rm TV}:=\sup\biggl{\{}\int\varphi\,d\mu,\ \varphi\in C_{0}^{\infty}(\mathbb{R}^{k}),\ \|\varphi\|_{\infty}\leq 1\biggr{\}},

∥ φ ∥_{\infty} := x \in R^{k} sup ∣ φ (x) ∣.

∥ φ ∥_{\infty} := x \in R^{k} sup ∣ φ (x) ∣.

\|\mu\|_{\rm KR}:=\sup\biggl{\{}\int\varphi\,d\mu:\varphi\in C_{0}^{\infty}(\mathbb{R}^{k}),\ \|\varphi\|_{\infty}\leq 1,\ \|\nabla\varphi\|_{\infty}\leq 1\biggr{\}}.

\|\mu\|_{\rm KR}:=\sup\biggl{\{}\int\varphi\,d\mu:\varphi\in C_{0}^{\infty}(\mathbb{R}^{k}),\ \|\varphi\|_{\infty}\leq 1,\ \|\nabla\varphi\|_{\infty}\leq 1\biggr{\}}.

\|\mu\|_{\rm K}:=\sup\Bigl{\{}\int\varphi\,d\mu,\ \varphi\in C_{0}^{\infty}(\mathbb{R}^{k}),\ \|\nabla\varphi\|_{\infty}\leq 1\Bigr{\}}.

\|\mu\|_{\rm K}:=\sup\Bigl{\{}\int\varphi\,d\mu,\ \varphi\in C_{0}^{\infty}(\mathbb{R}^{k}),\ \|\nabla\varphi\|_{\infty}\leq 1\Bigr{\}}.

\int_{R^{k}} ∣ ρ (x + h) - ρ (x) ∣ d x \leq C ∣ h ∣^{α} .

\int_{R^{k}} ∣ ρ (x + h) - ρ (x) ∣ d x \leq C ∣ h ∣^{α} .

∥ μ_{h} - μ ∥_{TV} \leq C ∣ h ∣^{α} .

∥ μ_{h} - μ ∥_{TV} \leq C ∣ h ∣^{α} .

|h|_{H}=\sup\biggl{\{}\ell(h)\colon\,\int_{E}\ell^{2}\,d\gamma\leq 1,\ \ell\in E^{*}\biggr{\}}.

|h|_{H}=\sup\biggl{\{}\ell(h)\colon\,\int_{E}\ell^{2}\,d\gamma\leq 1,\ \ell\in E^{*}\biggr{\}}.

\|f\|_{p}:=\|f\|_{L^{p}(\gamma)}:=\biggl{(}\int|f(x)|^{p}\gamma(dx)\biggr{)}^{1/p},\quad p\in[1,\infty).

\|f\|_{p}:=\|f\|_{L^{p}(\gamma)}:=\biggl{(}\int|f(x)|^{p}\gamma(dx)\biggr{)}^{1/p},\quad p\in[1,\infty).

D^{1} φ (x) = \nabla φ (x) = j = 1 \sum n (\partial_{j} ψ) (ℓ_{1} (x), \dots, ℓ_{n} (x)) e_{j},

D^{1} φ (x) = \nabla φ (x) = j = 1 \sum n (\partial_{j} ψ) (ℓ_{1} (x), \dots, ℓ_{n} (x)) e_{j},

\bigl{(}D^{2}\varphi\bigr{)}_{i,j}(x)=(\partial_{i}\partial_{j}\psi)(\ell_{1}(x),\ldots,\ell_{n}(x)).

\bigl{(}D^{2}\varphi\bigr{)}_{i,j}(x)=(\partial_{i}\partial_{j}\psi)(\ell_{1}(x),\ldots,\ell_{n}(x)).

∥ φ ∥_{W^{p, m} (γ)} := ∥ φ ∥_{p} + i = 1 \sum m ∥ D^{i} φ ∥_{p},

∥ φ ∥_{W^{p, m} (γ)} := ∥ φ ∥_{p} + i = 1 \sum m ∥ D^{i} φ ∥_{p},

L φ (x) = Δ φ (x) - ⟨ x, \nabla φ (x)⟩

L φ (x) = Δ φ (x) - ⟨ x, \nabla φ (x)⟩

∥ L φ ∥_{L^{p} (γ_{n})} \leq c_{1} (p) ∥ φ ∥_{W^{p, 2} (γ_{n})}

∥ L φ ∥_{L^{p} (γ_{n})} \leq c_{1} (p) ∥ φ ∥_{W^{p, 2} (γ_{n})}

M_{f} (x) = (m_{i, j} (x))_{i, j \leq k}, m_{i, j} (x) := ⟨ \nabla f_{i} (x), \nabla f_{j} (x) ⟩_{H} .

M_{f} (x) = (m_{i, j} (x))_{i, j \leq k}, m_{i, j} (x) := ⟨ \nabla f_{i} (x), \nabla f_{j} (x) ⟩_{H} .

A_{f} := {a_{i, j}}

A_{f} := {a_{i, j}}

Δ_{f} := det M_{f} .

Δ_{f} := det M_{f} .

Δ_{f} \cdot M_{f}^{- 1} = A_{f} .

Δ_{f} \cdot M_{f}^{- 1} = A_{f} .

u_{\gamma}(g,\varepsilon):=\int_{0}^{\infty}(s+1)^{-2}\gamma\bigl{(}g\leq\varepsilon s\bigr{)}\,ds.

u_{\gamma}(g,\varepsilon):=\int_{0}^{\infty}(s+1)^{-2}\gamma\bigl{(}g\leq\varepsilon s\bigr{)}\,ds.

\int (g + ε)^{- r} d γ \leq r ε^{- r} u_{γ} (g, ε) .

\int (g + ε)^{- r} d γ \leq r ε^{- r} u_{γ} (g, ε) .

\int(g+\varepsilon)^{-r}\,d\gamma=r\int_{0}^{\varepsilon^{-1}}t^{r-1}\gamma\bigl{(}(g+\varepsilon)^{-1}\geq t\bigr{)}\,dt\\ =r\int_{0}^{\infty}(s+\varepsilon)^{-r-1}\gamma\bigl{(}g\leq s\bigr{)}\,ds\leq r\varepsilon^{-r}\int_{0}^{\infty}(s+1)^{-r-1}\gamma\bigl{(}g\leq\varepsilon s\bigr{)}\,ds\\ \leq r\varepsilon^{-r}\int_{0}^{\infty}(s+1)^{-2}\gamma\bigl{(}g\leq\varepsilon s\bigr{)}\,ds.

\int(g+\varepsilon)^{-r}\,d\gamma=r\int_{0}^{\varepsilon^{-1}}t^{r-1}\gamma\bigl{(}(g+\varepsilon)^{-1}\geq t\bigr{)}\,dt\\ =r\int_{0}^{\infty}(s+\varepsilon)^{-r-1}\gamma\bigl{(}g\leq s\bigr{)}\,ds\leq r\varepsilon^{-r}\int_{0}^{\infty}(s+1)^{-r-1}\gamma\bigl{(}g\leq\varepsilon s\bigr{)}\,ds\\ \leq r\varepsilon^{-r}\int_{0}^{\infty}(s+1)^{-2}\gamma\bigl{(}g\leq\varepsilon s\bigr{)}\,ds.

\sigma(\mu,t):=\sup\Bigl{\{}\int\partial_{e}\varphi d\mu:\,\|\varphi\|_{\infty}\leq t,\,\|\partial_{e}\varphi\|_{\infty}\leq 1\Bigr{\}},

\sigma(\mu,t):=\sup\Bigl{\{}\int\partial_{e}\varphi d\mu:\,\|\varphi\|_{\infty}\leq t,\,\|\partial_{e}\varphi\|_{\infty}\leq 1\Bigr{\}},

∥ μ_{h} - μ ∥_{TV} \leq 2 σ (μ, ∣ h ∣/2), σ (μ, t) \leq 6 k ∣ h ∣ \leq t sup ∥ μ_{h} - μ ∥_{TV} .

∥ μ_{h} - μ ∥_{TV} \leq 2 σ (μ, ∣ h ∣/2), σ (μ, t) \leq 6 k ∣ h ∣ \leq t sup ∥ μ_{h} - μ ∥_{TV} .

∥ μ - ν ∥_{TV} \leq 3 k σ (μ - ν, ε) + k ε^{- 1} ∥ μ - ν ∥_{KR} .

∥ μ - ν ∥_{TV} \leq 3 k σ (μ - ν, ε) + k ε^{- 1} ∥ μ - ν ∥_{KR} .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

On fractional regularity of

distributions of functions in Gaussian random variables

Egor D. Kosov

Abstract.

We study fractional smoothness of measures on $\mathbb{R}^{k}$ , that are images of a Gaussian measure under mappings from Gaussian Sobolev classes. As a consequence we obtain Nikolskii–Besov fractional regularity of these distributions under some weak nondegeneracy assumption.

Keywords: Gaussian measure, distribution, Nikolskii–Besov space, total variation distance, Kantorovich norm

AMS Subject Classification: 60E05, 60E15, 28C20, 60F99

Introduction

Let $\gamma$ be a Gaussian measure on a locally convex space $E$ and $f\colon E\to\mathbb{R}^{k}$ be a polynomial mapping. It was shown in [5] and [12] that the density of the image measure $\gamma\circ f^{-1}$ belongs to a certain Nikolskii–Besov class. Here we consider a general Sobolev mapping $f\in W^{p,2}(\gamma)$ and provide an estimate of the total variation norm $\|(\gamma\circ f^{-1})_{h}-\gamma\circ f^{-1}\|_{\rm TV}$ in terms of the behavior of $\gamma(\Delta_{f}\leq t)$ (see Theorems 3.2, 4.2 and Corollaries 3.3, 4.3), where $\mu_{h}(A):=\mu(A-h)$ is the shift of the measure $\mu$ to the vector $h$ , and where $\Delta_{f}$ is the determinant of the Malliavin matrix $M_{f}$ of the mapping $f$ (all the necessary definitions are given in the first section). This result provides a quantitative estimate of smoothness of $\gamma\circ f^{-1}$ and complements the classical theorem (see [4, Theorem 9.2.4]) which asserts that such a distribution possesses a density with respect to the standard Lebesgue measure if $\Delta_{f}(x)\neq 0$ for $\gamma$ -almost every point $x$ . However, it should be mentioned that in this classical result only the inclusion of $f$ to the first Sobolev class is assumed. We also note that in [1, Theorem 2.11] the lower semi-continuity of densities of such distributions was established.

The obtained results also provide a quantitative estimate in the following qualitative theorem (see [8] and [5], which generalizes [1, Theorem 2.14]). Let $f_{n}=(f_{n,1},\ldots,f_{n,k})\colon\,E\to\mathbb{R}^{k}$ be a sequence of functions such that $f_{n,i}\in W^{4k,2}(\gamma)$ . Set

[TABLE]

and assume that

[TABLE]

If the sequence of measures $\gamma\circ f_{n}^{-1}$ converges in distribution, it also converges in variation. Corollary 4.4 of the present paper asserts that under the same assumptions one has

[TABLE]

where $\nu$ is the limiting distribution and $\|\cdot\|_{\rm KR}$ is the Kantorovich–Rubinstein norm, which metrizes weak convergence of probability measures. A similar bound is also valid for mappings from $W^{p,2}(\gamma)$ for any $p>4k-1$ , which is also an improvement of the above result.

The approach in this work is similar to the classical Malliavin method developed in [14] (see also [4]). The main idea of the method is to obtain bounds of the form

[TABLE]

which yields that the density of $\gamma\circ f^{-1}$ is infinitely differentiable. In works [5], [12], the Malliavin condition was modified to treat the case of Nikolskii–Besov fractional smoothness of distributions. In this work we similarly employ the results of [13] which estimate the quantity $\|\mu_{h}-\mu\|_{\rm TV}$ in terms of the function

[TABLE]

where the supremum is taken over all functions $\varphi\in C_{0}^{\infty}(\mathbb{R}^{k})$ and unit vectors $e$ .

To apply the classical Malliavin method one should assume some nondegeneracy of mapping $f$ , for example in the form of integrability of $\Delta_{f}^{-1}$ to some power $p>1$ . Such condition is sometimes very restrictive and difficult for verification. For example, the required integrability is not valid for polynomial mappings. Nevertheless, for polynomials on Gaussian space, the following weak nondegeneracy condition holds: $\Delta_{f}^{-1}$ is integrable to every power $\theta<\frac{1}{2d(k-1)}$ (this follows from the Carbery–Wright inequality [10], [15]). Thus, a natural question is to investigate the smoothness properties of distributions $\gamma\circ f^{-1}$ for Sobolev mappings $f$ under the weak nondegeneracy assumption of the integrability of $\Delta_{f}^{-1}$ to some power $\theta\in(0,1)$ . Corollaries 3.5 and 4.5 give the Nikolskii–Besov fractional smoothness of distributions under such weak assumption which generalizes the results of [5] about the polynomial mappings. Our results also give an estimate of the total variation distance between two such distributions under a common weak nondegeneracy assumption in terms of the Kantorovich–Rubinstein distance between these distributions.

1. Definitions and notations

In this section we introduce the definitions and notation used throughout the paper.

Let $C_{0}^{\infty}(\mathbb{R}^{n})$ denote the space of all infinitely smooth functions with compact support and let $C_{b}^{\infty}(\mathbb{R}^{n})$ denote the space of all bounded smooth functions with bounded derivatives of every order. The standard Euclidian inner product on $\mathbb{R}^{k}$ is denoted by $\langle\cdot,\cdot\rangle$ , and the standard norm is denoted by $|\cdot|$ . For the standard Lebesgue measure on $\mathbb{R}^{k}$ we will use the symbol $\lambda^{k}$ .

Let $\mu$ be a bounded measure on a measurable space. Recall that $\mu\circ f^{-1}$ denotes the image of the measure $\mu$ under a $\mu$ -measurable mapping $f$ , i.e., the following equality holds:

[TABLE]

For a Borel measure $\mu$ on $\mathbb{R}^{k}$ , its shift to the vector $h$ is the measure $\mu_{h}$ defined by the equality

[TABLE]

The total variation norm of a Borel measure $\mu$ on $\mathbb{R}^{k}$ (possibly signed) is defined by the equality

[TABLE]

where

[TABLE]

The Kantorovich–Rubinstein norm (which is sometimes called the Fortet–Mourier norm) of a Borel measure $\mu$ on $\mathbb{R}^{k}$ is defined by the formula

[TABLE]

We note here that, for probability measures, convergence in the Kantorovich–Rubinstein norm is equivalent to weak convergence (convergence in distribution for random variables). We also introduce the Kantorovich norm of a measure $\mu$ on $\mathbb{R}^{k}$ with finite first moment ( $\int|x|\,|\mu|(dx)<\infty$ ) and with $\mu(\mathbb{R}^{k})=0$ :

[TABLE]

We recall (see [2], [16], and [21]) that the Nikolskii–Besov space $B^{\alpha}(\mathbb{R}^{k}):=B^{\alpha}_{1,\infty}(\mathbb{R}^{k})$ with $\alpha\in(0,1)$ consists of all functions $\rho\in L^{1}(\mathbb{R}^{k})$ for which there is a constant $C$ such that for every $h\in\mathbb{R}^{k}$ one has

[TABLE]

When the function $\rho$ is the density (with respect to $\lambda^{k}$ ) of the measure $\mu$ the above condition can be represented in the following form:

[TABLE]

We now recall several facts about Gaussian measures on locally convex spaces.

Let $E$ be a locally convex space with the topological dual $E^{*}$ . Let $\gamma$ be a centered Gaussian measure on $E$ , i.e. it is a Radon measure such that every functional $\ell\in E^{*}$ is a normally distributed random variable with zero mean (its distribution is either the Dirac measure at zero or has a centered Gaussian density). Let $H\subset E$ be the Cameron–Martin space of the measure $\gamma$ consisting of all vectors $h$ with finite Cameron–Martin norm $|h|_{H}<\infty$ , where

[TABLE]

For the standard Gaussian measure on $\mathbb{R}^{n}$ , the Cameron–Martin space is $\mathbb{R}^{n}$ itself. For a general Radon Gaussian measure, the Cameron–Martin space is a separable Hilbert space (see [3, Theorem 3.2.7 and Proposition 2.4.6]) with the inner product $\langle\cdot,\cdot\rangle_{H}$ generated by $|\cdot|_{H}$ .

It is known (see, for example, [3, Section 2.10]) that for an arbitrary orthonormal family $\{\ell_{i}\}_{i=1}^{n}\subset E^{*}$ in $L^{2}(\gamma)$ there is an orthonormal family $\{e_{i}\}_{i=1}^{\infty}$ in $H$ such that $\ell_{i}(e_{j})=\delta_{i,j}$ . Let $\gamma_{n}$ be the distribution of the vector $(\ell_{1},\ldots,\ell_{n})$ on $\mathbb{R}^{n}$ . This distribution is the standard Gaussian measure on $\mathbb{R}^{n}$ with density $(2\pi)^{-n/2}\exp(-|x|^{2}/2)$ .

For a function $f\in L^{p}(\gamma)$ we set

[TABLE]

Let $\mathcal{FC}^{\infty}(E)$ be the set of all functions $\varphi$ of the form $\varphi(x)=\psi(\ell_{1}(x),\ldots,\ell_{n}(x))$ , where $\psi\in C_{b}^{\infty}(\mathbb{R}^{n})$ and $n\in\mathbb{N}$ .

For a function $\varphi\in\mathcal{FC}^{\infty}(E)$ of the form $\varphi(x)=\psi(\ell_{1}(x),\ldots,\ell_{n}(x))$ set

[TABLE]

The Sobolev space $W^{p,m}(\gamma)$ , $m\in\{1,2\}$ , is the closure of the class $\mathcal{FC}^{\infty}(E)$ with respect to the norm

[TABLE]

where $\|D^{1}\varphi\|_{p}:=\||\nabla\varphi|_{H}\|_{p}$ , $\|D^{2}\varphi\|_{p}:=\||D^{2}\varphi|_{HS}\|_{p}$ , and $|\cdot|_{HS}$ is the Hilbert–Schmidt norm.

Let $L$ be the Ornstein–Uhlenbeck operator defined by

[TABLE]

for $\varphi\in C_{b}^{\infty}(\mathbb{R}^{n})$ , where $\Delta$ is the Laplace operator. We note that

[TABLE]

for $p>1$ with some constant $c_{1}(p)$ depending only on $p$ (see [3, Theorem 5.7.1]).

Let $f\colon E\to\mathbb{R}^{k}$ be a mapping such that its components $f_{1},\ldots,f_{k}$ belongs to $W^{1,1}(\gamma)$ . Let us define the Malliavin matrix $M_{f}$ of the mapping $f$ by

[TABLE]

Let

[TABLE]

be the adjugate matrix of $M_{f}$ , i.e., $a_{i,j}=M^{j,i}$ , where $M^{j,i}$ is the cofactor of $m_{j,i}$ in the matrix $M_{f}$ . Set

[TABLE]

Note that

[TABLE]

For a function $g\geq 0$ we set

[TABLE]

We need the following simple lemma.

Lemma 1.1.

For a function $g\geq 0$ and arbitrary numbers $r\geq 1,\varepsilon>0$ one has

[TABLE]

Proof.

By Fubini’s theorem and Chebyshev’s inequality one has

[TABLE]

The lemma is proved. ∎

2. Smoothness properties of measures on $\mathbb{R}^{k}$

The following modulus of continuity plays a crucial role below.

Definition 2.1.

For a measure $\mu$ on $\mathbb{R}^{k}$ and $t>0$ we set

[TABLE]

where the supremum is taken over all functions $\varphi\in C_{0}^{\infty}(\mathbb{R}^{k})$ and over all unit vectors $e$ .

The following theorem is proved in [13].

Theorem 2.2.

For any measure $\mu$ on $\mathbb{R}^{k}$ one has

[TABLE]

This theorem implies that the measure $\mu$ is absolutely continuous with respect to Lebesgue measure if (and only if) $\sigma(\mu,t)\to 0$ as $t\to 0$ .

The modulus of continuity $\sigma(\mu,\cdot)$ can be used to compare different distances on the space of probability measures. In the following theorem we estimate the total variation distance between two probability measures $\mu$ and $\nu$ in terms of the Kantorovich–Rubinstein distance and the quantity $\sigma(\mu-\nu,\cdot)$ . This result generalizes some estimates from [5] and [12].

Lemma 2.3.

Let $\mu$ and $\nu$ be two probability measures on $\mathbb{R}^{k}$ . Then for any $\varepsilon\in(0,1)$ one has

[TABLE]

In particular, since $\sigma(\mu-\nu,\varepsilon)\leq\sigma(\mu,\varepsilon)+\sigma(\nu,\varepsilon)$ , we have

[TABLE]

Proof.

Set

[TABLE]

and

[TABLE]

For the measure $\omega:=\mu-\nu$ we have

[TABLE]

where $\omega*\rho_{\varepsilon}$ is the convolution of the measures $\omega$ and $\rho_{\varepsilon}\,dx$ . For the first term above, we have

[TABLE]

For the second term, we have

[TABLE]

Note that

[TABLE]

Thus, the Lipschitz constant of the function

[TABLE]

can be estimated from above by $\varepsilon^{-1}\int_{\mathbb{R}^{n}}|\nabla\rho(x)|\,dx\leq\varepsilon^{-1}\sqrt{k}$ . Moreover,

[TABLE]

for $\varepsilon\in(0,1)$ . So,

[TABLE]

In the first integral $\sigma(\omega,\varepsilon|y|/2)\leq\sigma(\omega,\varepsilon)$ by monotonicity of the function $\sigma(\omega,\cdot)$ and in the second integral $\sigma(\omega,\varepsilon|y|/2)\leq|y|/2\sigma(\omega,\varepsilon)$ , since $\sigma(\mu,t\varepsilon)\leq t\sigma(\mu,\varepsilon)$ for $t\geq 1$ . Thus,

[TABLE]

where $c_{n}=2\int_{|y|\leq 2}\rho(y)dy+\int_{|y|>2}|y|\rho(y)dy\leq 2+\sqrt{k}\leq 3\sqrt{k}$ . The lemma is proved. ∎

Remark 2.4.

By a similar reasoning, one can prove that, for an arbitrary pair of probability measures $\mu$ and $\nu$ on $\mathbb{R}^{k}$ and any $\varepsilon>0$ , one has

[TABLE]

3. One-dimensional case

In this section we study smoothness properties of the distribution $\gamma\circ f^{-1}$ on the real line generated by a Sobolev smooth function $f$ on a locally convex space equipped with a centered Gaussian measure $\gamma$ .

We start with the following technical lemma.

Lemma 3.1.

Let $p>1$ , $r\geq 1$ , $a>0$ . Then there is a constant $c(p)$ depending only on $p$ such that for every function $f\in W^{p,2}(\gamma)$ with

[TABLE]

and for every function $g\in W^{r,1}(\gamma)\cap L^{\infty}(\gamma)$ one has

[TABLE]

for any $\varepsilon>0$ .

Proof.

We first assume that the functions $g,f$ belong to $\mathcal{FC}^{\infty}(E)$ and are of the form $g=g(\ell_{1},\ldots,\ell_{n})$ , $f=f(\ell_{1},\ldots,\ell_{n})$ . Integrating by parts, we have

[TABLE]

where $L$ is the Ornstein–Uhlenbeck operator associated with the standard Gaussian measure $\gamma_{n}$ .

For a general function $f\in W^{p,2}(\gamma)$ , we can take a sequence $f^{n}\in\mathcal{FC}^{\infty}(E)$ such that $f^{n}\to f$ in $W^{p,2}(\gamma)$ which also converges almost everywhere along with first and second derivatives. Passing to the limit in the above inequality we obtain the same inequality for a general function $f\in W^{p,2}(\gamma)$ and a function $g\in\mathcal{FC}^{\infty}(E)$ . Now, for a function $g\in W^{r,1}(\gamma)\cap L^{\infty}(\gamma)$ we can take functions $g^{n}\in\mathcal{FC}^{\infty}(E)$ such that $g^{n}\to g$ in $W^{r,1}(\gamma)$ and almost everywhere. Let us consider function $\varphi\in C_{0}^{\infty}(\mathbb{R})$ such that $\varphi(t)=t$ for $t\in[-\|g\|_{\infty},\|g\|_{\infty}]$ and $|\varphi(t)|\leq 2\|g\|_{\infty}$ . Then the sequence $\{\varphi(g^{n})\}$ also converges to the function $g$ in $W^{r,1}(\gamma)$ and almost everywhere, $\|\varphi(g^{n})\|_{\infty}\leq 2\|g\|_{\infty}$ . We can pass to the limit in the above inequality and obtain a similar estimate for general functions $f\in W^{p,2}(\gamma)$ and $g\in W^{r,1}(\gamma)\cap L^{\infty}(\gamma)$ :

[TABLE]

By Lemma 1.1 we have

[TABLE]

Thus,

[TABLE]

The lemma is proved. ∎

Theorem 3.2.

Let $p>1$ , $a>0$ . Then there is a constant $c(p)$ , depending only on $p$ , such that for every function $f\in W^{p,2}(\gamma)$ with

[TABLE]

one has

[TABLE]

for every number $\varepsilon>0$ .

Proof.

For all $\varphi\in C_{0}^{\infty}(\mathbb{R})$ and $\varepsilon>0$ , we can write

[TABLE]

For the first term, by Lemma 3.1, we have

[TABLE]

The second term, by Lemma 1.1, does not exceed

[TABLE]

Therefore,

[TABLE]

The theorem is proved. ∎

Since $u_{\gamma}(|\nabla f|_{H},\varepsilon)\leq 1$ , taking $\varepsilon=\sqrt{t}$ in the previous theorem, we obtain the following result.

Corollary 3.3.

Let $p>1$ , $a>0$ . Then there is a constant $c(p)$ , depending only on $p$ , such that for every function $f\in W^{p,2}(\gamma)$ with

[TABLE]

one has

[TABLE]

The following corollary provides a quantitative bound in the following result from [8]: convergence in distribution of random variables $f_{n}$ from a certain Sobolev class implies convergence in variation under some uniform nondegeneracy assumption and uniform boundedness of their Sobolev norms.

Corollary 3.4.

Let $p>1$ and let $f_{n}\in W^{p,2}(\gamma)$ be a sequence such that

[TABLE]

Assume that the sequence of distributions $\gamma\circ f_{n}^{-1}$ converges weakly to the measure $\nu$ (equivalently $\|\gamma\circ f_{n}^{-1}-\nu\|_{\rm KR}\to 0$ ). Then $\|\gamma\circ f_{n}^{-1}-\nu\|_{\rm TV}\to 0$ and there is a constant $C(p,a)$ such that

[TABLE]

Proof.

By Lemma 2.3 and Corollary 3.3 we have

[TABLE]

Passing to the limit as $m\to\infty$ , we obtain a similar estimate with $\nu$ in place of $\gamma\circ f_{m}^{-1}$ . We now note that

[TABLE]

Taking $\varepsilon=\|\gamma\circ f_{n}^{-1}-\nu\|_{\rm KR}^{1/2}$ we get

[TABLE]

The corollary is proved. ∎

The following corollary gives the Nikolskii–Besov smoothness of $\gamma\circ f^{-1}$ under the assumption of $\gamma$ -integrability of $|\nabla f|_{H}^{-1}$ to some power $\theta\in(0,1)$ . This result generalizes [5, Theorem 5.1] to the case of general Sobolev functions.

Corollary 3.5.

Let $p>1$ , $a,b>0$ , $\theta\in(0,1)$ . Set $\alpha:=\frac{p\theta}{2p+\theta}$ . There is a constant $C:=C(p,a,b,\theta)$ such that for every function $f\in W^{p,2}(\gamma)$ with

[TABLE]

one has

[TABLE]

Equivalently, the measure $\gamma\circ f^{-1}$ possesses a density from the Nikolskii–Besov class $B^{\alpha}(\mathbb{R})$ .

Proof.

Under our assumptions, we have

[TABLE]

By Theorem 3.2 for every $\varepsilon>0$ , one has

[TABLE]

Taking $\varepsilon=t^{\frac{p}{2p+\theta}}$ and applying Theorem 2.2 we get the desired bound. ∎

The following corollary generalizes [5, Theorem 5.2].

Corollary 3.6.

Let $p>1$ , $a,b>0$ , $\theta\in(0,1)$ . Set $\beta:=\frac{p\theta}{(2+\theta)p+\theta}$ . There is a constant $C_{1}:=C_{1}(p,a,b,\theta)$ such that such that, for every pair of functions $f,g\in W^{p,2}(\gamma)$ with

[TABLE]

one has

[TABLE]

Proof.

By Lemma 2.3 for each $\varepsilon\in(0,1)$ one has

[TABLE]

where $\alpha=\frac{p\theta}{2p+\theta}$ . Taking $\varepsilon=\|\gamma\circ f^{-1}-\gamma\circ g^{-1}\|_{\rm KR}^{\frac{1}{1+\alpha}}$ we get the desired bound. ∎

4. Multidimensional case

We now proceed to the case of multidimensional mappings $f=(f_{1},\ldots,f_{k})\colon E\to\mathbb{R}^{k}$ and the properties of their distributions $\gamma\circ f^{-1}$ on $\mathbb{R}^{k}$ .

We start with the following analog of Lemma 3.1.

Lemma 4.1.

Let $k\in\mathbb{N}$ , $p>1$ , $q>1$ , $r\geq 1$ , $a>0$ . Then there exists a number $C_{0}:=C_{0}(k,p,q,a)>0$ such that, for every mapping $f=(f_{1},\ldots,f_{k})\colon\,E\to\mathbb{R}^{k}$ , where $f_{i}\in W^{p,2}(\gamma)$ and

[TABLE]

for every pair of functions $u\in W^{r,1}(\gamma)\cap L^{\infty}(\gamma)$ , $v\in W^{q,1}(\gamma)$ with $1/q+1/p+1/r=1$ , $1-1/q-(2k+1)/p>0$ and for every number $\varepsilon\in(0,1)$ , one has

[TABLE]

Proof.

We first assume that the functions $u,v,f_{i}$ , $i=1,2,\ldots,k$ , belong to $\mathcal{FC}^{\infty}(E)$ and are of the form $u=u(\ell_{1},\ldots,\ell_{n})$ , $v=v(\ell_{1},\ldots,\ell_{n})$ , $f_{i}=f_{i}(\ell_{1},\ldots,\ell_{n})$ , for $i=1,2,\ldots,k$ . Integrating by parts, we have

[TABLE]

where $L$ is the Ornstein–Uhlenbeck operator associated with the standard Gaussian measure $\gamma_{n}$ . We now estimate each of these three terms. The first term in (4.1) can be estimated from above by

[TABLE]

The third term in (4.1) can be estimated by

[TABLE]

To estimate the second term in (4.1) we need to estimate the gradient of the determinant. We note that for an arbitrary matrix $C$ , one has $|\det C|\leq\prod_{i}|c^{i}|$ , where $\{c^{i}\}$ are columns of the matrix $C$ . We have $\langle\nabla f_{j},\nabla\Delta_{f}\rangle=\sum_{i}\det C_{i}$ , where $C_{i}=\{c_{i}^{m,r}\}$ is the matrix such that $c_{i}^{m,r}=\langle\nabla f_{m},\nabla f_{r}\rangle$ for $r\neq i$ and $c_{i}^{m,i}=\langle D^{2}f_{m}\cdot\nabla f_{i},\nabla f_{j}\rangle+\langle D^{2}f_{i}\cdot\nabla f_{m},\nabla f_{j}\rangle$ . Thus,

[TABLE]

So, the second term in (4.1) is estimated by

[TABLE]

for some constant $c_{2}(k)$ , which depends only on $k$ .

Therefore, we have

[TABLE]

for functions $u,v,f_{i}\in\mathcal{FC}^{\infty}(E)$ , $i=1,2,\ldots,k$ . For general functions $f_{i}\in W^{p,2}(\gamma)$ , $v\in W^{q,1}(\gamma)$ , we can take sequences $f_{i}^{n}\in\mathcal{FC}^{\infty}(E)$ , $v^{n}\in\mathcal{FC}^{\infty}(E)$ such that $f_{i}^{n}\to f_{i}$ in $W^{p,2}(\gamma)$ , $v^{n}\to v$ in $W^{q,1}(\gamma)$ and both sequences (along with the sequences of their derivatives) also converge almost everywhere. Passing to the limit in the above inequality we obtain the same inequality for general functions $f_{i}\in W^{p,2}(\gamma)$ , $i=1,2,\ldots,k$ , $v\in W^{q,1}(\gamma)$ , and functions $u\in\mathcal{FC}^{\infty}(E)$ . Now, for a function $u\in W^{r,1}(\gamma)\cap L^{\infty}(\gamma)$ , we can take functions $u^{n}\in\mathcal{FC}^{\infty}(E)$ such that $u^{n}\to u$ in $W^{r,1}(\gamma)$ and almost everywhere. Let us consider a function $\varphi\in C_{0}^{\infty}(\mathbb{R})$ such that $\varphi(t)=t$ for $t\in[-\|u\|_{\infty},\|u\|_{\infty}]$ and $|\varphi(t)|\leq 2\|u\|_{\infty}$ . Then, the sequence $\{\varphi(u^{n})\}$ also converges to the function $u$ in $W^{r,1}(\gamma)$ and almost everywhere, $\|\varphi(u^{n})\|_{\infty}\leq 2\|u\|_{\infty}$ . We can pass to the limit in the above inequality and obtain a similar estimate for general functions $f_{i}\in W^{p,2}(\gamma)$ , $i=1,2,\ldots,k$ , $v\in W^{q,1}(\gamma)$ , $u\in W^{r,1}(\gamma)\cap L^{\infty}(\gamma)$ :

[TABLE]

with $c_{3}(k,p)=2\bigl{(}c_{1}(p)+1\bigr{)}+2c_{2}(k)$ .

By Lemma 1.1 we have

[TABLE]

and

[TABLE]

Since $\varepsilon\leq 1$ and $u_{\gamma}(\Delta_{f},\varepsilon)\leq 1$ we have

[TABLE]

Thus,

[TABLE]

with $C_{0}(k,p,q,a)=c_{3}(k,p)(2a+3a^{2k+1})$ . The lemma is proved. ∎

Theorem 4.2.

Let $k\in\mathbb{N}$ , $a>0$ , and $p>4k-1$ . Then there exists a number $C_{1}:=C_{1}(p,k,a)>0$ such that, for every mapping $f=(f_{1},\ldots,f_{k})\colon\,E\to\mathbb{R}^{k}$ , where $f_{i}\in W^{p,2}(\gamma)$ and

[TABLE]

for every $\varepsilon\in(0,1)$ , one has

[TABLE]

Proof.

Fix an arbitrary function $\varphi\in C_{0}^{\infty}(\mathbb{R}^{k})$ with $\|\varphi\|_{\infty}\leq t$ , $\|\partial_{e}\varphi\|_{\infty}\leq 1$ , and an arbitrary unit vector $e\in\mathbb{R}^{k}$ . It can be easily verified that

[TABLE]

Here the left-hand side is interpreted as the standard product of a matrix and a column vector. Then by (1.1) we have

[TABLE]

which yields the following equality:

[TABLE]

For any fixed number $\varepsilon\in(0,1)$ we can write

[TABLE]

For the first term by the above reasoning we have

[TABLE]

We note that $a^{i,j}_{f}e_{i}\in W^{p/(2k-2),1}(\gamma)$ and there is a constant $c_{4}(k)$ such that

[TABLE]

We also note that $\varphi\circ f\in W^{p,1}(\gamma)$ and $(2k-2)/p+1/p+1/p\leq 1$ . Hence $\varphi\circ f\in W^{\frac{1}{1-(2k-1)/p},1}(\gamma)$ and $\|\varphi\circ f\|_{W^{\frac{1}{1-(2k-1)/p},1}(\gamma)}\leq\|\varphi\circ f\|_{W^{p,1}(\gamma)}$ . Moreover, we have

[TABLE]

Applying now Lemma 4.1 with $r=(1-(2k-1)/p)^{-1}$ and $q=p/(2k-2)$ we obtain

[TABLE]

with $C_{1}(k,p,a)=k^{2}c_{4}(k)C_{0}(k,p,p/(2k-2),a)a^{2k-2}$ .

Using Lemma 1.1, we can estimate the second term in (4.2) in the following way:

[TABLE]

Hence we have obtained the estimate

[TABLE]

Since $\|\varphi\|_{\infty}\leq t$ , the theorem is proved. ∎

Taking $\varepsilon=\sqrt{t}$ we get the following result.

Corollary 4.3.

Let $k\in\mathbb{N}$ , $a>0$ , and $p>4k-1$ . Then there exists a constant $C:=C(p,k,a)>0$ such that, for every mapping $f=(f_{1},\ldots,f_{k})\colon\,E\to\mathbb{R}^{k}$ , where $f_{i}\in W^{p,2}(\gamma)$ and

[TABLE]

for every $t\in(0,1)$ , one has

[TABLE]

The following corollary is a multidimensional analog of Corollary 3.4. It asserts that convergence in distribution of random vectors $f_{n}$ from a Sobolev class implies convergence in variation provided they are uniformly nondegenerate and uniformly bounded in the Sobolev norm.

Corollary 4.4.

Let $k\in\mathbb{N}$ , $a>0$ , and $p>4k-1$ . Let $f_{n}=(f_{n,1},\ldots,f_{n,k})\in W^{p,2}(\gamma)$ be a sequence of mappings such that

[TABLE]

Assume also that the sequence of distributions $\gamma\circ f_{n}^{-1}$ converges weakly to some measure $\nu$ (equivalently, $\|\gamma\circ f_{n}^{-1}-\nu\|_{\rm KR}\to 0$ ). Then $\|\gamma\circ f_{n}^{-1}-\nu\|_{\rm TV}\to 0$ and

[TABLE]

Proof.

By Lemma 2.3 and Corollary 4.3 we have

[TABLE]

Passing to the limit as $m\to\infty$ , we obtain a similar estimate with $\nu$ in place of $\gamma\circ f_{m}^{-1}$ . Now we proceed as in Corollary 3.4:

[TABLE]

Taking $\varepsilon=2^{-1/2}\|\gamma\circ f_{n}^{-1}-\nu\|_{\rm KR}^{1/2}\leq 1$ we get

[TABLE]

The corollary is proved. ∎

We now apply Theorem 4.2 to show the Nikolskii–Besov smoothness of $\gamma\circ f^{-1}$ under our weak nondegeneracy condition: $\Delta_{f}^{-1}$ is $\gamma$ -integrable to some power $\theta\in(0,1)$ . The following corollary generalizes [5, Theorem 4.1].

Corollary 4.5.

Let $k\in\mathbb{N}$ , $a>0$ , $b>0$ , $\theta\in(0,1)$ , $p>4k-1$ . Set $\alpha:=\frac{p\theta}{2p+(4k-1)\theta}$ . Then there exists a number $C:=C(p,k,a,b,\theta)>0$ such that, for every mapping $f=(f_{1},\ldots,f_{k})\colon E\to\mathbb{R}^{k}$ , where $f_{i}\in W^{p,2}(\gamma)$ and

[TABLE]

one has

[TABLE]

In other words, the density of $\gamma\circ f^{-1}$ belongs to the Nikolskii–Besov space $B^{\alpha}(\mathbb{R}^{k})$ .

Proof.

Let us estimate $u_{\gamma}(\Delta_{f},\varepsilon)$ :

[TABLE]

By Theorem 4.2 for $\varepsilon\in(0,1)$ one has

[TABLE]

Taking $\varepsilon=t^{\frac{p}{2p+(4k-1)\theta}}$ for $t<1$ and noting that $\sigma(\gamma\circ f^{-1},t)\leq 1\leq t$ for $t\geq 1$ , by Theorem 2.2 we get the desired bound. ∎

The next corollary is a generalization of [5, Theorem 4.2] to the case of Sobolev mappings in place of polynomials.

Corollary 4.6.

Let $k\in\mathbb{N}$ , $a>0$ , $b>0$ , $\theta\in(0,1)$ , $p>4k-1$ . Set $\alpha:=\frac{p\theta}{2p+(4k-1)\theta}$ . Then there exists a number $C:=C(p,k,a,b,\theta)>0$ such that for every pair of mappings $f=(f_{1},\ldots,f_{k}),g=(g_{1},\ldots,g_{k})\colon\,E\to\mathbb{R}^{k}$ , where $f_{i},g_{i}\in W^{p,2}(\gamma)$ and

[TABLE]

one has

[TABLE]

Proof.

By Lemma 2.3, for an arbitrary $\varepsilon\in(0,1)$ , one has

[TABLE]

Taking $\varepsilon=2^{-1}\|\gamma\circ f^{-1}-\gamma\circ g^{-1}\|_{\rm KR}^{\frac{1}{1+\alpha}}$ we get the desired bound. ∎

The author is a Young Russian Mathematics award winner and would like to thank its sponsors and jury.

This research was supported by the Russian Science Foundation Grant 17-11-01058 at Lomonosov Moscow State University.

Bibliography21

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] V. Bally, L. Caramellino, On the distances between probability density functions, Electron. J. Probab. 19:110 (2014), 1–33.
2[2] O. V. Besov, V. P. Il’in, S. M. Nikolskiĭ, Integral representations of functions and imbedding theorems. V. I, II. Winston & Sons, Washington; Halsted Press, New York – Toronto – London, 1978, 1979.
3[3] V. I. Bogachev, Gaussian measures. Amer. Math. Soc., Providence, Rhode Island, 1998.
4[4] V.I. Bogachev, Differentiable measures and the Malliavin calculus, Amer. Math. Soc., Providence, Rhode Island, 2010.
5[5] V. I. Bogachev, E. D. Kosov, G. I. Zelenov, Fractional smoothness of distributions of polynomials and a fractional analog of the Hardy–Landau–Littlewood inequality, Amer. Math. Soc. 370:6 (2018), 4401–4432.
6[6] V. I. Bogachev, E. D. Kosov, I. Nourdin, G. Poly, Two properties of vectors of quadratic forms in Gaussian random variables, Theory Probab. Appl., 59:2 (2015), 208–221.
7[7] V. I. Bogachev, E. D. Kosov, S. N. Popova, A new approach to Nikolskii–Besov classes, to appear in Moscow Math. J.
8[8] V. I. Bogachev, G. I. Zelenov, On convergence in variation of weakly convergent multidimensional distributions, Doklady Math., 91:2 (2015), 138–141.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

On fractional regularity of

Abstract.

Introduction

1. Definitions and notations

Lemma 1.1**.**

Proof.

2. Smoothness properties of measures on Rk\mathbb{R}^{k}Rk

Definition 2.1**.**

Theorem 2.2**.**

Lemma 2.3**.**

Proof.

Remark 2.4**.**

3. One-dimensional case

Lemma 3.1**.**

Proof.

Theorem 3.2**.**

Proof.

Corollary 3.3**.**

Corollary 3.4**.**

Proof.

Corollary 3.5**.**

Proof.

Corollary 3.6**.**

Proof.

4. Multidimensional case

Lemma 4.1**.**

Proof.

Theorem 4.2**.**

Proof.

Corollary 4.3**.**

Corollary 4.4**.**

Proof.

Corollary 4.5**.**

Proof.

Corollary 4.6**.**

Proof.

Lemma 1.1.

2. Smoothness properties of measures on $\mathbb{R}^{k}$

Definition 2.1.

Theorem 2.2.

Lemma 2.3.

Remark 2.4.

Lemma 3.1.

Theorem 3.2.

Corollary 3.3.

Corollary 3.4.

Corollary 3.5.

Corollary 3.6.

Lemma 4.1.

Theorem 4.2.

Corollary 4.3.

Corollary 4.4.

Corollary 4.5.

Corollary 4.6.