On fractional regularity of distributions of functions in Gaussian random variables
Egor Kosov

TL;DR
This paper investigates the fractional smoothness of measures derived from Gaussian distributions through Sobolev class mappings, establishing fractional regularity results under weak nondegeneracy conditions.
Contribution
It introduces new results on the fractional regularity of Gaussian measure images, extending understanding of their smoothness properties under weak assumptions.
Findings
Established Nikolskii--Besov fractional regularity for Gaussian measure images.
Derived fractional smoothness results under weak nondegeneracy conditions.
Extended the theory of measure regularity in Gaussian spaces.
Abstract
We study fractional smoothness of measures on , that are images of a Gaussian measure under mappings from Gaussian Sobolev classes. As a consequence we obtain Nikolskii--Besov fractional regularity of these distributions under some weak nondegeneracy assumption.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
On fractional regularity of
distributions of functions in Gaussian random variables
Egor D. Kosov
Abstract.
We study fractional smoothness of measures on , that are images of a Gaussian measure under mappings from Gaussian Sobolev classes. As a consequence we obtain Nikolskii–Besov fractional regularity of these distributions under some weak nondegeneracy assumption.
Keywords: Gaussian measure, distribution, Nikolskii–Besov space, total variation distance, Kantorovich norm
AMS Subject Classification: 60E05, 60E15, 28C20, 60F99
Introduction
Let be a Gaussian measure on a locally convex space and be a polynomial mapping. It was shown in [5] and [12] that the density of the image measure belongs to a certain Nikolskii–Besov class. Here we consider a general Sobolev mapping and provide an estimate of the total variation norm in terms of the behavior of (see Theorems 3.2, 4.2 and Corollaries 3.3, 4.3), where is the shift of the measure to the vector , and where is the determinant of the Malliavin matrix of the mapping (all the necessary definitions are given in the first section). This result provides a quantitative estimate of smoothness of and complements the classical theorem (see [4, Theorem 9.2.4]) which asserts that such a distribution possesses a density with respect to the standard Lebesgue measure if for -almost every point . However, it should be mentioned that in this classical result only the inclusion of to the first Sobolev class is assumed. We also note that in [1, Theorem 2.11] the lower semi-continuity of densities of such distributions was established.
The obtained results also provide a quantitative estimate in the following qualitative theorem (see [8] and [5], which generalizes [1, Theorem 2.14]). Let be a sequence of functions such that . Set
[TABLE]
and assume that
[TABLE]
If the sequence of measures converges in distribution, it also converges in variation. Corollary 4.4 of the present paper asserts that under the same assumptions one has
[TABLE]
where is the limiting distribution and is the Kantorovich–Rubinstein norm, which metrizes weak convergence of probability measures. A similar bound is also valid for mappings from for any , which is also an improvement of the above result.
The approach in this work is similar to the classical Malliavin method developed in [14] (see also [4]). The main idea of the method is to obtain bounds of the form
[TABLE]
which yields that the density of is infinitely differentiable. In works [5], [12], the Malliavin condition was modified to treat the case of Nikolskii–Besov fractional smoothness of distributions. In this work we similarly employ the results of [13] which estimate the quantity in terms of the function
[TABLE]
where the supremum is taken over all functions and unit vectors .
To apply the classical Malliavin method one should assume some nondegeneracy of mapping , for example in the form of integrability of to some power . Such condition is sometimes very restrictive and difficult for verification. For example, the required integrability is not valid for polynomial mappings. Nevertheless, for polynomials on Gaussian space, the following weak nondegeneracy condition holds: is integrable to every power (this follows from the Carbery–Wright inequality [10], [15]). Thus, a natural question is to investigate the smoothness properties of distributions for Sobolev mappings under the weak nondegeneracy assumption of the integrability of to some power . Corollaries 3.5 and 4.5 give the Nikolskii–Besov fractional smoothness of distributions under such weak assumption which generalizes the results of [5] about the polynomial mappings. Our results also give an estimate of the total variation distance between two such distributions under a common weak nondegeneracy assumption in terms of the Kantorovich–Rubinstein distance between these distributions.
1. Definitions and notations
In this section we introduce the definitions and notation used throughout the paper.
Let denote the space of all infinitely smooth functions with compact support and let denote the space of all bounded smooth functions with bounded derivatives of every order. The standard Euclidian inner product on is denoted by , and the standard norm is denoted by . For the standard Lebesgue measure on we will use the symbol .
Let be a bounded measure on a measurable space. Recall that denotes the image of the measure under a -measurable mapping , i.e., the following equality holds:
[TABLE]
For a Borel measure on , its shift to the vector is the measure defined by the equality
[TABLE]
The total variation norm of a Borel measure on (possibly signed) is defined by the equality
[TABLE]
where
[TABLE]
The Kantorovich–Rubinstein norm (which is sometimes called the Fortet–Mourier norm) of a Borel measure on is defined by the formula
[TABLE]
We note here that, for probability measures, convergence in the Kantorovich–Rubinstein norm is equivalent to weak convergence (convergence in distribution for random variables). We also introduce the Kantorovich norm of a measure on with finite first moment () and with :
[TABLE]
We recall (see [2], [16], and [21]) that the Nikolskii–Besov space with consists of all functions for which there is a constant such that for every one has
[TABLE]
When the function is the density (with respect to ) of the measure the above condition can be represented in the following form:
[TABLE]
We now recall several facts about Gaussian measures on locally convex spaces.
Let be a locally convex space with the topological dual . Let be a centered Gaussian measure on , i.e. it is a Radon measure such that every functional is a normally distributed random variable with zero mean (its distribution is either the Dirac measure at zero or has a centered Gaussian density). Let be the Cameron–Martin space of the measure consisting of all vectors with finite Cameron–Martin norm , where
[TABLE]
For the standard Gaussian measure on , the Cameron–Martin space is itself. For a general Radon Gaussian measure, the Cameron–Martin space is a separable Hilbert space (see [3, Theorem 3.2.7 and Proposition 2.4.6]) with the inner product generated by .
It is known (see, for example, [3, Section 2.10]) that for an arbitrary orthonormal family in there is an orthonormal family in such that . Let be the distribution of the vector on . This distribution is the standard Gaussian measure on with density .
For a function we set
[TABLE]
Let be the set of all functions of the form , where and .
For a function of the form set
[TABLE]
[TABLE]
The Sobolev space , , is the closure of the class with respect to the norm
[TABLE]
where , , and is the Hilbert–Schmidt norm.
Let be the Ornstein–Uhlenbeck operator defined by
[TABLE]
for , where is the Laplace operator. We note that
[TABLE]
for with some constant depending only on (see [3, Theorem 5.7.1]).
Let be a mapping such that its components belongs to . Let us define the Malliavin matrix of the mapping by
[TABLE]
Let
[TABLE]
be the adjugate matrix of , i.e., , where is the cofactor of in the matrix . Set
[TABLE]
Note that
[TABLE]
For a function we set
[TABLE]
We need the following simple lemma.
Lemma 1.1**.**
For a function and arbitrary numbers one has
[TABLE]
Proof.
By Fubini’s theorem and Chebyshev’s inequality one has
[TABLE]
The lemma is proved. ∎
2. Smoothness properties of measures on
The following modulus of continuity plays a crucial role below.
Definition 2.1**.**
For a measure on and we set
[TABLE]
where the supremum is taken over all functions and over all unit vectors .
The following theorem is proved in [13].
Theorem 2.2**.**
For any measure on one has
[TABLE]
This theorem implies that the measure is absolutely continuous with respect to Lebesgue measure if (and only if) as .
The modulus of continuity can be used to compare different distances on the space of probability measures. In the following theorem we estimate the total variation distance between two probability measures and in terms of the Kantorovich–Rubinstein distance and the quantity . This result generalizes some estimates from [5] and [12].
Lemma 2.3**.**
Let and be two probability measures on . Then for any one has
[TABLE]
In particular, since , we have
[TABLE]
Proof.
Set
[TABLE]
and
[TABLE]
For the measure we have
[TABLE]
where is the convolution of the measures and . For the first term above, we have
[TABLE]
For the second term, we have
[TABLE]
Note that
[TABLE]
Thus, the Lipschitz constant of the function
[TABLE]
can be estimated from above by . Moreover,
[TABLE]
for . So,
[TABLE]
In the first integral by monotonicity of the function and in the second integral , since for . Thus,
[TABLE]
where . The lemma is proved. ∎
Remark 2.4**.**
By a similar reasoning, one can prove that, for an arbitrary pair of probability measures and on and any , one has
[TABLE]
3. One-dimensional case
In this section we study smoothness properties of the distribution on the real line generated by a Sobolev smooth function on a locally convex space equipped with a centered Gaussian measure .
We start with the following technical lemma.
Lemma 3.1**.**
Let , , . Then there is a constant depending only on such that for every function with
[TABLE]
and for every function one has
[TABLE]
for any .
Proof.
We first assume that the functions belong to and are of the form , . Integrating by parts, we have
[TABLE]
where is the Ornstein–Uhlenbeck operator associated with the standard Gaussian measure .
For a general function , we can take a sequence such that in which also converges almost everywhere along with first and second derivatives. Passing to the limit in the above inequality we obtain the same inequality for a general function and a function . Now, for a function we can take functions such that in and almost everywhere. Let us consider function such that for and . Then the sequence also converges to the function in and almost everywhere, . We can pass to the limit in the above inequality and obtain a similar estimate for general functions and :
[TABLE]
By Lemma 1.1 we have
[TABLE]
Thus,
[TABLE]
The lemma is proved. ∎
Theorem 3.2**.**
Let , . Then there is a constant , depending only on , such that for every function with
[TABLE]
one has
[TABLE]
for every number .
Proof.
For all and , we can write
[TABLE]
For the first term, by Lemma 3.1, we have
[TABLE]
The second term, by Lemma 1.1, does not exceed
[TABLE]
Therefore,
[TABLE]
The theorem is proved. ∎
Since , taking in the previous theorem, we obtain the following result.
Corollary 3.3**.**
Let , . Then there is a constant , depending only on , such that for every function with
[TABLE]
one has
[TABLE]
The following corollary provides a quantitative bound in the following result from [8]: convergence in distribution of random variables from a certain Sobolev class implies convergence in variation under some uniform nondegeneracy assumption and uniform boundedness of their Sobolev norms.
Corollary 3.4**.**
Let and let be a sequence such that
[TABLE]
Assume that the sequence of distributions converges weakly to the measure (equivalently ). Then and there is a constant such that
[TABLE]
Proof.
By Lemma 2.3 and Corollary 3.3 we have
[TABLE]
Passing to the limit as , we obtain a similar estimate with in place of . We now note that
[TABLE]
Taking we get
[TABLE]
The corollary is proved. ∎
The following corollary gives the Nikolskii–Besov smoothness of under the assumption of -integrability of to some power . This result generalizes [5, Theorem 5.1] to the case of general Sobolev functions.
Corollary 3.5**.**
Let , , . Set . There is a constant such that for every function with
[TABLE]
one has
[TABLE]
Equivalently, the measure possesses a density from the Nikolskii–Besov class .
Proof.
Under our assumptions, we have
[TABLE]
By Theorem 3.2 for every , one has
[TABLE]
Taking and applying Theorem 2.2 we get the desired bound. ∎
The following corollary generalizes [5, Theorem 5.2].
Corollary 3.6**.**
Let , , . Set . There is a constant such that such that, for every pair of functions with
[TABLE]
one has
[TABLE]
Proof.
By Lemma 2.3 for each one has
[TABLE]
where . Taking we get the desired bound. ∎
4. Multidimensional case
We now proceed to the case of multidimensional mappings and the properties of their distributions on .
We start with the following analog of Lemma 3.1.
Lemma 4.1**.**
Let , , , , . Then there exists a number such that, for every mapping , where and
[TABLE]
for every pair of functions , with , and for every number , one has
[TABLE]
Proof.
We first assume that the functions , , belong to and are of the form , , , for . Integrating by parts, we have
[TABLE]
where is the Ornstein–Uhlenbeck operator associated with the standard Gaussian measure . We now estimate each of these three terms. The first term in (4.1) can be estimated from above by
[TABLE]
The third term in (4.1) can be estimated by
[TABLE]
To estimate the second term in (4.1) we need to estimate the gradient of the determinant. We note that for an arbitrary matrix , one has , where are columns of the matrix . We have , where is the matrix such that for and . Thus,
[TABLE]
So, the second term in (4.1) is estimated by
[TABLE]
for some constant , which depends only on .
Therefore, we have
[TABLE]
for functions , . For general functions , , we can take sequences , such that in , in and both sequences (along with the sequences of their derivatives) also converge almost everywhere. Passing to the limit in the above inequality we obtain the same inequality for general functions , , , and functions . Now, for a function , we can take functions such that in and almost everywhere. Let us consider a function such that for and . Then, the sequence also converges to the function in and almost everywhere, . We can pass to the limit in the above inequality and obtain a similar estimate for general functions , , , :
[TABLE]
with c_{3}(k,p)=2\bigl{(}c_{1}(p)+1\bigr{)}+2c_{2}(k).
By Lemma 1.1 we have
[TABLE]
and
[TABLE]
Since and we have
[TABLE]
Thus,
[TABLE]
with . The lemma is proved. ∎
Theorem 4.2**.**
Let , , and . Then there exists a number such that, for every mapping , where and
[TABLE]
for every , one has
[TABLE]
Proof.
Fix an arbitrary function with , , and an arbitrary unit vector . It can be easily verified that
[TABLE]
Here the left-hand side is interpreted as the standard product of a matrix and a column vector. Then by (1.1) we have
[TABLE]
which yields the following equality:
[TABLE]
For any fixed number we can write
[TABLE]
For the first term by the above reasoning we have
[TABLE]
We note that and there is a constant such that
[TABLE]
We also note that and . Hence and . Moreover, we have
[TABLE]
Applying now Lemma 4.1 with and we obtain
[TABLE]
with .
Using Lemma 1.1, we can estimate the second term in (4.2) in the following way:
[TABLE]
Hence we have obtained the estimate
[TABLE]
Since , the theorem is proved. ∎
Taking we get the following result.
Corollary 4.3**.**
Let , , and . Then there exists a constant such that, for every mapping , where and
[TABLE]
for every , one has
[TABLE]
The following corollary is a multidimensional analog of Corollary 3.4. It asserts that convergence in distribution of random vectors from a Sobolev class implies convergence in variation provided they are uniformly nondegenerate and uniformly bounded in the Sobolev norm.
Corollary 4.4**.**
Let , , and . Let be a sequence of mappings such that
[TABLE]
Assume also that the sequence of distributions converges weakly to some measure (equivalently, ). Then and
[TABLE]
Proof.
By Lemma 2.3 and Corollary 4.3 we have
[TABLE]
Passing to the limit as , we obtain a similar estimate with in place of . Now we proceed as in Corollary 3.4:
[TABLE]
Taking we get
[TABLE]
The corollary is proved. ∎
We now apply Theorem 4.2 to show the Nikolskii–Besov smoothness of under our weak nondegeneracy condition: is -integrable to some power . The following corollary generalizes [5, Theorem 4.1].
Corollary 4.5**.**
Let , , , , . Set . Then there exists a number such that, for every mapping , where and
[TABLE]
one has
[TABLE]
In other words, the density of belongs to the Nikolskii–Besov space .
Proof.
Let us estimate :
[TABLE]
By Theorem 4.2 for one has
[TABLE]
Taking for and noting that for , by Theorem 2.2 we get the desired bound. ∎
The next corollary is a generalization of [5, Theorem 4.2] to the case of Sobolev mappings in place of polynomials.
Corollary 4.6**.**
Let , , , , . Set . Then there exists a number such that for every pair of mappings , where and
[TABLE]
one has
[TABLE]
Proof.
By Lemma 2.3, for an arbitrary , one has
[TABLE]
Taking we get the desired bound. ∎
The author is a Young Russian Mathematics award winner and would like to thank its sponsors and jury.
This research was supported by the Russian Science Foundation Grant 17-11-01058 at Lomonosov Moscow State University.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] V. Bally, L. Caramellino, On the distances between probability density functions, Electron. J. Probab. 19:110 (2014), 1–33.
- 2[2] O. V. Besov, V. P. Il’in, S. M. Nikolskiĭ, Integral representations of functions and imbedding theorems. V. I, II. Winston & Sons, Washington; Halsted Press, New York – Toronto – London, 1978, 1979.
- 3[3] V. I. Bogachev, Gaussian measures. Amer. Math. Soc., Providence, Rhode Island, 1998.
- 4[4] V.I. Bogachev, Differentiable measures and the Malliavin calculus, Amer. Math. Soc., Providence, Rhode Island, 2010.
- 5[5] V. I. Bogachev, E. D. Kosov, G. I. Zelenov, Fractional smoothness of distributions of polynomials and a fractional analog of the Hardy–Landau–Littlewood inequality, Amer. Math. Soc. 370:6 (2018), 4401–4432.
- 6[6] V. I. Bogachev, E. D. Kosov, I. Nourdin, G. Poly, Two properties of vectors of quadratic forms in Gaussian random variables, Theory Probab. Appl., 59:2 (2015), 208–221.
- 7[7] V. I. Bogachev, E. D. Kosov, S. N. Popova, A new approach to Nikolskii–Besov classes, to appear in Moscow Math. J.
- 8[8] V. I. Bogachev, G. I. Zelenov, On convergence in variation of weakly convergent multidimensional distributions, Doklady Math., 91:2 (2015), 138–141.
