Square functions and the Hamming cube: Duality
Paata Ivanisvili, Fedor Nazarov, Alexander Volberg

TL;DR
This paper establishes a new inequality relating the gradient and function moments on the Hamming cube for 1<p≤2, using duality between Euclidean square functions and Hamming cube gradient estimates.
Contribution
It introduces a novel inequality connecting gradient norms and function moments on the Hamming cube, with a precise constant derived from hypergeometric functions.
Findings
The inequality holds for all functions on the Hamming cube with 1<p≤2.
The constant C(p) is characterized as the smallest positive zero of a confluent hypergeometric function.
The approach reveals a duality between Euclidean square functions and Hamming cube gradient estimates.
Abstract
For , any and any , we obtain where is the smallest positive zero of the confluent hypergeometric function . Our approach is based on a certain duality between the classical square function estimates on the Euclidean space and the gradient estimates on the Hamming cube.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Square functions and the Hamming cube: Duality
Paata Ivanisvili This paper is based upon work supported by the National Science Foundation under Grant No. DMS-1440140 while two of the authors were in residence at the Mathematical Sciences Research Institute in Berkeley, California, during the Spring and Fall 2017 semester.
Fedor Nazarov Supported by NSF DMS-0800243
Alexander Volberg Supported by NSF DMS-1600065
Abstract
For , any and any , we obtain where is the smallest positive zero of the confluent hypergeometric function . Our approach is based on a certain duality between the classical square function estimates on the Euclidean space and the gradient estimates on the Hamming cube.
\dajAUTHORdetails
title = Square functions and the Hamming cube: Duality, author = Paata Ivanisvili, Fedor Nazarov and Alexander Volberg, plaintextauthor = Paata Ivanisvili, Fedor Nazarov, Alexander Volberg, plaintexttitle = Square functions and the Hamming cube: Duality, runningtitle = Square functions and the Hamming cube: Duality, runningauthor = Paata Ivanisvili, Fedor Nazarov, Alexander Volberg, copyrightauthor = P. Ivanisvili, F. Nazarov and A. Volberg, keywords = Square function, Hamming cube, Duality, \dajEDITORdetailsyear=2018, number=1, received=6 July 2017, published=19 January 2018, doi=10.19086/da.3113,
[classification=text]
1 Main result
Consider the Hamming cube of an arbitrary dimension . For any define the discrete partial derivative as follows
[TABLE]
where is obtained from by changing the sign of ’th coordinate of . Set , and we define the norm of the discrete gradient
[TABLE]
where the summation in the last term runs over all neighbor vertices of in . Set
[TABLE]
Theorem 1.1**.**
For any , , and any we have
[TABLE]
Here is the conjugate exponent of , and by we denote the smallest positive zero of the confluent hypergeometric function (see (6) for the definition).
In Lemma A.2 we obtain a lower bound for which is precise when . If for , then becomes the smallest positive zero of the Hermite polynomial where
[TABLE]
The constant in (1) is larger then all previously known bounds [15, 2] when is in a neighborhood of , say . For example, the estimate (1) improves the Naor–Schechtman bound [15] for the class of real valued functions for all . Indeed, it follows from an application of Khinchin inequality with the sharp constant and (1) that we have the following corollary
Corollary 1.2**.**
For any , , and any we have
[TABLE]
where and average in variables and correspondingly.
We will see in Proposition 3.4 that
[TABLE]
The latter implies that the estimate (2) improves the bound of Naor–Schechtman for in the case of real valued functions (see Theorem 1 in [15] where ).
On the other hand degenerates to [math] when which should not be the case for the best possible constant by a result of Talagrand (see Section 3.5). For this endpoint case, when is close to , the result of Ben-Efraim–Lust-Piquard [2] gives the better bounds
[TABLE]
and when it is widely believed that the sharp constant in the left hand side of (3) should be instead of (see Section 3.5 for more details).
We think that the main contribution of the current paper is not just Theorem 1.1 that we obtain but rather a new duality approach that we develop between two different classes of extremal problems: square function estimates on the interval and gradient estimates on the Hamming cube, and Theorem 1.1 should be considered as an example. Roughly speaking one can take a valid estimate for a square function, dualize it by a certain double Legendre transform, and one can write its corresponding dual estimate on the Hamming cube and vice versa. To illustrate another example of our duality approach, in Section 3.4 we present a short proof of the following theorem which improves a well–known inequality of Beckner
Theorem 1.3** (see [11]).**
For any , and any we have
[TABLE]
where denotes the real part, and is understood in the sense of principal brunch in the upper half-plane.
Going back to Theorem 1.1, it will be explained later that in a “dual” sense coincides with the sharp constant found by B. Davis in the norm estimates
[TABLE]
Here is the standard Brownian motion starting at zero, and is any stopping time. It was explained in [8] that the same sharp estimates (4) and (5) hold with replaced by an integrable function on with mean zero, and replaced by the dyadic square function of .
We notice the essential difference between the Davis estimates (4), (5) and (1) that for a given power , we need the “dual” constant in the theorem. Besides, inequality (1) cannot be extended to the full range of exponents with some finite strictly positive constant unlike (4) and (5) (see [8, 4, 6] and (49)).
2 Proof of the main result
2.1 An anonymous Bellman function
In this section we want to define a function that satisfies some special properties. Let and let be the conjugate exponent of . Let
[TABLE]
be the confluent hypergeometric function. satisfies the Hermite differential equation
[TABLE]
with initial conditions and . Let be the smallest positive zero of .
Set
[TABLE]
Clearly is smooth even concave function. The concavity follows from Lemma A.1 and the fact that . Finally we define
[TABLE]
For the first time the function appeared in [8]. Later it was also used in [20, 21] in the form , . It was explained in [8] that satisfies the following properties:
[TABLE]
We should refer to (9) as the obstacle condition, and to (10) as the main inequality. We caution the reader that in [8] one may not find (10) written explicitly but one will find its infinitesimal form
[TABLE]
which follows from the main inequality by expanding it into Taylor’s series with respect to near and comparing the second order terms. Here is defined everywhere except the curve where is only differentiable once.
In fact, the reverse implication also holds, i.e., one can derive (10) from (11) for this special . This was done in the PhD thesis of Wang [21] but we will present a short proof in Section A.2, which partly follows the Davis argument. Essentially the same argument also appeared later in [1] in a slightly different setting.
The function is essential in obtaining the result in the Davis paper, namely it is used in the proof of (4), and the argument goes as follows. First one shows that
[TABLE]
is a supermartingale which is guaranteed by (11). Finally, by the optional stopping theorem,
[TABLE]
which yields (4). One may notice that is the minimal function with properties (9) and (10).
Davis mentions that the proof presented in his paper was suggested by an anonymous referee, and this explains the title of the current section.
2.2 Dualizing the Bellman function and going to the Hamming cube
Set for and . We define
[TABLE]
Lemma 2.1**.**
For each , there exists such that
[TABLE]
and we have
[TABLE]
Proof.
First let us show that for each fixed the function is convex in and concave in . The concavity in follows from Lemma A.1, and the fact that is even and smooth in .
To verify the convexity in , it is enough to show that the map is convex for . Set . Then we have
[TABLE]
Since coincides with up to a positive constant, the convexity follows from Lemma A.1 and the fact that .
Notice that for each the map
[TABLE]
satisfies the assumptions of Theorem A.6 where we take (see Section A.3 in Appendix). Therefore the conclusions of Lemma 2.1 follow from Theorem A.6. ∎
Lemma 2.2**.**
For , any , and any we have
[TABLE]
The reader notices that dualization (12) produces inequality (17) that is different from (10).
Proof.
Set
[TABLE]
Lemma 2.1 gives points and corresponding to and . It follows from (14) that to prove (17) it would be enough to find numbers and such that
[TABLE]
The right choice will be
[TABLE]
but let us explain it in details.
Notice that by Cauchy–Schwarz we have
[TABLE]
provided that . Indeed, we have
[TABLE]
Denoting for , we see that it is enough to find and such that
[TABLE]
By choosing , and substituting the values for we see that it would suffice to find such that
[TABLE]
We will choose . It follows from that we only need to have the inequality
[TABLE]
But this inequality follows from (10).
To verify the obstacle condition (16), notice that (9) for gives
[TABLE]
Finally if , then we obtain
[TABLE]
Equality (*) follows from the fact that
[TABLE]
is an even convex map.
∎
Corollary 2.3**.**
For any , all , and any , we have
[TABLE]
Proof.
It follows from the definition of that the map is decreasing in for . Therefore by (17) and the triangle inequality we obtain
[TABLE]
∎
The inequality (20) gives rise to the estimate
[TABLE]
Indeed, the reader can find in [11] the passage from (20) to (21). In fact, inequality (20) is the same as
[TABLE]
where takes the average in the coordinate , i.e.,
[TABLE]
The rest follows by iterating (22), the fact that and .
2.3 The proof of Theorem 1.1
We have
[TABLE]
and this gives inequality (1).
3 Remarks and Applications
3.1 Going from to : from Square function to the Hamming cube
Let be an integrable function on . Let denote all dyadic intervals in . Consider the dyadic martingale defined as follows
[TABLE]
where . The square function is defined as follows
[TABLE]
For convenience we always assume that the number of nonzero terms in (23) is finite so that makes sense. Let be a continuous real valued function, and suppose one wants to estimate the quantity from above in terms of . If one finds a function
[TABLE]
then one obtains (see [20]) the bound
[TABLE]
Conversely, suppose that the inequality
[TABLE]
holds for all integrable functions on and some . Then there exists such that the conditions (24), (25) are satisfied and . Indeed, consider the extremal problem
[TABLE]
This satisfies (24) (take constant), and, in fact, it satisfies (25). The latter fact can be proved by using the standard Bellman principle (see Chapter 8, [17], and survey [16]). Besides,
[TABLE]
because of (27). Therefore there is one to one correspondence between the extremal problems for the square function of the form (27) and the functions with the properties (24) and (25).
The gradient estimates on the Hamming cube are more subtle. Take any real valued and suppose that we want to estimate from above in terms of for any and for all . If one finds such that
[TABLE]
then111We do also need to assume that is decreasing in for each fixed to ensure Corollary 2.3. But if is smooth then is guaranteed by (29). Indeed, if we take in (29) we obtain that is concave for each . Next, taking and sending , we obtain by Taylor’s formula that . Therefore . one can obtain the estimate (see [11])
[TABLE]
Thus finding such is sufficient to obtain the estimate but it is unclear whether conditions (28) and (29) are necessary to obtain the bound . In other words we do not know what is the corresponding extremal problem for , i.e., what is the right Bellman function . The reason lies in the fact that there is an essential difference between the Hamming cube and the dyadic intervals, i.e., test functions do not concatenate in a good way on as it happens for dyadic martingales.
Now we formulate an abstract theorem that formalizes our duality principle in a general setting.
Theorem 3.1**.**
Let be convex sets. Take an arbitrary , and let satisfy properties (24) and (25). Assume that for each , we have
[TABLE]
Then and defined as
[TABLE]
satisfy (29) and (28), and, thereby, (30) for any and any .
One may think that finding with the property (25) is a difficult problem. Let us make a quick remark here that if it happens that is convex for each fixed , then (25) is automatically implied by its infinitesimal form, i.e., by (see the proof of Lemma A.4).
Proof.
The proof essentially repeats the proof of Lemma 2.2. Let us sketch the argument. Define . The existence of a saddle point with properties (13) and (14) is guaranteed by Lemma A.5. The convexity of the set allows us to choose from , and according to (18). The rest of the proof of the theorem is the same as in Lemma 2.2. Inequality (32) follows from (24). Convexity of is needed, for example, to ensure that if , then , so that (30) makes sense. ∎
3.2 Going from to : from Hamming cube to square function
Another interesting observation is that equality (31) was lurking in a solution of a certain Monge–Ampère equation. For example, taking in (29), and using the Taylor’s series expansion (assuming that is smooth enough) one obtains
[TABLE]
When looking for the least function with and (33), it is reasonable to assume that condition (33) should degenerate except, possibly, on the set where coincides with its obstacle . The degeneracy of (33) means that the determinant of the matrix in (33) is zero. This is a general Monge–Ampère type equation and, after an application of the exterior differential systems of Bryant–Griffiths (see [12]), we obtain that the solutions can be locally characterized as follows:
[TABLE]
where satisfies the equation
[TABLE]
In [12] we used instead of , in which case (35) becomes just the backward heat equation for . We will not formulate a formal statement but we do make a remark that such a reasoning allows us to guess the dual of , i.e., to find given . The way this guess works will be illustrated in Section 3.4.
Our final remark is that one may try to use with because (29) clearly implies (25). It will definitely give some estimate for the square function but not the sharp one. Indeed, for the sharp estimates, condition (25) for usually degenerates, namely (35) holds. On the other hand, if and (33) holds, then , and
[TABLE]
for some constants . This family of functions corresponds to the trivial inequality . Analogously, the best possible function satisfying (24) and (25) will almost never satisfy (33) except for a very particular case when .
3.3 The dual to Log-Sobolev is Chang–Wilson–Wolff
The function satisfies (33) and, therefore, it gives the log-Sobolev inequality [12]. Its dual in the sense of (34) is (see Section 3.1.1 in [12] where ). Notice that for this , inequality (25) simplifies to
[TABLE]
which is true since for . Therefore we obtain
Corollary 3.2**.**
For any integrable on , we have
[TABLE]
This corollary immediately recovers the result of Chang-Wilson-Wolf [7] well-known to probabilists, namely for any with and , we have
[TABLE]
Next, repeating a standard argument, namely, considering and applying Chebyshev’s inequality (see Theorem 3.1 in [7]), one obtains the superexponential bound
[TABLE]
for any .
We should remind that the log-Sobolev inequality via the Herbst argument [13] gives Gaussian concentration inequalities, namely,
[TABLE]
for any and any smooth with . Here is the standard Gaussian measure on .
In other words we just illustrated that estimates (39) and (38) are dual to each other in the sense of duality between functions and .
3.4 Poincaré inequality 3/2: a simple proof via duality
It was proved in [11] that for any , we have
[TABLE]
where is taken in the sense of the principal brunch in the upper half-plane. Inequality (40) improves Beckner’s bound for a particular exponent [11]. Consider
[TABLE]
It was explained in [11] that to prove (40) it is enough to check that satisfies (29), and the latter fact involved careful investigation of the roots of several very high degree polynomials with integer coefficients. Let us give a simple proof of (29) using our duality technique.
Proposition 3.3**.**
The function satisfies (29) for all and .
Proof.
is a solution of the homogeneous Monge–Ampère equation (33), and therefore it has a representation of the form (34) (see Section 3.1.4 in [12]):
[TABLE]
This leads us to the following guess
[TABLE]
which can be directly checked. Using Theorem A.6 with , and following the proof of Lemma 2.2, it is enough to check that satisfies (25). Notice that (25) is an identity for . This finishes the proof of the proposition. ∎
3.5 Sobolev inequalities
3.5.1 The Hamming cube
For , let be the best possible constant such that
[TABLE]
Our theorem implies that for . Notice that when , we have , and (41) recovers the classical Poincaré inequality. When the constant tends to zero which should not be the case for . Indeed, it follows from a deep result of Talagrand [19] that if is the best possible constant in the following estimate
[TABLE]
then for all . Now notice that , and for . When , by example (49), we must have unlike the fact that for . So one may wonder whether the positivity of may not imply the positivity of on the interval . Let us mention that this is not the case, in fact for . Indeed, it will suffice to prove that . If this is obvious. Assume . Next, we show a simple inequality
[TABLE]
Plugging , and taking the expectation, we obtain . To verify (43), without loss of generality assume that (otherwise the inequality is trivial). Consider . Its second derivative changes signs at points which satisfy the equation , i.e., when . The right hand side of (43) represents the tangent line to the graph of at the point . Clearly is convex on . Therefore (43) is true on this interval. Next, is concave on and since , we have on because . Thus (43) is true for . For , by Bernoulli we have
[TABLE]
To the best of our knowledge, the constants are unknown for . There is a remarkable result of Ben-Efraim–Lust-Piquard [2] that for .
This, combined to our theorem, gives the lower bound for . However, due to the inequalities of Bobkov–Götze and Maurey–Pisier (see the next section), it is widely believed that .
An elegant idea of Naor–Schechtman [15] based on Burkholder’s inequality [5] gives an estimate
[TABLE]
Let us show that our bound (2) obtained in Corollary 1.2 is better.
Proposition 3.4**.**
For all we have
[TABLE]
Proof.
We estimate from below by (see Lemma A.2). Using we see that to prove (44) it is enough to show the following two inequalities
[TABLE]
where is the solution of the equation on the interval . Inequality (45) simplifies to which is true because the left hand side is concave, and the right hand side is convex on . To show (46) it is enough to verify that
[TABLE]
The latter inequality we rewrite as follows
[TABLE]
Since the Trigamma function is convex
[TABLE]
we estimate from below by its tangent line at point , i.e.,
[TABLE]
here is Euler’s constant. It is enough to show that
[TABLE]
The left hand side is concave on the interval , and at the endpoint cases we have the inequality. Notice that , and this finishes the proof. ∎
3.5.2 Gaussian measure on
The application of the Central Limit Theorem to (1) gives a dimension independent Sobolev inequality.
Corollary 3.5**.**
For any smooth bounded and any , we have
[TABLE]
The best possible constant in (47), unlike , should not degenerate when . Indeed, (see [14], pp. 115) one has
[TABLE]
where the constant is the best possible in the left hand side of (48). We should mention that estimate (48) can be also easily obtained by a remarkable trick of Maurey–Pisier [18].
Notice that (47) cannot be extended for the range of exponents with some positive constant instead of . Indeed, assume the contrary. Consider and take . Using Jensen’s inequality, we obtain
[TABLE]
Therefore, taking , we obtain the contradiction with for .
3.6 Discrete surface measure
Let be a subset of the Hamming cube with cardinality . Define so that is the number of boundary edges to containing , i.e., counts the number of edges with one endpoint in and another one in the complement of such that one of the endpoints is . Clearly if is in the “strict interior” of , or in the “strict complement” of , and it is nonzero if and only if is on the “boundary” of . Notice that can be nonzero for some . The function maybe be understood as a discrete surface measure of the boundary of . Consider the following quantity
[TABLE]
It follows from Harper’s edge-isoperimetric inequality [10] that and the value is attained on the halfcube. The monotonicity of in implies that for all . Also notice that considering Hamming balls, one can easily show that for . Therefore the first nontrivial value is . In this case it follows from Bobkov’s inequality (see [3] and references therein) that , and by monotonicity we obtain that which is definitely not sharp when .
Define as follows: if and if . Clearly . Applying (1) to , we obtain
[TABLE]
Inequality (51) gives the lower bound which tends to as , but fails to be sharp when . Thus combining this result with Bobkov’s inequality we obtain the bound
[TABLE]
Appendix A Appendix
A.1 Properties of
Lemma A.1**.**
For any , we have . In addition is decreasing in , and on for .
Proof.
Consider . Notice that the zeros of and are the same. It follows from (7) that
[TABLE]
Besides we know that the solution is even. Consider the critical case . In this case and the smallest positive zero is . Therefore it follows from the Sturm comparison principle that for (see below). Moreover, the same principle applied to and with implies that has a zero inside the interval . Thus we conclude that is decreasing in .
To verify that on , first we claim that
[TABLE]
for . Indeed the proof works in the same way as the proof of Sturm’s comparison principle. For the convenience of the reader we decided to include the argument. As before, consider . It is enough to show that on . It follows from (53) that . Therefore, using the Taylor series expansion at the point [math], we see that the claim is true at some neighbourhood of zero, say with sufficiently small. Next we assume the contrary, i.e., that there is a point such that on , and (notice that the case , by the uniqueness theorem for ODEs, would imply that everywhere, which is impossible). Consider the Wronskian
[TABLE]
We have and . On the other hand, we have
[TABLE]
which is a clear contradiction, and this proves the claim.
It follows from (6) that
[TABLE]
and inequalities on imply that on . Since and on , we must have on . ∎
Lemma A.2**.**
We have for all .
Proof.
Notice that satisfies (53) with . Now consider . The function satisfies the equation
[TABLE]
and , . Notice that on . Since
[TABLE]
it follows from the Sturm comparison principle (see the previous discussions) that on . Thus we obtain that . ∎
A.2 Heat inequality
Let be defined as in (8).
Lemma A.3**.**
For any , the map
[TABLE]
is convex.
Proof.
Without loss of generality, assume that . We recall that . Since , the only interesting case to consider is when (otherwise is convex). In this case we have up to a positive constant which we are going to ignore, and, therefore, by (7) we have . Using (54), we obtain
[TABLE]
Therefore it would be enough to show that for any , the function is decreasing for . Differentiating, and using (7) again, we obtain
[TABLE]
which is nonpositive by Lemma A.1. ∎
The next lemma, together with Lemma A.3 and (11), implies that satisfies (10).
Lemma A.4** (Barthe–Mauery [1]).**
Let be a convex subset of , and let be such that
[TABLE]
Then for all with and , we have
[TABLE]
The lemma says that the global discrete inequality (58) is in fact implied by its infinitesimal form (56) under the extra condition (57).
Proof.
The argument is borrowed from [1]. The similar argument was used by Davis [8] in obtaining sharp square function estimates from the ones for the Brownian motion.
Without loss of generality assume . Consider the process
[TABLE]
Here is the standard Brownian motion starting at zero. It follows from Ito’s formula together with (56) that is a supermartingale. Let be the stopping time
[TABLE]
It follows from the optional stopping theorem that
[TABLE]
Notice that we have used , , and the fact that the map is convex together with Jensen’s inequality. ∎
A.3 Minimax theorem for noncompact sets
Let be nonempty closed convex sets in . We say that a pair is a saddle point of on if
[TABLE]
Lemma A.5**.**
The function defined on with real values possesses a saddle point on if and only if
[TABLE]
and this number is then equal to .
For the proof we refer the reader to Proposition 1.2 in [9], pp. 167.
Theorem A.6**.**
Suppose that is continuous, concave in , convex in , and there exists such that
[TABLE]
Then possesses at least one saddle point on and
[TABLE]
The theorem is Proposition 2.2 in [9], pp. 173.
Acknowledgments
We are very grateful to several people for discussions and suggestions that led us to noticing the duality between the Hamming cube and the square function: G. Aubrun for valuable remarks on optimizers in (50); D. Bilyk for providing the reference to sharp constants for Square functions [20]; R. O’Donell for providing the references; R. Latała for pointing out example (49); S. Petermichl for bringing our attention to Bellman functions in Square function estimates and Poincaré inequalities for the Gaussian measure; S. Treil for attracting our attention to Chang–Wilson–Wolff’s superexponential bound (Corollary 38) and its similarity to the Gaussian concentration inequality; R. van Handel for references, including (3) and (48), and making several important remarks. We thank an anonymous referee for helpful comments and remarks.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] F. Barthe, B. Maurey , Some remarks on isoperimetry of Gaussian type , Annales de l’Institut Henri Poincare (B) Probability and Statistics, Vol. 36, Iss. 4, pp. 419–434
- 2[2] L. Ben-Efraim, F. Lust-Piquard , Poincaré type inequalities on the discrete cube and in the CAR algebra , Probability Theory and Related Fields, Vol. 141, Iss. 3–4, pp. 569–602 (2008)
- 3[3] S. G. Bobkov, F. Götze , Discrete isoperimetric and Poincaré-type inequalities , Probab. Theory Relat. Fields 114, 245–277 (1999)
- 4[4] D. Burkholder , Sharp inequalities for martingales and stochastic integrals , Colloque Paul Lévy (Palaiseau, 1987), Ast’erisque 157-158 (1988), 75–94
- 5[5] D. L. Burkholder , Boundary Value Problems and Sharp Inequalities for Martingale Transforms , Ann. Probab., Vol. 12, No. 3, (1984), 647–702
- 6[6] D. L. Burkholder, R. F. Gundy , Extrapolation and interpolation of quasi-linear operators on martingales , Acta. Math., Vol. 124 (1970), 249–304
- 7[7] A. Chang, J. M. Wilson, Th. Wolff , Some weighted norm inequalities concerning the Schrödinger operators , Comment. Math. Helvetici, Vol. 60, 1985, 217–246
- 8[8] B. Davis , On the L p superscript 𝐿 𝑝 L^{p} norms of stochastic integrals and other martingales , Duke Math. J. Vol. 43, pp. 697–704 (1976)
