Square functions and the Hamming cube: Duality

Paata Ivanisvili; Fedor Nazarov; Alexander Volberg

arXiv:1706.01930·math.AP·January 19, 2018

Square functions and the Hamming cube: Duality

Paata Ivanisvili, Fedor Nazarov, Alexander Volberg

PDF

TL;DR

This paper establishes a new inequality relating the gradient and function moments on the Hamming cube for 1<p≤2, using duality between Euclidean square functions and Hamming cube gradient estimates.

Contribution

It introduces a novel inequality connecting gradient norms and function moments on the Hamming cube, with a precise constant derived from hypergeometric functions.

Findings

01

The inequality holds for all functions on the Hamming cube with 1<p≤2.

02

The constant C(p) is characterized as the smallest positive zero of a confluent hypergeometric function.

03

The approach reveals a duality between Euclidean square functions and Hamming cube gradient estimates.

Abstract

For $1 < p \leq 2$ , any $n \geq 1$ and any $f : {- 1, 1}^{n} \to R$ , we obtain $(E ∣\nabla f ∣^{p})^{1/ p} \geq C (p) (E ∣ f ∣^{p} - ∣ E f ∣^{p})^{1/ p}$ where $C (p)$ is the smallest positive zero of the confluent hypergeometric function $_{1} F_{1} (\frac{p}{2 ( 1 - p )}, \frac{1}{2}, \frac{x ^{2}}{2})$ . Our approach is based on a certain duality between the classical square function estimates on the Euclidean space and the gradient estimates on the Hamming cube.

Equations226

\partial_{j} f (x) = \frac{f ( x ) - f ( S _{j} ( x ))}{2}, x = (x_{1}, \dots, x_{n}) \in {- 1, 1}^{n},

\partial_{j} f (x) = \frac{f ( x ) - f ( S _{j} ( x ))}{2}, x = (x_{1}, \dots, x_{n}) \in {- 1, 1}^{n},

∣\nabla f ∣^{2} (x) := j = 1 \sum n (\partial_{j} f (x))^{2} = y \sim x \sum (\frac{f ( x ) - f ( y )}{2})^{2},

∣\nabla f ∣^{2} (x) := j = 1 \sum n (\partial_{j} f (x))^{2} = y \sim x \sum (\frac{f ( x ) - f ( y )}{2})^{2},

E f = \frac{1}{2 ^{n}} x \in {- 1, 1}^{n} \sum f (x) .

E f = \frac{1}{2 ^{n}} x \in {- 1, 1}^{n} \sum f (x) .

s_{p^{'}} (E ∣ f ∣^{p} - ∣ E f ∣^{p})^{1/ p} \leq (E ∣\nabla f ∣^{p})^{1/ p} .

s_{p^{'}} (E ∣ f ∣^{p} - ∣ E f ∣^{p})^{1/ p} \leq (E ∣\nabla f ∣^{p})^{1/ p} .

H_{m} (x) = \int_{R} (x + i y)^{m} \frac{e ^{- y^{2} /2}}{2 π} d y .

H_{m} (x) = \int_{R} (x + i y)^{m} \frac{e ^{- y^{2} /2}}{2 π} d y .

s_{p^{'}}^{p} 2^{\frac{p - 2}{2}} min (1, \frac{Γ (( p + 1 ) /2 )}{Γ ( 3/2 )}) E ∣ f - E f ∣^{p} \leq E_{x} E_{x^{'}} j = 1 \sum n x_{j}^{'} \partial_{j} f (x)^{p},

s_{p^{'}}^{p} 2^{\frac{p - 2}{2}} min (1, \frac{Γ (( p + 1 ) /2 )}{Γ ( 3/2 )}) E ∣ f - E f ∣^{p} \leq E_{x} E_{x^{'}} j = 1 \sum n x_{j}^{'} \partial_{j} f (x)^{p},

s_{p^{'}}^{p} 2^{\frac{p - 2}{2}} min (1, \frac{Γ (( p + 1 ) /2 )}{Γ ( 3/2 )}) > (p - 1)^{p} for 1 < p < 2.

s_{p^{'}}^{p} 2^{\frac{p - 2}{2}} min (1, \frac{Γ (( p + 1 ) /2 )}{Γ ( 3/2 )}) > (p - 1)^{p} for 1 < p < 2.

\frac{2}{π} (E ∣ f - E f ∣^{p})^{1/ p} \leq (E ∣\nabla f ∣^{p})^{1/ p} 1 \leq p \leq 2,

\frac{2}{π} (E ∣ f - E f ∣^{p})^{1/ p} \leq (E ∣\nabla f ∣^{p})^{1/ p} 1 \leq p \leq 2,

E ℜ (f + i ∣\nabla f ∣)^{3/2} \leq (ℜ E f)^{3/2},

E ℜ (f + i ∣\nabla f ∣)^{3/2} \leq (ℜ E f)^{3/2},

s_{q} ∥ T^{1/2} ∥_{q} \leq ∥ B_{T} ∥_{q}, q \geq 2, ∥ T^{1/2} ∥_{q} < \infty;

s_{q} ∥ T^{1/2} ∥_{q} \leq ∥ B_{T} ∥_{q}, q \geq 2, ∥ T^{1/2} ∥_{q} < \infty;

∥ B_{T} ∥_{p} \leq s_{p} ∥ T^{1/2} ∥_{p}, 0 < p \leq 2.

N_{α} (x) :=_{1} F_{1} (- \frac{α}{2}, \frac{1}{2}, \frac{x ^{2}}{2}) = m = 0 \sum \infty \frac{( - 2 x ^{2} ) ^{m}}{( 2 m )!} \frac{α}{2} (\frac{α}{2} - 1) \dots (\frac{α}{2} - m + 1) = 1 - x^{2} \frac{α}{2} + ...

N_{α} (x) :=_{1} F_{1} (- \frac{α}{2}, \frac{1}{2}, \frac{x ^{2}}{2}) = m = 0 \sum \infty \frac{( - 2 x ^{2} ) ^{m}}{( 2 m )!} \frac{α}{2} (\frac{α}{2} - 1) \dots (\frac{α}{2} - m + 1) = 1 - x^{2} \frac{α}{2} + ...

N_{α}^{''} (x) - x N_{α}^{'} (x) + α N_{α} (x) = 0 for x \in R

N_{α}^{''} (x) - x N_{α}^{'} (x) + α N_{α} (x) = 0 for x \in R

u_{α} (x) := ⎩ ⎨ ⎧ - \frac{α s _{α}^{α - 1}}{N _{α}^{'} ( s _{α} )} N_{α} (x), s_{α}^{α} - ∣ x ∣^{α}, 0 \leq ∣ x ∣ \leq s_{α}; s_{α} \leq ∣ x ∣.

u_{α} (x) := ⎩ ⎨ ⎧ - \frac{α s _{α}^{α - 1}}{N _{α}^{'} ( s _{α} )} N_{α} (x), s_{α}^{α} - ∣ x ∣^{α}, 0 \leq ∣ x ∣ \leq s_{α}; s_{α} \leq ∣ x ∣.

U (p, q) := ∣ q ∣^{α} u_{α} (\frac{p}{∣ q ∣}) with U (p, 0) = - ∣ p ∣^{α} .

U (p, q) := ∣ q ∣^{α} u_{α} (\frac{p}{∣ q ∣}) with U (p, 0) = - ∣ p ∣^{α} .

U (p, q) \geq ∣ q ∣^{α} s_{α}^{α} - ∣ p ∣^{α} for all (p, q) \in R^{2}, and when q = 0, the equality holds;

U (p, q) \geq ∣ q ∣^{α} s_{α}^{α} - ∣ p ∣^{α} for all (p, q) \in R^{2}, and when q = 0, the equality holds;

2 U (p, q) \geq U (p + a, a^{2} + q^{2}) + U (p - a, a^{2} + q^{2}) for all (p, q, a) \in R^{3} .

u_{t} + \frac{u _{pp}}{2} \leq 0 for u (p, t) = U (p, t),

u_{t} + \frac{u _{pp}}{2} \leq 0 for u (p, t) = U (p, t),

X_{t} = U (B_{t}, t) for t \geq 0

X_{t} = U (B_{t}, t) for t \geq 0

E (T^{\frac{α}{2}} s_{α}^{α} - ∣ B_{T} ∣^{α}) \leq (\ref o b s t a c l e) E U (B_{T}, T) \leq U (0, 0) = 0,

E (T^{\frac{α}{2}} s_{α}^{α} - ∣ B_{T} ∣^{α}) \leq (\ref o b s t a c l e) E U (B_{T}, T) \leq U (0, 0) = 0,

M (x, y) = q \leq 0 min p \in R sup Ψ (p, q, x, y) for x \in R, y \geq 0.

M (x, y) = q \leq 0 min p \in R sup Ψ (p, q, x, y) for x \in R, y \geq 0.

q \leq 0 min p \in R sup Ψ (p, q, x, y) = p \in R max q \leq 0 in f Ψ (p, q, x, y) = Ψ (p^{*}, q^{*}, x, y)

q \leq 0 min p \in R sup Ψ (p, q, x, y) = p \in R max q \leq 0 in f Ψ (p, q, x, y) = Ψ (p^{*}, q^{*}, x, y)

Ψ (p, q^{*}, x, y) \leq Ψ (p^{*}, q^{*}, x, y) \leq Ψ (p^{*}, q, x, y) for all (p, q) \in R \times R_{-} .

Ψ (p, q^{*}, x, y) \leq Ψ (p^{*}, q^{*}, x, y) \leq Ψ (p^{*}, q, x, y) for all (p, q) \in R \times R_{-} .

U_{q q} = ∣ q ∣^{α - 2} [α (α - 1) u_{α} (z) - 2 (α - 1) z u_{α}^{'} (z) + z^{2} u_{α}^{''} (z)] = (\ref h er mi t)

U_{q q} = ∣ q ∣^{α - 2} [α (α - 1) u_{α} (z) - 2 (α - 1) z u_{α}^{'} (z) + z^{2} u_{α}^{''} (z)] = (\ref h er mi t)

∣ q ∣^{α - 2} [- (α - 1) z u_{α}^{'} (z) + (z^{2} - α + 1) u_{α}^{''} (z)] .

(p, q) \mapsto p x + q y + ∣ q ∣^{α} u_{α} (\frac{p}{∣ q ∣})

(p, q) \mapsto p x + q y + ∣ q ∣^{α} u_{α} (\frac{p}{∣ q ∣})

M (x, y) \geq (\frac{α - 1}{α ^{β}}) (∣ x ∣^{β} - \frac{y ^{β}}{s _{α}^{β}}) and when y = 0 the equality holds;

M (x, y) \geq (\frac{α - 1}{α ^{β}}) (∣ x ∣^{β} - \frac{y ^{β}}{s _{α}^{β}}) and when y = 0 the equality holds;

2 M (x, y) \geq M (x + a, a^{2} + (y + b)^{2}) + M (x - a, a^{2} + (y - b)^{2}) .

(x_{\pm}, y_{\pm}) := (x \pm a, a^{2} + (y \pm b)^{2}) .

(x_{\pm}, y_{\pm}) := (x \pm a, a^{2} + (y \pm b)^{2}) .

2Ψ (p, q^{*}, x, y) \geq Ψ (p^{+}, q_{1}, x_{+}, y_{+}) + Ψ (p^{-}, q_{2}, x_{-}, y_{-}) .

2Ψ (p, q^{*}, x, y) \geq Ψ (p^{+}, q_{1}, x_{+}, y_{+}) + Ψ (p^{-}, q_{2}, x_{-}, y_{-}) .

p = \frac{p ^{+} + p ^{-}}{2} and q_{1} = q_{2} = - (\frac{p ^{+} - p ^{-}}{2})^{2} + (q^{*})^{2},

p = \frac{p ^{+} + p ^{-}}{2} and q_{1} = q_{2} = - (\frac{p ^{+} - p ^{-}}{2})^{2} + (q^{*})^{2},

q_{1} a^{2} + (y + b)^{2} + q_{2} a^{2} + (y - b)^{2} - 2 q^{*} y \leq - ∣ a ∣ (q_{1}^{2} - (q^{*})^{2} + q_{2}^{2} - (q^{*})^{2})

q_{1} a^{2} + (y + b)^{2} + q_{2} a^{2} + (y - b)^{2} - 2 q^{*} y \leq - ∣ a ∣ (q_{1}^{2} - (q^{*})^{2} + q_{2}^{2} - (q^{*})^{2})

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Square functions and the Hamming cube: Duality

Paata Ivanisvili This paper is based upon work supported by the National Science Foundation under Grant No. DMS-1440140 while two of the authors were in residence at the Mathematical Sciences Research Institute in Berkeley, California, during the Spring and Fall 2017 semester.

Fedor Nazarov Supported by NSF DMS-0800243

Alexander Volberg Supported by NSF DMS-1600065

Abstract

For $1<p\leq 2$ , any $n\geq 1$ and any $f:\{-1,1\}^{n}\to\mathbb{R}$ , we obtain $(\mathbb{E}|\nabla f|^{p})^{1/p}\geq c(p)(\mathbb{E}|f|^{p}-|\mathbb{E}f|^{p})^{1/p}$ where $c(p)$ is the smallest positive zero of the confluent hypergeometric function ${}_{1}F_{1}(\frac{p}{2(1-p)},\frac{1}{2},\frac{x^{2}}{2})$ . Our approach is based on a certain duality between the classical square function estimates on the Euclidean space and the gradient estimates on the Hamming cube.

\dajAUTHORdetails

title = Square functions and the Hamming cube: Duality, author = Paata Ivanisvili, Fedor Nazarov and Alexander Volberg, plaintextauthor = Paata Ivanisvili, Fedor Nazarov, Alexander Volberg, plaintexttitle = Square functions and the Hamming cube: Duality, runningtitle = Square functions and the Hamming cube: Duality, runningauthor = Paata Ivanisvili, Fedor Nazarov, Alexander Volberg, copyrightauthor = P. Ivanisvili, F. Nazarov and A. Volberg, keywords = Square function, Hamming cube, Duality, \dajEDITORdetailsyear=2018, number=1, received=6 July 2017, published=19 January 2018, doi=10.19086/da.3113,

[classification=text]

1 Main result

Consider the Hamming cube $\{-1,1\}^{n}$ of an arbitrary dimension $n\geq 1$ . For any $f:\{-1,1\}^{n}\to\mathbb{R}$ define the discrete partial derivative $\partial_{j}f(x)$ as follows

[TABLE]

where $S_{j}(x)$ is obtained from $x$ by changing the sign of $j$ ’th coordinate of $x$ . Set $\nabla f(x):=(\partial_{1}f(x),\ldots,\partial_{n}f(x))$ , and we define the norm of the discrete gradient

[TABLE]

where the summation in the last term runs over all neighbor vertices of $x$ in $\{-1,1\}^{n}$ . Set

[TABLE]

Theorem 1.1.

For any $1<p\leq 2$ , $n\geq 1$ , and any $f:\{-1,1\}^{n}\to\mathbb{R}$ we have

[TABLE]

Here $p^{\prime}=\frac{p}{p-1}$ is the conjugate exponent of $p$ , and by $s_{q}$ we denote the smallest positive zero of the confluent hypergeometric function ${}_{1}F_{1}(-\frac{q}{2},\frac{1}{2},\frac{x^{2}}{2})$ (see (6) for the definition).

In Lemma A.2 we obtain a lower bound $s_{p^{\prime}}\geq\sqrt{2/p^{\prime}}$ for $1<p\leq 2$ which is precise when $p\to 2$ . If $p^{\prime}=2k$ for $k\in\mathbb{N}$ , then $s_{p^{\prime}}$ becomes the smallest positive zero of the Hermite polynomial $H_{2k}(x)$ where

[TABLE]

The constant $s_{p^{\prime}}$ in (1) is larger then all previously known bounds [15, 2] when $p$ is in a neighborhood of $2$ , say $p\in(1.26,2)$ . For example, the estimate (1) improves the Naor–Schechtman bound [15] for the class of real valued functions for all $1<p<2$ . Indeed, it follows from an application of Khinchin inequality with the sharp constant and (1) that we have the following corollary

Corollary 1.2.

For any $1<p\leq 2$ , $n\geq 1$ , and any $f:\{-1,1\}^{n}\to\mathbb{R}$ we have

[TABLE]

where $\mathbb{E}_{x}$ and $\mathbb{E}_{x^{\prime}}$ average in variables $x$ and $x^{\prime}=(x^{\prime}_{1},\ldots,x^{\prime}_{n})\in\{-1,1\}^{n}$ correspondingly.

We will see in Proposition 3.4 that

[TABLE]

The latter implies that the estimate (2) improves the bound of Naor–Schechtman for $1<p<2$ in the case of real valued functions (see Theorem 1 in [15] where $\beta_{p}(\mathbb{R})=1/(p-1)$ ).

On the other hand $s_{p^{\prime}}$ degenerates to [math] when $p\to 1+$ which should not be the case for the best possible constant by a result of Talagrand (see Section 3.5). For this endpoint case, when $p$ is close to $1$ , the result of Ben-Efraim–Lust-Piquard [2] gives the better bounds

[TABLE]

and when $p=1$ it is widely believed that the sharp constant in the left hand side of (3) should be $\sqrt{2/\pi}$ instead of $2/\pi$ (see Section 3.5 for more details).

We think that the main contribution of the current paper is not just Theorem 1.1 that we obtain but rather a new duality approach that we develop between two different classes of extremal problems: square function estimates on the interval $[0,1]$ and gradient estimates on the Hamming cube, and Theorem 1.1 should be considered as an example. Roughly speaking one can take a valid estimate for a square function, dualize it by a certain double Legendre transform, and one can write its corresponding dual estimate on the Hamming cube and vice versa. To illustrate another example of our duality approach, in Section 3.4 we present a short proof of the following theorem which improves a well–known inequality of Beckner

Theorem 1.3 (see [11]).

For any $n\geq 1$ , and any $f:\{-1,1\}^{n}\to\mathbb{R}$ we have

[TABLE]

where $\Re$ denotes the real part, and $z^{3/2}$ is understood in the sense of principal brunch in the upper half-plane.

Going back to Theorem 1.1, it will be explained later that $s_{p^{\prime}}$ in a “dual” sense coincides with the sharp constant found by B. Davis in the $L^{q}$ norm estimates

[TABLE]

Here $B_{t}$ is the standard Brownian motion starting at zero, and $T$ is any stopping time. It was explained in [8] that the same sharp estimates (4) and (5) hold with $B_{T}$ replaced by an integrable function $g$ on $[0,1]$ with mean zero, and $T^{1/2}$ replaced by the dyadic square function of $g$ .

We notice the essential difference between the Davis estimates (4), (5) and (1) that for a given power $p,1<p\leq 2$ , we need the “dual” constant $s_{p^{\prime}}=s_{\frac{p}{p-1}}$ in the theorem. Besides, inequality (1) cannot be extended to the full range of exponents $p$ with some finite strictly positive constant $c(p)$ unlike (4) and (5) (see [8, 4, 6] and (49)).

2 Proof of the main result

2.1 An anonymous Bellman function

In this section we want to define a function $U:\mathbb{R}^{2}\to\mathbb{R}$ that satisfies some special properties. Let $\alpha\geq 2$ and let $\beta=\frac{\alpha}{\alpha-1}\leq 2$ be the conjugate exponent of $\alpha$ . Let

[TABLE]

be the confluent hypergeometric function. $N_{\alpha}(x)$ satisfies the Hermite differential equation

[TABLE]

with initial conditions $N_{\alpha}(0)=1$ and $N^{\prime}_{\alpha}(0)=0$ . Let $s_{\alpha}$ be the smallest positive zero of $N_{\alpha}$ .

Set

[TABLE]

Clearly $u_{\alpha}(x)$ is $C^{1}(\mathbb{R})\cap C^{2}(\mathbb{R}\setminus{\{s_{\alpha}\}})$ smooth even concave function. The concavity follows from Lemma A.1 and the fact that $N^{\prime}_{\alpha}(s_{\alpha})<0$ . Finally we define

[TABLE]

For the first time the function $U(p,q)$ appeared in [8]. Later it was also used in [20, 21] in the form $\widetilde{u}(p,t)=U(p,\sqrt{t})$ , $t\geq 0$ . It was explained in [8] that $U(p,q)$ satisfies the following properties:

[TABLE]

We should refer to (9) as the obstacle condition, and to (10) as the main inequality. We caution the reader that in [8] one may not find (10) written explicitly but one will find its infinitesimal form

[TABLE]

which follows from the main inequality by expanding it into Taylor’s series with respect to $a$ near $a=0$ and comparing the second order terms. Here $\widetilde{u}_{pp}$ is defined everywhere except the curve $|p/\sqrt{t}|=s_{\alpha}$ where $\widetilde{u}$ is only differentiable once.

In fact, the reverse implication also holds, i.e., one can derive (10) from (11) for this special $U$ . This was done in the PhD thesis of Wang [21] but we will present a short proof in Section A.2, which partly follows the Davis argument. Essentially the same argument also appeared later in [1] in a slightly different setting.

The function $U(p,q)$ is essential in obtaining the result in the Davis paper, namely it is used in the proof of (4), and the argument goes as follows. First one shows that

[TABLE]

is a supermartingale which is guaranteed by (11). Finally, by the optional stopping theorem,

[TABLE]

which yields (4). One may notice that $U(p,q)$ is the minimal function with properties (9) and (10).

Davis mentions that the proof presented in his paper was suggested by an anonymous referee, and this explains the title of the current section.

2.2 Dualizing the Bellman function $U(p,q)$ and going to the Hamming cube

Set $\Psi(p,q,x,y):=px+qy+U(p,q)$ for $x\in\mathbb{R}$ and $y\geq 0$ . We define

[TABLE]

Lemma 2.1.

For each $(x,y)\in\mathbb{R}\times\mathbb{R}_{+}$ , there exists $(p^{*},q^{*})=(p^{*}(x,y),q^{*}(x,y))$ such that

[TABLE]

and we have

[TABLE]

Proof.

First let us show that for each fixed $(x,y)$ the function $\Psi(p,q,x,y)$ is convex in $q$ and concave in $p$ . The concavity in $p$ follows from Lemma A.1, and the fact that $U$ is even and $C^{1}$ smooth in $p$ .

To verify the convexity in $q$ , it is enough to show that the map $q\mapsto U(p,q)$ is convex for $|p|\leq|q|s_{\alpha}$ . Set $z=\frac{|p|}{|q|}\in[0,s_{\alpha}]$ . Then we have

[TABLE]

Since $u_{\alpha}(z)$ coincides with $N_{\alpha}(z)$ up to a positive constant, the convexity follows from Lemma A.1 and the fact that $\alpha\geq 2$ .

Notice that for each $(x,y)\in\mathbb{R}\times\mathbb{R}_{+}$ the map

[TABLE]

satisfies the assumptions of Theorem A.6 where we take $(p_{0},q_{0})=(0,0)$ (see Section A.3 in Appendix). Therefore the conclusions of Lemma 2.1 follow from Theorem A.6. ∎

Lemma 2.2.

For $\beta=\frac{\alpha}{\alpha-1}$ , any $x,a,b\in\mathbb{R}$ , and any $y\geq 0$ we have

[TABLE]

The reader notices that dualization (12) produces inequality (17) that is different from (10).

Proof.

Set

[TABLE]

Lemma 2.1 gives points $(p^{*},q^{*})$ and $(p^{\pm},q^{\pm})$ corresponding to $(x,y)$ and $(x_{\pm},y_{\pm})$ . It follows from (14) that to prove (17) it would be enough to find numbers $p\in\mathbb{R}$ and $q_{1},q_{2}\leq 0$ such that

[TABLE]

The right choice will be

[TABLE]

but let us explain it in details.

Notice that by Cauchy–Schwarz we have

[TABLE]

provided that $q_{1},q_{2}\leq q^{*}\leq 0$ . Indeed, we have

[TABLE]

Denoting $r_{j}^{2}=q_{j}^{2}-(q^{*})^{2}$ for $j=1,2$ , we see that it is enough to find $p\in\mathbb{R}$ and $r_{1},r_{2}\geq 0$ such that

[TABLE]

By choosing $p=\frac{p^{+}+p^{-}}{2}$ , and substituting the values for $x_{\pm}=x\pm a$ we see that it would suffice to find $r_{1},r_{2}\geq 0$ such that

[TABLE]

We will choose $r_{1}=r_{2}=\frac{|p^{+}-p^{-}|}{2}$ . It follows from $-|a||p^{+}-p^{-}|+a(p^{+}-p^{-})\leq 0$ that we only need to have the inequality

[TABLE]

But this inequality follows from (10).

To verify the obstacle condition (16), notice that (9) for $U(p,q)$ gives

[TABLE]

Finally if $y=0$ , then we obtain

[TABLE]

Equality (*) follows from the fact that

[TABLE]

is an even convex map.

∎

Corollary 2.3.

For any $a,x\in\mathbb{R}$ , all $y,b\in\mathbb{R}^{N}$ , and any $N\geq 1$ , we have

[TABLE]

Proof.

It follows from the definition of $M$ that the map $y\mapsto M(x,y)$ is decreasing in $y$ for $y\geq 0$ . Therefore by (17) and the triangle inequality we obtain

[TABLE]

∎

The inequality (20) gives rise to the estimate

[TABLE]

Indeed, the reader can find in [11] the passage from (20) to (21). In fact, inequality (20) is the same as

[TABLE]

where $\mathbb{E}_{x_{j}}$ takes the average in the coordinate $x_{j}$ , i.e.,

[TABLE]

The rest follows by iterating (22), the fact that $\mathbb{E}=\mathbb{E}_{x_{1}}\ldots\mathbb{E}_{x_{n}}$ and $|\nabla\mathbb{E}f|=0$ .

2.3 The proof of Theorem 1.1

We have

[TABLE]

and this gives inequality (1).

3 Remarks and Applications

3.1 Going from $U$ to $M$ : from Square function to the Hamming cube

Let $g$ be an integrable function on $[0,1]$ . Let $D([0,1])$ denote all dyadic intervals in $[0,1]$ . Consider the dyadic martingale $g_{n}$ defined as follows

[TABLE]

where $\langle g\rangle_{I}=\frac{1}{|I|}\int_{I}g$ . The square function $S(g)$ is defined as follows

[TABLE]

For convenience we always assume that the number of nonzero terms in (23) is finite so that $S(g)(x)$ makes sense. Let $O(p,q)$ be a continuous real valued function, and suppose one wants to estimate the quantity $\int_{0}^{1}O(g,S(g))$ from above in terms of $\int_{0}^{1}g$ . If one finds a function

[TABLE]

then one obtains (see [20]) the bound

[TABLE]

Conversely, suppose that the inequality

[TABLE]

holds for all integrable functions $g$ on $[0,1]$ and some $F$ . Then there exists $U(p,q)$ such that the conditions (24), (25) are satisfied and $U(p,0)\leq F(p)$ . Indeed, consider the extremal problem

[TABLE]

This $U$ satisfies (24) (take $g=p$ constant), and, in fact, it satisfies (25). The latter fact can be proved by using the standard Bellman principle (see Chapter 8, [17], and survey [16]). Besides,

[TABLE]

because of (27). Therefore there is one to one correspondence between the extremal problems for the square function of the form (27) and the functions $U(p,q)$ with the properties (24) and (25).

The gradient estimates on the Hamming cube are more subtle. Take any real valued $\widetilde{O}(x,y)$ and suppose that we want to estimate $\mathbb{E}\widetilde{O}(f,|\nabla f|)$ from above in terms of $\mathbb{E}f$ for any $f:\{-1,1\}^{n}\to\mathbb{R}$ and for all $n\geq 1$ . If one finds $M(x,y)$ such that

[TABLE]

then111We do also need to assume that $y\mapsto M(x,y)$ is decreasing in $y$ for each fixed $x$ to ensure Corollary 2.3. But if $M$ is $C^{1}$ smooth then $M_{y}\leq 0$ is guaranteed by (29). Indeed, if we take $a=0$ in (29) we obtain that $y\mapsto M(x,y)$ is concave for each $x$ . Next, taking $y=b=0$ and sending $a\to 0+$ , we obtain by Taylor’s formula that $M_{y}(x,0)\leq 0$ . Therefore $M_{y}(x,y)\leq 0$ . one can obtain the estimate (see [11])

[TABLE]

Thus finding such $M$ is sufficient to obtain the estimate but it is unclear whether conditions (28) and (29) are necessary to obtain the bound $\mathbb{E}\widetilde{O}(f,|\nabla f|)\leq M(\mathbb{E}f,0)$ . In other words we do not know what is the corresponding extremal problem for $M$ , i.e., what is the right Bellman function $M$ . The reason lies in the fact that there is an essential difference between the Hamming cube and the dyadic intervals, i.e., test functions do not concatenate in a good way on $\{-1,1\}^{n}$ as it happens for dyadic martingales.

Now we formulate an abstract theorem that formalizes our duality principle in a general setting.

Theorem 3.1.

Let $I,J\subseteq\mathbb{R}$ be convex sets. Take an arbitrary $O(p,q)\in C(I\times\mathbb{R}_{+})$ , and let $U(p,q):I\times\mathbb{R}_{+}\to\mathbb{R}$ satisfy properties (24) and (25). Assume that for each $(x,y)\in J\times\mathbb{R}_{+}$ , we have

[TABLE]

Then $M$ and $\widetilde{O}$ defined as

[TABLE]

satisfy (29) and (28), and, thereby, (30) for any $f:\{-1,1\}^{n}\to J$ and any $n\geq 1$ .

One may think that finding $U(p,q)$ with the property (25) is a difficult problem. Let us make a quick remark here that if it happens that $t\mapsto U(p,\sqrt{t})$ is convex for each fixed $p\in I$ , then (25) is automatically implied by its infinitesimal form, i.e., by $U_{pp}+U_{q}/q\leq 0$ (see the proof of Lemma A.4).

Proof.

The proof essentially repeats the proof of Lemma 2.2. Let us sketch the argument. Define $\Psi(p,q,x,y):=px+qy+U(p,|q|)$ . The existence of a saddle point $(p^{*},q^{*})$ with properties (13) and (14) is guaranteed by Lemma A.5. The convexity of the set $I$ allows us to choose $p$ from $I$ , and $q_{1},q_{2}\in(-\infty,0]$ according to (18). The rest of the proof of the theorem is the same as in Lemma 2.2. Inequality (32) follows from (24). Convexity of $J$ is needed, for example, to ensure that if $f:\{-1,1\}^{n}\to J$ , then $\mathbb{E}f\in J$ , so that (30) makes sense. ∎

3.2 Going from $M$ to $U$ : from Hamming cube to square function

Another interesting observation is that equality (31) was lurking in a solution of a certain Monge–Ampère equation. For example, taking $a,b\to 0$ in (29), and using the Taylor’s series expansion (assuming that $M$ is smooth enough) one obtains

[TABLE]

When looking for the least function $M$ with $M\geq\widetilde{O}$ and (33), it is reasonable to assume that condition (33) should degenerate except, possibly, on the set where $M$ coincides with its obstacle $\widetilde{O}$ . The degeneracy of (33) means that the determinant of the matrix in (33) is zero. This is a general Monge–Ampère type equation and, after an application of the exterior differential systems of Bryant–Griffiths (see [12]), we obtain that the solutions can be locally characterized as follows:

[TABLE]

where $U$ satisfies the equation

[TABLE]

In [12] we used $u(p,t)=-U(p,\sqrt{2t})$ instead of $U(p,q)$ , in which case (35) becomes just the backward heat equation for $u(p,t)$ . We will not formulate a formal statement but we do make a remark that such a reasoning allows us to guess the dual of $M$ , i.e., to find $U$ given $M$ . The way this guess works will be illustrated in Section 3.4.

Our final remark is that one may try to use $U(p,q):=M(p,q)$ with $O(p,q):=\widetilde{O}(p,q)$ because (29) clearly implies (25). It will definitely give some estimate for the square function but not the sharp one. Indeed, for the sharp estimates, condition (25) for $U$ usually degenerates, namely (35) holds. On the other hand, if $M_{xx}+M_{y}/y=0$ and (33) holds, then $M_{xy}=0$ , and

[TABLE]

for some constants $C,D,Q\in\mathbb{R}$ . This family of functions corresponds to the trivial inequality $\int_{0}^{1}S(g)^{2}\leq\int_{0}^{1}g^{2}$ . Analogously, the best possible function $U$ satisfying (24) and (25) will almost never satisfy (33) except for a very particular case when $U(p,q)=C(p^{2}-q^{2})+Dp+Q$ .

3.3 The dual to Log-Sobolev is Chang–Wilson–Wolff

The function $M(x,y)=x\ln x-\frac{y^{2}}{2x}$ satisfies (33) and, therefore, it gives the log-Sobolev inequality [12]. Its dual in the sense of (34) is $U(p,q)=e^{p-q^{2}/2}$ (see Section 3.1.1 in [12] where $t=q^{2}/2$ ). Notice that for this $U$ , inequality (25) simplifies to

[TABLE]

which is true since $(2k)!\geq 2^{k}k!$ for $k\geq 0$ . Therefore we obtain

Corollary 3.2.

For any integrable $g$ on $[0,1]$ , we have

[TABLE]

This corollary immediately recovers the result of Chang-Wilson-Wolf [7] well-known to probabilists, namely for any $g$ with $\int_{0}^{1}g=0$ and $\|S(g)\|_{\infty}<\infty$ , we have

[TABLE]

Next, repeating a standard argument, namely, considering $tg$ and applying Chebyshev’s inequality (see Theorem 3.1 in [7]), one obtains the superexponential bound

[TABLE]

for any $\lambda\geq 0$ .

We should remind that the log-Sobolev inequality via the Herbst argument [13] gives Gaussian concentration inequalities, namely,

[TABLE]

for any $\lambda\geq 0$ and any smooth $f:\mathbb{R}^{n}\to\mathbb{R}$ with $\|\nabla f\|_{\infty}<\infty$ . Here $\gamma$ is the standard Gaussian measure on $\mathbb{R}^{n}$ .

In other words we just illustrated that estimates (39) and (38) are dual to each other in the sense of duality between functions $M=x\ln x-\frac{y^{2}}{2x}$ and $U=e^{p-q^{2}/2}$ .

3.4 Poincaré inequality 3/2: a simple proof via duality

It was proved in [11] that for any $f:\{-1,1\}^{n}\to\mathbb{R}$ , we have

[TABLE]

where $z^{3/2}$ is taken in the sense of the principal brunch in the upper half-plane. Inequality (40) improves Beckner’s bound for a particular exponent [11]. Consider

[TABLE]

It was explained in [11] that to prove (40) it is enough to check that $M(x,y)$ satisfies (29), and the latter fact involved careful investigation of the roots of several very high degree polynomials with integer coefficients. Let us give a simple proof of (29) using our duality technique.

Proposition 3.3.

The function $M(x,y)=\Re(x+iy)^{3/2}$ satisfies (29) for all $x,a,b\in\mathbb{R}$ and $y\geq 0$ .

Proof.

$M(x,y)$ is a solution of the homogeneous Monge–Ampère equation (33), and therefore it has a representation of the form (34) (see Section 3.1.4 in [12]):

[TABLE]

This leads us to the following guess

[TABLE]

which can be directly checked. Using Theorem A.6 with $(p_{0},q_{0})=(0,0)$ , and following the proof of Lemma 2.2, it is enough to check that $U(p,q)$ satisfies (25). Notice that (25) is an identity for $U(p,q)=-\frac{4}{27}(p^{3}-3pq^{2})$ . This finishes the proof of the proposition. ∎

3.5 Sobolev inequalities

3.5.1 The Hamming cube $\{-1,1\}^{n}$

For $p\in[1,2]$ , let $c_{p}$ be the best possible constant such that

[TABLE]

Our theorem implies that $c_{p}\geq s^{p}_{p^{\prime}}$ for $p\in(1,2]$ . Notice that when $p=2$ , we have $c_{2}=s_{2}^{2}=1$ , and (41) recovers the classical Poincaré inequality. When $p\to 1+$ the constant $s^{p}_{p^{\prime}}$ tends to zero which should not be the case for $c_{p}$ . Indeed, it follows from a deep result of Talagrand [19] that if $T_{p}$ is the best possible constant in the following estimate

[TABLE]

then $T_{p}>0$ for all $p\in[1,\infty)$ . Now notice that $T_{1}=c_{1}$ , $T_{2}=c_{2}$ and $T_{p}\geq c_{p}$ for $p\in(1,2)$ . When $p>2$ , by example (49), we must have $c_{p}=0$ unlike the fact that $T_{p}>0$ for $p>2$ . So one may wonder whether the positivity of $T_{p}$ may not imply the positivity of $c_{p}$ on the interval $(1,2)$ . Let us mention that this is not the case, in fact $2c_{p}\geq T_{p}$ for $p\in(1,2)$ . Indeed, it will suffice to prove that $2\mathbb{E}|f-\mathbb{E}f|^{p}\geq\mathbb{E}|f|^{p}-|\mathbb{E}f|^{p}$ . If $\mathbb{E}f=0$ this is obvious. Assume $\mathbb{E}f\neq 0$ . Next, we show a simple inequality

[TABLE]

Plugging $x=f/\mathbb{E}f$ , and taking the expectation, we obtain $2\mathbb{E}|f-\mathbb{E}f|^{p}\geq\mathbb{E}|f|^{p}-|\mathbb{E}f|^{p}$ . To verify (43), without loss of generality assume that $p>1$ (otherwise the inequality is trivial). Consider $g(x)=2|x-1|^{p}-|x|^{p}+1$ . Its second derivative changes signs at points $x$ which satisfy the equation $|x-1|=2^{1/(2-p)}|x|$ , i.e., when $x=x_{\pm}=\frac{1}{1\pm 2^{1/(2-p)}}$ . The right hand side of (43) represents the tangent line to the graph of $g$ at the point $x=1$ . Clearly $g$ is convex on $[x_{+},\infty)$ . Therefore (43) is true on this interval. Next, $g$ is concave on $[x_{-},x_{+}]$ and since $x_{-}<0$ , we have $g\geq p(1-x)$ on $[0,x_{+}]$ because $g(0)>p(1-0)$ . Thus (43) is true for $x\geq 0$ . For $x\leq 0$ , by Bernoulli we have

[TABLE]

To the best of our knowledge, the constants $c_{p},T_{p}$ are unknown for $p\in[1,2)$ . There is a remarkable result of Ben-Efraim–Lust-Piquard [2] that $T_{p}\geq\frac{2}{\pi}$ for $1\leq p\leq 2$ .

This, combined to our theorem, gives the lower bound $T_{p}\geq\max\{\frac{2}{\pi},s_{p^{\prime}}\}$ for $1\leq p\leq 2$ . However, due to the inequalities of Bobkov–Götze and Maurey–Pisier (see the next section), it is widely believed that $c_{1}=T_{1}=\sqrt{\frac{2}{\pi}}$ .

An elegant idea of Naor–Schechtman [15] based on Burkholder’s inequality [5] gives an estimate

[TABLE]

Let us show that our bound (2) obtained in Corollary 1.2 is better.

Proposition 3.4.

For all $1<p<2$ we have

[TABLE]

Proof.

We estimate $s_{p^{\prime}}$ from below by $\sqrt{2/p^{\prime}}$ (see Lemma A.2). Using $\Gamma(3/2)=\sqrt{\pi}/2$ we see that to prove (44) it is enough to show the following two inequalities

[TABLE]

where $p_{0}\approx 1.847...$ is the solution of the equation $\Gamma((p+1)/2)=\sqrt{\pi}/2$ on the interval $(1,2)$ . Inequality (45) simplifies to $2^{2-\frac{2}{p}}>p(p-1)$ which is true because the left hand side is concave, and the right hand side is convex on $[1,2]$ . To show (46) it is enough to verify that

[TABLE]

The latter inequality we rewrite as follows

[TABLE]

Since the Trigamma function is convex

[TABLE]

we estimate $\ln(\Gamma((p+1)/2))$ from below by its tangent line at point $p=2$ , i.e.,

[TABLE]

here $\gamma$ is Euler’s constant. It is enough to show that

[TABLE]

The left hand side is concave on the interval $(1+\sqrt{2}/2,2)$ , and at the endpoint cases we have the inequality. Notice that $(1+\sqrt{2}/2)=1.7...<p_{0}=1.82...$ , and this finishes the proof. ∎

3.5.2 Gaussian measure on $\mathbb{R}^{n}$

The application of the Central Limit Theorem to (1) gives a dimension independent Sobolev inequality.

Corollary 3.5.

For any smooth bounded $f:\mathbb{R}^{n}\to\mathbb{R}$ and any $n\geq 1$ , we have

[TABLE]

The best possible constant in (47), unlike $s_{p^{\prime}}^{p}$ , should not degenerate when $p\to 1+$ . Indeed, (see [14], pp. 115) one has

[TABLE]

where the constant $\sqrt{\frac{2}{\pi}}$ is the best possible in the left hand side of (48). We should mention that estimate (48) can be also easily obtained by a remarkable trick of Maurey–Pisier [18].

Notice that (47) cannot be extended for the range of exponents $p>2$ with some positive constant $C(p)$ instead of $s_{p^{\prime}}^{p}$ . Indeed, assume the contrary. Consider $n=1$ and take $f(x)=1+ax$ . Using Jensen’s inequality, we obtain

[TABLE]

Therefore, taking $a\to 0$ , we obtain the contradiction with $pa^{2}/2>\frac{|a|^{p}}{C(p)}$ for $p>2$ .

3.6 Discrete surface measure

Let $A\subset\{-1,1\}^{n}$ be a subset of the Hamming cube with cardinality $|A|=2^{n-1}$ . Define $w_{A}:\{-1,1\}^{n}\to\mathbb{N}\cup\{0\}$ so that $w_{A}(x)$ is the number of boundary edges to $A$ containing $x$ , i.e., $w_{A}(x)$ counts the number of edges with one endpoint in $A$ and another one in the complement of $A$ such that one of the endpoints is $x$ . Clearly $w_{A}(x)=0$ if $x$ is in the “strict interior” of $A$ , or in the “strict complement” of $A$ , and it is nonzero if and only if $x$ is on the “boundary” of $A$ . Notice that $w_{A}(x)$ can be nonzero for some $x\notin A$ . The function $w_{A}$ maybe be understood as a discrete surface measure of the boundary of $A$ . Consider the following quantity

[TABLE]

It follows from Harper’s edge-isoperimetric inequality [10] that $\sigma(2)=1$ and the value is attained on the halfcube. The monotonicity of $\sigma(p)$ in $p$ implies that $\sigma(p)=1$ for all $p\geq 2$ . Also notice that considering Hamming balls, one can easily show that $\sigma(p)=0$ for $0\leq p<1$ . Therefore the first nontrivial value is $\sigma(1)$ . In this case it follows from Bobkov’s inequality (see [3] and references therein) that $\sigma(1)\geq\sqrt{\frac{2}{\pi}}\approx 0.79$ , and by monotonicity we obtain that $\sigma(p)\geq\sqrt{\frac{2}{\pi}}$ which is definitely not sharp when $p\to 2-$ .

Define $f:\{-1,1\}^{n}\to\{-1,1\}$ as follows: $f(x)=1$ if $x\in A$ and $f(x)=-1$ if $x\notin A$ . Clearly $|\nabla f(x)|^{2}=w_{A}(x)$ . Applying (1) to $f$ , we obtain

[TABLE]

Inequality (51) gives the lower bound $\sigma(p)\geq s^{p}_{p^{\prime}}$ which tends to $1$ as $p\to 2-$ , but fails to be sharp when $p\to 1+$ . Thus combining this result with Bobkov’s inequality we obtain the bound

[TABLE]

Appendix A Appendix

A.1 Properties of $N_{\alpha}(t)$

Lemma A.1.

For any $\alpha\geq 2$ , we have $0<s_{\alpha}\leq 1$ . In addition $s_{\alpha}$ is decreasing in $\alpha>0$ , and $N^{\prime}_{\alpha}(t),N^{\prime\prime}_{\alpha}(t)\leq 0$ on $[0,s_{\alpha}]$ for $\alpha>0$ .

Proof.

Consider $G_{\alpha}(t):=e^{-t^{2}/4}N_{\alpha}(t)$ . Notice that the zeros of $G_{\alpha}$ and $N_{\alpha}$ are the same. It follows from (7) that

[TABLE]

Besides we know that the solution is even. Consider the critical case $\alpha=2$ . In this case $G_{2}(t)=e^{-t^{2}/4}(1-t^{2})$ and the smallest positive zero is $s_{2}=1$ . Therefore it follows from the Sturm comparison principle that $0<s_{\alpha}<1$ for $\alpha>2$ (see below). Moreover, the same principle applied to $G_{\alpha_{1}}$ and $G_{\alpha_{2}}$ with $\alpha_{1}>\alpha_{2}$ implies that $G_{\alpha_{1}}$ has a zero inside the interval $(-s_{\alpha_{2}},s_{\alpha_{2}})$ . Thus we conclude that $s_{\alpha}$ is decreasing in $\alpha$ .

To verify that $N^{\prime}_{\alpha},N^{\prime\prime}_{\alpha}\leq 0$ on $[0,s_{\alpha}]$ , first we claim that

[TABLE]

for $\alpha_{1}>\alpha_{2}>0$ . Indeed the proof works in the same way as the proof of Sturm’s comparison principle. For the convenience of the reader we decided to include the argument. As before, consider $G_{\alpha_{j}}=e^{-t^{2}/4}N_{\alpha_{j}}$ . It is enough to show that $G_{\alpha_{2}}\geq G_{\alpha_{1}}$ on $[0,s_{\alpha_{1}}]$ . It follows from (53) that $G^{\prime\prime}_{\alpha_{2}}(0)>G^{\prime\prime}_{\alpha_{1}}(0)$ . Therefore, using the Taylor series expansion at the point [math], we see that the claim is true at some neighbourhood of zero, say $[0,\varepsilon)$ with $\varepsilon$ sufficiently small. Next we assume the contrary, i.e., that there is a point $a\in[\varepsilon,s_{\alpha_{1}}]$ such that $G_{\alpha_{2}}\geq G_{\alpha_{1}}$ on $[0,a]$ , $G_{\alpha_{2}}(a)=G_{\alpha_{1}}(a)$ and $G^{\prime}_{\alpha_{2}}(a)<G^{\prime}_{\alpha_{1}}(a)$ (notice that the case $G^{\prime}_{\alpha_{2}}(a)=G^{\prime}_{\alpha_{1}}(a)$ , by the uniqueness theorem for ODEs, would imply that $G_{\alpha_{2}}=G_{\alpha_{1}}$ everywhere, which is impossible). Consider the Wronskian

[TABLE]

We have $W(0)=0$ and $W(a)=G_{\alpha_{1}}(a)(G^{\prime}_{\alpha_{1}}(a)-G^{\prime}_{\alpha_{2}}(a))\geq 0$ . On the other hand, we have

[TABLE]

which is a clear contradiction, and this proves the claim.

It follows from (6) that

[TABLE]

and inequalities $N_{\alpha-2}\geq N_{\alpha}\geq 0$ on $[0,s_{\alpha}]$ imply that $N^{\prime\prime}_{\alpha}\leq 0$ on $[0,s_{\alpha}]$ . Since $N^{\prime}_{\alpha}(0)=0$ and $N^{\prime\prime}_{\alpha}\leq 0$ on $[0,s_{\alpha}]$ , we must have $N^{\prime}_{\alpha}\leq 0$ on $[0,s_{\alpha}]$ . ∎

Lemma A.2.

We have $s_{\alpha}\geq\sqrt{\frac{2}{\alpha}}$ for all $\alpha\geq 2$ .

Proof.

Notice that $G_{2}(x)=e^{-x^{2}/4}(1-x^{2})$ satisfies (53) with $\alpha=2$ . Now consider $V_{\alpha}(x):=G_{2}(x\sqrt{\alpha/2})$ . The function $V_{\alpha}(x)$ satisfies the equation

[TABLE]

and $V_{\alpha}(0)=1$ , $V^{\prime}_{\alpha}(0)=0$ . Notice that $V_{\alpha}(x)>0$ on $[0,\sqrt{2/\alpha})$ . Since

[TABLE]

it follows from the Sturm comparison principle (see the previous discussions) that $G_{\alpha}>V_{\alpha}>0$ on $(0,\sqrt{2/\alpha})$ . Thus we obtain that $s_{\alpha}\geq\sqrt{2/\alpha}$ . ∎

A.2 Heat inequality

Let $U(p,q)$ be defined as in (8).

Lemma A.3.

For any $p\in\mathbb{R}$ , the map

[TABLE]

is convex.

Proof.

Without loss of generality, assume that $p\geq 0$ . We recall that $U(p,\sqrt{t})=t^{\alpha/2}u_{\alpha}(p/\sqrt{t})$ . Since $\alpha\geq 2$ , the only interesting case to consider is when $p/\sqrt{t}<s_{\alpha}$ (otherwise $t^{\alpha/2}$ is convex). In this case we have $U(p,\sqrt{t})=t^{\alpha/2}N_{\alpha}(p/\sqrt{t})$ up to a positive constant which we are going to ignore, and, therefore, by (7) we have $(U(p,\sqrt{t}))_{t}+\frac{(U(p,\sqrt{t}))_{pp}}{2}=0$ . Using (54), we obtain

[TABLE]

Therefore it would be enough to show that for any $\gamma\geq 0$ , the function $\frac{N_{\gamma}(x)}{x^{\gamma}}$ is decreasing for $x\in(0,s_{\gamma+2})$ . Differentiating, and using (7) again, we obtain

[TABLE]

which is nonpositive by Lemma A.1. ∎

The next lemma, together with Lemma A.3 and (11), implies that $U(p,q)$ satisfies (10).

Lemma A.4 (Barthe–Mauery [1]).

Let $J$ be a convex subset of $\mathbb{R}$ , and let $V(p,q):J\times\mathbb{R}_{+}\to\mathbb{R}$ be such that

[TABLE]

Then for all $(p,q,a)$ with $p\pm a\in J$ and $q\geq 0$ , we have

[TABLE]

The lemma says that the global discrete inequality (58) is in fact implied by its infinitesimal form (56) under the extra condition (57).

Proof.

The argument is borrowed from [1]. The similar argument was used by Davis [8] in obtaining sharp square function estimates from the ones for the Brownian motion.

Without loss of generality assume $a\geq 0$ . Consider the process

[TABLE]

Here $B_{t}$ is the standard Brownian motion starting at zero. It follows from Ito’s formula together with (56) that $X_{t}$ is a supermartingale. Let $\tau$ be the stopping time

[TABLE]

It follows from the optional stopping theorem that

[TABLE]

Notice that we have used $P(B_{\tau}=a)=P(B_{\tau}=-a)=1/2$ , $\mathbb{E}(\tau|B_{\tau}=a)=\mathbb{E}(\tau|B_{\tau}=-a)=a^{2}$ , and the fact that the map $t\mapsto V(p,\sqrt{t})$ is convex together with Jensen’s inequality. ∎

A.3 Minimax theorem for noncompact sets

Let $P,Q$ be nonempty closed convex sets in $\mathbb{R}$ . We say that a pair $(p^{*},q^{*})\in P\times Q$ is a saddle point of $f$ on $P\times Q$ if

[TABLE]

Lemma A.5.

The function $f$ defined on $P\times Q$ with real values possesses a saddle point $(p^{*},q^{*})$ on $P\times Q$ if and only if

[TABLE]

and this number is then equal to $f(p^{*},q^{*})$ .

For the proof we refer the reader to Proposition 1.2 in [9], pp. 167.

Theorem A.6.

Suppose that $f:P\times Q\to\mathbb{R}$ is continuous, concave in $p$ , convex in $q$ , and there exists $(p_{0},q_{0})\in P\times Q$ such that

[TABLE]

Then $f$ possesses at least one saddle point on $P\times Q$ and

[TABLE]

The theorem is Proposition 2.2 in [9], pp. 173.

Acknowledgments

We are very grateful to several people for discussions and suggestions that led us to noticing the duality between the Hamming cube and the square function: G. Aubrun for valuable remarks on optimizers in (50); D. Bilyk for providing the reference to sharp constants for Square functions [20]; R. O’Donell for providing the references; R. Latała for pointing out example (49); S. Petermichl for bringing our attention to Bellman functions in Square function estimates and Poincaré inequalities for the Gaussian measure; S. Treil for attracting our attention to Chang–Wilson–Wolff’s superexponential bound (Corollary 38) and its similarity to the Gaussian concentration inequality; R. van Handel for references, including (3) and (48), and making several important remarks. We thank an anonymous referee for helpful comments and remarks.

Bibliography21

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] F. Barthe, B. Maurey , Some remarks on isoperimetry of Gaussian type , Annales de l’Institut Henri Poincare (B) Probability and Statistics, Vol. 36, Iss. 4, pp. 419–434
2[2] L. Ben-Efraim, F. Lust-Piquard , Poincaré type inequalities on the discrete cube and in the CAR algebra , Probability Theory and Related Fields, Vol. 141, Iss. 3–4, pp. 569–602 (2008)
3[3] S. G. Bobkov, F. Götze , Discrete isoperimetric and Poincaré-type inequalities , Probab. Theory Relat. Fields 114, 245–277 (1999)
4[4] D. Burkholder , Sharp inequalities for martingales and stochastic integrals , Colloque Paul Lévy (Palaiseau, 1987), Ast’erisque 157-158 (1988), 75–94
5[5] D. L. Burkholder , Boundary Value Problems and Sharp Inequalities for Martingale Transforms , Ann. Probab., Vol. 12, No. 3, (1984), 647–702
6[6] D. L. Burkholder, R. F. Gundy , Extrapolation and interpolation of quasi-linear operators on martingales , Acta. Math., Vol. 124 (1970), 249–304
7[7] A. Chang, J. M. Wilson, Th. Wolff , Some weighted norm inequalities concerning the Schrödinger operators , Comment. Math. Helvetici, Vol. 60, 1985, 217–246
8[8] B. Davis , On the L p superscript 𝐿 𝑝 L^{p} norms of stochastic integrals and other martingales , Duke Math. J. Vol. 43, pp. 697–704 (1976)

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Square functions and the Hamming cube: Duality

Abstract

1 Main result

Theorem 1.1**.**

Corollary 1.2**.**

Theorem 1.3** (see [11]).**

2 Proof of the main result

2.1 An anonymous Bellman function

2.2 Dualizing the Bellman function U(p,q)U(p,q)U(p,q) and going to the Hamming cube

Lemma 2.1**.**

Proof.

Lemma 2.2**.**

Proof.

Corollary 2.3**.**

Proof.

2.3 The proof of Theorem 1.1

3 Remarks and Applications

3.1 Going from UUU to MMM: from Square function to the Hamming cube

Theorem 3.1**.**

Proof.

3.2 Going from MMM to UUU: from Hamming cube to square function

3.3 The dual to Log-Sobolev is Chang–Wilson–Wolff

Corollary 3.2**.**

3.4 Poincaré inequality 3/2: a simple proof via duality

Proposition 3.3**.**

Proof.

3.5 Sobolev inequalities

3.5.1 The Hamming cube {−1,1}n\{-1,1\}^{n}{−1,1}n

Proposition 3.4**.**

Proof.

3.5.2 Gaussian measure on Rn\mathbb{R}^{n}Rn

Corollary 3.5**.**

3.6 Discrete surface measure

Appendix A Appendix

A.1 Properties of Nα(t)N_{\alpha}(t)Nα​(t)

Lemma A.1**.**

Proof.

Lemma A.2**.**

Proof.

A.2 Heat inequality

Lemma A.3**.**

Proof.

Lemma A.4** (Barthe–Mauery [1]).**

Proof.

A.3 Minimax theorem for noncompact sets

Lemma A.5**.**

Theorem A.6**.**

Acknowledgments

Theorem 1.1.

Corollary 1.2.

Theorem 1.3 (see [11]).

2.2 Dualizing the Bellman function $U(p,q)$ and going to the Hamming cube

Lemma 2.1.

Lemma 2.2.

Corollary 2.3.

3.1 Going from $U$ to $M$ : from Square function to the Hamming cube

Theorem 3.1.

3.2 Going from $M$ to $U$ : from Hamming cube to square function

Corollary 3.2.

Proposition 3.3.

3.5.1 The Hamming cube $\{-1,1\}^{n}$

Proposition 3.4.

3.5.2 Gaussian measure on $\mathbb{R}^{n}$

Corollary 3.5.

A.1 Properties of $N_{\alpha}(t)$

Lemma A.1.

Lemma A.2.

Lemma A.3.

Lemma A.4 (Barthe–Mauery [1]).

Lemma A.5.

Theorem A.6.