Geometric averaging operators and nonconcentration inequalities

Philip T Gressman

arXiv:1906.04599·math.CA·March 23, 2022

Geometric averaging operators and nonconcentration inequalities

Philip T Gressman

PDF

TL;DR

This paper systematically studies geometric integral inequalities related to Radon-like transforms over polynomial submanifolds, extending key results in geometric measure theory to improve understanding of $L^p$-inequalities.

Contribution

It introduces new geometric averaging operators and establishes nonconcentration inequalities, advancing the continuum combinatorial approach to $L^p$-improving inequalities.

Findings

01

Derived new geometric integral inequalities

02

Extended results in geometric measure theory

03

Enhanced understanding of Radon-like transforms

Abstract

This paper is devoted to a systematic study of certain geometric integral inequalities which arise in continuum combinatorial approaches to $L^{p}$ -improving inequalities for Radon-like transforms over polynomial submanifolds of intermediate dimension. The desired inequalities relate to and extend a number of important results in geometric measure theory.

Equations361

T f (x) := \int_{R^{n}} f (γ (t, x)) χ_{Ω} (t, x) d t,

T f (x) := \int_{R^{n}} f (γ (t, x)) χ_{Ω} (t, x) d t,

Σ_{x} := {γ (t, x) \in R^{N_{1}} t \in R^{n}, (t, x) \in Ω} .

Σ_{x} := {γ (t, x) \in R^{N_{1}} t \in R^{n}, (t, x) \in Ω} .

\begin{split}\omega&(t,x):=\!\!\!\\ &\sum_{1\leq i_{1}<\cdots<i_{r}\leq N_{2}}\!\!\det\left[\!\!\begin{array}[]{cccc}\frac{\partial\gamma}{\partial x_{i_{1}}}(t,x)&\!\!\cdots&\!\!\!\frac{\partial\gamma}{\partial x_{i_{r}}}(t,x)&\!\!\frac{\partial\gamma}{\partial t}(t,x)\end{array}\!\!\right]dx_{i_{1}}\wedge\cdots\wedge dx_{i_{r}},\end{split}

\begin{split}\omega&(t,x):=\!\!\!\\ &\sum_{1\leq i_{1}<\cdots<i_{r}\leq N_{2}}\!\!\det\left[\!\!\begin{array}[]{cccc}\frac{\partial\gamma}{\partial x_{i_{1}}}(t,x)&\!\!\cdots&\!\!\!\frac{\partial\gamma}{\partial x_{i_{r}}}(t,x)&\!\!\frac{\partial\gamma}{\partial t}(t,x)\end{array}\!\!\right]dx_{i_{1}}\wedge\cdots\wedge dx_{i_{r}},\end{split}

Φ_{x} (t_{1}, \dots, t_{k}) := \frac{ω ( t _{1} , x ) \land \dots \land ω ( t _{k} , x )}{d x _{1} \land \dots \land d x _{N_{2}}} .

Φ_{x} (t_{1}, \dots, t_{k}) := \frac{ω ( t _{1} , x ) \land \dots \land ω ( t _{k} , x )}{d x _{1} \land \dots \land d x _{N_{2}}} .

\int_{E^{k}} ∣ Φ_{x} (t_{1}, \dots, t_{k}) ∣ d t_{1} \dots d t_{k} \geq δ ∣ E ∣^{k + s}

\int_{E^{k}} ∣ Φ_{x} (t_{1}, \dots, t_{k}) ∣ d t_{1} \dots d t_{k} \geq δ ∣ E ∣^{k + s}

∣∣ T χ_{F} ∣ ∣_{L^{k + s} (R^{N_{2}})} ≲ δ^{- \frac{1}{k + s}} ∣ F ∣^{\frac{k}{k + s}}

∣∣ T χ_{F} ∣ ∣_{L^{k + s} (R^{N_{2}})} ≲ δ^{- \frac{1}{k + s}} ∣ F ∣^{\frac{k}{k + s}}

\int χ_{R \cap Ω} (ω (t, x)) d t ≲ ∣ R ∣^{\frac{1}{s}}

\int χ_{R \cap Ω} (ω (t, x)) d t ≲ ∣ R ∣^{\frac{1}{s}}

A (E)

A (E)

S (E)

S (E)

A (E)

A (E)

S (E)

S (E)

∣∣ S ∣ ∣_{μ, s} \geq ∣∣ A ∣ ∣_{μ, s} ≳ ∣∣ S ∣ ∣_{μ, s},

∣∣ S ∣ ∣_{μ, s} \geq ∣∣ A ∣ ∣_{μ, s} ≳ ∣∣ S ∣ ∣_{μ, s},

λ_{Φ}^{σ} (E) := δ \to 0^{+} lim in f {i = 1 \sum \infty c_{i} [S (E_{i})]^{σ} χ_{E} \leq i = 1 \sum \infty c_{i} χ_{E_{i}}, c_{i} \geq 0 \mbox an d diam (E_{i}) \leq δ \mbox f or a l l i [i \sum S (E_{i})]^{s}} .

λ_{Φ}^{σ} (E) := δ \to 0^{+} lim in f {i = 1 \sum \infty c_{i} [S (E_{i})]^{σ} χ_{E} \leq i = 1 \sum \infty c_{i} χ_{E_{i}}, c_{i} \geq 0 \mbox an d diam (E_{i}) \leq δ \mbox f or a l l i [i \sum S (E_{i})]^{s}} .

S (E) ≳ [λ_{Φ}^{\frac{n}{q}} (E)]^{\frac{q}{n}} ≳ ∣∣ S ∣ ∣_{μ, \frac{q}{n}} [μ (E)]^{\frac{q}{n}}

S (E) ≳ [λ_{Φ}^{\frac{n}{q}} (E)]^{\frac{q}{n}} ≳ ∣∣ S ∣ ∣_{μ, \frac{q}{n}} [μ (E)]^{\frac{q}{n}}

∣ E ∣ ≲ [diam (E)]^{n} .

∣ E ∣ ≲ [diam (E)]^{n} .

μ (E) ≲ [diam (E)]^{n}

μ (E) ≲ [diam (E)]^{n}

μ (B_{r} (x)) ≲ r^{n}

μ (B_{r} (x)) ≲ r^{n}

H^{p} (γ (E)) ≲ [diam (γ (E))]^{p} .

H^{p} (γ (E)) ≲ [diam (γ (E))]^{p} .

∣ E ∣ ≲ [A, A^{'} \in E sup ∣ det (A - A^{'}) ∣]^{n}

∣ E ∣ ≲ [A, A^{'} \in E sup ∣ det (A - A^{'}) ∣]^{n}

Φ (x_{1}, \dots, x_{n + 1}) := det (γ (x_{1}) - γ (x_{n + 1}), \dots, γ (x_{n}) - γ (x_{n + 1})),

Φ (x_{1}, \dots, x_{n + 1}) := det (γ (x_{1}) - γ (x_{n + 1}), \dots, γ (x_{n}) - γ (x_{n + 1})),

μ (R) ≲ ∣ R ∣^{\frac{q}{p}}

μ (R) ≲ ∣ R ∣^{\frac{q}{p}}

P^{σ} (E) := δ \to 0^{+} lim in f {i = 1 \sum \infty c_{i} ω_{1}, \dots, ω_{k} \in E_{i} sup \frac{ω _{1} \land \dots \land ω _{k}}{e _{1} \land \dots \land e _{r k}}^{σ} χ_{E} \leq c_{i} \geq 0 \mbox an d diam (E_{i}) \leq δ i = 1 \sum \infty c_{i} χ_{E_{i}}, \mbox f or a l l i i = 1 \sum \infty ω_{1}, \dots, ω_{k} \in E_{i} s u p \frac{ω _{1} \land \dots \land ω _{k}}{e _{1} \land \dots \land e _{r k}}^{σ}}

P^{σ} (E) := δ \to 0^{+} lim in f {i = 1 \sum \infty c_{i} ω_{1}, \dots, ω_{k} \in E_{i} sup \frac{ω _{1} \land \dots \land ω _{k}}{e _{1} \land \dots \land e _{r k}}^{σ} χ_{E} \leq c_{i} \geq 0 \mbox an d diam (E_{i}) \leq δ i = 1 \sum \infty c_{i} χ_{E_{i}}, \mbox f or a l l i i = 1 \sum \infty ω_{1}, \dots, ω_{k} \in E_{i} s u p \frac{ω _{1} \land \dots \land ω _{k}}{e _{1} \land \dots \land e _{r k}}^{σ}}

Ω := ⎩ ⎨ ⎧ (t, x) \in R^{n} \times R^{N_{2}} \frac{d λ _{Φ_{x}}^{\frac{n}{q}}}{d t} (t) \geq c δ^{\frac{n}{q}} ⎭ ⎬ ⎫

Ω := ⎩ ⎨ ⎧ (t, x) \in R^{n} \times R^{N_{2}} \frac{d λ _{Φ_{x}}^{\frac{n}{q}}}{d t} (t) \geq c δ^{\frac{n}{q}} ⎭ ⎬ ⎫

S (E \cap Ω_{x}) ≳ [λ_{Φ_{x}}^{\frac{n}{q}} (E \cap Ω_{x})]^{\frac{q}{n}} \geq [c δ^{\frac{n}{q}} ∣ E \cap Ω_{x} ∣]^{\frac{q}{n}}

S (E \cap Ω_{x}) ≳ [λ_{Φ_{x}}^{\frac{n}{q}} (E \cap Ω_{x})]^{\frac{q}{n}} \geq [c δ^{\frac{n}{q}} ∣ E \cap Ω_{x} ∣]^{\frac{q}{n}}

Q (F) := \int_{R^{N_{2}}} \int_{(R^{n})^{k}} ∣ Φ_{x} (t_{1}, \dots, t_{k}) ∣ j = 1 \prod k χ_{F} (γ (t_{j}, x)) χ_{Ω} (t_{j}, x) d t_{1} \dots d t_{k} d x

Q (F) := \int_{R^{N_{2}}} \int_{(R^{n})^{k}} ∣ Φ_{x} (t_{1}, \dots, t_{k}) ∣ j = 1 \prod k χ_{F} (γ (t_{j}, x)) χ_{Ω} (t_{j}, x) d t_{1} \dots d t_{k} d x

(γ (t_{1}, x), \dots, γ (t_{k}, x)) = (u_{1}, \dots, u_{k})

(γ (t_{1}, x), \dots, γ (t_{k}, x)) = (u_{1}, \dots, u_{k})

Q (F) \leq N \int_{(R^{N_{1}})^{k}} j = 1 \prod k χ_{F} (u_{j}) d u_{1} \dots d u_{k} = N ∣ F ∣^{k} .

Q (F) \leq N \int_{(R^{N_{1}})^{k}} j = 1 \prod k χ_{F} (u_{j}) d u_{1} \dots d u_{k} = N ∣ F ∣^{k} .

Φ_{x} (t_{1}, \dots, t_{k}) := det \frac{\partial ( γ ( t _{1} , x ) , \dots , γ ( t _{k} , x ))}{\partial ( x , t _{1} , \dots , t _{k} )} = \frac{ω ( t _{1} , x ) \land \dots \land ω ( t _{k} , x )}{d x _{1} \land \dots \land d x _{N_{2}}} .

Φ_{x} (t_{1}, \dots, t_{k}) := det \frac{\partial ( γ ( t _{1} , x ) , \dots , γ ( t _{k} , x ))}{\partial ( x , t _{1} , \dots , t _{k} )} = \frac{ω ( t _{1} , x ) \land \dots \land ω ( t _{k} , x )}{d x _{1} \land \dots \land d x _{N_{2}}} .

\left[\begin{array}[]{ccccc}\frac{\partial\gamma}{\partial x}(t_{1},x)&\frac{\partial\gamma}{\partial t}(t_{1},x)&0&\cdots&0\\ \vdots&0&\ddots&\ddots&\vdots\\ \frac{\partial\gamma}{\partial x}(t_{k-1},x)&\vdots&\ddots&\frac{\partial\gamma}{\partial t}(t_{k-1},x)&0\\ \frac{\partial\gamma}{\partial x}(t_{k},x)&0&\cdots&0&\frac{\partial\gamma}{\partial t}(t_{k},x)\end{array}\right]

\left[\begin{array}[]{ccccc}\frac{\partial\gamma}{\partial x}(t_{1},x)&\frac{\partial\gamma}{\partial t}(t_{1},x)&0&\cdots&0\\ \vdots&0&\ddots&\ddots&\vdots\\ \frac{\partial\gamma}{\partial x}(t_{k-1},x)&\vdots&\ddots&\frac{\partial\gamma}{\partial t}(t_{k-1},x)&0\\ \frac{\partial\gamma}{\partial x}(t_{k},x)&0&\cdots&0&\frac{\partial\gamma}{\partial t}(t_{k},x)\end{array}\right]

(a_{11}

(a_{11}

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Geometric averaging operators and nonconcentration inequalities

Philip T. Gressman111This work was partially supported by NSF grants DMS-1361697 and DMS-1764143.

Abstract

This paper is devoted to a systematic study of certain geometric integral inequalities which arise in continuum combinatorial approaches to $L^{p}$ -improving inequalities for Radon-like transforms over polynomial submanifolds of intermediate dimension. The desired inequalities relate to and extend a number of important results in geometric measure theory.

1 Introduction
1.1 Main results
1.2 Examples
1.3 Structure of the paper
2 Proof of Theorem 1
3 Proof of Theorem 2 and basic measure inequalities
3.1 Proof of Theorem 2
3.2 Basic GMT inequalities and Frostman’s Lemma
4 Proof of Theorem 3
4.1 The case $\sigma<n/q$
4.2 The case $\sigma\geq n/q$ : Comparison to Lebesgue measure
4.3 Multisystems and Theorem 3 with $\sigma=n/q$
4.4 Remarks on calculation
5 Proof of Lemma 3
5.1 Construction of the multisystem
5.2 Underlying geometry and solution counting
6 Further applications to Radon-like operators

1 Introduction

1.1 Main results

Suppose that $\gamma(t,x)$ is a polynomial map from ${\mathbb{R}}^{n}\times{\mathbb{R}}^{N_{2}}$ into ${\mathbb{R}}^{N_{1}}$ with $r:=N_{1}-n>0$ and that $\widetilde{\Omega}$ is some Borel measurable subset of ${\mathbb{R}}^{n}\times{\mathbb{R}}^{N_{2}}$ . To this $\gamma$ and $\widetilde{\Omega}$ , one may associate the Radon-like operator

[TABLE]

which may be informally regarded as averaging functions $f$ on ${\mathbb{R}}^{N_{1}}$ over the family of sets $\{\Sigma_{x}\}_{x\in{\mathbb{R}}^{N_{2}}}$ given by

[TABLE]

The main result of this paper regarding the operator (1) is the following:

Theorem 1.

Suppose $N_{2}=rk$ for some positive integer $k$ . Let $\omega$ be the $r$ -form

[TABLE]

where each ${\partial\gamma}/{\partial x_{i_{j}}}$ is an $N_{1}\times 1$ column matrix of partial derivatives, ${\partial\gamma}/{\partial t}$ is the $N_{1}\times n$ Jacobian matrix of $\gamma$ with respect to $t$ , and the determinant is that of the $N_{1}\times N_{1}$ square matrix formed by concatenation. For each $x\in{\mathbb{R}}^{N_{2}}$ , let222Note that the ratio of forms in the definition of $\Phi_{x}$ is a well-defined real number because both numerator and denominator belong to the same one-dimensional vector space of $N_{2}$ -forms on ${\mathbb{R}}^{N_{2}}$ .

[TABLE]

Fix any real $s,\delta>0$ and suppose that $\widetilde{\Omega}\subset{\mathbb{R}}^{n}\times{\mathbb{R}}^{N_{2}}$ is a Borel set such that

[TABLE]

for every point $x\in{\mathbb{R}}^{N_{2}}$ and every Borel $E\subset{\mathbb{R}}^{n}$ such that $E\times\{x\}\subset\widetilde{\Omega}$ , where $|E|$ denotes the Lebesgue measure of $E$ . Then the Radon-like operator (1) satisfies the inequality

[TABLE]

for all Borel sets $F\subset{\mathbb{R}}^{N_{1}}$ , with the notation “ $\lesssim$ ” indicating the presence of an implicit multiplicative factor. In this case, the factor depends only on $(n,N_{1},N_{2},s,\deg\gamma)$ .

The technical structure of the proof is built on the change of variables formula, similar to various earlier approaches [gressman2013, gressman2015] in the spirit of combinatorial/continuum incidence methods developed by Christ [christ1998]. Christ’s technique, based on ideas of Bourgain [bourgain1986, bourgain1991], Wolff [wolff1995, wolff1997], Schlag [schlag1997], and others, has, since its development twenty years ago, had an impact on the subject of harmonic analysis which is difficult to overstate. It has influenced and inspired work of Bennett, Carbery, and Wright [bcw2005], Dendrinos, Laghi, and Wright [dlw2009], Erdoğan and R. Oberlin [eo2008], Hickman [hickman2016], D. Oberlin [oberlin2000II], Stovall [stovall2011, stovall2014], Tao and Wright [tw2003], and many others.

When $r=1$ , the operator (1) integrates over hypersurfaces and the integral on the left-hand side of (4) reduces to a multilinear determinant functional [gressman2010]. In this case it is known that for fixed $x$ , the inequality (4) is satisfied if and only if the Lebesgue measure $dt$ on the submanifold $\Gamma_{x}\subset{\mathbb{R}}^{N_{2}}$ parametrized by $t\mapsto\omega(t,x)$ satisfies D. Oberlin’s affine curvature condition, meaning that

[TABLE]

for all boxes $R$ with arbitrary orientations and eccentricities, with an implicit constant which is independent of $R$ . The condition (6) is called affine because the implicit constant does not change when $\Gamma_{x}$ is acted on by an equiaffine333The prefix “equi-” specifies those affine transformations which preserve Lebesgue measure. transformation and is regarded as a curvature condition because it necessarily fails when $\Gamma_{x}$ lies in any affine hyperplane. The question of whether (6) is satisfied for a given $\omega(t,x)$ is surprisingly difficult to solve and systematic approaches have only recently become available [gressman2017]. When $r>1$ , the situation is even more difficult, as there are no previously-known analogues of the Oberlin affine curvature condition which apply to (4).

To address the inherent difficulties of the case $r>1$ , this paper is devoted primarily to the general study of functionals of the form

[TABLE]

and

[TABLE]

where the sets $E$ range over all Borel subsets of some domain $\Omega\subset{\mathbb{R}}^{n}$ and the measure $\mu$ is a nonnegative Borel measure. Functionals of the forms (7) and (8) will be called nonconcentration functionals since they quantify the extent to which product sets $E^{k}$ fail to lie in the zero set of $\Phi$ . Outside of the context of Theorem 1, $\Phi:\Omega^{k}\rightarrow{\mathbb{R}}^{m}$ will be taken to be any polynomial in $(x_{1},\ldots,x_{k})$ which vanishes to order $q\geq 1$ on the diagonal $\Delta:=\left\{(x_{1},\ldots,x_{k})\in\Omega^{k}\ \left|\ x_{1}=\cdots=x_{k}\right.\right\}$ , meaning that all partial derivatives of order less than $q$ vanish identically on $\Delta$ and some partial derivative of order $q$ is nonzero at some point of $\Delta$ . When $m>1$ , the absolute values $|\cdot|$ are to be understood as some fixed but otherwise arbitrary norm on ${\mathbb{R}}^{m}$ . The general question to be answered is to determine when one has inequalities of the form

[TABLE]

and

[TABLE]

for all Borel sets $E\subset\Omega$ , where $s>0$ is a fixed real number and $c_{\mu,s}$ and $c_{\mu,s}^{\prime}$ are nonnegative constants which do not depend on $E$ . The cases $c_{\mu,s}=0$ , $c_{\mu,s}^{\prime}=0$ , and $\mu=0$ are uninteresting; to avoid these exceptions, a nonnegative Borel measure $\mu$ on $\Omega$ will be said to satisfy (9) or (10) nontrivially when $\mu$ is not the zero measure and the corresponding inequality holds with a strictly positive constant. Both (9) and (10) will be called nonconcentration inequalities.

The first significant result for nonconcentration inequalities establishes the fundamental equivalence of (9) and (10):

Theorem 2.

For any nonnegative Borel measure $\mu$ and any $s>0$ , $\mu$ satisfies (9) with positive constant if and only if $\mu$ satisfies (10) with positive constant. Moreover, if one defines $||\mathcal{A}||_{\mu,s}$ to be the supremum of all nonnegative $c_{\mu,s}$ such that (9) holds for all Borel sets $E\subset\Omega$ and likewise defines $||\mathcal{S}||_{\mu,s}$ to be the supremum of all $c_{\mu,s}^{\prime}$ satisfying (10) for all Borel $E\subset\Omega$ , then

[TABLE]

where the implicit constant depends only on on $(n,k,s,\deg\Phi)$ .

The value of Theorem 2 is that the nonconcentration functional $\mathcal{S}$ is generally much easier to calculate and estimate than $\mathcal{A}$ . In particular, it is possible to characterize existence of nontrivial measures $\mu$ satisfying (10) in terms of a geometric measure-theoretic generalization of Hausdorff measure and a corresponding generalization of Frostman’s Lemma. In in the key “dimension” for this measure, it is also possible to deduce detailed information about the Radon-Nykodym derivative of this generalized Hausdorff measure with respect to Lebesgue measure. When combined with Theorem 2, this gives an explicit calculation which can be carried out to verify the hypothesis (4). Some of the most important results in this direction are summarized in the following theorem.

Theorem 3.

For any Borel set $E\subset\Omega$ and any $\sigma>0$ , the $\sigma$ -dimensional weighted $\Phi$ -Hausdorff measure of $E$ is defined to equal the quantity

[TABLE]

Then the following statements are true:

If $\sigma>n/q$ , then $\lambda^{\sigma}_{\Phi}(\Omega)=0$ . There are no Borel measures $\mu$ satisfying (10) nontrivially when $s=1/\sigma$ . 2. 2.

If $\sigma\leq n/q$ , then there is a Borel measure $\mu$ satisfying (10) nontrivially with $s=1/\sigma$ if and only if $\lambda^{\sigma}_{\Phi}(\Omega)>0$ . 3. 3.

If $\sigma=n/q$ , $\lambda^{\sigma}_{\Phi}$ is absolutely continuous with respect to Lebesgue measure and there is an explicit estimate (see (44)) for the pointwise magnitude of the Radon-Nykodym derivative. Moreover

[TABLE]

for any Borel set $E$ and any nonnegative Borel measure $\mu$ satisfying (10), with implicit constants depending only on $(n,k,q,\deg\Phi)$ . In other words, the measure $\lambda^{n/q}_{\Phi}$ satisfies (10) itself and is, up to a multiplicative constant, the largest such measure.

1.2 Examples

It is worthwhile to briefly examine the implications of Theorem 3 in some familiar and unfamiliar settings.

Example 1 (Hausdorff measure). When $\Phi(x,y):=x-y$ for $x,y\in{\mathbb{R}}^{n}$ , $\mathcal{S}(E)$ is the diameter of $E$ and $\lambda^{\sigma}_{\Phi}$ is equal to the classical $\sigma$ -dimensional Hausdorff measure $\mathcal{H}^{\sigma}$ (see Federer [federer1969, 2.10.24]). The order of vanishing $q$ is simply $1$ . The first inequality of (13) states that

[TABLE]

In its sharp form with optimal constant, this is known as the isodiametric inequality [federer1969, 2.10.33]. Likewise, if $\mu$ is any nonnegative Borel measure satisfying

[TABLE]

for every Borel set $E\subset\Omega$ , then (13) implies that $\mu(E)\lesssim|E|$ . Thus Lebesgue measure on ${\mathbb{R}}^{n}$ is, up to a constant, the largest measure on ${\mathbb{R}}^{n}$ satisfying an isodiametric inequality (14). It should also be noted that the inequality (14) is, modulo the constant, equivalent to the upper Ahlfors regularity condition

[TABLE]

for all Euclidean balls $B_{r}(x)\subset{\mathbb{R}}^{n}$ , since every set $E$ of bounded diameter is contained in a ball of comparable diameter by virtue of Jung’s Theorem [federer1969].

Example 1 $\prime$ (Hausdorff measure). To generalize the first example, suppose that $\gamma:{\mathbb{R}}^{p}\rightarrow{\mathbb{R}}^{n}$ , $p<n$ , is any locally injective polynomial function and set $\Phi(x,y):=\gamma(x)-\gamma(y)$ . Locally the measure $\lambda^{p}_{\Phi}$ on ${\mathbb{R}}^{p}$ pushes forward to equal exactly the $p$ -dimensional Hausdorff measure on ${\mathbb{R}}^{n}$ restricted to the image of $\gamma$ . Because the multiplicity of images of $\gamma$ is bounded in terms of the degree, the measures must be comparable globally as well. The order of vanishing $q$ is still $1$ , and by (13), it follows that the $p$ -dimensional Hausdorff measure on the image of $\gamma$ also satisfies an isodiametric inequality on ${\mathbb{R}}^{n}$ , i.e.,

[TABLE]

Such an inequality can only hold in general because $\gamma$ is polynomial; if $\gamma$ were merely $C^{\infty}$ it is easy to construct a highly oscillatory curve, for example, with infinite length inside a ball of finite radius. It is also worth noting that up to multiplicative constants, the measure $\mathcal{H}^{p}$ restricted to the image of $\gamma$ is essentially the largest measure satisfying the $p$ -dimensional upper Ahlfors regularity condition equivalent to (15).

Example 2 (Determinantal measure). An interesting nontrivial example on the space of $n\times n$ matrices is to set $\Phi(A_{1},A_{2}):=\det(A_{1}-A_{2})$ for any $A_{1},A_{2}\in{\mathbb{R}}^{n\times n}$ . The order of vanishing $q$ equals $n$ . Using the the estimate (44) for the magnitude of the Radon-Nykodym derivative $d\lambda^{n}_{\Phi}/dx$ , it will be shown (see Proposition 2) that $\lambda_{\Phi}^{n}$ is comparable to Lebesgue measure on ${\mathbb{R}}^{n\times n}$ . Thus, the first inequality of (13) becomes a determinantal isodiametric inequality for subsets of ${\mathbb{R}}^{n\times n}$ , namely,

[TABLE]

for all Borel sets $E\subset{\mathbb{R}}^{n\times n}$ . The implications of this inequality for a corresponding Radon-like operator are detailed in Section 6.

Example 3 (Affine measure). For $\gamma$ as in Example 1′, let

[TABLE]

where the determinant of an ordered list of $n$ vectors in ${\mathbb{R}}^{n}$ is defined to equal the determinant of the $n\times n$ matrix whose $j$ -th column contains the ordered coordinates of the $j$ -th vector in the standard basis. The measure $\lambda^{\sigma}_{\Phi}$ pushes forward to a measure on the graph of $\gamma$ which is is dominated by D. Oberlin’s affine measure of dimension $n\sigma$ [oberlin2003] up to a uniform multiplicative constant; while it is not clear that these two measures are comparable in all cases, it is a consequence of later arguments in this paper that the measures must be comparable when $\sigma=p/q$ . For this particular value of $\sigma$ , $\lambda^{\sigma}_{\Phi}$ is comparable to the recently-defined affine hypersurface measure [gressman2017], which is the optimal measure satisfying Oberlin’s affine curvature condition

[TABLE]

for all boxes $R\subset{\mathbb{R}}^{n}$ of arbitrary orientation. Similar to the Hausdorff measure and the upper Ahlfors regularity condition, the Oberlin condition (16) is in fact equivalent to the a priori stronger inequality (10) (see Section 3.2).

Example 4 (Projective Measure on Forms). When the underlying space is taken to be the decomposable444Here “decomposable” means expressible as an $r$ -fold wedge product of $1$ -vectors. $r$ -vectors in $\Lambda^{r}({\mathbb{R}}^{rk})$ for positive integers $r$ and $k$ , let

[TABLE]

(where diameter is with respect to any metric inducing the usual topology). The form $\omega(t,x)$ defined by (2) is always decomposable (see Sections 2 and 6); if $t\mapsto\omega(t,x)$ is locally injective for each $x$ , then the push forward of the measure $\lambda_{\Phi_{x}}^{\sigma}$ on ${\mathbb{R}}^{n}$ to the graph of $\omega(\cdot,x)$ will be comparable to the restriction of $P^{\sigma}$ to the same graph. If $q$ is the smallest integer such that $\Phi_{x}(t_{1},\ldots,t_{k})$ vanishes to order $q$ on the diagonal for some $x$ , then setting

[TABLE]

for an appropriate constant $c$ depending only on $(n,q,N_{1},N_{2},\deg\gamma)$ yields the inequality (4) with $s=q/n$ by Theorem 2 together with the fact that

[TABLE]

when $\Omega_{x}$ is the set where the Radon-Nykodym derivative $d\lambda^{n/q}_{\Phi_{x}}/dt$ exceeds $c\delta^{n/q}$ .

1.3 Structure of the paper

Section 2 is a self-contained proof of Theorem 1 using a combinatorial approach much like earlier work on uniform sublevel Radon-like inequalities and averages over $n$ -dimensional submanifolds of ${\mathbb{R}}^{2n}$ [gressman2013, gressman2015]. Section 3 contains a proof of Theorem 2 using elementary convex geometry as via Lemma 1,a n earlier version of which appears in work on affine submanifold measures [gressman2017]. This section also contains some basic GMT observations about $\Phi$ -Hausdorff and weighted $\Phi$ -Hausdorff measures which will be used in the proof of Theorem 3. In particular, Section 3.2 contains a proof of the relevant generalization of Frostman’s lemma, which is a rather direct reinterpretation of Howroyd’s proof as appearing in Mattila’s book [mattila]. Section 4 provides the bulk of the proof of Theorem 3. The case $\sigma<n/q$ is essentially an immediate consequence of Lemma 2, while the case $\sigma\geq n/q$ relies on a scaling argument to show that $\Phi$ -Hausdorff measure of dimension $\sigma$ must be absolutely continuous with respect to Lebesgue measure and to consequently estimate the Radon-Nykodym derivative. At this point, the remaining portions of Theorem 3 are reduced to establishing Theorem 4, which gives an explicit construction for any $s$ of a measure (possibly zero) satisfying (10). The proof of Theorem 4 is then reduced to proving Lemma 3 (see also [gressman2017]), which is the content of Section 5. As a part of the proof of Lemma 3, Section 5 also identifies the underlying intrinsic geometric objects which play an important algebraic role in the lemma and relate closely to earlier geometric sublevel set estimates [gressman2010II]. Finally, Section 6 gives some example applications of Theorem 1 which correspond to the GMT examples from Section 1.2.

2 Proof of Theorem 1

Proof of Theorem 1.

As defined in the introduction, suppose that $\gamma(t,x)$ is a polynomial map from ${\mathbb{R}}^{n}\times{\mathbb{R}}^{N_{2}}$ into ${\mathbb{R}}^{N_{1}}$ . Let $r:=N_{1}-n$ , and suppose that $N_{2}=rk$ for some integer $k$ . The basic structure of this proof is to estimate the quantity

[TABLE]

from below and above, where $\Phi_{x}(t_{1},\ldots,t_{k})$ is defined to be the Jacobian determinant of the map $(x,t_{1},\ldots,t_{k})\mapsto(\gamma(t_{1},x),\ldots,\gamma(t_{k},x))$ . The main upper bound for $Q(F)$ comes from the change of variables formula and Bézout’s Theorem: for any $(u_{1},\ldots,u_{k})\in({\mathbb{R}}^{N_{1}})^{k}$ , since $N_{1}k=N_{2}+nk$ , Bézout’s Theorem guarantees that the number of connected components in ${\mathbb{C}}^{N_{1}k}$ of the solution set of the system of equations

[TABLE]

is at most the product of the degrees of the polynomials (see Fulton [fulton1984, Chapter 8, Section 4]). This means that the number of real solutions of the system where the Jacobian is nonvanishing cannot exceed this same upper bound, since the nonvanishing of the Jacobian at a real solution guarantees that such a solution will be isolated in complex space as well. Now by the change of variables formula, if the number of solutions $(x,t_{1},\ldots,t_{k})$ of the system (19) inside the domain of the integral $Q(F)$ is never greater than $N$ for any choice of $(u_{1},\ldots,u_{k})$ , then

[TABLE]

Without loss of generality, it may be assumed that Jacobian determinant is nonvanishing at every counted solution of the system (since the integral on the set where $|\Phi_{x}(t_{1},\ldots,t_{k})|=0$ is necessarily zero), i.e., $N$ need only bound the number of isolated solutions of (19) for a given right-hand side $(u_{1},\ldots,u_{k})$ , which Bézout’s Theorem guarantees is bounded by the product of degrees.

To estimate (18) from below, recall the definition (2) of the form $\omega$ . The key fact to establish is that the functional $\Phi_{x}$ is indeed the Jacobian determinant of the map $(x,t_{1},\ldots,t_{k})\mapsto(\gamma(t_{1},x),\ldots,\gamma(t_{k},x))$ , i.e., that

[TABLE]

To prove (21), first observe that the Jacobian matrix has block structure

[TABLE]

where $\partial\gamma/\partial x$ is an $N_{1}\times N_{2}$ block of partial derivatives of $\gamma$ (with the coordinates of $\gamma$ corresponding to rows and the partial derivatives in the coordinate directions of $x$ corresponding to columns) and $\partial\gamma/\partial t$ is a corresponding $N_{1}\times n$ block of partial derivatives. To simplify the determinant of the matrix (22), label the coordinates of $t_{j}$ as $(t_{j1},\ldots,t_{jn})$ . It will be necessary to use the identity

[TABLE]

where one defines

[TABLE]

and observes of the remainder $E_{j}$ that it is spanned by all $N_{1}$ -fold wedge products of $dx_{1},\ldots,$ $dx_{N_{2}}$ , $dt_{j1},\ldots,$ $dt_{jn}$ which omit $dt_{ji}$ for at least one index $i\in\{1,\ldots,n\}$ . The proof of the identity is essentially immediate after observing that when computing the correct coefficient of $dx_{i_{1}}\wedge\cdots\wedge dx_{i_{r}}$ in $\omega_{j}$ , it suffices to assume that $a_{ji}=0$ for $i\neq i_{1},\ldots,i_{r}$ .

To use the identity (23), first express the determinant as the coefficient of $dx_{1}\wedge\cdots\wedge dx_{N_{2}}\wedge dt_{11}\wedge\cdots\wedge dt_{1n}\wedge\cdots\wedge dt_{k1}\wedge\cdots\wedge dt_{kn}$ in an $(N_{2}+kn)$ -fold wedge product of one forms with coefficients drawn from the rows of the block-form matrix (22). The wedge of the forms in the $j$ -th block of rows is given by (23) when each coefficient $a_{ii^{\prime}}$ is replaced the $(i,i^{\prime})$ -entry of the matrix $({\partial\gamma}/{\partial x})(t_{j},x)$ and each coefficient $b_{ii^{\prime}}$ is replaced the $(i,i^{\prime})$ -entry of the matrix $({\partial\gamma}/{\partial t})(t_{j},x)$ . In particular, this yields the identity $\omega_{j}=\omega(t_{j},x)$ . To compute the Jacobian determinant (21), it suffices to take the wedge of the expressions (23) over $j=1,\ldots,k$ and show that the remainders $E_{j}$ do not influence the coefficient of $dx_{1}\wedge\cdots\wedge dx_{N_{2}}\wedge dt_{11}\wedge\cdots\wedge dt_{1n}\wedge\cdots\wedge dt_{k1}\wedge\cdots\wedge dt_{kn}$ . Because the variables $t_{j}$ appear only in the $j$ -th block of rows, there is only one way for $dt_{j1}\wedge\cdots\wedge dt_{jm}$ to be a factor in the full wedge product: it must appear explicitly in a corresponding term of (23). In other words, when taking the wedge over all $j$ , any wedge product including an $E_{j}$ will not contain all $n$ factors $dt_{j1},\ldots,dt_{jn}$ . In the place of the missing $dt_{ji}$ , every term of $E_{j}$ must necessarily contain more than $r$ factors drawn from $dx_{1},\ldots,dx_{N_{2}}$ . Since every term of the wedge product (23) must contain at least $r$ factors drawn from $dx_{1},\ldots,dx_{N_{2}}$ , it follows by the pigeonhole principle that in the full $k$ -fold wedge product representing the determinant (22), when expanded by multilinearity, any term including $E_{j}$ must be expressible as a sum of wedge products with at least one duplicate $dx_{i}$ . Thus (21) must hold.

It is worth pausing briefly to make the observation that $\omega$ must be decomposable. First note that the form $\omega$ as defined by (2) is independent of the chosen coordinate systems on ${\mathbb{R}}^{N_{2}}$ and ${\mathbb{R}}^{n}$ . If $t\mapsto\gamma(t,x)$ does not have injective differential, then $\omega(t,x)$ vanishes. Thus, when $\omega$ is nonzero, the dimension of the quotient ${\mathbb{R}}^{N_{1}}$ modulo the image of the differential $d_{t}\gamma(t,x)$ always has dimension $r=N_{1}-n$ . The image of the differential $d_{x}\gamma(t,x)$ in this quotient space is therefore at most $r$ -dimensional, meaning that whenever $\omega(t,x)$ is not zero, it is always possible to choose a coordinate system near any given $x$ for which ${\partial\gamma}/{\partial x_{i}}$ belongs to the span of the $t$ partial derivatives of $\gamma$ whenever $i>r$ . Computing the form (2) in these coordinates shows that $\omega$ must be a multiple of $dx_{1}\wedge\cdots\wedge dx_{r}$ and is therefore decomposable. Moreover, it follows that $\omega(t_{1},x)\wedge\omega(t_{2},x)$ vanishes to at least order $r$ when $t_{1}=t_{2}$ and $\omega(t_{1},x)\neq 0$ . This then implies that $\Phi_{x}(t_{1},\ldots,t_{k})$ vanishes to order at least $r(k-1)$ on the diagonal $\Delta$ at all points where $\omega(t,x)\neq 0$ .

Returning to (18), fix a Borel measurable set $F\subset{\mathbb{R}}^{N_{1}}$ . By (20),

[TABLE]

where the implicit constant can be taken to equal the maximum number of isolated solutions $(x,t_{1},\ldots,t_{k})$ of the system $(\gamma(t_{1},x),\ldots,\gamma(t_{k},x))=(u_{1},\ldots,u_{k})$ as $u_{1},\ldots,u_{k}$ range over ${\mathbb{R}}^{N_{1}}$ . Defining $F_{x}\subset{\mathbb{R}}^{m}$ to equal

[TABLE]

(which will be a Borel subset of ${\mathbb{R}}^{n}$ since $\gamma$ is a continuous function of $t$ ), it follows by Fubini that

[TABLE]

By the main hypothesis (4) of Theorem 1, it must be the case that

[TABLE]

since for each $x$ , $F_{x}\times\{x\}\subset\widetilde{\Omega}$ . However, by the definition (1) of the Radon-like operator $T$ ,

[TABLE]

for each $x$ . Inserting this equality into (24) and raising both sides to the power $1/(k+s)$ gives the conclusion (5) of Theorem 1. ∎

As a final remark concerning the proof, it should be noted that the constraint that $r=N_{1}-n$ divides $N_{2}$ is only used in proving the upper bound for (18) via the change of variables formula. As weighted nonlinear Brascamp-Lieb inequalities (generalizing the results of Bennett, Carbery, Christ, and Tao [bcct2008, bcct2010]) ultimately become available, it will be possible to remove the divisibility constraint at the cost of changing the definition of $\Phi_{x}$ to correspond to the correct weight for that context.

3 Proof of Theorem 2 and basic measure inequalities

3.1 Proof of Theorem 2

The proof of Theorem 2 begins with the following lemma, which generalizes Tchebyshev’s inequality to finite dimensional vector spaces of functions. The heart of this generalization is to show that there exists a single set of controlled measure outside of which all functions in the vector space are uniformly bounded (when properly normalized). It extends earlier results for single-variable polynomials [gressman2009] and real analytic functions [gressman2017, Lemma 3]. Although it will only be applied to Borel measures, measurability in the lemma may be taken with respect to any abstract $\sigma$ -algebra.

Lemma 1.

Suppose $\mu$ is a positive measure on some space $X$ and $\mathcal{F}$ is a $d$ -dimensional real vector space of measurable functions from $X$ into some vector space with norm $|\cdot|$ . Then for any $\tau>0$ , there is a measurable set $E_{\tau}\subset X$ such that $\mu(X\setminus E_{\tau})<\tau^{-1}$ for which every $f\in\mathcal{F}$ satisfies the inequality

[TABLE]

Proof.

The inequality (25) is vacuously true for any $f\in{\mathcal{F}}$ (regardless of $\tau$ and $E_{\tau}$ ) for which the integral on the right-hand side is infinite. It therefore suffices to prove (25) for the subspace of those $f\in\mathcal{F}$ for which the integral is finite (the triangle inequality guarantees that such functions are indeed a vector space). Since this subspace also has dimension at most $d$ , we may assume without loss of generality that every $f\in\mathcal{F}$ is $\mu$ -integrable.

Next, let $\mathcal{F}_{0}$ be the subspace consisting of all $f\in\mathcal{F}$ such that $\int|f|d\mu=0$ . If $\mathcal{F}_{0}$ is nontrivial, let $\{h_{1},\ldots,h_{\ell}\}$ be a basis of $\mathcal{F}_{0}$ and define

[TABLE]

Because $\mathcal{F}_{0}$ is a finite-dimensional vector space (by the triangle inequality again) and because each basis element $h_{i}$ vanishes identically on $X\setminus X_{0}$ , every $f\in\mathcal{F}_{0}$ is identically zero on $X\setminus X_{0}$ . Furthermore $\mu(X_{0})=0$ ; this follows because

[TABLE]

so by the Monotone Convergence Theorem and Tchebyshev’s inequality,

[TABLE]

If $\mathcal{F}_{0}$ happens to be trivial, set $X_{0}:=\emptyset$ .

Now let $\mathcal{F}_{1}$ be any subspace of $\mathcal{F}$ which has trivial intersection with $\mathcal{F}_{0}$ and satisfies $\mathcal{F}=\mathcal{F}_{0}+\mathcal{F}_{1}$ . If $\mathcal{F}_{1}$ is trivial, then (25) holds because $\mathcal{F}=\mathcal{F}_{0}$ and consequently fixing $E_{\tau}:=X\setminus X_{0}$ gives $\mu(X\setminus E_{\tau})=0$ and $\sup_{x\in E_{\tau}}|f(x)|=0$ for all $f\in\mathcal{F}$ . Thus it may be assumed that the dimension of $\mathcal{F}_{1}$ equals $d_{1}\in\{1,\ldots,d\}$ . Define $S$ to be the set of all $f\in\mathcal{F}_{1}$ such that

[TABLE]

The mapping $f\mapsto\int|f|d\mu$ is continuous with respect to the vector space topology, and because $\mathcal{F}_{0}\cap\mathcal{F}_{1}$ is trivial, $f\mapsto\int|f|d\mu$ is a norm on $\mathcal{F}_{1}$ , which implies that $S$ must be compact. Fix $\det$ to be any nonzero alternating $d_{1}$ -linear functional on $\mathcal{F}_{1}$ . By continuity and compactness, $|\det(f_{1},\ldots,f_{d_{1}})|$ attains its maximum for some $(f_{1},\ldots,f_{d_{1}})\in S^{d_{1}}$ . Note also that the value of the maximum cannot be zero, since by scaling this would force $\det$ to be identically zero. By Cramer’s rule, for any $f\in S$ ,

[TABLE]

where the circumflex $\widehat{\cdot}$ indicates that $f_{j}$ is omitted from the sequence of arguments of $\det$ . In particular, by the choice of the functions $f_{1},\ldots,f_{d_{1}}$ , the coefficient of each $f_{j}$ in this expansion of $f$ has magnitude at most one. By the triangle inequality and scaling, then, it follows that

[TABLE]

for any $f\in\mathcal{F}_{1}$ and any $x\in X$ . Now for any $\tau>0$ , fix

[TABLE]

By Tchebyshev’s inequality,

[TABLE]

note in particular that the first inequality is strict because

[TABLE]

for each $x\in X\setminus X_{0}$ . Equality of the integrals over $X\setminus X_{0}$ would force equality of the two functions $\mu$ -almost everywhere on $X\setminus X_{0}$ , which would then force $\mu(X\setminus X_{0})=0$ , meaning ultimately that $\mu=0$ and $\mathcal{F}_{1}=\{0\}$ , which has already been handled. Taking a supremum of the inequality (26) over all $x\in E_{\tau}$ gives

[TABLE]

for any $f\in\mathcal{F}_{1}$ . Since every $f\in\mathcal{F}$ must equal $f_{0}+f_{1}$ for some $f_{0}\in\mathcal{F}_{0}$ and $f_{1}\in\mathcal{F}_{1}$ and since $f_{0}$ is identically zero on the given $E_{\tau}$ , the fact that (25) holds for $f_{1}$ immediately implies that it holds for $f$ as well. ∎

Before applying this lemma to the proof of Theorem 2, a brief remark is in order. Although the set $E_{\tau}$ given by (27) is only described as measurable, this is generally an understatement; if the functions of $\mathcal{F}$ are all continuous, then $E_{\tau}$ is closed; if every $f\in\mathcal{F}$ is a polynomial, the sets $E_{\tau}$ are semialgebraic since they take the form

[TABLE]

for functions $f_{j},h_{i}\in\mathcal{F}$ which in this case are polynomials of bounded degree.

Proof of Theorem 2.

The proof follows rather directly from Lemma 1. Without loss of generality, it may be assumed that $\mu$ is not the zero measure on $\Omega$ , since in this case $||\mathcal{A}||_{\mu,s}=||\mathcal{S}||_{\mu,s}=\infty$ . In all other cases, $||\mathcal{A}||_{\mu,s}$ and $||\mathcal{S}||_{\mu,s}$ must be finite. First observe that

[TABLE]

for any measurable set $E$ with nonzero $\mu$ -measure since the integrand of $\mathcal{A}(E)$ is pointwise dominated by $\mathcal{S}(E)$ on $E^{k}$ . Consequently, for any such $E$ ,

[TABLE]

which then implies that $||\mathcal{A}||_{\mu,s}\leq||\mathcal{S}||_{\mu,s}$ . To prove the remaining inequality of (11), one applies Lemma 1 with the vector space $\mathcal{F}$ being real-valued polynomials of degree at most $\deg\Phi$ . If $m>1$ , then an arbitrary and unspecified norm $|\cdot|$ has been fixed as well; let $K^{*}$ be the unique symmetric, compact, convex subset of ${\mathbb{R}}^{m}$ such that

[TABLE]

for all $v\in{\mathbb{R}}^{m}$ , where $\cdot$ is the usual dot product. When the inequality (25) is applied iteratively in conjunction with Fubini’s Theorem, this establishes the chain of inequalities

[TABLE]

for any $y_{1},\ldots,y_{k}\in\Omega$ , where $C$ is the dimension of $\mathcal{F}$ , which depends only on $n$ and $\deg\Phi$ . Taking a supremum over $\ell\in{\mathbb{R}}^{m}$ and $y_{1},\ldots,y_{k}$ and assuming that (10) holds gives that

[TABLE]

for any $\tau>1/\mu(E)$ . If $\mu(E)\in(0,\infty)$ , fixing $\tau:=2/\mu(E)$ gives that

[TABLE]

If $\mu(E)=0$ or $\mu(E)=\infty$ , then the inequality immediately above still holds since it is trivial when $\mu(E)=0$ and since the right-hand side of (29) is infinite for any positive $\tau$ when $\mu(E)=\infty$ . Therefore the inequality holds for all $E$ , meaning that

[TABLE]

which completes the main assertion (11) of Theorem 2. In particular, the constant depends only on $(n,k,s,\deg\Phi)$ and not on $\mu$ or the norm on ${\mathbb{R}}^{m}$ . ∎

3.2 Basic GMT inequalities and Frostman’s Lemma

In this section, the focus returns to Theorem 3. The goal for the moment is to lay out some basic geometric measure theory which underlies the analytic inequality (10). To that end, given a general polynomial $\Phi:\Omega^{k}\rightarrow{\mathbb{R}}^{m}$ vanishing to order $q\geq 1$ on the diagonal as the introduction, for any $\sigma>0$ and any $E\subset\Omega$ , let

[TABLE]

To be clear, one need not assume that the sets $E_{i}$ have any regularity, but there is no loss of generality in requiring that each $E_{i}$ be Borel or even closed since continuity of $\Phi$ implies that $\mathcal{S}$ assigns the same value to $E_{i}$ and its closure $\overline{E_{i}}$ . The quantity $\mathcal{H}^{\sigma}_{\Phi}$ will be called the $\Phi$ -Hausdorff measure of dimension $\sigma$ , and as already defined in Theorem 3, $\lambda^{\sigma}_{\Phi}$ is called the weighted $\Phi$ -Hausdorff measure of dimension $\sigma$ . Note that $\mathcal{H}^{\sigma}_{\Phi}$ is a special case of the Carathéodory construction (see Federer [federer1969] and Mattila [mattila]), while $\lambda^{\sigma}_{\Phi}$ generalizes the measure that Howroyd [howroyd1995] calls the weighted Hausdorff measure. Just as in the definition of the classical Hausdorff measure, the quantities (30) and (31) both define metric outer measures on $\Omega$ and therefore restrict to well-defined measures on the Borel sets; see Folland [folland, Proposition 11.6].

The most basic inequalities satisfied by these quantities are that

[TABLE]

for any Borel set $E$ and any nonnegative Borel measure $\mu$ . The first inequality follows because

[TABLE]

The latter inequality of (32) follows simply because the infimum (31) is taken over a strictly larger set than (30). It is natural to ask when the measures $\lambda^{\sigma}_{\Phi}$ and $\mathcal{H}^{\sigma}_{\Phi}$ are equal or comparable. For the classical Hausdorff measure equality is known (see Federer [federer1969]), but for general measures this need not be the case. In the context of this present paper, the arguments of Section 4 will establish comparability in the range $\sigma\geq n/q$ (although both measures are trivial when the inequality is strict). Beyond this observation, the question of comparability of $\lambda^{\sigma}_{\Phi}$ and $\mathcal{H}^{\sigma}_{\Phi}$ in the regime $\sigma<n/q$ will for now remain unexplored.

The measure $\lambda^{\sigma}_{\Phi}$ holds fundamental significance in the study of nonconcentration inequalities because it characterizes, via a generalization of Frostman’s Lemma, the existence of nontrivial measures $\mu$ satisfying such inequalities.

Lemma 2.

Fix any $\sigma>0$ . There exists a nontrivial positive Borel measure $\mu$ on the compact set $K\subset\Omega\subset{\mathbb{R}}^{n}$ satisfying

[TABLE]

for all Borel sets $E\subset K$ if and only if $\lambda^{\sigma}_{\Phi}(K)>0$ .

Proof.

The proof follows Howroyd’s proof [howroyd1995] of Frostman’s Lemma as given by Mattila [mattila, Theorem 8.17]. By (32), the existence of nontrivial $\mu$ automatically guarantees that $\lambda_{\Phi}^{\sigma}(K)>0$ . Conversely, for any function $f$ on $K$ , let

[TABLE]

For any continuous functions $f,g$ on $K$ , it is elementary to check that

[TABLE]

It is also true that $p_{\sigma,\delta}(g)=0$ for every nonpositive function $g$ . Thus

[TABLE]

Consequently by the Hahn-Banach Theorem, there must exist a linear functional $L$ defined on the space $C^{0}(K)$ of continuous functions on $K$ such that $L(\chi_{K})=p_{\sigma,\delta}(\chi_{K})$ and $L(f)\leq p_{\sigma,\delta}(f)$ for any continuous function $f$ . If $f$ is nonnegative, $0=-p_{\sigma,\delta}(-f)\leq L(f)$ as well, so $L$ is a positive linear functional on $C^{0}(K)$ . By the Riesz Representation Theorem, there must be a nonnegative Borel measure $\mu_{0}$ on $K$ such that

[TABLE]

Now if $E$ is any Borel set with diameter smaller than $\delta$ , let $f_{j}$ be a sequence of functions in $C^{0}(K)$ which are identically $1$ on a neighborhood of $E$ , bounded above by one everywhere, and vanish outside the set $E_{j}$ of points distance at most $1/j$ from $E$ . Then

[TABLE]

where the last inequality follows because $\Phi$ is a polynomial and therefore continuous. Finally, if $\lambda_{\Phi}^{\sigma}(K)>0$ , then there must be some positive $\delta$ such that $p_{\sigma,\delta}(\chi_{K})>0$ . For this fixed value of $\delta$ , $\mu_{0}$ must be nonzero. By subdividing ${\mathbb{R}}^{n}$ into nonoverlapping boxes, there must be a dyadic box $B$ of diameter less than $\delta$ such that $\mu_{0}(B)>0$ . Now define the measure $\mu$ by $\mu(E):=\mu_{0}(E\cap B)$ . It follows that $\mu(\Omega)=\mu_{0}(B)>0$ and for any Borel set $E\subset\Omega$ of any diameter,

[TABLE]

as desired. ∎

It is worthwhile to explicitly connect Lemma 2 to D. Oberlin’s affine measure and affine curvature condition (16). It was observed by D. Oberlin [oberlin2003] and others that any measure $\mu$ on ${\mathbb{R}}^{n}$ satisfying either a nontrivial Fourier restriction inequality or $L^{p}$ -improving convolution inequality must satisfy the inequality

[TABLE]

for some $\sigma>0$ as $R$ ranges over all boxes in ${\mathbb{R}}^{d}$ of arbitrary orientations, i.e., all sets of points which may be expressed as products of finite intervals with respect to some orthogonal coordinates on ${\mathbb{R}}^{n}$ . In analogy with Oberlin’s affine measure555Note that Oberlin adjusts the exponent $\sigma$ so that the affine dimension of ${\mathbb{R}}^{n}$ is $n$ , but by the present convention, the dimension is always $1$ ., let

[TABLE]

be called the $\sigma$ -dimensional weighted affine Hausdorff measure. This weighted affine Hausdorff measure is trivially dominated by Oberlin’s affine measure of dimension $n\sigma$ . In this setting, Lemma 2 has the following consequences:

Corollary 1.

Suppose $K\subset{\mathbb{R}}^{n}$ is compact and fix any $\sigma>0$ . Then $K$ admits a nontrivial positive Borel measure $\mu$ satisfying the Oberlin affine curvature condition (34) if and only if the $\sigma$ -dimensional weighted affine Hausdorff measure of $K$ is nonzero. In particular, if ${\mathcal{A}}^{\sigma}_{w}(K)=0$ implies that for any exponents $p_{1},p_{2},r_{1},r_{2}\in[1,\infty]$ satisfying

[TABLE]

neither of the inequalities

[TABLE]

(where $\widehat{f}$ denotes the Fourier transform) hold uniformly in $f$ for any nontrivial positive Borel measure $\mu$ supported on $K$ .

Proof.

Using Oberlin’s earlier calculations [oberlin2003, Proposition 2], it suffices to set $\Phi(x_{1},\ldots,x_{n+1}):=\det(x_{1}-x_{n+1},\ldots,x_{n}-x_{n+1})$ as noted in the introduction and show that the Oberlin affine curvature condition (34) is equivalent to (33) modulo constants and that $\mathcal{A}^{\sigma}_{w}\approx\lambda^{\sigma}_{\Phi}$ . Both facts are quickly established by showing that for any bounded Borel set $E\subset{\mathbb{R}}^{n}$ there is a box $R$ such that $E\subset R$ and

[TABLE]

with implicit constants depending only on dimension. Because taking the closure of $E$ does not change the supremum, it may be assumed without loss of generality that $E$ is compact and one may fix an ensemble $x_{1},\ldots,x_{n+1}$ which achieves the supremum of $|\Phi|$ on $E^{n+1}$ . If the supremum is zero, then necessarily the span of all vectors $x-x_{n+1}$ as $x$ ranges over $E$ must have dimension strictly less than $n$ , which implies that $E$ lies in an affine hyperplane. By boundedness of $E$ , this implies that $E$ is contained in a (degenerate) box $R$ of volume zero. Otherwise the supremum is strictly positive, and by the same argument appearing in the proof of Lemma 1, it must be the case for any $x\in E$ that

[TABLE]

for constants $c_{j}\in[-1,1]$ . The set of all such points having such an expansion is an affine image of the box $[-1,1]^{n}$ and consequently has Lebesgue measure $2^{n}|\det(x_{1}-x_{n+1},\ldots,x_{n}-x_{n+1})|=2^{n}\mathcal{S}(E)$ . By the John Ellipsoid Theorem, this same set of points must be contained in an ellipsoid of comparable volume, and that ellipsoid must trivially be contained in a box $R$ of comparable volume. Thus $E\subset R$ and $|R|\lesssim\mathcal{S}(E)$ as promised.

Using this conclusion, if (34) is assumed to hold, then for any bounded Borel set $E$ ,

[TABLE]

If $E$ is unbounded, we may write $E$ as the union of an increasing family $E_{j}$ of bounded Borel sets and then observe that

[TABLE]

Likewise it must clearly be the case that $\lambda_{\Phi}^{\sigma}\lesssim\mathcal{A}_{w}^{\sigma}$ since $|R|\approx\mathcal{S}(R)$ and since $\mathcal{A}_{w}^{\sigma}$ involves an infimum over a smaller class. However, for any bounded Borel sets $E_{i}$ such that $\sum_{j}c_{j}\chi_{E_{j}}\geq\chi_{E}$ for positive $c_{j}$ ’s, it is also true that $\sum_{j}c_{j}\chi_{R_{j}}\geq\chi_{E}$ for the distinguished rectangles $R_{j}$ containing each $E_{j}$ . Moreover,

[TABLE]

which implies that $\mathcal{A}_{w}^{\sigma}\approx\lambda^{\sigma}_{\Phi}$ . The corollary now follows from Lemma 2. ∎

4 Proof of Theorem 3

The most difficult case of Theorem 3 to establish is the case $\sigma=n/q$ . After the cases $\sigma<n/q$ and $\sigma>n/q$ are settled (the former using Lemma 2 and the latter using what amounts to a scaling argument), the proof of Theorem 3 is reduced to the related Theorem 4 and ultimately to Lemma 3.

4.1 The case $\sigma<n/q$

The proof of Theorem 3 in the case $\sigma<n/q$ is an almost immediate consequence of Lemma 2. First, supposing that there is a Borel measure $\mu$ satisfying (10) nontrivially with $s=1/\sigma$ , then $\lambda^{\sigma}_{\Phi}(\Omega)>0$ by virtue of (32) applied to the set $\Omega$ directly.

On the other hand, if $\lambda_{\Phi}^{\sigma}(\Omega)>0$ , then because $\Omega$ is an open subset of ${\mathbb{R}}^{n}$ , it may be written as a countable increasing union of compact sets. By the Monotone Convergence Theorem, at least one of these compact subsets $K$ must have $\lambda_{\Phi}^{\sigma}(K)>0$ as well. By Lemma 2, $K$ must admit a measure $\mu$ satisfying (10) nontrivially on $K$ ; extending $\mu$ to be zero on the complement of $K$ gives a measure $\mu$ on $\Omega$ which satisfies (10) nontrivially as well. In fact, it is worth noting that this argument works for any value of $\sigma$ . Consequently for any $s>0$ , $\mathcal{S}$ admits a Borel measure satisfying (10) nontrivially if and only if $\lambda^{1/s}_{\Phi}(\Omega)>0$ . The reason for the restriction, as will be seen momentarily, is simply that $\lambda^{\sigma}_{\Phi}(\Omega)=0$ if $\sigma>n/q$ .

4.2 The case $\sigma\geq n/q$ : Comparison to Lebesgue measure

The goal of this section is to establish that $\mathcal{H}^{\sigma}_{\Phi}$ must vanish when $\sigma>n/q$ and to further show when $\sigma=n/q$ that $\mathcal{H}^{\sigma}_{\Phi}$ must be absolutely continuous with respect to Lebesgue measure with an upper bound on the corresponding Radon-Nykodym derivative. Fix standard coordinates on $\Omega\subset{\mathbb{R}}^{n}$ . Let $\partial$ denote the $n$ -tuple of partial derivatives $(\partial_{1},\ldots,\partial_{n})$ in the coordinate directions. Furthermore, for any $T\in{\mathrm{GL}}(n,{\mathbb{R}})$ , $T^{*}\partial$ will denote the $n$ -tuple

[TABLE]

Assuming that $\Phi:\Omega^{k}\rightarrow{\mathbb{R}}^{m}$ is any smooth function which vanishes to order at least $q$ at every point $(x,\ldots,x)\in\Omega^{k}$ for every $x\in\Omega$ , the main inequality to be proved in this section is that for almost every $x\in\Omega$

[TABLE]

where $\alpha_{1},\ldots,\alpha_{k}$ are multiindices and the subscript $j$ in $(T^{*}\partial)^{\alpha}_{j}$ indicates that the partial derivatives are applied to the argument $x_{j}$ of $\Phi$ . The implicit constant in (35) will depend only on $k,n$ , and $q$ .

To begin this calculation, fix $\delta\in(0,\infty)$ and $T\in{\mathrm{GL}}(n,{\mathbb{R}})$ , and suppose that $u_{1},\ldots,u_{k}\in[-1,1]^{n}$ and that $K\geq 1$ is a positive integer. It must be the case by Taylor’s Theorem that

[TABLE]

for any $x^{\prime}$ belonging to any fixed compact subset of $\Omega\subset{\mathbb{R}}^{n}$ . Since $\Phi$ is smooth, the error term $O(K^{-q-1}\delta^{q+1})$ is uniform as $x$ ranges over any compact set and as $u_{1},\ldots,u_{k}$ vary inside the box $B:=[-1,1]^{n}$ . In particular, if $x$ is any fixed point in $\Omega$ and $x^{\prime}\in x+\delta TB$ , then by a second application of Taylor’s Theorem to the main term on the right-hand side of (36), it follows that

[TABLE]

for small $\delta$ and large $K$ , with an implicit constant depending only on $q,k$ , and $n$ , in contrast with the error terms, which may also depend on $x$ , $T$ , etc. From this inequality, it follows that if $C:=\{C_{1},\ldots,C_{K^{n}}\}$ is the covering of $x+\delta TB$ by the collection of $K^{n}$ boxes induced by subdividing $B$ into $K$ equal parts along each axis, then

[TABLE]

As $K\rightarrow\infty$ , the diameters of all sets in the covering $C$ go to zero, so taking this limit implies that

[TABLE]

and that

[TABLE]

where just as on previous lines, the implicit constant depends only on $q,k$ , and $n$ . When $\sigma>n/q$ , the equality (38) forces $\mathcal{H}^{\sigma}_{\Phi}(\Omega)=0$ since $\Omega$ is contained in a countable union of boxes $x+\delta TB$ with centers $x\in\Omega$ . By (32), this forces $\lambda^{\sigma}_{\Phi}(\Omega)=0$ as well and rules out the existence of any nontrivial Borel measure satisfying a nonconcentration inequality when $s=1/\sigma$ .

It now suffices to assume $\sigma=n/q$ . For any $x$ in a compact subset of $\Omega$ and any sufficiently small $\delta$ , it has been established that

[TABLE]

with implicit constant depending only on $q,k$ , and $n$ . To reiterate: the restriction of $x$ to a compact set influences the a priori size of the error term but not the implicit constant of (39). Because the maximum over $\alpha_{1},\ldots,\alpha_{k}$ is a locally bounded function of $x$ and because $\delta^{n(q+1)/q}/|x+\delta TB|\rightarrow 0$ as $\delta\rightarrow 0^{+}$ , it follows that for all sufficiently small $\delta$ and all $x$ in any compact set, there is a constant $C$ (depending on the compact set and the transformation $T$ as well as on $q,k$ , and $n$ ) such that $\mathcal{H}^{n/q}_{\Phi}(x+\delta TB)\leq C|x+\delta TB|$ . This inequality forces $\mathcal{H}_{\Phi}^{n/q}$ to be locally absolutely continuous with respect to Lebesgue measure since any set of Lebesgue measure zero can be covered by a countable union of boxes of this form whose measures sum to any prescribed small value. Now because $\mathcal{H}^{n/q}_{\Phi}$ is known to be absolutely continuous with respect to Lebesgue measure, the Radon-Nykodym derivative can be estimated pointwise almost everywhere by dividing both sides of (39) by $|x+\delta TB|$ and letting $\delta\rightarrow 0^{+}$ . The result is that for almost every $x\in\Omega$ ,

[TABLE]

Because the inequality is true uniformly in $T$ , one can take an infimum of the right-hand side over a countable dense subset of ${\mathrm{GL}}(n,{\mathbb{R}})$ to conclude that

[TABLE]

with some implicit constant depending only on $q,k$ , and $n$ . This is exactly the asserted inequality (35).

It is worth observing that by homogeneity and scaling (and permuting the order of the standard coordinates), it suffices to take the infimum in $T$ over the group ${\mathrm{SL}}(n,{\mathbb{R}})$ rather than ${\mathrm{GL}}(n,{\mathbb{R}})$ . It should also be mentioned that since the coordinate system used to derive (35) was essentially arbitrary, one could strengthen (35) a priori even further by taking an infimum on the right-hand side over all coordinate systems. However, this apparent strengthening of (35) is not an actual improvement in this case: since all lower-order derivatives vanish, it turns out that replacing the standard coordinate partial derivatives with partial derivatives in new coordinates leaves the value of the right-hand side of (35) unchanged. This coordinate independence will be a key point in the final stages of the proof of Theorem 3.

4.3 Multisystems and Theorem 3 with $\sigma=n/q$

The inequalities (32) and (35) just proved establish that for a given $\Phi$ , any measure $\mu$ satisfying (10) with $s=q/n$ must be absolutely continuous with respect to Lebesgue measure and must have a Radon-Nykodym derivative controlled (up to an implicit constant) by $||\mathcal{S}||_{q/n}^{-n/q}$ times the expression on the right-hand side of (35). The purpose of this section is to introduce some additional ideas which will be used to show that the upper bound given by (35) can be used to define a measure which also satisfies (10). To prove this fact, it turns out to be necessary to work with a slightly more elaborate expression and then to show that this new, more complicated expression happens to be comparable to the the right-hand side of (35).

The added complexity which is required is to replace the standard coordinate derivatives $\partial^{\alpha}$ by a broader family of differential operators which includes coordinate partial derivatives in all smooth coordinates as well as some slightly more general operators. The new object under consideration will be called a multisystem. A multisystem ${\boldsymbol{\partial}}$ on an open set $U$ is a collection of smooth vector fields $Y^{(i)}_{j}$ , $i=1,\ldots,N$ , $j=1,\ldots,n,$ where for each fixed $i$ , $\{Y^{(i)}_{j}\}_{j=1,\ldots,n}$ commute and are linearly independent at every point in $U$ . The integer $N$ will be called the size of ${\boldsymbol{\partial}}$ , and the class of all multisystems of size $N$ will be denoted ${\mathbb{M}}^{(N)}$ . For any finite sequence of the form $\alpha:\{1,\ldots,a\}\rightarrow\{1,\ldots,n\}$ with $a\leq N$ and any $n$ -tuple of vectors $X_{1},\ldots,X_{n}$ at the point $p$ , let

[TABLE]

where $Z^{(i)}_{\ell}$ is the unique constant-coefficient linear combination of $Y^{(i)}_{1},\ldots,Y^{(i)}_{n}$ which equals $X_{\ell}$ at the point $p$ . Such $\alpha$ will be called ordered multiindices in $n$ variables and $|\alpha|$ will be used to denote the order of differentiation of $(X\cdot{\boldsymbol{\partial}})^{\alpha}$ , which equals the cardinality of the domain of $\alpha$ . As in the previous section, $T\in{\mathrm{GL}}(n,{\mathbb{R}})$ will also act on these differential operators by defining

[TABLE]

and taking $(T^{*}X\cdot{\boldsymbol{\partial}})^{\alpha}:=((T^{*}X)\cdot{\boldsymbol{\partial}})^{\alpha}$ .

Since the remainder of this paper deals with measures on ${\mathbb{R}}^{n}$ which are absolutely continuous with respect to Lebesgue measure, it will be convenient to switch back and forth between analytic and geometric descriptions of these measures. In particular, every measure $\mu$ will be identified with a density $\mu(X_{1},\ldots,X_{n})$ which acts on $n$ -tuples of vectors at the point $x$ (for $\mu$ -a.e. $x\in\Omega$ ) by means of the correspondence

[TABLE]

where the determinant is of the $n\times n$ matrix whose columns are the coefficients of the vectors $X_{i}$ in the standard basis. With all notation in place, it is now possible to state the main existence result for nonconcentration inequalities:

Theorem 4.

For any $s>0$ , let $\mu$ be the density on $\Omega$ which at the point $x$ is given by

[TABLE]

For any Borel set $E\subset\Omega$ ,

[TABLE]

with implicit constant depending only on $(n,k,s,\deg\Phi,N)$ .

It is implicit in the statement of Theorem 4 that the expression (42) is a density in the sense of (41). To see that this is the case, it suffices to observe first that (42) is zero when $X_{1},\ldots,X_{n}$ are linearly dependent. This follows because for each $\delta>0$ , there must be a matrix $T_{\delta}\in{\mathrm{GL}}(n,{\mathbb{R}})$ such that $(T^{*}_{\delta}X)_{j}=X_{j}$ for each $j$ but $\det T_{\delta}=\delta^{-1}$ . Testing (42) on this family $T_{\delta}$ and sending $\delta\rightarrow 0^{+}$ shows that the right-hand side of (42) must be zero. The next step is that when $X_{1},\ldots,X_{n}$ are linearly independent, there must be a matrix $M_{X}$ sending the standard basis $e_{1},\ldots,e_{n}$ to $X_{1},\ldots,X_{n}$ , which implies that $\det M_{X}=\det(X_{1},\ldots,X_{n})$ . Then because $GL(n,{\mathbb{R}})$ is a group, one may replace $T$ everywhere on the right-hand side of (42) by $(M_{X}^{-1})^{*}T$ , which gives

[TABLE]

as desired.

The main lemma necessary to prove Theorem 4 and complete the proof of Theorem 3 is stated below and proved in Section 5. It establishes the existence of a special multisystem ${\boldsymbol{\partial}}$ and vector fields $Y_{1},\ldots,Y_{n}$ for which it is possible to prove a kind of Bernstein or reverse Sobolev inequality on arbitrary Borel sets. Versions of such inequalities for intervals and boxes appear, for example, in work of Phong and Stein [ps1998, (2.1)] and Greenblatt [greenblatt2007, (3.21)], respectively. The adaptation of such results to arbitrary Borel sets requires substantial new ideas, even in comparison to the one-dimensional version of this result appearing in [gressman2009]. The lemma’s usefulness follows from the fact that, like Lemma 1, the set $E^{\prime}$ and the implicit constants are independent of the choice of $f$ within the vector space.

Assuming for the moment that Theorem 4 has been established, it is possible to quickly finish the proof of Theorem 3 in the remaining special case $\sigma=n/q$ . The second inequality of (13), i.e.,

[TABLE]

is simply a restatement of the corresponding basic inequality from (32) when $\sigma=n/q$ . To complete the proof of Theorem 3, it suffices to show when $s=q/n$ that the density (42) from Theorem 4 is comparable to or greater than the density on the right-hand side of (40) which dominates $d\mathcal{H}^{n/q}_{\Phi}/dx$ . Once this is known, if $\mu$ is the measure promised by Theorem 4 when $s=q/n$ ,

[TABLE]

for any Borel set $E$ , with uniform implicit constants depending only on the parameters $(q,k,n,\deg\Phi)$ , because $\mu$ dominates $\mathcal{H}^{n/q}_{\Phi}$ by comparison of densities and $\mu$ satisfies (10) by Theorem 4. Combining with the basic inequalities (32) gives

[TABLE]

for all Borel sets $E$ , with implicit constants depending only on $(q,k,n,\deg\Phi)$ . To reiterate, $\mu$ is dominated by $\mathcal{H}^{n/q}_{\Phi}$ by virtue of the basic inequalities (32), so the densities from (42) and (40) must in fact be comparable, and thus the upper bound (40) improves to become

[TABLE]

with implicit constants depending only on $(k,n,q,\deg\Phi)$ .

Thus, assuming Theorem 4 it suffices to compare the densities from (35) and (42), and show that the latter dominates the former. In so doing, it further suffices to fix $X_{1},\ldots,X_{n}$ to be the standard coordinate vectors on $\Omega\subset{\mathbb{R}}^{n}$ . Now because $\Phi$ vanishes to order $q$ on $\Delta$ , it must be the case that

[TABLE]

whenever $|\alpha_{1}|+\cdots+|\alpha_{k}|=q$ since the two differential operators have equal highest-order parts and the lower-order terms are all differential operators of order $q-1$ and lower (Note that for any ordered multiindex $\alpha_{j}$ , the operator $\partial^{\alpha}_{j}$ makes sense as a standard multiindex because the coordinate vector fields commute.) Therefore the inequality

[TABLE]

must hold. Thus the final portions of Theorem 3 will follow once the proof of Theorem 4 is complete.

Theorem 4 is itself a rather direct consequence of the following lemma:

Lemma 3.

Suppose that $\mu$ is a nonnegative Borel measure on $\Omega\subset{\mathbb{R}}^{n}$ which is absolutely continuous with respect to Lebesgue measure with locally integrable Radon-Nykodym derivative. Let $d\geq 1$ and $N\geq 1$ be fixed positive integers. Given any bounded Borel set $E\subset\Omega$ of finite, nonzero $\mu$ -measure, there exists an open set $U$ , a multisystem ${\boldsymbol{\partial}}$ of size $N$ on $U$ , vector fields $Y_{1},\ldots,Y_{n}$ on $U$ , and a Borel set $E^{\prime}\subset U\cap E$ such that

$\mu(E^{\prime})\gtrsim\mu(E)$ ** 2. 2.

$\mu(Y_{1},\ldots,Y_{n})\gtrsim\mu(E)$ * at every point of $E^{\prime}$ .* 3. 3.

For every polynomial map $f:\Omega\rightarrow{\mathbb{R}}^{m}$ of degree at most $d$ and every ordered multiindex $\alpha$ with $|\alpha|\leq N$ ,

[TABLE]

The implicit constants depend only on $(n,d,N)$ .

Proof of Theorem 4 assuming Lemma 3..

At this point, the proof of Theorem 4 is almost the same as the proof of Theorem 2. Let $E$ be a bounded Borel measurable set with positive $\mu$ measure. Fix an integer $N>0$ and let the multisystem ${\boldsymbol{\partial}}$ , vector fields $Y_{1},\ldots,Y_{n}$ , and sets $E^{\prime}$ and $U$ be as in Lemma 3. Let $y$ be any point in $E^{\prime}$ . If $\alpha_{1},\ldots,\alpha_{k}$ are ordered multiindices such that $|\alpha_{i}|\leq N$ for all $i=1,\ldots,k$ , then

[TABLE]

Taking a maximum over $\alpha_{1},\ldots,\alpha_{k}$ and comparing to the definition (42) of the density $\mu$ (fixing $T$ to be the identity), it follows that

[TABLE]

This is exactly the desired inequality (43). If $\mu(E)=0$ , the inequality (43) is trivial, so the only remaining case is when $E$ is an unbounded Borel set. In this case, $E=\bigcup_{M=1}^{\infty}E_{M}$ , where $E_{M}:=E\cap\left\{x\in\Omega\ \left|\ |x|\leq M\right.\right\}$ . Then by Monotone Convergence,

[TABLE]

as desired. ∎

4.4 Remarks on calculation

Before proceeding with the proof of Lemma 3, it is perhaps worthwhile to make some elementary remarks regarding the infimum appearing in (42) or (40) since from a practical perspective it represents the most difficult part of any actual calculation of the density. If $\alpha$ is any ordered multiindex of order $d$ , then by multilinearity it follows for any invertible square matrices $T$ and $O$ that

[TABLE]

If, for example, $O$ is an orthogonal matrix, it must then be the case that

[TABLE]

by simply using the fact that $|O_{jk}^{-1}|\leq 1$ and making the conservative estimate that the number of terms in the expanded multilinear sum is never greater than $n^{Nk}$ . This simple calculation shows that the infimum over $T\in{\mathrm{GL}}(n,{\mathbb{R}})$ in (42) is always comparable (up to a factor depending only on $n,k,N,$ and $s$ ) to the infimum over all matrices in some fixed subset $\mathcal{G}\subset{\mathrm{GL}}(n,{\mathbb{R}})$ provided that every matrix $T\in{\mathrm{GL}}(n,{\mathbb{R}})$ has a factorization $T=GO$ where $G\in\mathcal{G}$ and $O$ is orthogonal. The propositions below demonstrate two slightly different applications of this same idea.

The first example is based on the Singular Value Decomposition. Using this simplification, it is possible to characterize the positivity of the density (44) pointwise in terms of a height-type criterion for certain Newton-like polytopes. Algebraically, the proposition is closely related to the Hilbert-Mumford criterion, which was first proved in the real-valued case proved by Birkes [birkes1971].

Proposition 1.

For any $x\in\Omega$ , if $\Phi$ vanishes to order $q$ at $(x,\ldots,x)$ , then

[TABLE]

if and only if for every orthogonal matrix $O$ , the point $(q/n,\ldots,q/n)\in[0,\infty)^{n}$ belongs to the convex hull in $[0,\infty)^{n}$ of the set

[TABLE]

Proof.

By the SVD, every $T\in GL(n,{\mathbb{R}})$ factors as $T=O_{1}DO_{2}$ where $O_{1},O_{2}\in O(n,{\mathbb{R}})$ and $D$ is a nonnegative diagonal matrix. If the diagonal entries of $D$ are denoted $(t_{1},\ldots,t_{n})$ , the expansion analogous to (46) gives that

[TABLE]

where ${\bf 1}:=(1,\ldots,1)\in{\mathbb{Z}}^{n}$ . It is also trivially true that the inequality (48) is reversed when the factor of $n^{qk}$ is omitted. Thus it suffices to find necessary and sufficient conditions for the quantity on the right-hand side of (48) to be nonzero. For convenience, let $a$ denote any $k$ -tuple of multiindices $(\alpha_{1},\ldots,\alpha_{k})$ with $|\alpha_{1}|+\cdots+|\alpha_{k}|=q$ , and define $\Sigma a:=\alpha_{1}+\cdots+\alpha_{k}$ and

[TABLE]

If $(q/n){\bf 1}$ belongs to the convex hull of the set (47) for every $O$ , then for every $O$ it must be possible to find $a_{1},\ldots,a_{N_{O}}$ and $\theta_{1},\ldots,\theta_{N_{O}}\in[0,1]$ such that $\theta_{1}+\cdots+\theta_{N_{O}}=1$ ,

[TABLE]

and $C_{a_{j}}>0$ for $j=1,\ldots,N_{O}$ . Because a maximum of terms always dominates any convex combination, it follows that

[TABLE]

The quantities $C_{a}$ are continuous functions of $O$ and nonzero at the particular $O$ in question, so each $C_{a_{j}}$ is strictly positive on a neighborhood of $O$ and consequently the infimum (49) must be bounded below by a positive quantity on a neighborhood of $O\in O(n,{\mathbb{R}})$ . By compactness of the orthogonal group, the infimum (48) must be strictly positive.

If, on the other hand, there is some $O\in O(n,{\mathbb{R}})$ such that $(q/n)\bf 1$ does not belong to the convex hull of (47), then the Separating Hyperplane Theorem guarantees the existence of $\ell\in{\mathbb{R}}^{n}$ such that $\ell\cdot\Sigma a>(q/n)\ell\cdot\bf 1$ for all $a$ . Taking $t=(e^{-s\ell_{1}},\ldots,e^{-s\ell n})$ gives

[TABLE]

as $s\rightarrow\infty$ for all $a$ . Consequently the infimum (48) must be zero. ∎

For the second example, recall the determinantal Hausdorff measure from Section 1.2. In that section, it was claimed that

[TABLE]

for any Borel set $E\subset{\mathbb{R}}^{n\times n}$ . By virtue of Theorem 3, to prove this inequality, it suffices to show that the density (44) is uniformly bounded below. This calculation is relatively straightforward for triangular matrices $T$ and is recorded in the following proposition:

Proposition 2.

Let

[TABLE]

where $A_{1}$ and $A_{2}$ denote matrices in ${\mathbb{R}}^{n\times n}$ . Then the Radon-Nykodym derivative $d\lambda^{n}_{\Phi}/dx$ is uniformly bounded below by a constant depending only on $n$ .

Proof.

Before beginning, note that the correct $\Phi$ -Hausdorff dimension for this problem is $n$ because $n^{2}$ is the dimension of the parameter space ${\mathbb{R}}^{n\times n}$ and $q=n$ is the order of vanishing of $\Phi$ on the diagonal.

Order the entries $(i,j)$ of $n\times n$ matrices lexicographically and let $\partial_{ij}$ correspond to differentiation in the direction of the $(i,j)$ entry. For any $T\in GL(n\times n,{\mathbb{R}})$ , one may write $T=LQ$ for a lower triangular matrix $L$ and an orthogonal matrix $Q$ (this is just the so-called $QR$ decomposition applied to $T^{*}$ ). Consequently, in taking the infimum (44), up to a uniform constant, it suffices to assume that $T$ is lower triangular; in this case the directional derivatives $Y_{ij}:=(T^{*}\partial)_{ij}$ are spanned by $\partial_{i^{\prime}j^{\prime}}$ for those entries $(i^{\prime},j^{\prime})$ which are lexicographically greater than or equal to $(i,j)$ .

Because the determinant is a linear function of each column and each row of a matrix,

[TABLE]

if either the indices $i_{1},\ldots,i_{n}$ or the indices $j_{1},\ldots,j_{n}$ are not distinct. When both the $i$ ’s and the $j$ ’s are distinct, the value of the derivative is $\pm 1$ depending on the relative orderings of the indices. By definition of the directional derivatives $Y_{ij}$ , the differential operator $Y_{1\ell_{1}}\cdots Y_{n\ell_{n}}$ can always be written as a linear combination of derivatives $\partial_{i_{1}j_{1}}\cdots\partial_{i_{n}j_{n}}$ where $(i_{1},j_{1})\geq(1,\ell_{1}),\ldots,(i_{n},j_{n})\geq(n,\ell_{n})$ lexicographically. However, among all such possible choices of the entries $(i_{1},j_{1}),\ldots,(i_{n},j_{n})$ , there is only one possibility where the $i$ ’s and $j^{\prime}s$ are distinct: $(i_{1},j_{1})=(1,\ell_{1}),\ldots,(i_{n},j_{n})=(n,\ell_{n})$ . This is because $i_{1}\geq 1,\ldots,i_{n}\geq n$ , so by the Pigeonhole Principle, the $i$ ’s can only be distinct when $i_{1}=1,\ldots i_{n}=n$ . This forces $j_{1}\geq\ell_{1},\ldots,j_{n}\geq\ell_{n}$ , which implies $j_{1}=\ell_{1},\ldots,j_{n}=\ell_{n}$ for the same reason because $\ell_{1},\ldots,\ell_{n}$ are already distinct. Therefore

[TABLE]

where $c_{ij}$ is the coefficient of $\partial_{ij}$ in the expansion of $Y_{ij}$ . It follows that

[TABLE]

since each entry $(i,j)$ appears in a $1/n$ fraction of all permutations $\sigma$ . Because $T$ is lower triangular, the product of all $|c_{ij}|$ is just the absolute value of the determinant. Therefore

[TABLE]

for any lower triangular matrix $T$ . Raising both sides to the power $n$ gives exactly the desired lower bound for the density (44). ∎

As a final remark on calculation, note that the simplifications used above apply equally well to Theorem 4. Using the QR decomposition as above, for example, it is possible to show that the function $\Phi$ on ${\mathbb{R}}^{2}\times{\mathbb{R}}^{2}$ given by

[TABLE]

satisfies the nonconcentration inequality

[TABLE]

which is an interesting result because this $\Phi$ is degenerate when $\sigma=n/q=2/2$ . The necessary calculation is relatively simple when one assumes without loss of generality that one of the two vectors in the pair $T^{*}X$ points in the $y$ -direction.

5 Proof of Lemma 3

5.1 Construction of the multisystem

Proof.

The proof begins by establishing that it suffices to assume that the functions $f$ are scalar-valued, i.e., that $m=1$ . When $m>1$ , as previously noted in (28), there must exist a symmetric, compact, convex set $K^{*}\subset{\mathbb{R}}^{m}$ such that

[TABLE]

for all $v\in{\mathbb{R}}^{m}$ . Taking $f:=(f_{1},\ldots,f_{m})$ to be a polynomial map of degree $d$ and assuming the lemma for the case $m=1$ gives

[TABLE]

so the implicit constant can taken to be independent of $m$ and of the choice of norm $|\cdot|$ on ${\mathbb{R}}^{m}$ .

Let $\mathcal{F}_{0}$ be the vector space of polynomials $f$ of degree at most $d$ and let $D:=\dim\mathcal{F}_{0}$ . Because $E$ is bounded, all polynomials of degree $d$ are bounded on $E$ , and because $E$ has nonzero $\mu$ measure, no nontrivial polynomial can vanish identically on $E$ . Thus $f\mapsto\sup_{x\in E}|f(x)|$ is a norm on $\mathcal{F}_{0}$ , and as in the proof of Lemma 1, one may fix $\det$ to be any nonzero alternating $D$ -linear form on $\mathcal{F}_{0}$ . Using this $\det$ just as was done earlier, it is possible to find $f_{1},\ldots,f_{D}\in{\mathcal{F}}_{0}$ such that $\sup_{x\in E}|f_{j}(x)|\leq 1$ and

[TABLE]

for any $f\in\mathcal{F}_{0}$ with constants $c_{j}$ satisfying $|c_{j}|\leq\sup_{x\in E}|f(x)|$ for each $j=1,\ldots,D$ . For any $n$ -tuple $(j_{1},j_{2},\ldots,j_{n})$ of indices in $\{1,\ldots,D\}$ such that $j_{1}<j_{2}<\cdots<j_{n}$ , let $U_{j_{1},\ldots,j_{n}}$ be the open set of points $x\in\Omega$ such that

[TABLE]

where $\left.df\right|_{x}$ denotes the exterior derivative of $f$ at the point $x$ . The union of all $U_{j_{1},\ldots,j_{n}}$ over all possible $j_{1}<\cdots<j_{n}$ must be all of $\Omega$ because at every point $x$ there must be some $j_{1}<\cdots<j_{n}$ for which $\left.df_{j_{1}}\wedge\cdots\wedge df_{j_{n}}\right|_{x}$ is nonzero. Since these open sets cover $\Omega$ , they cover $E$ as well, and there must consequently be a single choice of $j_{1}<\cdots<j_{n}$ such that $\mu(E\cap U_{j_{1},\ldots,j_{n}})\geq D^{-n}\mu(E)$ . On $U:=U_{j_{1},\dots,j_{n}}$ , define vector fields $Y_{1},\ldots,Y_{n}$ by means of the formula

[TABLE]

where $df$ in the numerator appears in position $i$ of the wedge product and replaces $df_{j_{i}}$ . This means that $Y_{i}f_{j_{i^{\prime}}}$ vanishes if $i\neq i^{\prime}$ and is identically one on $U_{j_{1},\ldots,j_{n}}$ if $i=i^{\prime}$ , which further means that the $Y_{i}$ are locally coordinate vector fields and commute with one another. Moreover, by (50) and the definition of $U_{j_{1},\ldots,j_{n}}$ , it must be the case that

[TABLE]

at every point of $U_{j_{1},\ldots,j_{n}}$ . Furthermore

[TABLE]

and by the change of variables formula, the last integral will be bounded above by the maximum number of nondegenerate solutions (i.e., solutions where the Jacobian determinant of the system is nonzero) of the system of equations

[TABLE]

in $E\cap U$ for $a_{1},\ldots,a_{n}\in[-1,1]$ since $|f_{j_{i}}(x)|\leq 1$ on $E$ . Letting $S$ denote a uniform upper bound for this number of solutions, it follows from Tchebyshev’s inequality that there is a measurable set $E^{\prime}\subset E\cap U$ with $\mu(E^{\prime})\geq\frac{1}{2}D^{-n}\mu(E)$ such that

[TABLE]

This completes the proof of Lemma 3 in the case $N=1$ .

By induction, assume the lemma has been established up to some level $N-1$ . For convenience, let the sets $E^{\prime}$ and $U$ at stage $N-1$ be denoted $E_{N-1}$ and $U_{N-1}$ , respectively. Suppose also that the lemma has been proved for some class of functions $\mathcal{F}_{N-1}$ which includes all polynomials of degree $d$ . Stage $N$ follows by applying the already-established base case of the lemma to the space of functions $\mathcal{F}_{N}$ on $U_{N-1}$ which defined to be the span of $\mathcal{F}_{N-1}$ and $Y_{i}\mathcal{F}_{N-1}$ , $i=1,\ldots,n$ . Postponing for the moment the problem of counting solutions of systems of equations during this induction procedure, it must be the case that for any $N$ , there is an open set $U_{N}$ and some measurable $E_{N}\subset E$ such that $\mu(E_{N}\cap U_{N})\gtrsim\mu(E)$ for some implicit constant depending on $(n,d,N)$ and there is a multisystem ${\boldsymbol{\partial}}$ of size $N$ , formed by extending the multisystem ${\boldsymbol{\partial}}$ of size $N-1$ to add new vector fields $Y^{(N)}_{j}:=Y_{j}$ defined by (51) on $U_{N}$ as above. For this extended multisystem, it must be the case that

[TABLE]

for all $j_{1},\ldots,j_{N}$ and all $f\in\mathcal{F}_{0}$ . Moreover, because each collection $Y^{(i)}_{1},\ldots,Y^{(i)}_{n}$ is locally given by coordinate vector fields with local coordinate functions which themselves belong to the finite-dimensional function space $\mathcal{F}_{N-1}$ , it follows that

[TABLE]

when $f_{1},\ldots,f_{n}$ are the functions used to construct the $Y^{(i)}_{\ell}$ . In particular, the coefficients $|Y^{(i+1)}_{j}f_{\ell}|$ are bounded uniformly in $j$ and $\ell$ (and uniformly in $E$ and $\mu$ ). By induction, this implies that the final vector fields $Y^{(N)}_{j}$ are linear combinations of the $Y^{(i)}_{\ell}$ for $i<N$ with coefficients that are uniformly bounded. Because the vectors $Y^{(N)}_{j}$ may be written as linear combinations of all previous $Y^{(i)}_{\ell}$ with bounded linear coefficients, it follows from (53) that

[TABLE]

with implicit constant independent of $\mu$ and $E$ whenever $\alpha$ is an ordered multiindex with $|\alpha|\leq N$ . Taking the vector fields $Y^{(N)}_{1},\ldots,Y^{(N)}_{n}$ to be vector fields promised in the statement of the lemma together with $E^{\prime}:=E_{N}$ and $U:=U_{N}$ completes the proof with the exception of the unfinished business of counting solutions of systems of equations. ∎

5.2 Underlying geometry and solution counting

The problem of counting solutions is an independent algebraic issue which has already been addressed elsewhere in the case of real analytic functions [gressman2017], so the reader who is not interested in the precise nature of the implicit constants in Theorem 3 may skip the rest of this section and consider Theorem 3 fully proved. For those who continue reading, there are two main purposes to this section. The first is to establish that the systems of equations encountered in the previous section have a bounded number of isolated solutions with an upper bound depending only on the constants $(n,d,N)$ as promised. The second major purpose of this section is to demonstrate that there is an intrinsic geometric object which governs the possible number of solutions. This means that a finite upper bound will continue to hold uniformly even when the functions $f$ belong, for example, to some o-minimal structure. This intrinsic geometric object is also closely related to certain geometric differential operators which were constructed some time ago to study uniform coordinate-independent sublevel set estimates [gressman2010II]. In a very precise way, the object described below allows one to extend those earlier differential operators to a broader class which includes rational functions of the simpler objects.

Throughout this section, the open set $\Omega\subset{\mathbb{R}}^{n}$ and the polynomials of bounded degree on $\Omega$ will be regarded as simply an abstract smooth manifold $\mathcal{M}$ of dimension $n$ and a finite-dimensional vector space $\mathcal{F}$ of smooth functions on $\mathcal{M}$ . Given such a pair $(\mathcal{M},\mathcal{F})$ , a new pair $(\mathcal{M}^{\prime},\mathcal{F}^{\prime})$ , representing a sort of abstract derivative of the original pair, is constructed as follows. Let $\mathcal{M}^{\prime}$ be the bundle $\Lambda^{n}_{*}(\mathcal{M})$ of nonvanishing $n$ -forms over points of $\mathcal{M}$ , i.e., points of $\mathcal{M}^{\prime}$ are nonvanishing $n$ -forms $\omega_{x}$ , where the subscript $x$ is used to indicate that $\omega_{x}$ acts as an alternating $n$ -linear form on the tangent space at $x\in\mathcal{M}$ . Let $\mathcal{F}^{\prime}$ be the vector space of smooth functions on $\mathcal{M}^{\prime}$ spanned by the functions

[TABLE]

and

[TABLE]

The construction of $(\mathcal{M}^{\prime},\mathcal{F}^{\prime})$ allows one to extend the class of functions $\mathcal{F}$ to a broader class involving derivatives of the functions in $\mathcal{F}$ without constructing vector fields or coordinate systems. The cost of the construction is the change of dimension of $\mathcal{M}$ from $n$ to $n+1$ , which roughly corresponds to including a new indeterminate variable. If $\mathcal{M}$ is the one-dimensional interval $(a,b)$ , for example, then one can show that $\mathcal{M}$ is diffeomorphic to $(a,b)\times{\mathbb{R}}_{\neq 0}$ and $\mathcal{F}^{\prime}$ is spanned by the functions $f(t)$ for $f\in\mathcal{F}$ and functions of the form $sf^{\prime}(t)$ where $s\neq 0$ is the new indeterminate. In higher dimensions, the situation is somewhat more complex but still analogous.

Iterating the construction of $\mathcal{M}^{\prime}$ and $\mathcal{F}^{\prime}$ gives a sequence of manifolds $\mathcal{M}^{(i)}$ and function spaces $\mathcal{F}^{(i)}$ on $\mathcal{M}^{(i)}$ , $i=0,\ldots,N$ (with $\mathcal{M}^{(0)}:=\mathcal{M}$ and $\mathcal{F}^{(0)}:=\mathcal{F}$ ). The spaces $\mathcal{M}^{(i)}$ have dimension $n+i$ and have fiber bundle projections $p_{i}$

[TABLE]

For convenience, let $\pi^{(i)}$ be the projection map $p_{1}\circ\cdots\circ p_{i}$ from $\mathcal{M}^{(i)}$ to $\mathcal{M}^{(0)}$ . The space $\mathcal{F}^{(i)}$ is spanned by functions of the forms

[TABLE]

and

[TABLE]

for $f_{1},\ldots,f_{n+i-1}\in\mathcal{F}^{(i-1)}$ . For convenience, define $\dot{\mathcal{F}}^{(i)}$ to be the vector space of functions on $\mathcal{M}^{(i)}$ which are of the form (54) only. One may also also regard $\mathcal{F}^{(i-1)}$ to be a subspace of $\mathcal{F}^{(i)}$ by composing with the projection $p_{i}$ .

The manifolds $\mathcal{M}^{(N)}$ completely capture the analysis and geometry of the vector fields $Y_{j}^{(i)}$ and the function spaces $\mathcal{F}_{N}$ constructed in Lemma 3. In a practical sense, this is because the problem of counting solutions can be lifted from $\mathcal{M}$ to $\mathcal{M}^{(N)}$ . This idea is formalized by the following lemma.

Lemma 4.

Suppose $\mathcal{F}_{0}$ consists of a finite-dimensional vector space of smooth functions on $\mathcal{M}$ . Let $\mathcal{F}_{1},\ldots,\mathcal{F}_{N}$ be the vector spaces of functions as constructed in the proof of Lemma 3, i.e., $\mathcal{F}_{i}$ is the span of $\mathcal{F}_{i-1}$ and $Y_{j}\mathcal{F}_{i-1}$ , $j=1,\ldots,n$ , for vector fields $Y_{j}$ defined as in (51) for some $f_{j_{1}},\ldots,f_{j_{n}}\in\mathcal{F}_{i-1}$ . Then the number of nondegenerate solutions $x\in U$ of the system

[TABLE]

where $f_{1},\ldots,f_{n}\in\mathcal{F}_{N}$ , $a_{1},\ldots,a_{n}\in{\mathbb{R}}$ , for a given open set $U$ is equal to the number of nondegenerate solutions $p\in(\pi^{(N)})^{-1}(U)$ of a corresponding system

[TABLE]

where $F_{1},\ldots,F_{n+N}\in\mathcal{F}^{(N)}$ , $b_{1},\ldots,b_{n+N}\in{\mathbb{R}}$ .

Although the manifold $\mathcal{M}^{(N)}$ is somewhat more abstract than $\mathcal{M}$ itself, Lemma 4 is a significant result for two reasons. The first is that it allows one to sidestep inherent difficulties of understanding the vector fields $Y_{i}$ when counting solutions. The second is that the functions in $\mathcal{F}^{(N)}$ are never more complex than derivatives of the functions in $\mathcal{F}$ and polynomials, as shown by the following proposition:

Proposition 3.

Suppose that $\varphi$ is a diffeomorphism from some open set $U\subset{\mathbb{R}}^{n}$ onto some open subset of $\mathcal{M}$ . For each $N$ , there is a diffeomorphism $\varphi^{(N)}$ from $U\times{\mathbb{R}}_{\neq 0}^{N}$ onto $(\pi^{(N)})^{-1}(\varphi(U))$ such that for every $F_{1},\ldots,F_{n+N-1}\in\mathcal{F}^{(N-1)}$ ,

[TABLE]

where the determinant on the right-hand side is the usual Jacobian determinant in the coordinates $(x,t_{1},\ldots,t_{N-1})\in U\times{\mathbb{R}}^{N-1}_{\neq 0}$ .

Proof.

By induction on $N$ , let $\varphi^{(N)}$ be given by

[TABLE]

where $dx_{1},\ldots,dx_{n}$ are differentials of the coordinate functions $x_{1},\ldots,x_{n}$ on $\varphi^{-1}(U)$ induced by $\varphi$ . As can be seen from the formula, these coordinates have the property that the canonical projection from $\mathcal{M}^{(N)}$ to $\mathcal{M}^{(N-1)}$ corresponds to dropping the variable $t_{N}$ . It is easy to check in these coordinates that

[TABLE]

for any $F_{1},\ldots,F_{n+N-1}\in\mathcal{F}^{(N-1)}$ . Definition (54) immediately gives (57). ∎

An important corollary is that when the functions $\mathcal{F}$ are polynomials of bounded degree in a suitable coordinate system (as will always be the case when applying the result to Lemma 3), the functions $\mathcal{F}^{(N)}$ may also be regarded as polynomials of a suitably bounded degree in the appropriate coordinates as well. Thus the number of nondegenerate solutions to the system (56) would immediately be bounded by Bézout’s Theorem just as applied in the proof of Theorem 1.

The proof of Lemma 4 proceeds by showing that every function $f\in\mathcal{F}_{N}$ (the function space analogous to Lemma 3) must agree with a function in $\mathcal{F}^{(N)}$ (the function space on $\mathcal{M}^{(N)}$ ) on a suitably-constructed $n$ -dimensional submanifold of $\mathcal{M}^{(N)}$ which is defined implicitly via a system of equations in $\mathcal{F}^{(N)}$ . This implies that the system of equations (55) involving the somewhat mysteriously-constructed functions $f_{j_{1}},\ldots,f_{j_{n}}$ can be naturally lifted to an system on $\mathcal{M}^{(N)}$ where the functions in the system belong to $\mathcal{F}^{(N)}$ . Because both $\mathcal{F}_{N}$ and $\mathcal{F}^{(N)}$ are vector spaces, the only part of this assertion which is somewhat cumbersome to prove is that ratios of wedge products a la (51) appear as values of functions in $\mathcal{F}^{(N)}$ restricted to suitable submanifolds. This is accomplished by a trivial induction on $N$ combined with the following proposition, which shows how to identify quantities like (51) via the identity (59) and also demonstrates in (58) how to inductively identify the $n$ -dimensional submanifold of $\mathcal{M}^{(N)}$ on which the desired identities hold.

Proposition 4.

Suppose $F_{j}\in\dot{\mathcal{F}}^{(j)}$ for each $j=1,\ldots,N$ and let

[TABLE]

Then

The set ${\mathcal{M}}^{(N)}_{F}$ is a manifold and the projection $\pi^{(N)}$ is a diffeomorphism of any open subset of ${\mathcal{M}}^{(N)}_{F}$ and its image.

Next suppose that $h_{1},\ldots,h_{n}$ and $g_{1},\ldots,g_{n}$ are smooth functions on some open subset $O\subset\mathcal{M}$ for which there exist $H_{1},\ldots,H_{n},G_{1},\ldots,G_{n}\in\mathcal{F}^{(N)}$ such that for each $j=1,\ldots,n$ , $H_{j}$ restricts to $h_{j}$ on ${\mathcal{M}}^{(N)}_{F}\cap(\pi^{(N)})^{-1}(O)$ and likewise for $G_{j}$ and $g_{j}$ . In other words, $h_{j}\circ\pi^{(N)}=H_{j}$ on ${\mathcal{M}}^{(N)}_{F}\cap(\pi^{(N)})^{-1}(O)$ and $g_{j}\circ\pi^{(N)}=G_{j}$ on ${\mathcal{M}}^{(N)}_{F}\cap(\pi^{(N)})^{-1}(O)$ for each $j=1,\ldots,n$ . If one defines

[TABLE]

the following must also be true:

The image $\pi^{(N+1)}(\mathcal{M}^{(N+1)}_{F})\cap O\subset\mathcal{M}$ consists of exactly those points in $\pi^{(N)}(\mathcal{M}^{(N)}_{F})\cap O$ at which $dg_{1}\wedge\cdots\wedge dg_{n}\neq 0$ . 2. 3.

There is a function in $\mathcal{F}^{(N+1)}$ which restricts to

[TABLE]

at every point of $O$ where the denominator is nonzero, namely

[TABLE]

on ${\mathcal{M}}^{(N+1)}_{F}\cap(\pi^{(N+1)})^{-1}(O)$ .

Proof.

From the formula (57) in the coordinates $\varphi^{(N)}$ on $\mathcal{M}^{(N)}\cap(\pi^{(N)})^{-1}(U)$ , it is clear that every $F_{j}\in\dot{\mathcal{F}}^{(j)}$ must equal $t_{1}\cdots t_{j}$ times a polynomial in $(t_{1},\ldots,t_{j-1})$ with coefficients that are smooth functions of $x$ . There are several important consequences of this simple observation. The first is that $F_{j}$ is independent of $t_{k}$ when $k>j$ . When $k=j$ , it also follows that

[TABLE]

This means that the Jacobian matrix $\partial(F_{1},\ldots,F_{N})/\partial(t_{1},\ldots,t_{N})$ always has full rank at every point of $\mathcal{M}_{F}^{(N)}$ since the Jacobian matrix it is triangular and its diagonal entries are never zero (since $F_{j}=1$ on $\mathcal{M}^{(N)}_{F}$ for each $j$ and by assumption $t_{j}\neq 0$ for each $j$ as well). By the Implicit Function Theorem, this guarantees that $\mathcal{M}^{(N)}_{F}$ is always a manifold regardless of the choice of the particular $F_{j}$ ’s. Moreover, because of this triangular structure and the linearity of $F_{j}$ as a function of $t_{j}$ , it is easy to see that for a given $(x,t_{1},\ldots,t_{i})\in\mathcal{M}^{(i)}_{F}$ , there is at most a unique value of $t_{i+1}$ such that $(x,t_{1},\ldots,t_{i+1})\in\mathcal{M}^{(i+1)}_{F}$ , and such a solution exists if and only if $F_{i+1}(x,t_{1},\ldots,t_{i},t)$ is not an identically zero function of $t$ . As already noted, if such a value of $t_{i+1}$ exists, it is necessarily true that the Jacobian determinant $\det\partial(F_{1},\ldots,F_{i+1})/\partial(t_{1},\ldots,t_{i+1})$ must be nonvanishing at $(x,t_{1},\ldots,t_{i+1})$ . Therefore by the Implicit Function Theorem, the projection $\pi^{(N)}$ must be a diffeomorphism of any open subset of $\mathcal{M}^{(N)}_{F}$ and its image. This establishes the first conclusion of the proposition.

Because $\pi^{(N)}$ is a diffeomorphism of any open subset of $\mathcal{M}^{(N)}_{F}$ and its image, one may define coordinates on $\mathcal{M}^{(N)}_{F}\cap(\pi^{(N)})^{-1}(U)$ using $\varphi$ by lifting the coordinate function $\varphi$ via $(\pi^{(N)})^{-1}$ , i.e., by mapping $x\in U\cap\varphi^{-1}\pi^{(N)}(\mathcal{M}^{(N)}_{F})$ to $(\pi^{(N)})^{-1}(\varphi(x))$ , where $U$ is any suitable open subset of $\mathcal{M}$ on which a coordinate system $\varphi$ is defined. Let $X_{1},\ldots,X_{n}$ denote the associated coordinate vector fields. It follows that $d\pi^{(N)}(X_{i})=\partial/\partial x_{i}$ for each $i=1,\ldots,n$ . In the coordinates $\varphi^{(N)}$ on $\mathcal{M}^{(N)}$ , this means that

[TABLE]

for each $i=1,\ldots,n$ . Since each $F_{j}$ is constant on $\mathcal{M}^{(N)}_{F}$ , it must be the case that $X_{i}F_{j}=0$ on $\mathcal{M}^{(N)}_{F}$ for each pair of indices $i,j$ . Therefore by applying the usual row operations to the Jacobian determinant (57) (assuming that distinct rows of the matrix correspond to partial derivatives with respect to distinct coordinate variables), it must be the case that

[TABLE]

on $\mathcal{M}^{(N)}_{F}$ (using the triangular structure of $\partial F/\partial t$ and (60)). If it is also known that $G_{j}$ restricts to $g_{j}$ on $\mathcal{M}^{(N)}_{F}\cap(\pi^{(N)})^{-1}(O)$ , then $X_{i}G_{j}=X_{i}(g_{j}\circ\pi^{(N)})=(d\pi^{(N)}(X_{i})g_{j})\circ\pi^{(N)}=(\partial g_{j}/\partial x_{i})\circ\pi^{(N)}$ , so

[TABLE]

in the coordinates $(x,t_{1},\ldots,t_{N+1})$ when $(x,t_{1},\ldots,t_{N})\in\mathcal{M}^{(N)}_{F}\cap(\pi^{(N)})^{-1}(U)$ .

Now assuming that $F_{N+1}$ is selected in such a way that (58) holds, it follows that for a given point $(x,t_{1},\ldots,t_{N})\in\mathcal{M}^{(N)}_{F}\cap(\pi^{(N)})^{-1}(U)$ , the equation $F_{N+1}(x,t_{1},\ldots,t_{N+1})=1$ will have a solution $t_{N+1}$ if and only if $\det(\partial g/\partial x)\neq 0$ at the point $x\in U$ , which will occur exactly when $dg_{1}\wedge\cdots\wedge dg_{n}\neq 0$ . Because every point of $O$ is contained in an open set $U$ on which a coordinate system is defined, this forces the second conclusion of the proposition to be true, namely, that $\pi^{(N+1)}(\mathcal{M}_{F}^{(N+1)})\cap O$ will be exactly the subset of $\pi^{(N)}(\mathcal{M}_{F}^{(N)})\cap O$ at which $dg_{1}\wedge\cdots\wedge dg_{n}\neq 0$ .

As for the third conclusion of the proposition, assuming that $x\in U$ is a point at which $dg_{1}\wedge\cdots\wedge dg_{n}\neq 0$ and that $(x,t_{1},\ldots,t_{N})\in\mathcal{M}_{F}^{(N)}$ ,

[TABLE]

assuming $1=t_{N+1}\det(\partial g/\partial x)$ , which must be the case when $(x,t_{1},\ldots,t_{N+1})\in\mathcal{M}^{(N+1)}_{F}$ . Because $U$ was arbitrary, the formula holds on all of $O$ as well. ∎

The proof of Lemma 4 follows quickly from Proposition 4. By induction on $N$ , once it is known that there are suitable $F_{i}\in\dot{\mathcal{F}}^{(i)}$ for $i=1,\ldots,N$ such that every function $g\in\mathcal{F}_{N}$ of the form

[TABLE]

for $f\in\mathcal{F}$ has a corresponding function $G$ in $\mathcal{M}^{(N)}$ which restricts to $g$ on $\mathcal{M}^{(N)}_{F}$ , the third conclusion of the proposition establishes that the same property must hold at stage $N+1$ as well. This is because the functions $f_{j_{1}},\ldots,f_{j_{n}}$ in the denominator of (51) defining the new vector fields $Y^{(N+1)}_{i}$ belong to the span of $\mathcal{F}_{N}$ and $Y_{i}^{(N)}\mathcal{F}_{N}$ , which means by induction that each such function is the restriction to $\mathcal{M}^{(N)}_{F}$ of a function in $\mathcal{F}^{(N)}$ . These extended functions define $F_{n+1}$ via (58). The key point is that the vector fields $Y^{(N+1)}_{1},\ldots,Y^{(N+1)}_{n}$ all have the same denominator, so the same choice of $F_{N+1}$ defining $\mathcal{M}^{(N+1)}_{F}$ works simultaneously for the application of any one of the vector fields $Y^{(N+1)}_{i}$ via the identity (59).

A consequence of this observation is that when $H_{1},\ldots,H_{n}\in\mathcal{F}^{(N)}$ restrict to $h_{1},\ldots,h_{n}$ on some open subset of $\mathcal{M}_{F}^{(N)}\cap(\pi^{(N)})^{-1}(O)$ , then every solution of the system of equations

[TABLE]

for $x\in O$ will correspond to a solution of the augmented system

[TABLE]

in $\mathcal{M}^{(N)}\cap(\pi^{(N)})^{-1}(O)$ (in the sense that $(\pi^{(N)})^{-1}$ will map solutions in $O$ injectively to solutions in $\mathcal{M}^{(N)}\cap(\pi^{(N)})^{-1}(O)$ of the augmented system) and that the mapping preserves nondegeneracy in the sense that $\det(\partial h/\partial x)\neq 0$ for a solution point in $O$ if and only if $\det(\partial(H_{1},\ldots,H_{n},F_{1},\ldots,F_{N})/\partial(x,t_{1},\ldots,t_{N}))\neq 0$ . This latter observation follows immediately from the equality of (57) (when fixing $(G_{1},\ldots,G_{n+N}):=(H_{1},\ldots,H_{n},F_{1},\ldots,F_{N})$ ) and (62). Thus Lemma 4 must be true. This completes the proof of Lemma 4 and consequently the proofs of Lemma 3 and Theorems 3 and 4 as well.

6 Further applications to Radon-like operators

To close, it is illuminating to return to the context of averaging operators (1) of Theorem 1 and explicitly see how Theorem 3 applies, as was abstractly indicated by Example 4 in Section 1.2. For convenience, it will be assumed that the map $\gamma(t,x)$ has the form

[TABLE]

where $\gamma_{0}:{\mathbb{R}}^{n}\times{\mathbb{R}}^{N_{2}}\rightarrow{\mathbb{R}}^{r}$ for some integer $r$ (in which case $N_{1}:=n+r$ ) and $N_{2}=rk$ for some integer $k\geq 2$ . A short calculation gives that

[TABLE]

because the determinants in the original definition (2) have block structure in the first $n$ rows and last $n$ columns. If the coordinates of $\gamma_{0}$ are labelled $(\gamma_{0})_{1},\ldots,(\gamma_{0})_{r}$ , then this formula for $\omega(t,x)$ agrees with the wedge product

[TABLE]

where $d_{x}$ is the exterior derivative in the $x$ variables only. From this observation, it follows that $\Phi$ has the particularly simple form

[TABLE]

where $\partial\gamma_{0}/\partial x$ is the $r\times rk$ Jacobian matrix of $\gamma_{0}$ .

Example 1 (Hausdorff measure). Let $\mathcal{C}_{\ell}$ be the real associative algebra666The algebra $\mathcal{C}_{\ell}$ is an example of a Clifford algebra. generated by elements $1,e_{1},\ldots,e_{\ell}$ which are subject to the relations $1e_{j}=e_{j}1=e_{j}$ for all $j$ , $e_{i}e_{j}=-e_{j}e_{i}$ when $j\neq i$ , and $e_{i}^{2}=1$ . The dimension of the algebra as a vector space over the reals is $2^{\ell}$ , and

[TABLE]

for any real numbers $a_{1},\ldots,a_{\ell}$ . Consequently if $M_{1},\ldots,M_{\ell}$ are the $2^{\ell}\times 2^{\ell}$ matrices which express the action of left multiplication in $\mathcal{C}_{\ell}$ by $e_{1},\ldots,e_{\ell}$ , respectively, in the standard basis, then

[TABLE]

If $n\leq\ell$ and one defines a mapping

[TABLE]

for polynomial functions $\Gamma_{1},\ldots,\Gamma_{\ell}$ , then the Radon-like operator

[TABLE]

where $x,y\in\mathcal{C}_{\ell}$ , has the corresponding functional $\Phi$

[TABLE]

where $|\cdot|$ denotes the Euclidean distance of points in $\mathcal{C}$ when expressed in coordinates with respect to the standard basis. This $\Phi$ vanishes to order $2^{\ell}$ on the diagonal, so when $\sigma=n2^{-\ell}$ and $s=2^{\ell}/n$ the optimal measure of Theorem 3 is comparable to the $n$ -dimensional Hausdorff measure on the image of $\Gamma$ , assuming that $\Gamma(t)$ is locally injective. If $\widetilde{\Omega}:=\Omega\times\mathcal{C}_{\ell}\times\mathcal{C}_{\ell}$ for a set $\Omega$ on which $(\det(\partial\Gamma/\partial t)^{T}(\partial\Gamma/\partial t))^{1/2}\gtrsim\delta^{n/2^{\ell}}$ , then (4) must apply and consequently

[TABLE]

for all Borel sets $F\subset{\mathbb{R}}^{n}\times\mathcal{C}_{\ell}$ . In particular, note that the image of $\Gamma$ need not have any curvature whatsoever; in this case, the multiplicative structure of the Clifford algebra grants the operator (63) a sort of rotational curvature regardless of the higher-order geometric properties of $\Gamma$ . If $\Gamma$ simply parametrizes a linear subspace, then (63) becomes a restricted $n$ -plane transform; the estimate (64) can be taken to be global in $t$ and consequently scaling and Knapp examples give that the integrability exponents appearing in (64) are sharp.

Example 2 (Determinantal measure). Generalizing the first example, suppose that $\Gamma:{\mathbb{R}}^{n}\rightarrow{\mathbb{R}}^{n^{\prime}\times n^{\prime}}$ is a polynomial map. The Radon-like operator

[TABLE]

where $y,x\in{\mathbb{R}}^{n^{\prime}}$ and $\Gamma(t)x$ denotes matrix-vector multiplication, has functional

[TABLE]

The order of vanishing $q$ of $\Phi$ on the diagonal must be at least $n^{\prime}$ . The associated measure $\mathcal{H}^{n/n^{\prime}}_{\Phi}$ from Theorem 3 is comparable to the $n/n^{\prime}$ -dimensional determinantal Hausdorff measure from Section 1.2 restricted to the image of $\Gamma$ (assuming, for example, that $\Gamma$ is locally injective). The measure must be absolutely continuous with respect to Lebesgue measure, so whenever it is nonzero, one can take $\widetilde{\Omega}:=\Omega\times{\mathbb{R}}^{n^{\prime}}\times{\mathbb{R}}^{n^{\prime}}$ where $\Omega$ is any set on which the Radon-Nykodym derivative is at least comparable to $\delta^{n/n^{\prime}}$ . Then (4) will hold and the conclusion (5) of Theorem 1 will hold with $k=2$ and $s=n^{\prime}/n$ . An extreme case occurs when $n=n^{\prime 2}$ and $\Gamma$ is simply a linear isomorphism. Fixing $dT$ to Lebesgue measure on ${\mathbb{R}}^{n^{\prime}\times n^{\prime}}$ , then the isodiametric determinantal inequality on ${\mathbb{R}}^{n^{\prime}\times n^{\prime}}$ proved in Proposition 2 implies the global, scaling-invariant inequality

[TABLE]

for all Borel sets $F\subset{\mathbb{R}}^{n^{\prime}\times n^{\prime}}\times{\mathbb{R}}^{n^{\prime}}$ .

A modification of this example also applies to the case of convolution with measures on quadratic submanifolds of dimension $n$ in ${\mathbb{R}}^{2n}$ . Specifically, fixing

[TABLE]

under the assumption that $Q^{\ell}_{ij}=Q^{\ell}_{ji}$ for each $i,j,\ell=1,\ldots,n$ , then the operator

[TABLE]

has a corresponding functional $\Phi$ given by

[TABLE]

where $Q(\cdot,a)$ denotes the $n\times n$ matrix whose $(i,j)$ -entry equals

[TABLE]

Since $\Phi$ is a polynomial of degree exactly $n$ , the density (44) is a constant function. In the framework of geometric invariant theory, the infimum (44) is comparable to the infimum over the ${\mathrm{SL}}(n,{\mathbb{R}})$ -orbit of the polynomial $p(t):=\det Q(\cdot,t)$ , where elements of ${\mathrm{SL}}(n,{\mathbb{R}})$ act by linear coordinate changes (see, for example, the work of Richardson and Slodowy [rs1990] extending the Kempf-Ness minimum vector construction to the context of real algebraic geometry). Thus the infimum is zero if and only if $p$ belongs to the nullcone of the representation. Because the nullcone is exactly the zero set of all ${\mathrm{SL}}(n,{\mathbb{R}})$ -invariant polynomials in the coefficients (which is a finitely generated algebra), this reduces the problem of applying Theorem 1 to (66) to a finite list of calculations once a set of generating ${\mathrm{SL}}(n,{\mathbb{R}})$ -invariant polynomials is known. This approach complements earlier work of the author [gressman2015] which formulates a slightly weaker result in terms of the critical integrability exponent of the polynomial $\det Q(\cdot,t)$ .

Example 3 (Affine measure). For the Radon-like operator

[TABLE]

where $x^{\prime}\in{\mathbb{R}}$ , $x\in{\mathbb{R}}^{k}$ , and $\Gamma:{\mathbb{R}}^{n}\rightarrow{\mathbb{R}}^{k}$ is a polynomial map (and $\cdot$ is the dot product), the corresponding functional $\Phi$ equals

[TABLE]

up to a factor of $\pm 1$ . The order of vanishing $q$ must be at least $k$ but will generally be much larger. If $\sigma=n/q$ and $\Gamma$ is locally injective, then the sharp measure from Theorem 3 is comparable to Oberlin’s affine measure on the image of $\Gamma$ ; for general submanifolds, this measure will be comparable to affine submanifold measure as recently constructed by the author elsewhere [gressman2017] (although the comparability may fail in special cases, e.g., when $\Gamma$ includes no mixed monomials). Unlike the Clifford algebra example, the nondegeneracy of affine submanifold measure on $\Gamma$ depends on higher-order geometry of $\Gamma$ and not just its first derivatives. Once again, because this measure is necessarily absolutely continuous with respect to Lebesgue measure, if the image of $\Gamma$ has nonzero affine Hausdorff measure, then a suitable $\widetilde{\Omega}$ can be defined to apply Theorem 1 to (67).

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Geometric averaging operators and nonconcentration inequalities

Abstract

Contents

1 Introduction

1.1 Main results

Theorem 1**.**

Theorem 2**.**

Theorem 3**.**

1.2 Examples

1.3 Structure of the paper

2 Proof of Theorem 1

Proof of Theorem 1.

3 Proof of Theorem 2 and basic measure inequalities

3.1 Proof of Theorem 2

Lemma 1**.**

Proof.

Proof of Theorem 2.

3.2 Basic GMT inequalities and Frostman’s Lemma

Lemma 2**.**

Proof.

Corollary 1**.**

Proof.

4 Proof of Theorem 3

4.1 The case σ<n/q\sigma<n/qσ<n/q

4.2 The case σ≥n/q\sigma\geq n/qσ≥n/q: Comparison to Lebesgue measure

4.3 Multisystems and Theorem 3 with σ=n/q\sigma=n/qσ=n/q

Theorem 4**.**

Lemma 3**.**

Proof of Theorem 4 assuming Lemma 3..

4.4 Remarks on calculation

Proposition 1**.**

Proof.

Proposition 2**.**

Proof.

5 Proof of Lemma 3

5.1 Construction of the multisystem

Proof.

5.2 Underlying geometry and solution counting

Lemma 4**.**

Proposition 3**.**

Proof.

Proposition 4**.**

Proof.

6 Further applications to Radon-like operators

References

Theorem 1.

Theorem 2.

Theorem 3.

Lemma 1.

Lemma 2.

Corollary 1.

4.1 The case $\sigma<n/q$

4.2 The case $\sigma\geq n/q$ : Comparison to Lebesgue measure

4.3 Multisystems and Theorem 3 with $\sigma=n/q$

Theorem 4.

Lemma 3.

Proposition 1.

Proposition 2.

Lemma 4.

Proposition 3.

Proposition 4.