Geometric averaging operators and nonconcentration inequalities
Philip T Gressman

TL;DR
This paper systematically studies geometric integral inequalities related to Radon-like transforms over polynomial submanifolds, extending key results in geometric measure theory to improve understanding of $L^p$-inequalities.
Contribution
It introduces new geometric averaging operators and establishes nonconcentration inequalities, advancing the continuum combinatorial approach to $L^p$-improving inequalities.
Findings
Derived new geometric integral inequalities
Extended results in geometric measure theory
Enhanced understanding of Radon-like transforms
Abstract
This paper is devoted to a systematic study of certain geometric integral inequalities which arise in continuum combinatorial approaches to -improving inequalities for Radon-like transforms over polynomial submanifolds of intermediate dimension. The desired inequalities relate to and extend a number of important results in geometric measure theory.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Geometric averaging operators and nonconcentration inequalities
Philip T. Gressman111This work was partially supported by NSF grants DMS-1361697 and DMS-1764143.
Abstract
This paper is devoted to a systematic study of certain geometric integral inequalities which arise in continuum combinatorial approaches to -improving inequalities for Radon-like transforms over polynomial submanifolds of intermediate dimension. The desired inequalities relate to and extend a number of important results in geometric measure theory.
Contents
1 Introduction
1.1 Main results
Suppose that is a polynomial map from into with and that is some Borel measurable subset of . To this and , one may associate the Radon-like operator
[TABLE]
which may be informally regarded as averaging functions on over the family of sets given by
[TABLE]
The main result of this paper regarding the operator (1) is the following:
Theorem 1**.**
Suppose for some positive integer . Let be the -form
[TABLE]
where each is an column matrix of partial derivatives, is the Jacobian matrix of with respect to , and the determinant is that of the square matrix formed by concatenation. For each , let222Note that the ratio of forms in the definition of is a well-defined real number because both numerator and denominator belong to the same one-dimensional vector space of -forms on .
[TABLE]
Fix any real and suppose that is a Borel set such that
[TABLE]
for every point and every Borel such that , where denotes the Lebesgue measure of . Then the Radon-like operator (1) satisfies the inequality
[TABLE]
for all Borel sets , with the notation “” indicating the presence of an implicit multiplicative factor. In this case, the factor depends only on .
The technical structure of the proof is built on the change of variables formula, similar to various earlier approaches [gressman2013, gressman2015] in the spirit of combinatorial/continuum incidence methods developed by Christ [christ1998]. Christ’s technique, based on ideas of Bourgain [bourgain1986, bourgain1991], Wolff [wolff1995, wolff1997], Schlag [schlag1997], and others, has, since its development twenty years ago, had an impact on the subject of harmonic analysis which is difficult to overstate. It has influenced and inspired work of Bennett, Carbery, and Wright [bcw2005], Dendrinos, Laghi, and Wright [dlw2009], Erdoğan and R. Oberlin [eo2008], Hickman [hickman2016], D. Oberlin [oberlin2000II], Stovall [stovall2011, stovall2014], Tao and Wright [tw2003], and many others.
When , the operator (1) integrates over hypersurfaces and the integral on the left-hand side of (4) reduces to a multilinear determinant functional [gressman2010]. In this case it is known that for fixed , the inequality (4) is satisfied if and only if the Lebesgue measure on the submanifold parametrized by satisfies D. Oberlin’s affine curvature condition, meaning that
[TABLE]
for all boxes with arbitrary orientations and eccentricities, with an implicit constant which is independent of . The condition (6) is called affine because the implicit constant does not change when is acted on by an equiaffine333The prefix “equi-” specifies those affine transformations which preserve Lebesgue measure. transformation and is regarded as a curvature condition because it necessarily fails when lies in any affine hyperplane. The question of whether (6) is satisfied for a given is surprisingly difficult to solve and systematic approaches have only recently become available [gressman2017]. When , the situation is even more difficult, as there are no previously-known analogues of the Oberlin affine curvature condition which apply to (4).
To address the inherent difficulties of the case , this paper is devoted primarily to the general study of functionals of the form
[TABLE]
and
[TABLE]
where the sets range over all Borel subsets of some domain and the measure is a nonnegative Borel measure. Functionals of the forms (7) and (8) will be called nonconcentration functionals since they quantify the extent to which product sets fail to lie in the zero set of . Outside of the context of Theorem 1, will be taken to be any polynomial in which vanishes to order on the diagonal , meaning that all partial derivatives of order less than vanish identically on and some partial derivative of order is nonzero at some point of . When , the absolute values are to be understood as some fixed but otherwise arbitrary norm on . The general question to be answered is to determine when one has inequalities of the form
[TABLE]
and
[TABLE]
for all Borel sets , where is a fixed real number and and are nonnegative constants which do not depend on . The cases , , and are uninteresting; to avoid these exceptions, a nonnegative Borel measure on will be said to satisfy (9) or (10) nontrivially when is not the zero measure and the corresponding inequality holds with a strictly positive constant. Both (9) and (10) will be called nonconcentration inequalities.
The first significant result for nonconcentration inequalities establishes the fundamental equivalence of (9) and (10):
Theorem 2**.**
For any nonnegative Borel measure and any , satisfies (9) with positive constant if and only if satisfies (10) with positive constant. Moreover, if one defines to be the supremum of all nonnegative such that (9) holds for all Borel sets and likewise defines to be the supremum of all satisfying (10) for all Borel , then
[TABLE]
where the implicit constant depends only on on .
The value of Theorem 2 is that the nonconcentration functional is generally much easier to calculate and estimate than . In particular, it is possible to characterize existence of nontrivial measures satisfying (10) in terms of a geometric measure-theoretic generalization of Hausdorff measure and a corresponding generalization of Frostman’s Lemma. In in the key “dimension” for this measure, it is also possible to deduce detailed information about the Radon-Nykodym derivative of this generalized Hausdorff measure with respect to Lebesgue measure. When combined with Theorem 2, this gives an explicit calculation which can be carried out to verify the hypothesis (4). Some of the most important results in this direction are summarized in the following theorem.
Theorem 3**.**
For any Borel set and any , the -dimensional weighted -Hausdorff measure of is defined to equal the quantity
[TABLE]
Then the following statements are true:
If , then . There are no Borel measures satisfying (10) nontrivially when . 2. 2.
If , then there is a Borel measure satisfying (10) nontrivially with if and only if . 3. 3.
If , is absolutely continuous with respect to Lebesgue measure and there is an explicit estimate (see (44)) for the pointwise magnitude of the Radon-Nykodym derivative. Moreover
[TABLE]
for any Borel set and any nonnegative Borel measure satisfying (10), with implicit constants depending only on . In other words, the measure satisfies (10) itself and is, up to a multiplicative constant, the largest such measure.
1.2 Examples
It is worthwhile to briefly examine the implications of Theorem 3 in some familiar and unfamiliar settings.
Example 1 (Hausdorff measure). When for , is the diameter of and is equal to the classical -dimensional Hausdorff measure (see Federer [federer1969, 2.10.24]). The order of vanishing is simply . The first inequality of (13) states that
[TABLE]
In its sharp form with optimal constant, this is known as the isodiametric inequality [federer1969, 2.10.33]. Likewise, if is any nonnegative Borel measure satisfying
[TABLE]
for every Borel set , then (13) implies that . Thus Lebesgue measure on is, up to a constant, the largest measure on satisfying an isodiametric inequality (14). It should also be noted that the inequality (14) is, modulo the constant, equivalent to the upper Ahlfors regularity condition
[TABLE]
for all Euclidean balls , since every set of bounded diameter is contained in a ball of comparable diameter by virtue of Jung’s Theorem [federer1969].
Example 1 (Hausdorff measure). To generalize the first example, suppose that , , is any locally injective polynomial function and set . Locally the measure on pushes forward to equal exactly the -dimensional Hausdorff measure on restricted to the image of . Because the multiplicity of images of is bounded in terms of the degree, the measures must be comparable globally as well. The order of vanishing is still , and by (13), it follows that the -dimensional Hausdorff measure on the image of also satisfies an isodiametric inequality on , i.e.,
[TABLE]
Such an inequality can only hold in general because is polynomial; if were merely it is easy to construct a highly oscillatory curve, for example, with infinite length inside a ball of finite radius. It is also worth noting that up to multiplicative constants, the measure restricted to the image of is essentially the largest measure satisfying the -dimensional upper Ahlfors regularity condition equivalent to (15).
Example 2 (Determinantal measure). An interesting nontrivial example on the space of matrices is to set for any . The order of vanishing equals . Using the the estimate (44) for the magnitude of the Radon-Nykodym derivative , it will be shown (see Proposition 2) that is comparable to Lebesgue measure on . Thus, the first inequality of (13) becomes a determinantal isodiametric inequality for subsets of , namely,
[TABLE]
for all Borel sets . The implications of this inequality for a corresponding Radon-like operator are detailed in Section 6.
Example 3 (Affine measure). For as in Example 1′, let
[TABLE]
where the determinant of an ordered list of vectors in is defined to equal the determinant of the matrix whose -th column contains the ordered coordinates of the -th vector in the standard basis. The measure pushes forward to a measure on the graph of which is is dominated by D. Oberlin’s affine measure of dimension [oberlin2003] up to a uniform multiplicative constant; while it is not clear that these two measures are comparable in all cases, it is a consequence of later arguments in this paper that the measures must be comparable when . For this particular value of , is comparable to the recently-defined affine hypersurface measure [gressman2017], which is the optimal measure satisfying Oberlin’s affine curvature condition
[TABLE]
for all boxes of arbitrary orientation. Similar to the Hausdorff measure and the upper Ahlfors regularity condition, the Oberlin condition (16) is in fact equivalent to the a priori stronger inequality (10) (see Section 3.2).
Example 4 (Projective Measure on Forms). When the underlying space is taken to be the decomposable444Here “decomposable” means expressible as an -fold wedge product of -vectors. -vectors in for positive integers and , let
[TABLE]
(where diameter is with respect to any metric inducing the usual topology). The form defined by (2) is always decomposable (see Sections 2 and 6); if is locally injective for each , then the push forward of the measure on to the graph of will be comparable to the restriction of to the same graph. If is the smallest integer such that vanishes to order on the diagonal for some , then setting
[TABLE]
for an appropriate constant depending only on yields the inequality (4) with by Theorem 2 together with the fact that
[TABLE]
when is the set where the Radon-Nykodym derivative exceeds .
1.3 Structure of the paper
Section 2 is a self-contained proof of Theorem 1 using a combinatorial approach much like earlier work on uniform sublevel Radon-like inequalities and averages over -dimensional submanifolds of [gressman2013, gressman2015]. Section 3 contains a proof of Theorem 2 using elementary convex geometry as via Lemma 1,a n earlier version of which appears in work on affine submanifold measures [gressman2017]. This section also contains some basic GMT observations about -Hausdorff and weighted -Hausdorff measures which will be used in the proof of Theorem 3. In particular, Section 3.2 contains a proof of the relevant generalization of Frostman’s lemma, which is a rather direct reinterpretation of Howroyd’s proof as appearing in Mattila’s book [mattila]. Section 4 provides the bulk of the proof of Theorem 3. The case is essentially an immediate consequence of Lemma 2, while the case relies on a scaling argument to show that -Hausdorff measure of dimension must be absolutely continuous with respect to Lebesgue measure and to consequently estimate the Radon-Nykodym derivative. At this point, the remaining portions of Theorem 3 are reduced to establishing Theorem 4, which gives an explicit construction for any of a measure (possibly zero) satisfying (10). The proof of Theorem 4 is then reduced to proving Lemma 3 (see also [gressman2017]), which is the content of Section 5. As a part of the proof of Lemma 3, Section 5 also identifies the underlying intrinsic geometric objects which play an important algebraic role in the lemma and relate closely to earlier geometric sublevel set estimates [gressman2010II]. Finally, Section 6 gives some example applications of Theorem 1 which correspond to the GMT examples from Section 1.2.
2 Proof of Theorem 1
Proof of Theorem 1.
As defined in the introduction, suppose that is a polynomial map from into . Let , and suppose that for some integer . The basic structure of this proof is to estimate the quantity
[TABLE]
from below and above, where is defined to be the Jacobian determinant of the map . The main upper bound for comes from the change of variables formula and Bézout’s Theorem: for any , since , Bézout’s Theorem guarantees that the number of connected components in of the solution set of the system of equations
[TABLE]
is at most the product of the degrees of the polynomials (see Fulton [fulton1984, Chapter 8, Section 4]). This means that the number of real solutions of the system where the Jacobian is nonvanishing cannot exceed this same upper bound, since the nonvanishing of the Jacobian at a real solution guarantees that such a solution will be isolated in complex space as well. Now by the change of variables formula, if the number of solutions of the system (19) inside the domain of the integral is never greater than for any choice of , then
[TABLE]
Without loss of generality, it may be assumed that Jacobian determinant is nonvanishing at every counted solution of the system (since the integral on the set where is necessarily zero), i.e., need only bound the number of isolated solutions of (19) for a given right-hand side , which Bézout’s Theorem guarantees is bounded by the product of degrees.
To estimate (18) from below, recall the definition (2) of the form . The key fact to establish is that the functional is indeed the Jacobian determinant of the map , i.e., that
[TABLE]
To prove (21), first observe that the Jacobian matrix has block structure
[TABLE]
where is an block of partial derivatives of (with the coordinates of corresponding to rows and the partial derivatives in the coordinate directions of corresponding to columns) and is a corresponding block of partial derivatives. To simplify the determinant of the matrix (22), label the coordinates of as . It will be necessary to use the identity
[TABLE]
where one defines
[TABLE]
and observes of the remainder that it is spanned by all -fold wedge products of , which omit for at least one index . The proof of the identity is essentially immediate after observing that when computing the correct coefficient of in , it suffices to assume that for .
To use the identity (23), first express the determinant as the coefficient of in an -fold wedge product of one forms with coefficients drawn from the rows of the block-form matrix (22). The wedge of the forms in the -th block of rows is given by (23) when each coefficient is replaced the -entry of the matrix and each coefficient is replaced the -entry of the matrix . In particular, this yields the identity . To compute the Jacobian determinant (21), it suffices to take the wedge of the expressions (23) over and show that the remainders do not influence the coefficient of . Because the variables appear only in the -th block of rows, there is only one way for to be a factor in the full wedge product: it must appear explicitly in a corresponding term of (23). In other words, when taking the wedge over all , any wedge product including an will not contain all factors . In the place of the missing , every term of must necessarily contain more than factors drawn from . Since every term of the wedge product (23) must contain at least factors drawn from , it follows by the pigeonhole principle that in the full -fold wedge product representing the determinant (22), when expanded by multilinearity, any term including must be expressible as a sum of wedge products with at least one duplicate . Thus (21) must hold.
It is worth pausing briefly to make the observation that must be decomposable. First note that the form as defined by (2) is independent of the chosen coordinate systems on and . If does not have injective differential, then vanishes. Thus, when is nonzero, the dimension of the quotient modulo the image of the differential always has dimension . The image of the differential in this quotient space is therefore at most -dimensional, meaning that whenever is not zero, it is always possible to choose a coordinate system near any given for which belongs to the span of the partial derivatives of whenever . Computing the form (2) in these coordinates shows that must be a multiple of and is therefore decomposable. Moreover, it follows that vanishes to at least order when and . This then implies that vanishes to order at least on the diagonal at all points where .
Returning to (18), fix a Borel measurable set . By (20),
[TABLE]
where the implicit constant can be taken to equal the maximum number of isolated solutions of the system as range over . Defining to equal
[TABLE]
(which will be a Borel subset of since is a continuous function of ), it follows by Fubini that
[TABLE]
By the main hypothesis (4) of Theorem 1, it must be the case that
[TABLE]
since for each , . However, by the definition (1) of the Radon-like operator ,
[TABLE]
for each . Inserting this equality into (24) and raising both sides to the power gives the conclusion (5) of Theorem 1. ∎
As a final remark concerning the proof, it should be noted that the constraint that divides is only used in proving the upper bound for (18) via the change of variables formula. As weighted nonlinear Brascamp-Lieb inequalities (generalizing the results of Bennett, Carbery, Christ, and Tao [bcct2008, bcct2010]) ultimately become available, it will be possible to remove the divisibility constraint at the cost of changing the definition of to correspond to the correct weight for that context.
3 Proof of Theorem 2 and basic measure inequalities
3.1 Proof of Theorem 2
The proof of Theorem 2 begins with the following lemma, which generalizes Tchebyshev’s inequality to finite dimensional vector spaces of functions. The heart of this generalization is to show that there exists a single set of controlled measure outside of which all functions in the vector space are uniformly bounded (when properly normalized). It extends earlier results for single-variable polynomials [gressman2009] and real analytic functions [gressman2017, Lemma 3]. Although it will only be applied to Borel measures, measurability in the lemma may be taken with respect to any abstract -algebra.
Lemma 1**.**
Suppose is a positive measure on some space and is a -dimensional real vector space of measurable functions from into some vector space with norm . Then for any , there is a measurable set such that for which every satisfies the inequality
[TABLE]
Proof.
The inequality (25) is vacuously true for any (regardless of and ) for which the integral on the right-hand side is infinite. It therefore suffices to prove (25) for the subspace of those for which the integral is finite (the triangle inequality guarantees that such functions are indeed a vector space). Since this subspace also has dimension at most , we may assume without loss of generality that every is -integrable.
Next, let be the subspace consisting of all such that . If is nontrivial, let be a basis of and define
[TABLE]
Because is a finite-dimensional vector space (by the triangle inequality again) and because each basis element vanishes identically on , every is identically zero on . Furthermore ; this follows because
[TABLE]
so by the Monotone Convergence Theorem and Tchebyshev’s inequality,
[TABLE]
If happens to be trivial, set .
Now let be any subspace of which has trivial intersection with and satisfies . If is trivial, then (25) holds because and consequently fixing gives and for all . Thus it may be assumed that the dimension of equals . Define to be the set of all such that
[TABLE]
The mapping is continuous with respect to the vector space topology, and because is trivial, is a norm on , which implies that must be compact. Fix to be any nonzero alternating -linear functional on . By continuity and compactness, attains its maximum for some . Note also that the value of the maximum cannot be zero, since by scaling this would force to be identically zero. By Cramer’s rule, for any ,
[TABLE]
where the circumflex indicates that is omitted from the sequence of arguments of . In particular, by the choice of the functions , the coefficient of each in this expansion of has magnitude at most one. By the triangle inequality and scaling, then, it follows that
[TABLE]
for any and any . Now for any , fix
[TABLE]
By Tchebyshev’s inequality,
[TABLE]
note in particular that the first inequality is strict because
[TABLE]
for each . Equality of the integrals over would force equality of the two functions -almost everywhere on , which would then force , meaning ultimately that and , which has already been handled. Taking a supremum of the inequality (26) over all gives
[TABLE]
for any . Since every must equal for some and and since is identically zero on the given , the fact that (25) holds for immediately implies that it holds for as well. ∎
Before applying this lemma to the proof of Theorem 2, a brief remark is in order. Although the set given by (27) is only described as measurable, this is generally an understatement; if the functions of are all continuous, then is closed; if every is a polynomial, the sets are semialgebraic since they take the form
[TABLE]
for functions which in this case are polynomials of bounded degree.
Proof of Theorem 2.
The proof follows rather directly from Lemma 1. Without loss of generality, it may be assumed that is not the zero measure on , since in this case . In all other cases, and must be finite. First observe that
[TABLE]
for any measurable set with nonzero -measure since the integrand of is pointwise dominated by on . Consequently, for any such ,
[TABLE]
which then implies that . To prove the remaining inequality of (11), one applies Lemma 1 with the vector space being real-valued polynomials of degree at most . If , then an arbitrary and unspecified norm has been fixed as well; let be the unique symmetric, compact, convex subset of such that
[TABLE]
for all , where is the usual dot product. When the inequality (25) is applied iteratively in conjunction with Fubini’s Theorem, this establishes the chain of inequalities
[TABLE]
for any , where is the dimension of , which depends only on and . Taking a supremum over and and assuming that (10) holds gives that
[TABLE]
for any . If , fixing gives that
[TABLE]
If or , then the inequality immediately above still holds since it is trivial when and since the right-hand side of (29) is infinite for any positive when . Therefore the inequality holds for all , meaning that
[TABLE]
which completes the main assertion (11) of Theorem 2. In particular, the constant depends only on and not on or the norm on . ∎
3.2 Basic GMT inequalities and Frostman’s Lemma
In this section, the focus returns to Theorem 3. The goal for the moment is to lay out some basic geometric measure theory which underlies the analytic inequality (10). To that end, given a general polynomial vanishing to order on the diagonal as the introduction, for any and any , let
[TABLE]
To be clear, one need not assume that the sets have any regularity, but there is no loss of generality in requiring that each be Borel or even closed since continuity of implies that assigns the same value to and its closure . The quantity will be called the -Hausdorff measure of dimension , and as already defined in Theorem 3, is called the weighted -Hausdorff measure of dimension . Note that is a special case of the Carathéodory construction (see Federer [federer1969] and Mattila [mattila]), while generalizes the measure that Howroyd [howroyd1995] calls the weighted Hausdorff measure. Just as in the definition of the classical Hausdorff measure, the quantities (30) and (31) both define metric outer measures on and therefore restrict to well-defined measures on the Borel sets; see Folland [folland, Proposition 11.6].
The most basic inequalities satisfied by these quantities are that
[TABLE]
for any Borel set and any nonnegative Borel measure . The first inequality follows because
[TABLE]
The latter inequality of (32) follows simply because the infimum (31) is taken over a strictly larger set than (30). It is natural to ask when the measures and are equal or comparable. For the classical Hausdorff measure equality is known (see Federer [federer1969]), but for general measures this need not be the case. In the context of this present paper, the arguments of Section 4 will establish comparability in the range (although both measures are trivial when the inequality is strict). Beyond this observation, the question of comparability of and in the regime will for now remain unexplored.
The measure holds fundamental significance in the study of nonconcentration inequalities because it characterizes, via a generalization of Frostman’s Lemma, the existence of nontrivial measures satisfying such inequalities.
Lemma 2**.**
Fix any . There exists a nontrivial positive Borel measure on the compact set satisfying
[TABLE]
for all Borel sets if and only if .
Proof.
The proof follows Howroyd’s proof [howroyd1995] of Frostman’s Lemma as given by Mattila [mattila, Theorem 8.17]. By (32), the existence of nontrivial automatically guarantees that . Conversely, for any function on , let
[TABLE]
For any continuous functions on , it is elementary to check that
[TABLE]
It is also true that for every nonpositive function . Thus
[TABLE]
Consequently by the Hahn-Banach Theorem, there must exist a linear functional defined on the space of continuous functions on such that and for any continuous function . If is nonnegative, as well, so is a positive linear functional on . By the Riesz Representation Theorem, there must be a nonnegative Borel measure on such that
[TABLE]
Now if is any Borel set with diameter smaller than , let be a sequence of functions in which are identically on a neighborhood of , bounded above by one everywhere, and vanish outside the set of points distance at most from . Then
[TABLE]
where the last inequality follows because is a polynomial and therefore continuous. Finally, if , then there must be some positive such that . For this fixed value of , must be nonzero. By subdividing into nonoverlapping boxes, there must be a dyadic box of diameter less than such that . Now define the measure by . It follows that and for any Borel set of any diameter,
[TABLE]
as desired. ∎
It is worthwhile to explicitly connect Lemma 2 to D. Oberlin’s affine measure and affine curvature condition (16). It was observed by D. Oberlin [oberlin2003] and others that any measure on satisfying either a nontrivial Fourier restriction inequality or -improving convolution inequality must satisfy the inequality
[TABLE]
for some as ranges over all boxes in of arbitrary orientations, i.e., all sets of points which may be expressed as products of finite intervals with respect to some orthogonal coordinates on . In analogy with Oberlin’s affine measure555Note that Oberlin adjusts the exponent so that the affine dimension of is , but by the present convention, the dimension is always ., let
[TABLE]
be called the -dimensional weighted affine Hausdorff measure. This weighted affine Hausdorff measure is trivially dominated by Oberlin’s affine measure of dimension . In this setting, Lemma 2 has the following consequences:
Corollary 1**.**
Suppose is compact and fix any . Then admits a nontrivial positive Borel measure satisfying the Oberlin affine curvature condition (34) if and only if the -dimensional weighted affine Hausdorff measure of is nonzero. In particular, if implies that for any exponents satisfying
[TABLE]
neither of the inequalities
[TABLE]
(where denotes the Fourier transform) hold uniformly in for any nontrivial positive Borel measure supported on .
Proof.
Using Oberlin’s earlier calculations [oberlin2003, Proposition 2], it suffices to set as noted in the introduction and show that the Oberlin affine curvature condition (34) is equivalent to (33) modulo constants and that . Both facts are quickly established by showing that for any bounded Borel set there is a box such that and
[TABLE]
with implicit constants depending only on dimension. Because taking the closure of does not change the supremum, it may be assumed without loss of generality that is compact and one may fix an ensemble which achieves the supremum of on . If the supremum is zero, then necessarily the span of all vectors as ranges over must have dimension strictly less than , which implies that lies in an affine hyperplane. By boundedness of , this implies that is contained in a (degenerate) box of volume zero. Otherwise the supremum is strictly positive, and by the same argument appearing in the proof of Lemma 1, it must be the case for any that
[TABLE]
for constants . The set of all such points having such an expansion is an affine image of the box and consequently has Lebesgue measure . By the John Ellipsoid Theorem, this same set of points must be contained in an ellipsoid of comparable volume, and that ellipsoid must trivially be contained in a box of comparable volume. Thus and as promised.
Using this conclusion, if (34) is assumed to hold, then for any bounded Borel set ,
[TABLE]
If is unbounded, we may write as the union of an increasing family of bounded Borel sets and then observe that
[TABLE]
Likewise it must clearly be the case that since and since involves an infimum over a smaller class. However, for any bounded Borel sets such that for positive ’s, it is also true that for the distinguished rectangles containing each . Moreover,
[TABLE]
which implies that . The corollary now follows from Lemma 2. ∎
4 Proof of Theorem 3
The most difficult case of Theorem 3 to establish is the case . After the cases and are settled (the former using Lemma 2 and the latter using what amounts to a scaling argument), the proof of Theorem 3 is reduced to the related Theorem 4 and ultimately to Lemma 3.
4.1 The case
The proof of Theorem 3 in the case is an almost immediate consequence of Lemma 2. First, supposing that there is a Borel measure satisfying (10) nontrivially with , then by virtue of (32) applied to the set directly.
On the other hand, if , then because is an open subset of , it may be written as a countable increasing union of compact sets. By the Monotone Convergence Theorem, at least one of these compact subsets must have as well. By Lemma 2, must admit a measure satisfying (10) nontrivially on ; extending to be zero on the complement of gives a measure on which satisfies (10) nontrivially as well. In fact, it is worth noting that this argument works for any value of . Consequently for any , admits a Borel measure satisfying (10) nontrivially if and only if . The reason for the restriction, as will be seen momentarily, is simply that if .
4.2 The case : Comparison to Lebesgue measure
The goal of this section is to establish that must vanish when and to further show when that must be absolutely continuous with respect to Lebesgue measure with an upper bound on the corresponding Radon-Nykodym derivative. Fix standard coordinates on . Let denote the -tuple of partial derivatives in the coordinate directions. Furthermore, for any , will denote the -tuple
[TABLE]
Assuming that is any smooth function which vanishes to order at least at every point for every , the main inequality to be proved in this section is that for almost every
[TABLE]
where are multiindices and the subscript in indicates that the partial derivatives are applied to the argument of . The implicit constant in (35) will depend only on , and .
To begin this calculation, fix and , and suppose that and that is a positive integer. It must be the case by Taylor’s Theorem that
[TABLE]
for any belonging to any fixed compact subset of . Since is smooth, the error term is uniform as ranges over any compact set and as vary inside the box . In particular, if is any fixed point in and , then by a second application of Taylor’s Theorem to the main term on the right-hand side of (36), it follows that
[TABLE]
for small and large , with an implicit constant depending only on , and , in contrast with the error terms, which may also depend on , , etc. From this inequality, it follows that if is the covering of by the collection of boxes induced by subdividing into equal parts along each axis, then
[TABLE]
As , the diameters of all sets in the covering go to zero, so taking this limit implies that
[TABLE]
and that
[TABLE]
where just as on previous lines, the implicit constant depends only on , and . When , the equality (38) forces since is contained in a countable union of boxes with centers . By (32), this forces as well and rules out the existence of any nontrivial Borel measure satisfying a nonconcentration inequality when .
It now suffices to assume . For any in a compact subset of and any sufficiently small , it has been established that
[TABLE]
with implicit constant depending only on , and . To reiterate: the restriction of to a compact set influences the a priori size of the error term but not the implicit constant of (39). Because the maximum over is a locally bounded function of and because as , it follows that for all sufficiently small and all in any compact set, there is a constant (depending on the compact set and the transformation as well as on , and ) such that . This inequality forces to be locally absolutely continuous with respect to Lebesgue measure since any set of Lebesgue measure zero can be covered by a countable union of boxes of this form whose measures sum to any prescribed small value. Now because is known to be absolutely continuous with respect to Lebesgue measure, the Radon-Nykodym derivative can be estimated pointwise almost everywhere by dividing both sides of (39) by and letting . The result is that for almost every ,
[TABLE]
Because the inequality is true uniformly in , one can take an infimum of the right-hand side over a countable dense subset of to conclude that
[TABLE]
with some implicit constant depending only on , and . This is exactly the asserted inequality (35).
It is worth observing that by homogeneity and scaling (and permuting the order of the standard coordinates), it suffices to take the infimum in over the group rather than . It should also be mentioned that since the coordinate system used to derive (35) was essentially arbitrary, one could strengthen (35) a priori even further by taking an infimum on the right-hand side over all coordinate systems. However, this apparent strengthening of (35) is not an actual improvement in this case: since all lower-order derivatives vanish, it turns out that replacing the standard coordinate partial derivatives with partial derivatives in new coordinates leaves the value of the right-hand side of (35) unchanged. This coordinate independence will be a key point in the final stages of the proof of Theorem 3.
4.3 Multisystems and Theorem 3 with
The inequalities (32) and (35) just proved establish that for a given , any measure satisfying (10) with must be absolutely continuous with respect to Lebesgue measure and must have a Radon-Nykodym derivative controlled (up to an implicit constant) by times the expression on the right-hand side of (35). The purpose of this section is to introduce some additional ideas which will be used to show that the upper bound given by (35) can be used to define a measure which also satisfies (10). To prove this fact, it turns out to be necessary to work with a slightly more elaborate expression and then to show that this new, more complicated expression happens to be comparable to the the right-hand side of (35).
The added complexity which is required is to replace the standard coordinate derivatives by a broader family of differential operators which includes coordinate partial derivatives in all smooth coordinates as well as some slightly more general operators. The new object under consideration will be called a multisystem. A multisystem on an open set is a collection of smooth vector fields , , where for each fixed , commute and are linearly independent at every point in . The integer will be called the size of , and the class of all multisystems of size will be denoted . For any finite sequence of the form with and any -tuple of vectors at the point , let
[TABLE]
where is the unique constant-coefficient linear combination of which equals at the point . Such will be called ordered multiindices in variables and will be used to denote the order of differentiation of , which equals the cardinality of the domain of . As in the previous section, will also act on these differential operators by defining
[TABLE]
and taking .
Since the remainder of this paper deals with measures on which are absolutely continuous with respect to Lebesgue measure, it will be convenient to switch back and forth between analytic and geometric descriptions of these measures. In particular, every measure will be identified with a density which acts on -tuples of vectors at the point (for -a.e. ) by means of the correspondence
[TABLE]
where the determinant is of the matrix whose columns are the coefficients of the vectors in the standard basis. With all notation in place, it is now possible to state the main existence result for nonconcentration inequalities:
Theorem 4**.**
For any , let be the density on which at the point is given by
[TABLE]
For any Borel set ,
[TABLE]
with implicit constant depending only on .
It is implicit in the statement of Theorem 4 that the expression (42) is a density in the sense of (41). To see that this is the case, it suffices to observe first that (42) is zero when are linearly dependent. This follows because for each , there must be a matrix such that for each but . Testing (42) on this family and sending shows that the right-hand side of (42) must be zero. The next step is that when are linearly independent, there must be a matrix sending the standard basis to , which implies that . Then because is a group, one may replace everywhere on the right-hand side of (42) by , which gives
[TABLE]
as desired.
The main lemma necessary to prove Theorem 4 and complete the proof of Theorem 3 is stated below and proved in Section 5. It establishes the existence of a special multisystem and vector fields for which it is possible to prove a kind of Bernstein or reverse Sobolev inequality on arbitrary Borel sets. Versions of such inequalities for intervals and boxes appear, for example, in work of Phong and Stein [ps1998, (2.1)] and Greenblatt [greenblatt2007, (3.21)], respectively. The adaptation of such results to arbitrary Borel sets requires substantial new ideas, even in comparison to the one-dimensional version of this result appearing in [gressman2009]. The lemma’s usefulness follows from the fact that, like Lemma 1, the set and the implicit constants are independent of the choice of within the vector space.
Assuming for the moment that Theorem 4 has been established, it is possible to quickly finish the proof of Theorem 3 in the remaining special case . The second inequality of (13), i.e.,
[TABLE]
is simply a restatement of the corresponding basic inequality from (32) when . To complete the proof of Theorem 3, it suffices to show when that the density (42) from Theorem 4 is comparable to or greater than the density on the right-hand side of (40) which dominates . Once this is known, if is the measure promised by Theorem 4 when ,
[TABLE]
for any Borel set , with uniform implicit constants depending only on the parameters , because dominates by comparison of densities and satisfies (10) by Theorem 4. Combining with the basic inequalities (32) gives
[TABLE]
for all Borel sets , with implicit constants depending only on . To reiterate, is dominated by by virtue of the basic inequalities (32), so the densities from (42) and (40) must in fact be comparable, and thus the upper bound (40) improves to become
[TABLE]
with implicit constants depending only on .
Thus, assuming Theorem 4 it suffices to compare the densities from (35) and (42), and show that the latter dominates the former. In so doing, it further suffices to fix to be the standard coordinate vectors on . Now because vanishes to order on , it must be the case that
[TABLE]
whenever since the two differential operators have equal highest-order parts and the lower-order terms are all differential operators of order and lower (Note that for any ordered multiindex , the operator makes sense as a standard multiindex because the coordinate vector fields commute.) Therefore the inequality
[TABLE]
must hold. Thus the final portions of Theorem 3 will follow once the proof of Theorem 4 is complete.
Theorem 4 is itself a rather direct consequence of the following lemma:
Lemma 3**.**
Suppose that is a nonnegative Borel measure on which is absolutely continuous with respect to Lebesgue measure with locally integrable Radon-Nykodym derivative. Let and be fixed positive integers. Given any bounded Borel set of finite, nonzero -measure, there exists an open set , a multisystem of size on , vector fields on , and a Borel set such that
** 2. 2.
* at every point of .* 3. 3.
For every polynomial map of degree at most and every ordered multiindex with ,
[TABLE]
The implicit constants depend only on .
Proof of Theorem 4 assuming Lemma 3..
At this point, the proof of Theorem 4 is almost the same as the proof of Theorem 2. Let be a bounded Borel measurable set with positive measure. Fix an integer and let the multisystem , vector fields , and sets and be as in Lemma 3. Let be any point in . If are ordered multiindices such that for all , then
[TABLE]
Taking a maximum over and comparing to the definition (42) of the density (fixing to be the identity), it follows that
[TABLE]
This is exactly the desired inequality (43). If , the inequality (43) is trivial, so the only remaining case is when is an unbounded Borel set. In this case, , where . Then by Monotone Convergence,
[TABLE]
as desired. ∎
4.4 Remarks on calculation
Before proceeding with the proof of Lemma 3, it is perhaps worthwhile to make some elementary remarks regarding the infimum appearing in (42) or (40) since from a practical perspective it represents the most difficult part of any actual calculation of the density. If is any ordered multiindex of order , then by multilinearity it follows for any invertible square matrices and that
[TABLE]
If, for example, is an orthogonal matrix, it must then be the case that
[TABLE]
by simply using the fact that and making the conservative estimate that the number of terms in the expanded multilinear sum is never greater than . This simple calculation shows that the infimum over in (42) is always comparable (up to a factor depending only on and ) to the infimum over all matrices in some fixed subset provided that every matrix has a factorization where and is orthogonal. The propositions below demonstrate two slightly different applications of this same idea.
The first example is based on the Singular Value Decomposition. Using this simplification, it is possible to characterize the positivity of the density (44) pointwise in terms of a height-type criterion for certain Newton-like polytopes. Algebraically, the proposition is closely related to the Hilbert-Mumford criterion, which was first proved in the real-valued case proved by Birkes [birkes1971].
Proposition 1**.**
For any , if vanishes to order at , then
[TABLE]
if and only if for every orthogonal matrix , the point belongs to the convex hull in of the set
[TABLE]
Proof.
By the SVD, every factors as where and is a nonnegative diagonal matrix. If the diagonal entries of are denoted , the expansion analogous to (46) gives that
[TABLE]
where . It is also trivially true that the inequality (48) is reversed when the factor of is omitted. Thus it suffices to find necessary and sufficient conditions for the quantity on the right-hand side of (48) to be nonzero. For convenience, let denote any -tuple of multiindices with , and define and
[TABLE]
If belongs to the convex hull of the set (47) for every , then for every it must be possible to find and such that ,
[TABLE]
and for . Because a maximum of terms always dominates any convex combination, it follows that
[TABLE]
The quantities are continuous functions of and nonzero at the particular in question, so each is strictly positive on a neighborhood of and consequently the infimum (49) must be bounded below by a positive quantity on a neighborhood of . By compactness of the orthogonal group, the infimum (48) must be strictly positive.
If, on the other hand, there is some such that does not belong to the convex hull of (47), then the Separating Hyperplane Theorem guarantees the existence of such that for all . Taking gives
[TABLE]
as for all . Consequently the infimum (48) must be zero. ∎
For the second example, recall the determinantal Hausdorff measure from Section 1.2. In that section, it was claimed that
[TABLE]
for any Borel set . By virtue of Theorem 3, to prove this inequality, it suffices to show that the density (44) is uniformly bounded below. This calculation is relatively straightforward for triangular matrices and is recorded in the following proposition:
Proposition 2**.**
Let
[TABLE]
where and denote matrices in . Then the Radon-Nykodym derivative is uniformly bounded below by a constant depending only on .
Proof.
Before beginning, note that the correct -Hausdorff dimension for this problem is because is the dimension of the parameter space and is the order of vanishing of on the diagonal.
Order the entries of matrices lexicographically and let correspond to differentiation in the direction of the entry. For any , one may write for a lower triangular matrix and an orthogonal matrix (this is just the so-called decomposition applied to ). Consequently, in taking the infimum (44), up to a uniform constant, it suffices to assume that is lower triangular; in this case the directional derivatives are spanned by for those entries which are lexicographically greater than or equal to .
Because the determinant is a linear function of each column and each row of a matrix,
[TABLE]
if either the indices or the indices are not distinct. When both the ’s and the ’s are distinct, the value of the derivative is depending on the relative orderings of the indices. By definition of the directional derivatives , the differential operator can always be written as a linear combination of derivatives where lexicographically. However, among all such possible choices of the entries , there is only one possibility where the ’s and are distinct: . This is because , so by the Pigeonhole Principle, the ’s can only be distinct when . This forces , which implies for the same reason because are already distinct. Therefore
[TABLE]
where is the coefficient of in the expansion of . It follows that
[TABLE]
since each entry appears in a fraction of all permutations . Because is lower triangular, the product of all is just the absolute value of the determinant. Therefore
[TABLE]
for any lower triangular matrix . Raising both sides to the power gives exactly the desired lower bound for the density (44). ∎
As a final remark on calculation, note that the simplifications used above apply equally well to Theorem 4. Using the QR decomposition as above, for example, it is possible to show that the function on given by
[TABLE]
satisfies the nonconcentration inequality
[TABLE]
which is an interesting result because this is degenerate when . The necessary calculation is relatively simple when one assumes without loss of generality that one of the two vectors in the pair points in the -direction.
5 Proof of Lemma 3
5.1 Construction of the multisystem
Proof.
The proof begins by establishing that it suffices to assume that the functions are scalar-valued, i.e., that . When , as previously noted in (28), there must exist a symmetric, compact, convex set such that
[TABLE]
for all . Taking to be a polynomial map of degree and assuming the lemma for the case gives
[TABLE]
so the implicit constant can taken to be independent of and of the choice of norm on .
Let be the vector space of polynomials of degree at most and let . Because is bounded, all polynomials of degree are bounded on , and because has nonzero measure, no nontrivial polynomial can vanish identically on . Thus is a norm on , and as in the proof of Lemma 1, one may fix to be any nonzero alternating -linear form on . Using this just as was done earlier, it is possible to find such that and
[TABLE]
for any with constants satisfying for each . For any -tuple of indices in such that , let be the open set of points such that
[TABLE]
where denotes the exterior derivative of at the point . The union of all over all possible must be all of because at every point there must be some for which is nonzero. Since these open sets cover , they cover as well, and there must consequently be a single choice of such that . On , define vector fields by means of the formula
[TABLE]
where in the numerator appears in position of the wedge product and replaces . This means that vanishes if and is identically one on if , which further means that the are locally coordinate vector fields and commute with one another. Moreover, by (50) and the definition of , it must be the case that
[TABLE]
at every point of . Furthermore
[TABLE]
and by the change of variables formula, the last integral will be bounded above by the maximum number of nondegenerate solutions (i.e., solutions where the Jacobian determinant of the system is nonzero) of the system of equations
[TABLE]
in for since on . Letting denote a uniform upper bound for this number of solutions, it follows from Tchebyshev’s inequality that there is a measurable set with such that
[TABLE]
This completes the proof of Lemma 3 in the case .
By induction, assume the lemma has been established up to some level . For convenience, let the sets and at stage be denoted and , respectively. Suppose also that the lemma has been proved for some class of functions which includes all polynomials of degree . Stage follows by applying the already-established base case of the lemma to the space of functions on which defined to be the span of and , . Postponing for the moment the problem of counting solutions of systems of equations during this induction procedure, it must be the case that for any , there is an open set and some measurable such that for some implicit constant depending on and there is a multisystem of size , formed by extending the multisystem of size to add new vector fields defined by (51) on as above. For this extended multisystem, it must be the case that
[TABLE]
for all and all . Moreover, because each collection is locally given by coordinate vector fields with local coordinate functions which themselves belong to the finite-dimensional function space , it follows that
[TABLE]
when are the functions used to construct the . In particular, the coefficients are bounded uniformly in and (and uniformly in and ). By induction, this implies that the final vector fields are linear combinations of the for with coefficients that are uniformly bounded. Because the vectors may be written as linear combinations of all previous with bounded linear coefficients, it follows from (53) that
[TABLE]
with implicit constant independent of and whenever is an ordered multiindex with . Taking the vector fields to be vector fields promised in the statement of the lemma together with and completes the proof with the exception of the unfinished business of counting solutions of systems of equations. ∎
5.2 Underlying geometry and solution counting
The problem of counting solutions is an independent algebraic issue which has already been addressed elsewhere in the case of real analytic functions [gressman2017], so the reader who is not interested in the precise nature of the implicit constants in Theorem 3 may skip the rest of this section and consider Theorem 3 fully proved. For those who continue reading, there are two main purposes to this section. The first is to establish that the systems of equations encountered in the previous section have a bounded number of isolated solutions with an upper bound depending only on the constants as promised. The second major purpose of this section is to demonstrate that there is an intrinsic geometric object which governs the possible number of solutions. This means that a finite upper bound will continue to hold uniformly even when the functions belong, for example, to some o-minimal structure. This intrinsic geometric object is also closely related to certain geometric differential operators which were constructed some time ago to study uniform coordinate-independent sublevel set estimates [gressman2010II]. In a very precise way, the object described below allows one to extend those earlier differential operators to a broader class which includes rational functions of the simpler objects.
Throughout this section, the open set and the polynomials of bounded degree on will be regarded as simply an abstract smooth manifold of dimension and a finite-dimensional vector space of smooth functions on . Given such a pair , a new pair , representing a sort of abstract derivative of the original pair, is constructed as follows. Let be the bundle of nonvanishing -forms over points of , i.e., points of are nonvanishing -forms , where the subscript is used to indicate that acts as an alternating -linear form on the tangent space at . Let be the vector space of smooth functions on spanned by the functions
[TABLE]
and
[TABLE]
The construction of allows one to extend the class of functions to a broader class involving derivatives of the functions in without constructing vector fields or coordinate systems. The cost of the construction is the change of dimension of from to , which roughly corresponds to including a new indeterminate variable. If is the one-dimensional interval , for example, then one can show that is diffeomorphic to and is spanned by the functions for and functions of the form where is the new indeterminate. In higher dimensions, the situation is somewhat more complex but still analogous.
Iterating the construction of and gives a sequence of manifolds and function spaces on , (with and ). The spaces have dimension and have fiber bundle projections
[TABLE]
For convenience, let be the projection map from to . The space is spanned by functions of the forms
[TABLE]
and
[TABLE]
for . For convenience, define to be the vector space of functions on which are of the form (54) only. One may also also regard to be a subspace of by composing with the projection .
The manifolds completely capture the analysis and geometry of the vector fields and the function spaces constructed in Lemma 3. In a practical sense, this is because the problem of counting solutions can be lifted from to . This idea is formalized by the following lemma.
Lemma 4**.**
Suppose consists of a finite-dimensional vector space of smooth functions on . Let be the vector spaces of functions as constructed in the proof of Lemma 3, i.e., is the span of and , , for vector fields defined as in (51) for some . Then the number of nondegenerate solutions of the system
[TABLE]
where , , for a given open set is equal to the number of nondegenerate solutions of a corresponding system
[TABLE]
where , .
Although the manifold is somewhat more abstract than itself, Lemma 4 is a significant result for two reasons. The first is that it allows one to sidestep inherent difficulties of understanding the vector fields when counting solutions. The second is that the functions in are never more complex than derivatives of the functions in and polynomials, as shown by the following proposition:
Proposition 3**.**
Suppose that is a diffeomorphism from some open set onto some open subset of . For each , there is a diffeomorphism from onto such that for every ,
[TABLE]
where the determinant on the right-hand side is the usual Jacobian determinant in the coordinates .
Proof.
By induction on , let be given by
[TABLE]
where are differentials of the coordinate functions on induced by . As can be seen from the formula, these coordinates have the property that the canonical projection from to corresponds to dropping the variable . It is easy to check in these coordinates that
[TABLE]
for any . Definition (54) immediately gives (57). ∎
An important corollary is that when the functions are polynomials of bounded degree in a suitable coordinate system (as will always be the case when applying the result to Lemma 3), the functions may also be regarded as polynomials of a suitably bounded degree in the appropriate coordinates as well. Thus the number of nondegenerate solutions to the system (56) would immediately be bounded by Bézout’s Theorem just as applied in the proof of Theorem 1.
The proof of Lemma 4 proceeds by showing that every function (the function space analogous to Lemma 3) must agree with a function in (the function space on ) on a suitably-constructed -dimensional submanifold of which is defined implicitly via a system of equations in . This implies that the system of equations (55) involving the somewhat mysteriously-constructed functions can be naturally lifted to an system on where the functions in the system belong to . Because both and are vector spaces, the only part of this assertion which is somewhat cumbersome to prove is that ratios of wedge products a la (51) appear as values of functions in restricted to suitable submanifolds. This is accomplished by a trivial induction on combined with the following proposition, which shows how to identify quantities like (51) via the identity (59) and also demonstrates in (58) how to inductively identify the -dimensional submanifold of on which the desired identities hold.
Proposition 4**.**
Suppose for each and let
[TABLE]
Then
The set is a manifold and the projection is a diffeomorphism of any open subset of and its image.
Next suppose that and are smooth functions on some open subset for which there exist such that for each , restricts to on and likewise for and . In other words, on and on for each . If one defines
[TABLE]
the following must also be true:
The image consists of exactly those points in at which . 2. 3.
There is a function in which restricts to
[TABLE]
at every point of where the denominator is nonzero, namely
[TABLE]
on .
Proof.
From the formula (57) in the coordinates on , it is clear that every must equal times a polynomial in with coefficients that are smooth functions of . There are several important consequences of this simple observation. The first is that is independent of when . When , it also follows that
[TABLE]
This means that the Jacobian matrix always has full rank at every point of since the Jacobian matrix it is triangular and its diagonal entries are never zero (since on for each and by assumption for each as well). By the Implicit Function Theorem, this guarantees that is always a manifold regardless of the choice of the particular ’s. Moreover, because of this triangular structure and the linearity of as a function of , it is easy to see that for a given , there is at most a unique value of such that , and such a solution exists if and only if is not an identically zero function of . As already noted, if such a value of exists, it is necessarily true that the Jacobian determinant must be nonvanishing at . Therefore by the Implicit Function Theorem, the projection must be a diffeomorphism of any open subset of and its image. This establishes the first conclusion of the proposition.
Because is a diffeomorphism of any open subset of and its image, one may define coordinates on using by lifting the coordinate function via , i.e., by mapping to , where is any suitable open subset of on which a coordinate system is defined. Let denote the associated coordinate vector fields. It follows that for each . In the coordinates on , this means that
[TABLE]
for each . Since each is constant on , it must be the case that on for each pair of indices . Therefore by applying the usual row operations to the Jacobian determinant (57) (assuming that distinct rows of the matrix correspond to partial derivatives with respect to distinct coordinate variables), it must be the case that
[TABLE]
on (using the triangular structure of and (60)). If it is also known that restricts to on , then , so
[TABLE]
in the coordinates when .
Now assuming that is selected in such a way that (58) holds, it follows that for a given point , the equation will have a solution if and only if at the point , which will occur exactly when . Because every point of is contained in an open set on which a coordinate system is defined, this forces the second conclusion of the proposition to be true, namely, that will be exactly the subset of at which .
As for the third conclusion of the proposition, assuming that is a point at which and that ,
[TABLE]
assuming , which must be the case when . Because was arbitrary, the formula holds on all of as well. ∎
The proof of Lemma 4 follows quickly from Proposition 4. By induction on , once it is known that there are suitable for such that every function of the form
[TABLE]
for has a corresponding function in which restricts to on , the third conclusion of the proposition establishes that the same property must hold at stage as well. This is because the functions in the denominator of (51) defining the new vector fields belong to the span of and , which means by induction that each such function is the restriction to of a function in . These extended functions define via (58). The key point is that the vector fields all have the same denominator, so the same choice of defining works simultaneously for the application of any one of the vector fields via the identity (59).
A consequence of this observation is that when restrict to on some open subset of , then every solution of the system of equations
[TABLE]
for will correspond to a solution of the augmented system
[TABLE]
in (in the sense that will map solutions in injectively to solutions in of the augmented system) and that the mapping preserves nondegeneracy in the sense that for a solution point in if and only if . This latter observation follows immediately from the equality of (57) (when fixing ) and (62). Thus Lemma 4 must be true. This completes the proof of Lemma 4 and consequently the proofs of Lemma 3 and Theorems 3 and 4 as well.
6 Further applications to Radon-like operators
To close, it is illuminating to return to the context of averaging operators (1) of Theorem 1 and explicitly see how Theorem 3 applies, as was abstractly indicated by Example 4 in Section 1.2. For convenience, it will be assumed that the map has the form
[TABLE]
where for some integer (in which case ) and for some integer . A short calculation gives that
[TABLE]
because the determinants in the original definition (2) have block structure in the first rows and last columns. If the coordinates of are labelled , then this formula for agrees with the wedge product
[TABLE]
where is the exterior derivative in the variables only. From this observation, it follows that has the particularly simple form
[TABLE]
where is the Jacobian matrix of .
Example 1 (Hausdorff measure). Let be the real associative algebra666The algebra is an example of a Clifford algebra. generated by elements which are subject to the relations for all , when , and . The dimension of the algebra as a vector space over the reals is , and
[TABLE]
for any real numbers . Consequently if are the matrices which express the action of left multiplication in by , respectively, in the standard basis, then
[TABLE]
If and one defines a mapping
[TABLE]
for polynomial functions , then the Radon-like operator
[TABLE]
where , has the corresponding functional
[TABLE]
where denotes the Euclidean distance of points in when expressed in coordinates with respect to the standard basis. This vanishes to order on the diagonal, so when and the optimal measure of Theorem 3 is comparable to the -dimensional Hausdorff measure on the image of , assuming that is locally injective. If for a set on which , then (4) must apply and consequently
[TABLE]
for all Borel sets . In particular, note that the image of need not have any curvature whatsoever; in this case, the multiplicative structure of the Clifford algebra grants the operator (63) a sort of rotational curvature regardless of the higher-order geometric properties of . If simply parametrizes a linear subspace, then (63) becomes a restricted -plane transform; the estimate (64) can be taken to be global in and consequently scaling and Knapp examples give that the integrability exponents appearing in (64) are sharp.
Example 2 (Determinantal measure). Generalizing the first example, suppose that is a polynomial map. The Radon-like operator
[TABLE]
where and denotes matrix-vector multiplication, has functional
[TABLE]
The order of vanishing of on the diagonal must be at least . The associated measure from Theorem 3 is comparable to the -dimensional determinantal Hausdorff measure from Section 1.2 restricted to the image of (assuming, for example, that is locally injective). The measure must be absolutely continuous with respect to Lebesgue measure, so whenever it is nonzero, one can take where is any set on which the Radon-Nykodym derivative is at least comparable to . Then (4) will hold and the conclusion (5) of Theorem 1 will hold with and . An extreme case occurs when and is simply a linear isomorphism. Fixing to Lebesgue measure on , then the isodiametric determinantal inequality on proved in Proposition 2 implies the global, scaling-invariant inequality
[TABLE]
for all Borel sets .
A modification of this example also applies to the case of convolution with measures on quadratic submanifolds of dimension in . Specifically, fixing
[TABLE]
under the assumption that for each , then the operator
[TABLE]
has a corresponding functional given by
[TABLE]
where denotes the matrix whose -entry equals
[TABLE]
Since is a polynomial of degree exactly , the density (44) is a constant function. In the framework of geometric invariant theory, the infimum (44) is comparable to the infimum over the -orbit of the polynomial , where elements of act by linear coordinate changes (see, for example, the work of Richardson and Slodowy [rs1990] extending the Kempf-Ness minimum vector construction to the context of real algebraic geometry). Thus the infimum is zero if and only if belongs to the nullcone of the representation. Because the nullcone is exactly the zero set of all -invariant polynomials in the coefficients (which is a finitely generated algebra), this reduces the problem of applying Theorem 1 to (66) to a finite list of calculations once a set of generating -invariant polynomials is known. This approach complements earlier work of the author [gressman2015] which formulates a slightly weaker result in terms of the critical integrability exponent of the polynomial .
Example 3 (Affine measure). For the Radon-like operator
[TABLE]
where , , and is a polynomial map (and is the dot product), the corresponding functional equals
[TABLE]
up to a factor of . The order of vanishing must be at least but will generally be much larger. If and is locally injective, then the sharp measure from Theorem 3 is comparable to Oberlin’s affine measure on the image of ; for general submanifolds, this measure will be comparable to affine submanifold measure as recently constructed by the author elsewhere [gressman2017] (although the comparability may fail in special cases, e.g., when includes no mixed monomials). Unlike the Clifford algebra example, the nondegeneracy of affine submanifold measure on depends on higher-order geometry of and not just its first derivatives. Once again, because this measure is necessarily absolutely continuous with respect to Lebesgue measure, if the image of has nonzero affine Hausdorff measure, then a suitable can be defined to apply Theorem 1 to (67).
References
