Square functions for bi-Lipschitz maps and directional operators
Francesco Di Plinio, Shaoming Guo, Christoph Thiele, Pavel, Zorin-Kranich

TL;DR
This paper establishes new square function bounds for bi-Lipschitz perturbations and directional operators, providing insights into Hilbert transforms along curves and a novel proof related to direction fields.
Contribution
It introduces a Littlewood-Paley diagonalization for bi-Lipschitz maps and derives new bounds for directional operators, advancing understanding of related harmonic analysis problems.
Findings
Diagonalization result for bi-Lipschitz maps
Square function bounds for directional operators
Alternative proof of Katz's theorem on direction fields
Abstract
First we prove a Littlewood-Paley diagonalization result for bi-Lipschitz perturbations of the identity map on the real line. This result entails a number of corollaries for the Hilbert transform along lines and monomial curves in the plane. Second, we prove a square function bound for a single scale directional operator. As a corollary we give a new proof of part of a theorem of Katz on direction fields with finitely many directions.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Square functions for bi-Lipschitz maps and directional operators
Francesco Di Plinio
Department of Mathematics, University of Virginia, Kerchof Hall, Box 400137, Charlottesville, VA 22904-4137, USA
,
Shaoming Guo
Indiana University Bloomington, 831 E Third St, Bloomington, IN 47405, USA
,
Christoph Thiele
and
Pavel Zorin-Kranich
Mathematical Institute, University of Bonn, Endenicher Allee 60, 53115 Bonn, Germany
Abstract.
First we prove a Littlewood-Paley diagonalization result for bi-Lipschitz perturbations of the identity map on the real line. This result entails a number of corollaries for the Hilbert transform along lines and monomial curves in the plane. Second, we prove a square function bound for a single scale directional operator. As a corollary we give a new proof of part of a theorem of Katz on direction fields with finitely many directions.
2010 Mathematics Subject Classification:
42B25
1. Introduction
This paper grew out of a study of variable directional operators in the plane. We present two main results together with some corollaries.
It is a folklore conjecture and discussed by several authors, for example [Ste93], [LL10], [Guo17], that Lipschitz is the critical regularity assumption on a direction field to yield boundedness of some associated directional operators. Possibly at the heart of positive results in this direction appears to be a one dimensional Littlewood-Paley diagonalization estimate for bi-Lipschitz maps, which is our first main theorem.
Theorem 1.1**.**
Let be a Lipschitz function with and consider the change of variable .
Let be a Schwartz function on such that identically equals on and vanishes outside of . Let be another Schwartz function on such that is supported on . Let be the Littlewood–Paley operators associated to , where . Then
[TABLE]
Note that when the Lipschitz norm of becomes too large, then in general fails to be a bijection and the estimate of the theorem breaks down. By rescaling with and a convexity argument the estimate of the theorem remains true for the following expressions in place of the left hand side:
[TABLE]
We call this result a Littlewood-Paley diagonalization result, since it compares for suitable normalization of and
[TABLE]
with the diagonal term
[TABLE]
The diagonal term by Littlewood-Paley theory and the Fefferman-Stein maximal theorem can and typically will be controlled in norm by that of any of the following square functions
[TABLE]
where denotes the Hardy-Littlewood maximal operator.
An application of Theorem 1.1 is to the directional Hilbert transform in the plane defined for measurable as
[TABLE]
Corollary 1.3**.**
Assume that has Lipschitz constant for almost every . With notation as in Theorem 1.1, we have
[TABLE]
Here and convolution with act in the second variable.
As outlined above, this theorem reduces bounds for to bounds for a square function. The part of the following corollary is then immediate.
Corollary 1.4**.**
Let be such that has Lipschitz constant for almost every . Assume further with notation as in Theorem 1.1 that
[TABLE]
for some and . If , then
[TABLE]
If , then
[TABLE]
Lacey and Li [LL10] proved (1.5) for all (including a weak type endpoint), and they stated a condition [LL10, Conjecture 1.14] on under which they extended (1.5) to all in a neighborhood of . That condition is known to hold for analytic vector fields, and more generally for a class of vector fields previously considered by Bourgain [Bou89]. Lacey and Li have also deduced (1.6) from (1.5) with with Lipschitz assumption on the vector field replaced by .
The estimate (1.5) for all is known for -parameter vector fields [Bat13] and vector fields constant along Lipschitz curves [Guo17]. In these cases the conclusion (1.7) has been obtained in [BT13] and [Guo17], respectively. Our argument for the corollary follows closely [BT13], the main additional observation being that (1.5) can be used as a black box, whereas in [BT13] elements of the proof of this estimate for one-parameter vector fields have been used.
We also recall that the Lipschitz regularity hypothesis in Corollary 1.4 cannot be substantially relaxed. Once the segments of integration emanating from the points of a fixed vertical line start to overlap, they may do so in a bad way and one can disprove boundedness by testing on characteristic functions of Perron trees, see e.g. [Ste93, Section X.1].
Adding curvature to the picture by defining
[TABLE]
where may be interpreted either as or , we may argue similarly as above but remove the conditionality thanks to the results in [Guo+16]. We obtain
Corollary 1.9**.**
For every , , and every , there exits such that for every Lipschitz function with , we have
[TABLE]
Our second main result concerns bounds for the square function of the single scale directional operator
[TABLE]
associated to a Schwartz function .
Theorem 1.12**.**
Let be a measurable function. Then
[TABLE]
The operator is in general not bounded on unless . Even if we assume to be Lipschitz in the vertical direction, we may not apply our first main theorem if does not have suitable compact support, and the operator remains unbounded in general.
As an application of this result, we elaborate on a remark made by Demeter in [Dem10].
Corollary 1.14**.**
Assume the measurable function takes at most different values. Then
[TABLE]
Indeed, Demeter proves the sharper endpoint version of this estimate for , reproducing an earlier result by Katz [Kat99]. Demeter proposes an alternative proof of this result using an inequality by Chang, Wilson, and Wolff [CWW85], in the same vein as in his proof of [Dem10, Theorem 2]. Theorem 1.12 allows to follow through with this proposal, albeit only for . For the operator obtained by replacing in (1.11) with a one-dimensional singular integral kernel, the same quantitative estimate as (1.15), up to -losses in the power of when is sufficiently close to , holds when the finite range of is assumed to have additional structure [DD14]. For instance, one may take . Thus, it is of interest whether the methods behind Corollary 1.14 may be applied to the singular integral case, with the aim of lifting the structure restrictions appearing in [DD14].
FDP was partially supported by NSF grants DMS-1500449 and DMS-1650810, by the Severo Ochoa Program SEV-2013-0323 and by Basque Government BERC Program 2014-2017. SG and CT acknowledge support by the NSF under grant DMS-1440140 through participation in the harmonic analysis program at MSRI in Spring 2017. CT and PZK acknowledge support by DFG-SFB 1060 and the Hausdorff Center for Mathematics in Bonn.
2. Lipschitz vector fields
2.1. Carleson embeddings with compactly supported test functions
We refer to [DT15, Section 2 and 3] for the general theory of outer measure spaces. In this section we use the outer measure space with the collection of distinguished sets consisting of the tents
[TABLE]
and an outer measure generated by .
Let be a Dini modulus of continuity, that is, is a function that is subadditive in the sense
[TABLE]
and has finite Dini norm . Let be the class of testing functions that satisfy
[TABLE]
For locally integrable functions we define the embeddings
[TABLE]
Theorem 2.4** (cf. [DT15, Theorem 4.1]).**
For every we have
[TABLE]
Moreover, we have the endpoint estimates
[TABLE]
The main difference from [DT15, Theorem 4.1] is the supremum over in the definition of , whereas [DT15, Theorem 4.1] uses a fixed . This supremum does not affect the proof strongly, but is important for our application. The precise choice of the class of test functions is not important for this application, but the Dini regularity condition appears naturally in the proof.
We linearize the supremum in the definition of by choosing for each pair a function for which the supremum is almost attained. Denote then . This is an normalized wave packet at scale . The almost orthogonality of these wave packets is captured by the following estimate.
Lemma 2.5**.**
If then
[TABLE]
Proof.
Using the cancellation condition (2.1) and the support condition we write
[TABLE]
We use the almost orthogonality statement in Lemma 2.5 to deduce a square function estimate for .
Lemma 2.6**.**
[TABLE]
Proof.
We begin with a measurable selection of functions that almost extremize . Expand the square of the left hand side of (2.7)
[TABLE]
using the estimate
[TABLE]
in the last inequality. It suffices to verify
[TABLE]
By Lemma 2.5 and using bounded support of the ’s we have
[TABLE]
This finishes the proof of Lemma 2.6. ∎
Proof of Theorem 2.4.
We may assume that the superlevel sets , where is the uncentered Hardy–Littlewood maximal function, have finite measure for all , since otherwise the right-hand side of the conclusion is infinite.
Let be a Whitney decomposition of the superlevel set . Let denote the center and the diameter of . Let
[TABLE]
and note that
[TABLE]
The claim of the theorem will therefore follow from the more precise results
[TABLE]
Let . Then no ball with is contained in a Whitney cube. It follows that, for some constant that depends only on the dimension, the ball is not contained in . Hence
[TABLE]
This completes the proof of (2.9). Now we show (2.10). The Calderón–Zygmund decomposition , associated to the Whitney decomposition has the properties
- (1)
, 2. (2)
, 3. (3)
, 4. (4)
.
Using the bounded support condition on the wave packets and Lemma 2.6 we obtain
[TABLE]
Hence (2.10) holds with replaced by . By sublinearity of the embedding map and subadditivity of the outer norm it remains to show (2.10) holds with replaced by . More explicitly, for every tent we want to show
[TABLE]
We know
[TABLE]
By logarithmic convexity of sizes it therefore suffices to show
[TABLE]
Claim 2.12**.**
.
Proof of Claim 2.12..
Notice that, due to support constraints, can only be non-zero if . Moreover, under this condition and choosing that almost extremizes we obtain
[TABLE]
Hence
[TABLE]
This finishes the proof of Claim 2.12. ∎
In order to show (2.11) notice that only the Whitney cubes contribute to .
[TABLE]
This finishes the proof of Theorem 2.4. ∎
2.2. Carleson embeddings with tails
It is possible to adapt the proofs in Section 2.1 to embeddings defined using test functions with tails. Since we do not need testing functions with sharp decay rates for tails, we will instead estimate such embeddings by averaging the results in Section 2.1.
In this section we work in dimension and consider the following embedding maps:
[TABLE]
where
[TABLE]
The smoothness and decay conditions in these embeddings are not optimal, but they suffice for our purposes. Decomposing the testing functions and into series of compactly supported bump functions as in [Mus+06, Lemma 3.1], see also Lemma 3.4 in this article, we can deduce the embeddings
[TABLE]
for from Theorem 2.4.
2.3. Jones beta numbers
Let be a Lipschitz function and let be its distributional derivative, so that . Let be a compactly supported bump function with
[TABLE]
and
[TABLE]
Let be an normalized mean zero bump function at scale . Let
[TABLE]
be the average slope of near at scale and let
[TABLE]
This definition includes the supremum over the range of uncertainty around , which seems convenient.
Lemma 2.20**.**
With the notation (2.19) we have
[TABLE]
Proof.
Let , . By the fundamental theorem of calculus and Calderón’s reproducing formula for we can write
[TABLE]
We estimate the two terms on the right-hand side separately. In the first term we note , where is also an normalized mean zero bump function at scale , by assumption (2.17). Therefore
[TABLE]
Since is almost constant at scale , this can be further estimated by
[TABLE]
We split the second term via
[TABLE]
Then
[TABLE]
and this can be absorbed into the estimate for . The latter term from (2.21) is bounded by
[TABLE]
Since , the function in the square brackets is a mean zero normalized bump function at scale with constant by the fundamental theorem of calculus, so
[TABLE]
This finishes the proof of Lemma 2.20. ∎
Lemma 2.22** (cf. [Jon89, Lemma 3]).**
.
Proof.
We have to show
[TABLE]
with the implicit constant independent of .
We estimate the size on the tent centered at with height separately for the two terms in the conclusion of Lemma 2.20. For the first term we consider the square of the size:
[TABLE]
For the second term we consider the size
[TABLE]
The conclusion follows from (2.16). ∎
Corollary 2.23** (cf. [Jon89, Lemma 4]).**
Let and
[TABLE]
Then
[TABLE]
The difference from the original formulation of Jones’s beta number estimate is that we take a supremum over an uncertainty region in all available parameters.
2.4. Littlewood–Paley diagonalization of Lipschitz change of variables
Proof of Theorem 1.1.
Since the Lipschitz norm of is strictly smaller than , the change of variable is invertible and bi-Lipschitz. Denote its inverse function by , so that .
Write
[TABLE]
This integral is a linear combination of the functions that we view as non-linear deformations of wave packets centered at . The main idea is to replace the non-linear change of variable in the argument of by the linear change of variable , where is the average slope of the function in the sense of (2.18). Since , the function
[TABLE]
has Fourier support inside , so it is annihilated by .
It remains to estimate the error that has been made in approximating the non-linear change of coordinates in the argument of by a linear one. To this end we compute the difference of the arguments:
[TABLE]
By the Lipschitz property of and since we have
[TABLE]
and it follows that both and have (signed) distance of the order from zero. Therefore
[TABLE]
It follows that
[TABLE]
Multiplying this with a function and integrating in we obtain the estimate
[TABLE]
The sum over can be dominated by since all functions are almost (up to a multiplicative factor) constant on Carleson boxes . By [DT15, Proposition 3.6] and outer Hölder inequality [DT15, Proposition 3.4] this is bounded by
[TABLE]
Since the function is bi-Lipschitz, it does not affect outer norms up to a multiplicative constant. To see this note that
[TABLE]
for a sufficiently large constant .
Thus we obtain the estimate
[TABLE]
Estimating the first term using (2.16), the middle term using Corollary 2.23, and the last term using (2.15) we obtain the claim. ∎
2.5. Application to truncated directional Hilbert transforms
Proof of Corollary 1.3.
By Minkowski’s integral inequality we obtain
[TABLE]
This finishes the proof of Corollary 1.3. ∎
In the remaining part of this section we prove Corollary 1.4. As an initial reduction observe that it suffices to estimate the restriction of to a vertical strip; more precisely we need an estimate of the form
[TABLE]
for functions supported in the vertical strip . This reduction will be important in the case . Also, it is easy to see that we may replace by the smoothly truncated operator
[TABLE]
where is a smooth even function with , for some large and . This is possible because the maps are uniformly bi-Lipschitz for , so is a bounded operator on .
We note that the operators (as well as the analogous ones obtained with from (2.25) in place of ) are also trivially bounded in uniformly in . To see this split
[TABLE]
The first term is a one-dimensional truncated Hilbert transform on each horizontal line, and therefore bounded on any , . The second term can be written as
[TABLE]
This is in turn bounded by
[TABLE]
where denotes the Hardy–Littlewood maximal function in the -th variable. The differential operator is bounded on the subspace of functions with for and therefore we obtain estimates for this term.
Remark 2.26**.**
The same argument can be used to estimate on functions with small horizontal frequencies, thus simplifying an argument in [GT16, Section 3].
Below, we work with from (2.25) in place of , and omit the tilde for simplicity of notation. By linearity and the Calderón reproducing formula it suffices to estimate the operator
[TABLE]
in . By superposition of Corollary 1.3 we obtain estimates for the off-diagonal term
[TABLE]
so it suffices to estimate the diagonal term
[TABLE]
By discretization and Littlewood–Paley theory it suffices to show
[TABLE]
or, more generally,
[TABLE]
for arbitrary functions supported in the strip . In the case this follows immediately from the single band hypothesis (1.5) and Fubini’s theorem.
In order to obtain the larger range of ’s in the case we use the technique for proving vector-valued estimates introduced in [BT13] (see also [DS15] for more applications of this technique).
Theorem 2.27**.**
Let and let be a sequence of subadditive operators. Let and suppose that for every pair of (non-null, finite measure) measurable sets , with there exist subsets , with
[TABLE]
for every and every function supported on we have
[TABLE]
Then for any functions we have
[TABLE]
Proof.
By the monotone convergence theorem it suffices to consider a finite sequence of operators as long as we obtain estimates that do not depend on its length. The hypothesis (2.28) continues to hold for the operator defined on -valued functions, and we know
[TABLE]
with some constant given by the qualitative boundedness assumption on ’s and depending on the length of the sequence of operators. By duality of Lorentz spaces this is equivalent to
[TABLE]
for all finite measure sets and all functions with . We have to find a universal upper bound for .
Let be measurable sets with finite measure and be the major subsets given by the hypothesis. Then for any function with we have
[TABLE]
by Hölder’s inequality and the hypothesis. It follows that for any function with we have
[TABLE]
Taking a supremum over we obtain . ∎
Corollary 1.4 will be obtained via an application of Theorem 2.27 to the operators , with the choice . The corresponding assumption (2.28) in Theorem 2.27 will follow by interpolation of the estimates
[TABLE]
where , , and are as in Theorem 2.27, are arbitrary measurable subsets, , and is in a neighborhood of [math].
The set of pairs for which the estimate (2.29) holds is clearly convex. Hence it suffices to establish (2.29) near the vertices of the dashed triangle in Figure 1. The intersection of the line with this triangle corresponds to the range of ’s claimed in (1.7).
We will use Estimates 16, 17, 21, and 22 from [BT13], which do not rely on the single parameter assumption on the vector field made in [Bat13, BT13]. One twist is in the proof of Estimate 21, where we have to use a version of [BT13, Theorem 8] for Lipschitz vector fields. This result goes back to [LL06]; a slightly simplified version of the proof of the required covering lemma in [BT13] is presented in Appendix A. The covering lemma for Lipschitz vector fields only holds for parallelograms of bounded length. This is the reason for restricting the operator to a vertical strip: we can apply the covering lemma to the intersection of parallelograms with this vertical strip. The other difficulty is that we are dealing with a (smooth) truncation of the Hilbert kernel, so the results of [Bat13] do not directly apply. The easiest way to work around this seems to be running the argument in [Bat13] with more general wave packets which can be used to assemble also the truncated Hilbert kernel .
2.5.1. Using the single band estimate below
The hypothesis (1.5) shows in particular that (2.29) holds with .
2.5.2. Using the Córdoba–Fefferman covering argument
By Estimates 16, 17, and 22 in [BT13] we can estimate the left-hand side of (2.29) by
[TABLE]
for any integer , where both sums are over positive dyadic numbers.
The (geometric) sum over has two critical points: and . This gives the estimate
[TABLE]
The sum over has a critical point with , and we obtain the estimate
[TABLE]
This proves the claim with , . We can make approach by choosing suitably large.
2.5.3. Using the Lacey–Li covering argument
By Estimates 16, 17, and 21 from [BT13] we can estimate the left-hand side of (2.29) by
[TABLE]
The sum over now has two critical points with and with and is dominated by the minimum of the two corresponding terms, so we have the estimate
[TABLE]
The sum over has a critical point at . This gives the estimate
[TABLE]
Making small we can make approach . This completes the proof of Corollary 1.4.
Remark 2.30**.**
The upper part of the solid polygon in Figure 1 yields the hypothesis of Theorem 2.27 for any . This implies that the operator maps into a directional Triebel–Lizorkin space of type (provided that is Lipschitz in the vertical direction). More precisely,
[TABLE]
Indeed, the left-hand side is monotonically decreasing in , so it suffices to consider . With a suitable choice of we may write . For notational simplicity we consider only the contribution of . By the Fefferman–Stein maximal inequality we may replace by larger Littlewood–Paley projections such that .
In the diagonal term we use the Fefferman–Stein maximal inequality, the vector-valued estimate provided by Theorem 2.27 with , monotonicity of norms, and Littlewood–Paley theory to estimate
[TABLE]
In the off-diagonal term we use monotonicity of norms, Littlewood–Paley theory, and Corollary 1.3 to estimate
[TABLE]
2.6. Application to Hilbert transforms along Lipschitz variable parabolas
Proof of Corollary 1.9.
In the following, we will assume for notational convenience that almost everywhere. The region that can be handled similarly, while the region is trivial by Fubini as the operator acts only in the first variable. By the trivial analogue of Corollary 1.3, it suffices to show
[TABLE]
We use where is as defined before acting in the second variable. We note that for
[TABLE]
we have by an application of the fundamental theorem of calculus
[TABLE]
Hence we have for the integral over small values of
[TABLE]
[TABLE]
[TABLE]
The former term (2.32) can be estimated using the vector-valued estimate for the maximally truncated Hilbert transform. Using integrability of near zero we estimate the latter term (2.33) by
[TABLE]
Here we have used the Fefferman–Stein maximal inequality and Littlewood-Paley theory.
We turn to the remaining part of the kernel with and . Note we may restrict the summation over to , as for the domain of integration is empty. We will break up the integral into lacunary pieces parametrized by and estimate the pieces separately, with suitable power decay in allowing to geometrically sum the estimates.
We introduce Littlewood-Paley projections in the first variable and write and to distinguish projections in first and second variable. Consider the averaging operator
[TABLE]
We note similarly to above for the averaged part of the integral pieces:
[TABLE]
[TABLE]
[TABLE]
The factor in the index of the averaging operator is chosen because it is roughly in the domain of integration. In the former term (2.34) we change variables, replacing by on the positive and similarly on the negative axis and do a partial integration in , noting that by the mean zero property the primitive of the kernel of is a bump function again, to estimate this term by
[TABLE]
[TABLE]
plus two similar boundary terms, which are all estimated by the Fefferman-Stein maximal inequality with power decay in . The latter term (2.35) above is estimated by the same change of variables by
[TABLE]
[TABLE]
which is again estimated by the Fefferman-Stein maximal inequality with decay in .
A similar estimate can be obtained if instead of the sharp cut-off we use a smooth cut-off. More precisely, we will choose cut-off functions as defined in the following operator:
[TABLE]
where is smooth and supported on and for , and where is the largest integer power of less than . Note the auxiliary factor is bounded above and below respectively by and .
Then, with the above arguments, it suffices to estimate the rough part of each piece with some that may depend on as follows:
[TABLE]
Here we point out that this estimate has essentially been established in [Guo+16]. First of all, we recognize that the left hand side of (2.37) is essentially the term (5.13) in [Guo+16], there one has a large power of in the index of but this makes their bound only stronger. By the local smoothing estimates and a certain interpolation argument, the bounds of (2.37) for all have been established in Subsection 5.3 in [Guo+16]. To prove bounds for all , we cite the pointwise estimate (3.19) in [Guo+16], which implies for these that
[TABLE]
A further interpolation gives the desired estimate (2.37) for all for slightly smaller . This finishes the proof of the square function estimate (2.31). ∎
3. Single scale operator
In this section we prove Theorem 1.12. The strategy is to use duality and outer Hölder inequality to reduce the estimate to two estimates of Carleson embedding flavor, the “energy embedding” in Section 3.2 and the “mass embedding” in Section 3.3.
3.1. Tiles and the outer measure space
We subdivide the parameter space into tiles. Each tile can be represented in three equivalent ways:
- (1)
by a shearing matrix
[TABLE]
and the spatial location , . 2. (2)
by the corresponding spatial parallelogram
[TABLE] 3. (3)
or by the corresponding frequency parallelogram and the spatial location
[TABLE]
Figure 2 shows the spatial and the frequency parallelograms of a tile (with ). The frequency picture also includes the symmetric parallelograms (in a lighter shade of gray), because the Fourier transforms of the wave packets associated to tiles will concentrate on both these parallelograms. However, for combinatorial purposes it suffices to consider only the upper parallelogram. The slope of a tile is the number . It is the slope of the lower and the upper side of the corresponding spatial parallelogram. The spatial parallelogram seems to be the most concise description of a tile, so we denote tiles by the letter (for “parallelogram”).
The fact that we are dealing with a single scale operator in Section 3 is reflected in that we define an outer measure on a finite set of tiles with , that is, tiles with the fixed horizontal scale . (The restriction to finite sets of tiles avoids technicalities associated with infinite sums. All estimates will be independent of the specific finite set, so we can pass to the set of all tiles at the end of the argument.) The outer measure is generated by a function whose domain is the collection of all non-empty subsets of . We denote by the parallelogram with the same slope and center as but side lengths multiplied by . For set
[TABLE]
where is a large number to be chosen later. The three sizes that we need are
[TABLE]
3.2. Wave packets and the energy embedding
Let be the set of functions on that satisfy
[TABLE]
for some sufficiently large that will be chosen later and
[TABLE]
We think of as morally supported on and of as morally supported on for .
The normalized wave packets associated to a tile are the functions of the form
[TABLE]
The normalized wave packets, , are the functions . Note that . The spatial and the frequency parallelograms of a tile correspond to the moral space/frequency support of the wave packets associated to this tile.
3.2.1. Almost orthogonality
The fundamental property of the wave packets is their almost orthogonality for tiles with different scales or slopes.
Lemma 3.2**.**
[TABLE]
where can be made arbitrarily large provided that the order of decay in the definition of is sufficiently large.
Proof.
Without loss of generality suppose . We will estimate
[TABLE]
for . This is sufficient because the spatial location of the tiles only affects the phase of the Fourier transforms of the associated wave packets, but not their magnitude.
Correlation decay due to shearing
Let and be a vertical strip of width . The critical intersection is a parallelogram centered at zero of width and height . By the vanishing moments assumption we have
[TABLE]
on the critical intersection. Using the fact that the Fourier transforms and are normalized functions and the decay of these Fourier transforms at infinity we obtain
[TABLE]
Choosing as we may provided that , we obtain
[TABLE]
and this gives the second estimate in the conclusion of the lemma.
Correlation decay for separated scales
Let . Using again the fact that the Fourier transforms and are normalized functions and the decay of Fourier transforms near and at infinity we obtain
[TABLE]
Choosing we obtain
[TABLE]
and this gives the third estimate in the conclusion of the lemma. ∎
3.2.2. Bessel inequality
Lemma 3.3**.**
For each tile fix an normalized wave packet adapted to . Then
[TABLE]
Proof.
Schur’s test
[TABLE]
shows that it suffices to prove
[TABLE]
For a fixed tile we split the above sum according to the shearing matrix of the tile . For a given shearing matrix we distinguish the cases and .
In the case the tile has larger scale than , so the tail of the associated wave packet is more important. For let
[TABLE]
and let , for . Then
[TABLE]
and
[TABLE]
where the first estimate inside the minimum is due to spatial separation and the other two estimates come from Lemma 3.2. Summing this over and we obtain
[TABLE]
In the region we make a similar decomposition with
[TABLE]
The resulting estimate is similar to the above with the roles of and reversed. ∎
3.2.3. Splitting into compactly supported wave packets
In order to obtain a localized Bessel inequality we decompose wave packets into compactly supported parts as in [Mus+06, Lemma 3.1].
Lemma 3.4**.**
For every there exists such that if , then there exists a decomposition
[TABLE]
Sketch of proof.
Let be a smooth function supported on and identically equal to on . Write for its dilates. Let also be smooth functions supported on with
[TABLE]
For and let
[TABLE]
then for and we have
[TABLE]
provided that is sufficiently large. The claimed splitting is given by
[TABLE]
3.2.4. Energy embedding
The energy embedding is defined by
[TABLE]
where the supremum is taken over all normalized wave packets adapted to with a sufficiently large order of decay .
Lemma 3.5**.**
.
Proof.
Let be a maximal collection of tiles with . If also has size , then using subadditivity of it is easy to see that also has size , contradicting maximality. Hence by maximality we have . On the other hand,
[TABLE]
by Lemma 3.3. ∎
Lemma 3.6**.**
.
Proof.
Let and let , , be wave packets that almost extremize . Splitting the corresponding members of using Lemma 3.4 we obtain decompositions , where each is an normalized wave packet adapted to (with a lower order of decay ) and supported on .
By Lemma 3.3 and the support condition we have
[TABLE]
and summing in we obtain
[TABLE]
so that as required. ∎
3.3. Covering lemma for parallelograms and the mass embedding
For completeness we include a slightly streamlined proof of a covering lemma from [BT13]. Covering lemmas of this type go back to [CF75]. We consider parallelograms with two vertical edges as shown below:
R$$A$$B$$C$$D$$I
The height is the common length of and . The shadow is the projection of onto the horizontal axis. The slope is the common slope of the edges and . The uncertainty interval is the interval between the slopes of and . It is the interval of length centered at .
Lemma 3.7** (cf. [BT13, Lemma 7]).**
Let a finite collection of parallelograms with vertical edges and dyadic shadow. Then there exists such that
[TABLE]
and for every we have
[TABLE]
In particular, for every measurable function the sets
[TABLE]
satisfy
[TABLE]
In [BT13] the conclusion (3.10) is stated for one-variable vector fields, but this structural assumption is not used in the proof.
In the proof of Lemma 3.7 we denote by the parallelogram with the same center, slope, and shadow as but height (this definition of is used only here). We need the following geometric observation:
Lemma 3.11**.**
Let be two parallelograms with , , and . If , then .
Let denote the Hardy–Littlewood maximal operator in the vertical direction:
[TABLE]
where the supremum is taken over all intervals containing .
Proof of Lemma 3.7.
We select using the following iterative procedure. Initialize
[TABLE]
While , choose an with maximal . Update
[TABLE]
This procedure terminates after finitely many steps since at each step at least the selected parallelogram is removed from .
By construction
[TABLE]
and (3.8) follows by the weak inequality for .
We prove (3.9) by induction on . For the statement clearly holds. Suppose that (3.9) holds for a given , we will show that it also holds with replaced by . For each let
[TABLE]
All terms in (3.9) in which some occurs at least twice are estimated by the inductive hypothesis. In the remaining terms we may arrange the ’s in the order reverse to the selection order (losing a factor ), and omitting some vanishing terms we obtain the estimate
[TABLE]
We claim that for every we have
[TABLE]
To see this let , so that in particular and . If , then , and Lemma 3.11 shows that , so that on , contradicting . Therefore , so , and Lemma 3.11 shows that
[TABLE]
The inequality (3.15) follows, since otherwise on , contradicting . Hence
[TABLE]
This completes the proof of (3.9). In order to see (3.10) observe that its left-hand side is monotonically increasing in , so it suffices to consider integer values , and in this case the left-hand side of (3.10) is dominated by the left-hand side of (3.9). ∎
3.3.1. Mass embedding
The mass embedding is given by
[TABLE]
Lemma 3.16**.**
Let . If the constant in the definition of is sufficiently large depending on , then .
Recall that now again denotes the parallelogram expanded by the factor both in the horizontal and in the vertical direction.
Proof.
Let , , and let be a collection of tiles such that for . We have to show
[TABLE]
Note that the definition of makes sense for arbitrary parallelograms (not only the dyadic ones that we call tiles). For the enlarged parallelograms we still have , so it suffices to show (3.17) with and a collection of arbitrary parallelograms , provided that the constant in the definition of is at least .
Enlarging the parallelograms in such a way that their shadows become intervals in adjacent dyadic grids and the uncertainty intervals stay the same we preserve the hypothesis up to a multiplicative constant. Hence we may assume that the parallelograms have dyadic shadows.
In view of (3.8) it suffices to consider the parallelograms in the subset provided by Lemma 3.7. By the density assumption and Hölder’s inequality we have
[TABLE]
where in the last passage we have used the estimate (3.10). After division by the middle factor of the right hand side we obtain the claim. ∎
3.4. Estimate for the square function
We finally prove Theorem 1.12. Note that is the integral of against an normalized wave packet associated to a tile that contains and whose uncertainty interval contains . Hence the left-hand side of (1.13) is bounded by
[TABLE]
Dualizing with a function we obtain
[TABLE]
For every we have . Therefore by [DT15, Proposition 3.6] and outer Hölder inequality [DT15, Proposition 3.4] the above is bounded by
[TABLE]
The first term is bounded by by Lemmas 3.5 and 3.6 and interpolation [DT15, Proposition 3.5]. The second term is bounded by by Lemma 3.16 and interpolation [DT15, Proposition 3.5].
3.5. Application to a maximal operator with a restricted set of directions
In this section we prove Corollary 1.14.
Although the operator (1.11) is unbounded for general direction fields , it is clearly bounded (on any , ) with norm as long as is allowed to take at most values. This trivial estimate has been improved to on by Katz [Kat99]. Note that we also have the trivial estimate on , and by interpolation one obtains logarithmic dependence on of the operator norm of (1.11) on also for all . Demeter [Dem10] gives an alternative proof of Katz’s result, and furthermore hints at yet another different proof via reduction to the square function bound Theorem 1.12 by means of the good- inequality with sharp constant due to Chang, Wilson, and Wolff [CWW85]. The first appearance of a similar reduction to square function in the context of maximal multipliers goes back to Grafakos, Honzík, and Seeger [GHS06], and analogous approaches have been since used in Demeter [Dem10] and Demeter with the first author [DD14]. We have not been able to reproduce the endpoint using this technique. However, notice that our square function approach, after interpolation, recovers the result for up to an arbitrarily small loss in the exponent of the logarithm.
Proof of Corollary 1.14.
For , define the dyadic martingale averaging operator
[TABLE]
where the summation runs over all standard dyadic squares in with side length . Further define
[TABLE]
[TABLE]
Let denote the non-dyadic Hardy–Littlewood maximal operator. Chang, Wilson, and Wolff [CWW85, Corollary 3.1] prove that there are universal constants and such that for all and
[TABLE]
Denote the finitely many values of by , , and write for the operator with the constant direction field . Corollary 1.14 follows by Marcinkiewicz interpolation from the weak type inequality
[TABLE]
for . Gearing up for Chang, Wilson, and Wolff we estimate
[TABLE]
Using (3.19) we estimate
[TABLE]
provided .
The function in (3.21) is pointwise dominated by the standard Hardy–Littlewood maximal operator, because and compose to some averaging operator at scale [math]. Therefore
[TABLE]
To control (3.22) we introduce a suitable Littlewood–Paley decomposition in the second variable, note that commutes with , and estimate pointwise
[TABLE]
where is the -maximal operator in the vertical direction for any fixed with as in (3.12), is the usual two-dimensional Hardy–Littlewood maximal operator, and the pointwise estimate follows from [GHS06, Sublemma 4.2] applied in the vertical direction. The Fefferman–Stein maximal inequalities and Theorem 1.12 give
[TABLE]
With Tchebysheff we obtain
[TABLE]
and this concludes the proof of Corollary 1.14. ∎
Appendix A Lacey–Li covering argument
Lacey and Li [LL10] have introduced a certain family of maximal operators associated to a vector field , which they called the “Lipschitz–Kakeya” maximal operator:
[TABLE]
where, using the notation from Section 3.3, is the collection of those parallelograms with ; that is, the vector field points within the uncertainty interval of on (at least a) -portion of . These authors proved that such maximal operators have weak type operator norm if the vector field is Lipschitz. In the same paper, they have further showed that an bound for this operator for any implies the estimate for the single band version of the directional Hilbert transform. Bateman and Thiele [BT13] gave a streamlined proof of the weak type estimate for this maximal operator in the case of a one-variable vector field and used it to obtain square function estimates of the type (1.2) for the directional Hilbert transform.
In this section we further simplify the proof of the weak type estimate for this maximal operator, also taking care of Lipschitz vector fields. We use the notation from Section 3.3 and write . The main part of the proof is the following covering argument.
Theorem A.1**.**
Let and let be a finite collection of parallelograms with vertical edges and dyadic shadow such that for each we have
[TABLE]
and . Then there is a subset such that
[TABLE]
The set is constructed as in Lemma 3.7, so that (A.2) holds by construction. In the remaining part of this section we will show (A.3). Expanding the square on the left-hand side of (A.3) and using symmetry we obtain the estimate
[TABLE]
where is the set of pairs such that and has been chosen before . The former term is clearly bounded by the right-hand side of (A.3). In the latter term we notice first that by (3.15) we have
[TABLE]
and this is also bounded by the right-hand side of (A.3). Hence it suffices to estimate
[TABLE]
where
[TABLE]
First we clarify the position of relative to when .
Lemma A.5**.**
Suppose . Then
[TABLE]
Proof.
We distinguish two cases:
- (1)
. In this case we use the definition of . 2. (2)
. In this case we have
[TABLE]
and in particular . If the conclusion was false, then , and by Lemma 3.11 we obtain . This contradicts the hypothesis that was added to after .
∎
The next lemma gives a condition for two parallelograms to have comparable slopes. This is the only place where the Lipschitz hypothesis is used. Denote the projection onto the first coordinate by .
Lemma A.6**.**
Assume . Suppose and . Then
[TABLE]
Proof.
Let . The distance of the points such that and is bounded above by
[TABLE]
Choosing and and using the Lipschitz hypothesis and Lemma A.5 we obtain
[TABLE]
The conclusion follows. ∎
The basic estimate for the size of the intersection of two parallelograms is the size of the intersection of infinite stripes containing them:
Lemma A.7**.**
Let . Then
[TABLE]
Proof.
By a shearing transformation we may assume that the central line segment of is horizontal. Let be the central slope of . Then is contained in a parallelogram of height and base . On the other hand, . ∎
We decompose the set dyadically according to the distance between and . Specifically, for let
[TABLE]
For a fixed we will estimate the contribution of to (A.4) using a stopping time argument. For a dyadic interval denote .
Lemma A.9**.**
Let be a dyadic interval such that there exists with . Then
[TABLE]
Proof.
Let be the parallelogram with that has been chosen last. Let
[TABLE]
Since , it suffices to show
[TABLE]
Assume for contradiction that (A.10) fails. Let . By Lemma A.5 we have and thus
[TABLE]
In particular
[TABLE]
The parallelogram has been selected for after the parallelogram and the parallelograms . To obtain a contradiction with the construction of it suffices to show that
[TABLE]
where is the vertical directional maximal function, is larger than on the parallelogram .
First assume there exists with . Note that
[TABLE]
Applying Lemma 3.11 to the rectangles and we obtain
[TABLE]
on , which proves Lemma A.9 in the given case.
Hence we may assume
[TABLE]
for every . We then have on that
[TABLE]
This completes the proof of Lemma A.9. ∎
Corollary A.11**.**
[TABLE]
Proof.
Let be the set of maximal dyadic intervals contained in that do not contain for any . For each let denote its dyadic parent. Then by maximality of and Lemma A.9 we have
[TABLE]
The set is a covering of , so the conclusion of the lemma follows after summing over all intervals in . ∎
We are now in position to complete the proof of Theorem A.1 by estimating (A.4):
[TABLE]
where is a system of representatives for maximal intervals , in the penultimate step we have used the density hypothesis in the form , and in the last step we have used Lemma A.6 to conclude that the projections there have bounded overlap.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[Bat 13] Michael Bateman “Single annulus L p superscript 𝐿 𝑝 L^{p} estimates for Hilbert transforms along vector fields” In Rev. Mat. Iberoam. 29.3 , 2013, pp. 1021–1069 DOI: 10.4171/RMI/748 · doi ↗
- 2[Bou 89] J. Bourgain “A remark on the maximal function associated to an analytic vector field” In Analysis at Urbana, Vol. I (Urbana, IL, 1986–1987) 137 , London Math. Soc. Lecture Note Ser. Cambridge Univ. Press, Cambridge, 1989, pp. 111–132
- 3[BT 13] Michael Bateman and Christoph Thiele “ L p superscript 𝐿 𝑝 L^{p} estimates for the Hilbert transforms along a one-variable vector field” In Anal. PDE 6.7 , 2013, pp. 1577–1600 DOI: 10.2140/apde.2013.6.1577 · doi ↗
- 4[CF 75] A. Cordoba and R. Fefferman “A geometric proof of the strong maximal theorem” In Ann. of Math. (2) 102.1 , 1975, pp. 95–100
- 5[CWW 85] S.-Y.. Chang, J.. Wilson and T.. Wolff “Some weighted norm inequalities concerning the Schrödinger operators” In Comment. Math. Helv. 60.2 , 1985, pp. 217–246 DOI: 10.1007/BF 02567411 · doi ↗
- 6[DD 14] Ciprian Demeter and Francesco Di Plinio “Logarithmic L p superscript 𝐿 𝑝 L^{p} bounds for maximal directional singular integrals in the plane” In J. Geom. Anal. 24.1 , 2014, pp. 375–416 DOI: 10.1007/s 12220-012-9340-2 · doi ↗
- 7[Dem 10] Ciprian Demeter “Singular integrals along N 𝑁 N directions in ℝ 2 superscript ℝ 2 \mathbb{R}^{2} ” In Proc. Amer. Math. Soc. 138.12 , 2010, pp. 4433–4442 DOI: 10.1090/S 0002-9939-2010-10442-2 · doi ↗
- 8[DS 15] Ciprian Demeter and Prabath Silva “Some new light on a few classical results” In Colloq. Math. 140.1 , 2015, pp. 129–147 DOI: 10.4064/cm 140-1-11 · doi ↗
