Endpoint estimates for the maximal function over prime numbers
Bartosz Trojan

TL;DR
This paper establishes almost sure convergence of ergodic averages over prime numbers for functions in a specific Orlicz space, extending classical results to a new setting involving primes and advanced function spaces.
Contribution
It proves endpoint estimates for the maximal function over primes in the context of ergodic theory, particularly for functions in the Orlicz space $L( ext{log} L)^2( ext{log} ext{log} L)$.
Findings
Almost sure convergence of ergodic averages over primes for functions in the specified Orlicz space.
Extension of classical ergodic theorems to prime number averages with endpoint estimates.
New techniques for handling maximal functions over primes in ergodic theory.
Abstract
Given an ergodic dynamical system , we prove that for each function belonging to the Orlicz space , the ergodic averages \[ \frac{1}{\pi(N)} \sum_{p \in \mathbb{P}_N} f\big(T^p x\big), \] converge for -almost all , where is the set of prime numbers not larger that and .
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Endpoint estimates for the maximal function over prime numbers
Bartosz Trojan
Bartosz Trojan
Institute of Mathematics of Polish Academy of Science
ul. Śniadeckich 8
00-656 Warszawa
Poland
Abstract.
Given an ergodic dynamical system , we prove that for each function belonging to the Orlicz space , the ergodic averages
[TABLE]
converge for -almost all , where is the set of prime numbers not larger that and .
Key words and phrases:
weak maximal ergodic inequality, Orlicz space, prime numbers, pointwise convergence
2010 Mathematics Subject Classification:
Primary: 37A45. Secondary: 46E30, 42B25.
1. Introduction
Let be an ergodic dynamical system, that is is a probability space with a measurable and measure preserving transformation . The classical Birkhoff theorem [2] states that for any function from with , the ergodic averages
[TABLE]
converge for -almost all . This classical result, among others, motivates studying ergodic averages over subsequences of integers. In this article we are interested in pointwise convergence of the following averages,
[TABLE]
where is the set of prime numbers not larger than and . The problem of ergodic averages along prime numbers was initially studied by Bourgain in [4] where the case of functions belonging to has been covered. It was extended by Wierdl in [22] to all , for , see also [6, Section 9]. However, the endpoint , was left open for more than twenty years. Following the method developed in [7] by Buczolich and Mauldin, LaVictoire in [13] has shown that for each ergodic dynamical system there exists such that the sequence diverges on a set of positive measure.
The purpose of this article is to find an Orlicz space close to where the almost everywhere convergence holds. We show the following theorem (see Theorem 7.4).
Theorem A**.**
For each , the limit
[TABLE]
exists for -almost all .
In light of the pointwise convergence obtained by Bourgain in [5], see also [16], to prove Theorem A it suffices to show the weak maximal ergodic inequality for functions in Orlicz space . This inequality is deduce from the following restricted weak Orlicz estimate.
Theorem B**.**
There is such that for any subset ,
[TABLE]
for all .
By appealing to the Calderón transference principle, see [8], Theorem B is deduced from the corresponding result for integers with the counting measure and the shift operator. To be more precise, for a function , we define
[TABLE]
Our main result is following theorem (see Theorem 6.3).
Theorem C**.**
There is such that for any subset of a finite cardinality
[TABLE]
for all .
Theorem C together with estimates are sufficiently strong to imply the maximal inequality for all spaces, for , giving an alternative proof of the Wierld’s theorem [22].
Let us now give some details about the proof of Theorem C. Without loss of generality, we may restrict the supremum to dyadic numbers. It is more convenient to work with weighted averages instead of where
[TABLE]
and
[TABLE]
Given , for each , we decompose the operator into two parts and , in such a way that the maximal function associated with has norm , whereas the one corresponding to has norm \lesssim\exp\big{(}-c\sqrt{t}\big{)}\|f\|_{\ell^{2}}. When applied to the distribution function \big{|}\big{\{}\sup_{n\in\mathbb{N}}\mathcal{M}_{2^{n}}({\mathds{1}_{{F}}})>\lambda\big{\}}\big{|}, we can optimize both estimates by taking . This idea originated to Ch. Fefferman [9], see also Bourgain [3]. Ionescu introduced this technique in a related discrete context, see [11]. The decomposition of uses the circle method of Hardy and Littlewood. However, to achieve the exponential decay of the error term, due to the Page’s theorem, the approximating multiplier has to contain the second term of the asymptotic as well. Thus, the possible existence of the Siegel zero entails that in the neighborhood of the rational point the approximating multiplier depends on the rational number . We refer to Sections 3 and 5 for details. Thanks to the log-convexity of , the weak type estimates are reduced to showing
[TABLE]
for with . At this stage we exploit the behavior of the Gauss sums described in Theorem 2.1.
Let us emphasize that under the Generalized Riemann Hypothesis we can obtain in Proposition 3.1, and consequently in Theorem 3.2, a better error estimate. However, it is not clear whether one can prove Theorem 6.1 with the bounds proportional to .
The paper is organized as follows. In Section 2, we collect necessary facts about Dirichlet characters and the zero-free region. Then we evaluate the Gauss sum that appears in the approximating multiplier (Theorem 2.1). Section 3 is devoted to construction of the approximating multipliers. In Sections 5 and 6, we show and the weak type estimates, respectively. In Section 7, we give two applications of Theorem C. Namely, we show how to deduce the maximal ergodic inequality for functions from , (Theorem 7.1). Next we apply the transference principle (Proposition 7.3) and show almost everywhere convergence of the ergodic averages for , (Theorem 7.4).
Notation
Throughout the whole article, we write () if there is an absolute constant such that , (). Moreover, stands for a large positive constant which value may vary from occurrence to occurrence. If and hold simultaneously then we write . The set of positive integers and the set of prime numbers are denoted by and , respectively. For , we set . Let .
2. Gauss sums
We start by recalling some basic facts from number theory. A general reference here is the book [17].
A homomorphism
[TABLE]
is called a Dirichlet character modulo . The simplest example, called the principal character modulo , is defined as
[TABLE]
A character modulo is primitive, if is the least integer , such that for all and . For each character there is the unique primitive character modulo for some , such that
[TABLE]
The character is quadratic if it takes only values with at least one . Recall that, if is a primitive quadratic character with modulus , then
- •
, and is square-free, or
- •
, , and is square-free.
Given a Dirichlet character and with , we define the Dirichlet -function by the formula
[TABLE]
In fact, extends to the analytic function in . There is an absolute constant , such that if is a Dirichlet character modulo , then the region
[TABLE]
contains at most one zero of , which we denote by . The zero is real and the corresponding character is quadratic. The character having zero in (1) is called exceptional. Since implies that , we may assume that .
The Gauss sum of a Dirichlet character modulo is defined as
[TABLE]
where A_{q}=\big{\{}1\leq a\leq q:\gcd(a,q)=1\big{\}}, and . Let us recall that for each there is such that
[TABLE]
We set
[TABLE]
Let us denote by the Möbious function, which is defined for , where are distinct primes, as
[TABLE]
and . The following theorem plays the crucial role in Section 6.
Theorem 2.1**.**
Let be a quadratic Dirichlet character modulo induced by having the conductor . For , we set . Then
[TABLE]
provided that is square-free, and . Otherwise the sum equals zero.
Proof.
By [17, Theorem 9.12], if then
[TABLE]
otherwise the sum equals zero. In particular, for , we have
[TABLE]
Hence, entails that is square-free and . Next, using (4) and (3) we get
[TABLE]
Because , we have . Hence,
[TABLE]
Finally, since is square-free, and , we deduce that . Therefore,
[TABLE]
which together with (5) completes the proof. ∎
Let us observe that the identity (4) together with (2) imply that
[TABLE]
for any . Moreover, entails that is square-free or and is square-free.
3. Approximating multipliers
Let us denote by the averaging operator over prime numbers, that is for a function we have
[TABLE]
where and . Since sums over primes are very irregular, it is more convenient to work with
[TABLE]
where
[TABLE]
By the partial summation, we easily see that
[TABLE]
thus
[TABLE]
To better understand the operators , we use the Hardy–Littlewood circle method. Let denote the Fourier transform on defined for any function as
[TABLE]
If , we set
[TABLE]
To simplify the notation we denote by the inverse Fourier transform on or the inverse Fourier transform on the torus , depending on the context. Let be the Fourier multiplier corresponding to , i.e.,
[TABLE]
Then for a finitely supported function , we have
[TABLE]
For , we set
[TABLE]
To simplify the notation we write for . Let . Recall that
[TABLE]
For , we notice that the operators are not averaging operators. Moreover, by the partial summation and (10), we get
[TABLE]
Hence,
[TABLE]
Moreover,
[TABLE]
thus
[TABLE]
Therefore,
[TABLE]
Given , and , we set
[TABLE]
if there is no exceptional character modulo , and
[TABLE]
when there is an exceptional character modulo and is the corresponding zero.
Proposition 3.1**.**
There is such that if ,
[TABLE]
for some , , and 1\leq Q\leq\exp\big{(}c\sqrt{\log N}\big{)}, then
[TABLE]
Proof.
Observe that for a prime , if and only if . Hence,
[TABLE]
Let . For , we have
[TABLE]
thus
[TABLE]
For , we set
[TABLE]
Then, by the partial summation, we obtain
[TABLE]
Analogously, for any , we can write
[TABLE]
By the Page’s theorem, there is an absolute constant such that for each , 1\leq q\leq\exp\big{(}c\sqrt{\log x}\big{)}, and ,
[TABLE]
if there is no exceptional character modulo , and
[TABLE]
when there is an exceptional character modulo , and is the concomitant zero. Therefore, by (15) and (16), we obtain
[TABLE]
which is bounded by NQ\exp\big{(}-c\sqrt{\log N}\big{)}. Finally, by the prime number theorem
[TABLE]
and the proposition follows. ∎
Next, we select , a smooth function such that , and
[TABLE]
We may assume that is a convolution of two smooth functions with supports contained in \big{(}-\tfrac{1}{2},\tfrac{1}{2}\big{)}. For , we set
[TABLE]
We define a family of approximating multipliers, by the formula
[TABLE]
where
[TABLE]
and . We set .
Theorem 3.2**.**
There are such that for all and ,
[TABLE]
where is defined by (8).
Proof.
Let
[TABLE]
where the constant is determined in Proposition 3.1. By the Dirichlet’s principle, there are coprime integers and , satisfying , and such that
[TABLE]
Let us first consider the case when . We select satisfying
[TABLE]
For and , with , we have
[TABLE]
[TABLE]
which implies that
[TABLE]
For , by (6) we obtain
[TABLE]
If is square-free or and is square-free then there is such that , thus
[TABLE]
By Proposition 3.1,
[TABLE]
Since , whenever
[TABLE]
we obtain
[TABLE]
Finally, if and are not square-free then by Proposition 3.1,
[TABLE]
It remains to deal with . By the Vinogradov’s inequality (see [21, Theorem 1, Chapter IX] or [18, Theorem 8.5]), we get
[TABLE]
Next, we show that
[TABLE]
Select such that
[TABLE]
For , if , then , and hence
[TABLE]
[TABLE]
which entails that
[TABLE]
If , then by (6), we get
[TABLE]
hence by (18),
[TABLE]
and the theorem follows. ∎
4. Equidistribution of weak norms
In this section we prove that the maximal function associated with kernels has weak -norm equidistributed in residue classes. Before embarking on the proof, let us recall two lemmas essential for the argument.
Lemma 4.1**.**
[14*, Lemma 1]**
There is such that for all and ,*
[TABLE]
Lemma 4.2**.**
[14*, Lemma 2]**
For all , any with , , and any finitely supported function ,*
[TABLE]
The following theorem is the main result of this section.
Theorem 4.3**.**
There is such that for any with , , , and any finitely supported function ,
[TABLE]
Proof.
Observe that, by the mean value theorem, for ,
[TABLE]
thus
[TABLE]
In particular, by the Hardy–Littlewood maximal theorem, there is such that for all , and any ,
[TABLE]
For and , we set
[TABLE]
Then, by (19), we have
[TABLE]
Moreover, for any , we have
[TABLE]
Since , by Young’s convolution inequality and Lemma 4.1, we obtain
[TABLE]
Thus
[TABLE]
which together with (20) imply that
[TABLE]
where the last inequality is a consequence of . Therefore, in view of Lemma 4.2, we immediately get
[TABLE]
which is the desired conclusion. ∎
Essentially the same reasoning as in the proof of Theorem 4.3 leads to the following theorem.
Theorem 4.4**.**
There is such that for all with , , , and any finitely supported function ,
[TABLE]
5. theory
We are now in the position to prove boundedness of the maximal function associated to the multipliers .
Theorem 5.1**.**
For each there is such that for all , and any finitely supported function ,
[TABLE]
Proof.
We divide the supremum into two parts: and . Then the following holds true.
Claim 5.2**.**
For each there is such that for all , and any finitely supported function ,
[TABLE]
For the proof, we apply [15, Lemma 1] to write
[TABLE]
Let us fix . Then by the Plancherel’s theorem we get
[TABLE]
where I_{j}^{i}=\big{\{}j2^{i}+1,j2^{i}+2,\ldots,(j+1)2^{i}\big{\}}. By (6), we obtain
[TABLE]
where \Delta^{q}_{m}=\big{|}\widehat{M_{2^{m}}}-\widehat{M_{2^{m-1}}}\big{|}+\big{|}\widehat{M^{\beta_{q}}_{2^{m}}}-\widehat{M^{\beta_{q}}_{2^{m-1}}}\big{|}. In view of (12), we have
[TABLE]
uniformly with respect to , , and . Since supports of are disjoint while varies over , we obtain
[TABLE]
which together with (22) imply (21).
It remains now to treat supremum over . For each we set
[TABLE]
and . In view of the Landau’s theorem [17, Corollary 11.9], there are distinct ’s. Therefore, it suffices to show the following claim.
Claim 5.3**.**
For each there is such that for all , , any finitely supported function ,
[TABLE]
Let us fix . We define
[TABLE]
and
[TABLE]
Observe that the functions and are periodic where
[TABLE]
By the Plancherel’s theorem, for , we have
[TABLE]
because by (11),
[TABLE]
Therefore, by the triangle inequality
[TABLE]
Since contains at most rational numbers, by the Cauchy–Schwarz inequality we get
[TABLE]
Observe that
[TABLE]
thus
[TABLE]
Hence,
[TABLE]
Now, by multiple change of variables and periodicity we get
[TABLE]
Using Theorem 4.4, we can estimate
[TABLE]
Notice that
[TABLE]
Since supports of are disjoint while varies over , by (6) we get
[TABLE]
Therefore,
[TABLE]
which together with (24) imply (23) and the theorem follows. ∎
Given and , we define the multiplier
[TABLE]
Corollary 5.4**.**
There are such that for each , and any finitely supported function ,
[TABLE]
Proof.
Since
[TABLE]
our assertion follows from Theorem 3.2 and Theorem 5.1. Indeed, by the Plancherel’s theorem and Theorem 3.2 we get
[TABLE]
On the other hand, by Theorem 5.1,
[TABLE]
which concludes the proof. ∎
6. Weak type estimates
In this section we investigate the weak type estimates for the multipliers \big{(}\Pi_{n}^{t}:n\geq t\big{)}. Then together with results from Section 5 we deduce Theorem C.
Theorem 6.1**.**
There is such that for all and any finitely supported function ,
[TABLE]
Proof.
Let us fix for some . Let . Suppose that is a quadratic Dirichlet character modulo induced by having the conductor . We claim that the following holds true.
Claim 6.2**.**
There is such that for any finitely supported function ,
[TABLE]
The constant is independent of , and .
Let us first see that from Claim 6.2, we can deduce the theorem. Indeed, from (25) we easily get
[TABLE]
Recall that (see e.g. [19]),
[TABLE]
thus
[TABLE]
Hence, by log-convexity of , (see [12, 20]) we obtain
[TABLE]
which is bounded by .
What is left now is to prove Claim 6.2. Let . For , we have
[TABLE]
where
[TABLE]
Hence, by Theorem 4.3, we obtain
[TABLE]
Next, by Young’s convolution inequality we get
[TABLE]
and
[TABLE]
Now, by Theorem 2.1, we can compute
[TABLE]
where in the last inequality we have used Lemma 4.2 together with Lemma 4.1. Since (see e.g. [19])
[TABLE]
we conclude that
[TABLE]
proving the claim and the theorem follows. ∎
Theorem 6.3**.**
There is such that for any subset of a finite cardinality and all ,
[TABLE]
Proof.
We start by proving the following statement.
Claim 6.4**.**
There are such that for each , there are two sequences of operators and such that , and for any finitely supported function ,
[TABLE]
and
[TABLE]
Without loss of generality, we may assume that is non-negative finitely supported function on . For , we set
[TABLE]
Since by the prime number theorem,
[TABLE]
we have
[TABLE]
Hence, by the Hardy–Littlewood theorem,
[TABLE]
For , we set
[TABLE]
In view of Corollary 5.4 and Theorem 6.1, we obtain (27) and (26), respectively, and the claim follows.
Now, the theorem is an easy consequence of Claim 6.4. Indeed, given a subset of a finite cardinality, for any , we can write
[TABLE]
Thus, taking
[TABLE]
we get the desired conclusion. ∎
In view of (7), Theorem 6.3 entails the following corollary, which is precisely Theorem C.
Corollary 6.5**.**
There is such that for any subset of a finite cardinality and all ,
[TABLE]
7. Applications
In this section we show two applications of Theorem 6.3 and Corollary 6.5. First, we prove that the restricted weak Orlicz estimates together with strong bounds are sufficient to get maximal inequalities for all . Next, we conclude almost everywhere convergence of ergodic averages for functions in some Orlicz space close to .
7.1. theory
Theorem 7.1**.**
For each there is such that for any function ,
[TABLE]
Proof.
With loss of generality, we may restrict the supremum to dyadic numbers. We claim the following holds true.
Claim 7.2**.**
There is such that for any subset of finite cardinality, and any ,
[TABLE]
Since are averaging operators, we may assume that . Observe that the function
[TABLE]
attains its maximum at
[TABLE]
The maximal value equals , thus
[TABLE]
Hence, by Theorem 6.3, we get
[TABLE]
which is what we claimed.
Next, we notice that by Theorem 3.2 and Theorem 5.1, we have
[TABLE]
Let us consider . Set . Since , the weak is normable (see [10]), thus at the cost of the additional factor of , we get
[TABLE]
for any . Now, by the Marcinkiewicz interpolation theorem, [1, Theorem 11.9], based on (28) and (29) we obtain
[TABLE]
where satisfies
[TABLE]
Since
[TABLE]
the theorem follows. ∎
7.2. Pointwise convergence
Let be a probability space with a measurable and measure preserving transformation . We consider the following averages
[TABLE]
With a help of the Calderón transference principle from [8] applied to Corollary 6.5, we deduce the following proposition.
Proposition 7.3**.**
There is such that for any subset , and all ,
[TABLE]
Proof.
Fix and . For , we define a finite subset of by setting
[TABLE]
Then for , ,
[TABLE]
Hence,
[TABLE]
By Corollary 6.5,
[TABLE]
Since preserves the measure , by integrating with respect to we obtain
[TABLE]
We now divide by and take approaching infinity to get
[TABLE]
Finally, taking tending to infinity by the monotone convergence theorem we conclude the proof. ∎
We are now in the position to show -almost everywhere convergence of the ergodic averages for a function from the Orlicz space . Let us recall that consists of functions such that
[TABLE]
where . The space is a Banach space with the norm
[TABLE]
where is the decreasing rearrangement of , that is
[TABLE]
and
[TABLE]
Theorem 7.4**.**
There is such that for each ,
[TABLE]
In particular, for each ,
[TABLE]
for -almost all .
Proof.
We first prove the following claim.
Claim 7.5**.**
There is such that for each , and any ,
[TABLE]
Indeed, by monotonicity, if , then
[TABLE]
Otherwise, , which entails that
[TABLE]
In view of Proposition 7.3,
[TABLE]
which together with (31) and (32) easily lead to (30).
Now, to show the theorem, let us fix . We set
[TABLE]
and
[TABLE]
Since for , we have
[TABLE]
Moreover, if then for and , we have . Since , we get
[TABLE]
Because the space is log-convex (see [12, 20]), by Claim 7.5, we get
[TABLE]
On the other hand, by (33) we have
[TABLE]
which together with (34) conclude the proof. ∎
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] J. Arias de Reyna, Pointwise convergence of Fourier series , Lecture Notes in Mathematics, Springer-Verlag, 2002.
- 2[2] G.D. Birkhoff, Proof of the ergodic theorem , Proc. Natl. Acad. Sci. USA 17 (1931), 656–660.
- 3[3] J. Bourgain, Estimations de certaines fonctions maximales , C. R. Acad. Sci. Paris Sér. I Math. 301 (1985), no. 10, 499–502.
- 4[4] by same author, An approach to pointwise ergodic theorems , Geometric Aspects of Functional Analysis, Springer, 1988, pp. 204–223.
- 5[5] by same author, On the maximal ergodic theorem for certain subsets of the integers , Israel J. Math. 61 (1988), 39–72.
- 6[6] by same author, Pointwise ergodic theorems for arithmetic sets. With an appendix by the author, Harry Furstenberg, Yitzhak Katznelson and Donald S. Ornstein. , Publ. Math.-Paris 69 (1989), no. 1, 5–45.
- 7[7] Z. Buczolich and R.D. Mauldin, Divergent square averages , Ann. Math. 171 (2010), no. 3, 1479–1530.
- 8[8] A.P. Calerón, Ergodic theory and translatina-invariant operators , Proc. Natl. Acad. Sci. 59 (1968), no. 2, 349–353.
