Endpoint estimates for the maximal function over prime numbers

Bartosz Trojan

arXiv:1907.04753·math.DS·July 11, 2019

Endpoint estimates for the maximal function over prime numbers

Bartosz Trojan

PDF

TL;DR

This paper establishes almost sure convergence of ergodic averages over prime numbers for functions in a specific Orlicz space, extending classical results to a new setting involving primes and advanced function spaces.

Contribution

It proves endpoint estimates for the maximal function over primes in the context of ergodic theory, particularly for functions in the Orlicz space $L( ext{log} L)^2( ext{log} ext{log} L)$.

Findings

01

Almost sure convergence of ergodic averages over primes for functions in the specified Orlicz space.

02

Extension of classical ergodic theorems to prime number averages with endpoint estimates.

03

New techniques for handling maximal functions over primes in ergodic theory.

Abstract

Given an ergodic dynamical system $(X, B, μ, T)$ , we prove that for each function $f$ belonging to the Orlicz space $L (lo g L)^{2} (lo g lo g L) (X, μ)$ , the ergodic averages \[ \frac{1}{\pi(N)} \sum_{p \in \mathbb{P}_N} f\big(T^p x\big), \] converge for $μ$ -almost all $x \in X$ , where $P_{N}$ is the set of prime numbers not larger that $N$ and $π (N) = # P_{N}$ .

Equations414

\frac{1}{\pi(N)}\sum_{p\in\mathbb{P}_{N}}f\big{(}T^{p}x\big{)},

\frac{1}{\pi(N)}\sum_{p\in\mathbb{P}_{N}}f\big{(}T^{p}x\big{)},

\frac{1}{N}\sum_{n=0}^{N-1}f\big{(}T^{n}x\big{)}

\frac{1}{N}\sum_{n=0}^{N-1}f\big{(}T^{n}x\big{)}

\mathscr{A}_{N}f(x)=\frac{1}{\pi(N)}\sum_{p\in\mathbb{P}_{N}}f\big{(}T^{p}x\big{)}

\mathscr{A}_{N}f(x)=\frac{1}{\pi(N)}\sum_{p\in\mathbb{P}_{N}}f\big{(}T^{p}x\big{)}

N \to \infty lim A_{N} f (x)

N \to \infty lim A_{N} f (x)

\mu\Big{\{}x\in X:\sup_{N\in\mathbb{N}}\mathscr{A}_{N}\big{(}{\mathds{1}_{{A}}}\big{)}(x)>\lambda\Big{\}}\leq C\lambda^{-1}\log^{2}(e/\lambda)\mu(A)

\mu\Big{\{}x\in X:\sup_{N\in\mathbb{N}}\mathscr{A}_{N}\big{(}{\mathds{1}_{{A}}}\big{)}(x)>\lambda\Big{\}}\leq C\lambda^{-1}\log^{2}(e/\lambda)\mu(A)

A_{N} f (x) = \frac{1}{π ( N )} p \in P_{N} \sum f (x + p) .

A_{N} f (x) = \frac{1}{π ( N )} p \in P_{N} \sum f (x + p) .

\Big{|}\Big{\{}x\in\mathbb{Z}:\sup_{N\in\mathbb{N}}\mathcal{A}_{N}\big{(}{\mathds{1}_{{F}}}\big{)}(x)>\lambda\Big{\}}\Big{|}\leq C\lambda^{-1}\log^{2}(e/\lambda)\lvert{F}\rvert

\Big{|}\Big{\{}x\in\mathbb{Z}:\sup_{N\in\mathbb{N}}\mathcal{A}_{N}\big{(}{\mathds{1}_{{F}}}\big{)}(x)>\lambda\Big{\}}\Big{|}\leq C\lambda^{-1}\log^{2}(e/\lambda)\lvert{F}\rvert

M_{N} f (x) = \frac{1}{ϑ ( N )} p \in P_{N} \sum f (x + p) lo g p,

M_{N} f (x) = \frac{1}{ϑ ( N )} p \in P_{N} \sum f (x + p) lo g p,

ϑ (N) = p \in P_{N} \sum lo g p .

ϑ (N) = p \in P_{N} \sum lo g p .

\Big{|}\Big{\{}x\in\mathbb{Z}:\sup_{t\leq n}\Big{|}\sum_{a\in A_{q}}\mathcal{F}^{-1}\big{(}\widehat{L^{a,q}_{2^{n}}}(\cdot-a/q)\eta_{s}(\cdot-a/q)\hat{f}\big{)}(x)\Big{|}>\lambda\Big{\}}\Big{|}\leq C\frac{1}{\lambda\varphi(q)}\|f\|_{\ell^{1}}

\Big{|}\Big{\{}x\in\mathbb{Z}:\sup_{t\leq n}\Big{|}\sum_{a\in A_{q}}\mathcal{F}^{-1}\big{(}\widehat{L^{a,q}_{2^{n}}}(\cdot-a/q)\eta_{s}(\cdot-a/q)\hat{f}\big{)}(x)\Big{|}>\lambda\Big{\}}\Big{|}\leq C\frac{1}{\lambda\varphi(q)}\|f\|_{\ell^{1}}

\chi:\big{(}\mathbb{Z}/q\mathbb{Z}\big{)}^{\times}\rightarrow\mathbb{C}^{\times},

\chi:\big{(}\mathbb{Z}/q\mathbb{Z}\big{)}^{\times}\rightarrow\mathbb{C}^{\times},

\mathds 1_{q} (x) = {10 if g cd (x, q) = 1, otherwise.

\mathds 1_{q} (x) = {10 if g cd (x, q) = 1, otherwise.

χ (n) = {χ^{⋆} (n) 0 if (n, q) = 1, otherwise .

χ (n) = {χ^{⋆} (n) 0 if (n, q) = 1, otherwise .

L (s, χ) = n \geq 1 \sum \frac{χ ( n )}{n ^{s}} .

L (s, χ) = n \geq 1 \sum \frac{χ ( n )}{n ^{s}} .

\Big{\{}z\in\mathbb{C}:1-\frac{c}{\log q}<\Re z<1\Big{\}}

\Big{\{}z\in\mathbb{C}:1-\frac{c}{\log q}<\Re z<1\Big{\}}

G (χ, n) = \frac{1}{φ ( q )} r \in A_{q} \sum χ (r) e^{2 π i r n / q}

G (χ, n) = \frac{1}{φ ( q )} r \in A_{q} \sum χ (r) e^{2 π i r n / q}

φ (q) \geq C_{ϵ} q^{1 - ϵ} .

φ (q) \geq C_{ϵ} q^{1 - ϵ} .

τ (χ) = φ (q) G (χ, 1) .

τ (χ) = φ (q) G (χ, 1) .

μ (q) = {(- 1)^{n} 0 if α_{1} = \dots = α_{n} = 1, otherwise,

μ (q) = {(- 1)^{n} 0 if α_{1} = \dots = α_{n} = 1, otherwise,

a \in A_{q} \sum G (χ, a) e^{2 π i x a / q} = μ (r) q_{0} \frac{φ ( r )}{φ ( q )} χ^{⋆} (- x)

a \in A_{q} \sum G (χ, a) e^{2 π i x a / q} = μ (r) q_{0} \frac{φ ( r )}{φ ( q )} χ^{⋆} (- x)

\sum_{a\in A_{q}}\chi(a)e^{2\pi iax/q}=\frac{\varphi(q)}{\varphi(q/r)}\chi^{\star}\big{(}x/r\big{)}\chi^{\star}\big{(}q/(rq_{0})\big{)}\mu\big{(}q/(rq_{0})\big{)}\tau(\chi^{\star}),

\sum_{a\in A_{q}}\chi(a)e^{2\pi iax/q}=\frac{\varphi(q)}{\varphi(q/r)}\chi^{\star}\big{(}x/r\big{)}\chi^{\star}\big{(}q/(rq_{0})\big{)}\mu\big{(}q/(rq_{0})\big{)}\tau(\chi^{\star}),

G (χ, a) = \frac{μ ( q / q _{0} )}{φ ( q )} χ^{⋆} (a) χ^{⋆} (q / q_{0}) τ (χ^{⋆}) .

G (χ, a) = \frac{μ ( q / q _{0} )}{φ ( q )} χ^{⋆} (a) χ^{⋆} (q / q_{0}) τ (χ^{⋆}) .

a \in A_{q} \sum G (χ, a) e^{2 π i x a / q}

a \in A_{q} \sum G (χ, a) e^{2 π i x a / q}

\displaystyle=\frac{\mu(r)}{\varphi(q/r)}\chi^{\star}(q/q_{0})\chi^{\star}\big{(}x/r\big{)}\chi^{\star}\big{(}q/(rq_{0})\big{)}\tau(\chi^{\star})^{2}

= \frac{μ ( r )}{φ ( q / r )} χ^{⋆} (x) τ (χ^{⋆})^{2} .

a \in A_{q} \sum G (χ, a) e^{2 π i x a / q} = \frac{μ ( r )}{φ ( q / r )} χ^{⋆} (- x) q_{0} .

a \in A_{q} \sum G (χ, a) e^{2 π i x a / q} = \frac{μ ( r )}{φ ( q / r )} χ^{⋆} (- x) q_{0} .

φ (q / r) φ (r) = φ (q),

φ (q / r) φ (r) = φ (q),

\big{|}G(\chi,a)\big{|}\leq\frac{\sqrt{q_{0}}}{\varphi(q)}\leq C_{\epsilon}q^{-\frac{1}{2}+\epsilon}.

\big{|}G(\chi,a)\big{|}\leq\frac{\sqrt{q_{0}}}{\varphi(q)}\leq C_{\epsilon}q^{-\frac{1}{2}+\epsilon}.

A_{N} f (x) = \frac{1}{π ( N )} p \in P_{N} \sum f (x + p)

A_{N} f (x) = \frac{1}{π ( N )} p \in P_{N} \sum f (x + p)

M_{N} f (x) = \frac{1}{ϑ ( N )} p \in P_{N} \sum f (x + p) lo g p

M_{N} f (x) = \frac{1}{ϑ ( N )} p \in P_{N} \sum f (x + p) lo g p

ϑ (N) = p \in P_{N} \sum lo g p .

ϑ (N) = p \in P_{N} \sum lo g p .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Endpoint estimates for the maximal function over prime numbers

Bartosz Trojan

Institute of Mathematics of Polish Academy of Science

ul. Śniadeckich 8

00-656 Warszawa

Poland

[email protected]

Abstract.

Given an ergodic dynamical system $(X,\mathcal{B},\mu,T)$ , we prove that for each function $f$ belonging to the Orlicz space $L(\log L)^{2}(\log\log L)(X,\mu)$ , the ergodic averages

[TABLE]

converge for $\mu$ -almost all $x\in X$ , where $\mathbb{P}_{N}$ is the set of prime numbers not larger that $N$ and $\pi(N)=\#\mathbb{P}_{N}$ .

Key words and phrases:

weak maximal ergodic inequality, Orlicz space, prime numbers, pointwise convergence

2010 Mathematics Subject Classification:

Primary: 37A45. Secondary: 46E30, 42B25.

1. Introduction

Let $(X,\mathcal{B},\mu,T)$ be an ergodic dynamical system, that is $(X,\mathcal{B},\mu)$ is a probability space with a measurable and measure preserving transformation $T:X\rightarrow X$ . The classical Birkhoff theorem [2] states that for any function $f$ from $L^{p}(X,\mu)$ with $p\in[1,\infty)$ , the ergodic averages

[TABLE]

converge for $\mu$ -almost all $x\in X$ . This classical result, among others, motivates studying ergodic averages over subsequences of integers. In this article we are interested in pointwise convergence of the following averages,

[TABLE]

where $\mathbb{P}_{N}$ is the set of prime numbers not larger than $N$ and $\pi(N)=\#\mathbb{P}_{N}$ . The problem of ergodic averages along prime numbers was initially studied by Bourgain in [4] where the case of functions belonging to $L^{2}(X,\mu)$ has been covered. It was extended by Wierdl in [22] to all $L^{p}(X,\mu)$ , for $p>1$ , see also [6, Section 9]. However, the endpoint $p=1$ , was left open for more than twenty years. Following the method developed in [7] by Buczolich and Mauldin, LaVictoire in [13] has shown that for each ergodic dynamical system there exists $f\in L^{1}(X,\mu)$ such that the sequence $(\mathscr{A}_{N}f:N\in\mathbb{N})$ diverges on a set of positive measure.

The purpose of this article is to find an Orlicz space close to $L^{1}(X,\mu)$ where the almost everywhere convergence holds. We show the following theorem (see Theorem 7.4).

Theorem A.

For each $f\in L(\log L)^{2}(\log\log L)(X,\mu)$ , the limit

[TABLE]

exists for $\mu$ -almost all $x\in X$ .

In light of the pointwise convergence obtained by Bourgain in [5], see also [16], to prove Theorem A it suffices to show the weak maximal ergodic inequality for functions in Orlicz space $L(\log L)^{2}(\log\log L)(X,\mu)$ . This inequality is deduce from the following restricted weak Orlicz estimate.

Theorem B.

There is $C>0$ such that for any subset $A\subset X$ ,

[TABLE]

for all $1>\lambda>0$ .

By appealing to the Calderón transference principle, see [8], Theorem B is deduced from the corresponding result for integers $\mathbb{Z}$ with the counting measure and the shift operator. To be more precise, for a function $f:\mathbb{Z}\rightarrow\mathbb{C}$ , we define

[TABLE]

Our main result is following theorem (see Theorem 6.3).

Theorem C.

There is $C>0$ such that for any subset $F\subset\mathbb{Z}$ of a finite cardinality

[TABLE]

for all $0<\lambda<1$ .

Theorem C together with $\ell^{2}(\mathbb{Z})$ estimates are sufficiently strong to imply the maximal inequality for all $\ell^{p}(\mathbb{Z})$ spaces, for $p>1$ , giving an alternative proof of the Wierld’s theorem [22].

Let us now give some details about the proof of Theorem C. Without loss of generality, we may restrict the supremum to dyadic numbers. It is more convenient to work with weighted averages $\mathcal{M}_{N}f$ instead of $\mathcal{A}_{N}f$ where

[TABLE]

and

[TABLE]

Given $t>0$ , for each $n\in\mathbb{N}$ , we decompose the operator $\mathcal{M}_{2^{n}}$ into two parts $A_{n}^{t}$ and $B_{n}^{t}$ , in such a way that the maximal function associated with $A_{n}^{t}$ has $\ell^{1,\infty}(\mathbb{Z})$ norm $\lesssim t\|f\|_{\ell^{1}}$ , whereas the one corresponding to $B_{n}^{t}$ has $\ell^{2}(\mathbb{Z})$ norm $\lesssim\exp\big{(}-c\sqrt{t}\big{)}\|f\|_{\ell^{2}}$ . When applied to the distribution function $\big{|}\big{\{}\sup_{n\in\mathbb{N}}\mathcal{M}_{2^{n}}({\mathds{1}_{{F}}})>\lambda\big{\}}\big{|}$ , we can optimize both estimates by taking $t\simeq\log^{2}(e/\lambda)$ . This idea originated to Ch. Fefferman [9], see also Bourgain [3]. Ionescu introduced this technique in a related discrete context, see [11]. The decomposition of $\mathcal{M}_{2^{n}}$ uses the circle method of Hardy and Littlewood. However, to achieve the exponential decay of the error term, due to the Page’s theorem, the approximating multiplier has to contain the second term of the asymptotic as well. Thus, the possible existence of the Siegel zero entails that in the neighborhood of the rational point $a/q$ the approximating multiplier $\widehat{L^{a,q}_{2^{n}}}(\cdot-a/q)$ depends on the rational number $a/q$ . We refer to Sections 3 and 5 for details. Thanks to the log-convexity of $\ell^{1,\infty}(\mathbb{Z})$ , the weak type estimates are reduced to showing

[TABLE]

for $2^{s}\leq q<2^{s+1}$ with $1\leq s\leq\sqrt{t}$ . At this stage we exploit the behavior of the Gauss sums described in Theorem 2.1.

Let us emphasize that under the Generalized Riemann Hypothesis we can obtain in Proposition 3.1, and consequently in Theorem 3.2, a better error estimate. However, it is not clear whether one can prove Theorem 6.1 with the bounds proportional to $\sqrt{t}\|f\|_{\ell^{1}}$ .

The paper is organized as follows. In Section 2, we collect necessary facts about Dirichlet characters and the zero-free region. Then we evaluate the Gauss sum that appears in the approximating multiplier (Theorem 2.1). Section 3 is devoted to construction of the approximating multipliers. In Sections 5 and 6, we show $\ell^{2}$ and the weak type estimates, respectively. In Section 7, we give two applications of Theorem C. Namely, we show how to deduce the maximal ergodic inequality for functions from $\ell^{p}(\mathbb{Z})$ , (Theorem 7.1). Next we apply the transference principle (Proposition 7.3) and show almost everywhere convergence of the ergodic averages $(\mathscr{A}_{N}f:N\in\mathbb{N})$ for $f\in L(\log L)^{2}(\log\log L)(X,\mu)$ , (Theorem 7.4).

Notation

Throughout the whole article, we write $A\lesssim B$ ( $A\gtrsim B$ ) if there is an absolute constant $C>0$ such that $A\leq CB$ , ( $A\geq CB$ ). Moreover, $C$ stands for a large positive constant which value may vary from occurrence to occurrence. If $A\lesssim B$ and $A\gtrsim B$ hold simultaneously then we write $A\simeq B$ . The set of positive integers and the set of prime numbers are denoted by $\mathbb{N}$ and $\mathbb{P}$ , respectively. For $x>0$ , we set $\mathbb{Z}_{x}=[1,x]\cap\mathbb{N}$ . Let $\mathbb{N}_{0}=\mathbb{N}\cup\{0\}$ .

2. Gauss sums

We start by recalling some basic facts from number theory. A general reference here is the book [17].

A homomorphism

[TABLE]

is called a Dirichlet character modulo $q$ . The simplest example, called the principal character modulo $q$ , is defined as

[TABLE]

A character $\chi$ modulo $q$ is primitive, if $q$ is the least integer $d$ , such that $\chi(m)=\chi(n)$ for all $m\equiv n\pmod{d}$ and $(mn,q)=1$ . For each character $\chi$ there is the unique primitive character $\chi^{\star}$ modulo $q_{0}$ for some $q_{0}\mid q$ , such that

[TABLE]

The character is quadratic if it takes only values $\{-1,0,1\}$ with at least one $-1$ . Recall that, if $\chi^{\star}$ is a primitive quadratic character with modulus $q_{0}$ , then

•

$q_{0}\equiv 1\pmod{4}$ , and $q_{0}$ is square-free, or

•

$4\mid q_{0}$ , $q_{0}/4\equiv 2\text{ or }3\pmod{4}$ , and $q_{0}/4$ is square-free.

Given a Dirichlet character $\chi$ and $s\in\mathbb{C}$ with $\Re s>1$ , we define the Dirichlet $L$ -function by the formula

[TABLE]

In fact, $L(\>\cdot\>,\chi)$ extends to the analytic function in $\{z\in\mathbb{C}:\Re z>0\}$ . There is an absolute constant $c>0$ , such that if $\chi$ is a Dirichlet character modulo $q$ , then the region

[TABLE]

contains at most one zero of $L(\>\cdot\>,\chi)$ , which we denote by $\beta_{q}$ . The zero $\beta_{q}$ is real and the corresponding character is quadratic. The character having zero in (1) is called exceptional. Since $L(\beta,\chi)=0$ implies that $L(1-\beta,\chi)=0$ , we may assume that $\frac{1}{2}\leq\beta_{q}<1$ .

The Gauss sum of a Dirichlet character $\chi$ modulo $q$ is defined as

[TABLE]

where $A_{q}=\big{\{}1\leq a\leq q:\gcd(a,q)=1\big{\}}$ , and $\varphi(q)=\#A_{q}$ . Let us recall that for each $\epsilon>0$ there is $C_{\epsilon}>0$ such that

[TABLE]

We set

[TABLE]

Let us denote by $\mu$ the Möbious function, which is defined for $q=p_{1}^{\alpha_{1}}\dots p_{n}^{\alpha_{n}}$ , where $p_{1},\ldots,p_{n}$ are distinct primes, as

[TABLE]

and $\mu(1)=1$ . The following theorem plays the crucial role in Section 6.

Theorem 2.1.

Let $\chi$ be a quadratic Dirichlet character modulo $q$ induced by $\chi^{\star}$ having the conductor $q_{0}$ . For $x\in\mathbb{Z}$ , we set $r=\gcd(q,x)$ . Then

[TABLE]

provided that $q/q_{0}$ is square-free, $\gcd(q/q_{0},q_{0})=1$ and $r\mid q/q_{0}$ . Otherwise the sum equals zero.

Proof.

By [17, Theorem 9.12], if $r\mid q/q_{0}$ then

[TABLE]

otherwise the sum equals zero. In particular, for $a\in A_{q}$ , we have

[TABLE]

Hence, $G(\chi,a)\neq 0$ entails that $q/q_{0}$ is square-free and $\gcd(q/q_{0},q_{0})=1$ . Next, using (4) and (3) we get

[TABLE]

Because $\lvert{\tau(\chi^{\star})}\rvert=\sqrt{q_{0}}$ , we have $\tau(\chi^{\star})^{2}=q_{0}\chi^{\star}(-1)$ . Hence,

[TABLE]

Finally, since $q/q_{0}$ is square-free, $\gcd(q/q_{0},q_{0})=1$ and $r\mid q/q_{0}$ , we deduce that $\gcd(q/r,r)=1$ . Therefore,

[TABLE]

which together with (5) completes the proof. ∎

Let us observe that the identity (4) together with (2) imply that

[TABLE]

for any $\epsilon>0$ . Moreover, $G(\chi,a)\neq 0$ entails that $q$ is square-free or $4\mid q$ and $q/4$ is square-free.

3. Approximating multipliers

Let us denote by $\mathcal{A}_{N}$ the averaging operator over prime numbers, that is for a function $f:\mathbb{Z}\rightarrow\mathbb{C}$ we have

[TABLE]

where $\mathbb{P}_{N}=[1,N]\cap\mathbb{P}$ and $\pi(N)=\#\mathbb{P}_{N}$ . Since sums over primes are very irregular, it is more convenient to work with

[TABLE]

where

[TABLE]

By the partial summation, we easily see that

[TABLE]

thus

[TABLE]

To better understand the operators $\mathcal{M}_{N}$ , we use the Hardy–Littlewood circle method. Let $\mathcal{F}$ denote the Fourier transform on $\mathbb{R}$ defined for any function $f\in L^{1}(\mathbb{R})$ as

[TABLE]

If $f\in\ell^{1}(\mathbb{Z})$ , we set

[TABLE]

To simplify the notation we denote by $\mathcal{F}^{-1}$ the inverse Fourier transform on $\mathbb{R}$ or the inverse Fourier transform on the torus $\mathbb{T}\equiv[0,1)$ , depending on the context. Let $\mathfrak{m}_{N}$ be the Fourier multiplier corresponding to $\mathcal{M}_{N}$ , i.e.,

[TABLE]

Then for a finitely supported function $f:\mathbb{Z}\rightarrow\mathbb{C}$ , we have

[TABLE]

For $\frac{1}{2}\leq\beta\leq 1$ , we set

[TABLE]

To simplify the notation we write $M_{N}$ for $M_{N}^{1}$ . Let $M_{0}\equiv 0$ . Recall that

[TABLE]

For $\beta<1$ , we notice that the operators $M_{N}^{\beta}$ are not averaging operators. Moreover, by the partial summation and (10), we get

[TABLE]

Hence,

[TABLE]

Moreover,

[TABLE]

thus

[TABLE]

Therefore,

[TABLE]

Given $q\in\mathbb{N}$ , and $a\in A_{q}$ , we set

[TABLE]

if there is no exceptional character modulo $q$ , and

[TABLE]

when there is an exceptional character $\chi_{q}$ modulo $q$ and $\beta_{q}$ is the corresponding zero.

Proposition 3.1.

There is $c>0$ such that if $\xi\in\mathbb{T}$ ,

[TABLE]

for some $1\leq q\leq Q$ , $a\in A_{q}$ , and $1\leq Q\leq\exp\big{(}c\sqrt{\log N}\big{)}$ , then

[TABLE]

Proof.

Observe that for a prime $p$ , $p\mid q$ if and only if $(p\bmod q,q)>1$ . Hence,

[TABLE]

Let $\theta=\xi-a/q$ . For $p\equiv r\pmod{q}$ , we have

[TABLE]

thus

[TABLE]

For $x\geq 2$ , we set

[TABLE]

Then, by the partial summation, we obtain

[TABLE]

Analogously, for any $\frac{1}{2}\leq\beta\leq 1$ , we can write

[TABLE]

By the Page’s theorem, there is an absolute constant $c>0$ such that for each $x\geq 2$ , $1\leq q\leq\exp\big{(}c\sqrt{\log x}\big{)}$ , and $r\in A_{q}$ ,

[TABLE]

if there is no exceptional character modulo $q$ , and

[TABLE]

when there is an exceptional character $\chi$ modulo $q$ , and $\beta$ is the concomitant zero. Therefore, by (15) and (16), we obtain

[TABLE]

which is bounded by $NQ\exp\big{(}-c\sqrt{\log N}\big{)}$ . Finally, by the prime number theorem

[TABLE]

and the proposition follows. ∎

Next, we select $\eta:\mathbb{R}\rightarrow\mathbb{R}$ , a smooth function such that $0\leq\eta\leq 1$ , and

[TABLE]

We may assume that $\eta$ is a convolution of two smooth functions with supports contained in $\big{(}-\tfrac{1}{2},\tfrac{1}{2}\big{)}$ . For $s\in\mathbb{N}_{0}$ , we set

[TABLE]

We define a family of approximating multipliers, by the formula

[TABLE]

where

[TABLE]

and $\mathscr{R}_{0}=\{1\}$ . We set $\nu_{n}=\sum_{s\geq 0}\nu_{n}^{s}$ .

Theorem 3.2.

There are $C,c>0$ such that for all $n\in\mathbb{N}_{0}$ and $\xi\in\mathbb{T}$ ,

[TABLE]

where $\mathfrak{m}_{N}$ is defined by (8).

Proof.

Let

[TABLE]

where the constant $c$ is determined in Proposition 3.1. By the Dirichlet’s principle, there are coprime integers $a$ and $q$ , satisfying $1\leq a\leq q\leq 2^{n}Q_{n}^{-1}$ , and such that

[TABLE]

Let us first consider the case when $1\leq q\leq Q_{n}$ . We select $s_{1}\in\mathbb{N}_{0}$ satisfying

[TABLE]

For $s\leq s_{1}$ and $a^{\prime}/q^{\prime}\in\mathscr{R}_{s}$ , with $a^{\prime}/q^{\prime}\neq a/q$ , we have

[TABLE]

Therefore, by (6) and (11),

[TABLE]

which implies that

[TABLE]

For $s>s_{1}$ , by (6) we obtain

[TABLE]

If $q$ is square-free or $4\mid q$ and $q/4$ is square-free then there is $s_{0}\in\mathbb{N}_{0}$ such that $a/q\in\mathscr{R}_{s_{0}}$ , thus

[TABLE]

By Proposition 3.1,

[TABLE]

Since $1-\eta_{s_{0}}(\xi-a/q)>0$ , whenever

[TABLE]

we obtain

[TABLE]

Finally, if $q$ and $q/4$ are not square-free then by Proposition 3.1,

[TABLE]

It remains to deal with $Q_{n}\leq q\leq 2^{n}Q_{n}^{-1}$ . By the Vinogradov’s inequality (see [21, Theorem 1, Chapter IX] or [18, Theorem 8.5]), we get

[TABLE]

Next, we show that

[TABLE]

Select $s_{2}\in\mathbb{N}_{0}$ such that

[TABLE]

For $s\leq s_{2}$ , if $a^{\prime}/q^{\prime}\in\mathscr{R}_{s}$ , then $1\leq q^{\prime}\leq Q_{n}^{\frac{1}{2}}$ , and hence

[TABLE]

Therefore, by (6) and (11),

[TABLE]

which entails that

[TABLE]

If $s>s_{2}$ , then by (6), we get

[TABLE]

hence by (18),

[TABLE]

and the theorem follows. ∎

4. Equidistribution of weak $\ell^{1}$ norms

In this section we prove that the maximal function associated with kernels $(M^{\beta}_{2^{n}}:n\in\mathbb{N}_{0})$ has weak $\ell^{1}(\mathbb{Z})$ -norm equidistributed in residue classes. Before embarking on the proof, let us recall two lemmas essential for the argument.

Lemma 4.1.

[14*, Lemma 1]**

There is $C>0$ such that for all $s\in\mathbb{N}$ and $u\in\mathbb{R}$ ,*

[TABLE]

Lemma 4.2.

[14*, Lemma 2]**

For all $p\geq 1$ , any $1\leq Q\leq 2^{2s}$ with $s\in\mathbb{N}$ , $r\in\{1,\ldots,Q\}$ , and any finitely supported function $f:\mathbb{Z}\rightarrow\mathbb{C}$ ,*

[TABLE]

The following theorem is the main result of this section.

Theorem 4.3.

There is $C>0$ such that for any $1\leq Q\leq 2^{2s}$ with $s\in\mathbb{N}$ , $r\in\{1,\ldots,Q\}$ , $\frac{1}{2}\leq\beta\leq 1$ , and any finitely supported function $f:\mathbb{Z}\rightarrow\mathbb{C}$ ,

[TABLE]

Proof.

Observe that, by the mean value theorem, for $x\in\mathbb{N}$ ,

[TABLE]

thus

[TABLE]

In particular, by the Hardy–Littlewood maximal theorem, there is $C>0$ such that for all $\frac{1}{2}\leq\beta\leq 1$ , and any $f\in\ell^{1}(\mathbb{Z})$ ,

[TABLE]

For $r\in\{1,\ldots,Q\}$ and $\lambda>0$ , we set

[TABLE]

Then, by (19), we have

[TABLE]

Moreover, for any $r,r^{\prime}\in\{1,\ldots,Q\}$ , we have

[TABLE]

Since $\eta_{s}=\eta_{s}\eta_{s-1}$ , by Young’s convolution inequality and Lemma 4.1, we obtain

[TABLE]

Thus

[TABLE]

which together with (20) imply that

[TABLE]

where the last inequality is a consequence of $1\leq Q\leq 2^{2s}$ . Therefore, in view of Lemma 4.2, we immediately get

[TABLE]

which is the desired conclusion. ∎

Essentially the same reasoning as in the proof of Theorem 4.3 leads to the following theorem.

Theorem 4.4.

There is $C>0$ such that for all $1\leq Q\leq 2^{2s}$ with $s\in\mathbb{N}$ , $r\in\{1,\ldots,Q\}$ , $\frac{1}{2}\leq\beta\leq 1$ , and any finitely supported function $f:\mathbb{Z}\rightarrow\mathbb{C}$ ,

[TABLE]

5. $\ell^{2}$ theory

We are now in the position to prove $\ell^{2}(\mathbb{Z})$ boundedness of the maximal function associated to the multipliers $(\nu_{n}^{s}:n\in\mathbb{N})$ .

Theorem 5.1.

For each $\epsilon>0$ there is $C>0$ such that for all $s\in\mathbb{N}_{0}$ , and any finitely supported function $f:\mathbb{Z}\rightarrow\mathbb{C}$ ,

[TABLE]

Proof.

We divide the supremum into two parts: $0\leq n<2^{s+4}$ and $2^{s+4}\leq n$ . Then the following holds true.

Claim 5.2.

For each $\epsilon>0$ there is $C>0$ such that for all $s\in\mathbb{N}_{0}$ , and any finitely supported function $f:\mathbb{Z}\rightarrow\mathbb{C}$ ,

[TABLE]

For the proof, we apply [15, Lemma 1] to write

[TABLE]

Let us fix $i\in\{0,\ldots,s\}$ . Then by the Plancherel’s theorem we get

[TABLE]

where $I_{j}^{i}=\big{\{}j2^{i}+1,j2^{i}+2,\ldots,(j+1)2^{i}\big{\}}$ . By (6), we obtain

[TABLE]

where $\Delta^{q}_{m}=\big{|}\widehat{M_{2^{m}}}-\widehat{M_{2^{m-1}}}\big{|}+\big{|}\widehat{M^{\beta_{q}}_{2^{m}}}-\widehat{M^{\beta_{q}}_{2^{m-1}}}\big{|}$ . In view of (12), we have

[TABLE]

uniformly with respect to $\xi\in\mathbb{T}$ , $q\in\mathbb{N}$ , and $\frac{1}{2}\leq\beta_{q}\leq 1$ . Since supports of $\eta_{s}(\cdot-a/q)$ are disjoint while $a/q$ varies over $\mathscr{R}_{s}$ , we obtain

[TABLE]

which together with (22) imply (21).

It remains now to treat supremum over $n\geq 2^{s+4}$ . For each $\frac{1}{2}\leq\beta<1$ we set

[TABLE]

and $\mathscr{R}_{s}^{1}=\mathscr{R}_{s}$ . In view of the Landau’s theorem [17, Corollary 11.9], there are $\mathcal{O}(\log s)$ distinct $\beta$ ’s. Therefore, it suffices to show the following claim.

Claim 5.3.

For each $\epsilon>0$ there is $C>0$ such that for all $s\in\mathbb{N}_{0}$ , $\frac{1}{2}\leq\beta\leq 1$ , any finitely supported function $f:\mathbb{Z}\rightarrow\mathbb{C}$ ,

[TABLE]

Let us fix $\frac{1}{2}\leq\beta\leq 1$ . We define

[TABLE]

and

[TABLE]

Observe that the functions $x\mapsto I(x,y)$ and $x\mapsto J(x,y)$ are $Q_{s}$ periodic where

[TABLE]

By the Plancherel’s theorem, for $u\in\mathbb{Z}_{Q_{s}}$ , we have

[TABLE]

because by (11),

[TABLE]

Therefore, by the triangle inequality

[TABLE]

Since $\mathscr{R}_{s}$ contains at most $2^{2(s+1)}$ rational numbers, by the Cauchy–Schwarz inequality we get

[TABLE]

Observe that

[TABLE]

thus

[TABLE]

Hence,

[TABLE]

Now, by multiple change of variables and periodicity we get

[TABLE]

Using Theorem 4.4, we can estimate

[TABLE]

Notice that

[TABLE]

Since supports of $\eta_{s}(\cdot-a/q)$ are disjoint while $a/q$ varies over $\mathscr{R}_{s}$ , by (6) we get

[TABLE]

Therefore,

[TABLE]

which together with (24) imply (23) and the theorem follows. ∎

Given $t>0$ and $n>t$ , we define the multiplier

[TABLE]

Corollary 5.4.

There are $C,c>0$ such that for each $t>0$ , and any finitely supported function $f\in\mathbb{Z}\rightarrow\mathbb{C}$ ,

[TABLE]

Proof.

Since

[TABLE]

our assertion follows from Theorem 3.2 and Theorem 5.1. Indeed, by the Plancherel’s theorem and Theorem 3.2 we get

[TABLE]

On the other hand, by Theorem 5.1,

[TABLE]

which concludes the proof. ∎

6. Weak type estimates

In this section we investigate the weak type estimates for the multipliers $\big{(}\Pi_{n}^{t}:n\geq t\big{)}$ . Then together with results from Section 5 we deduce Theorem C.

Theorem 6.1.

There is $C>0$ such that for all $t>0$ and any finitely supported function $f:\mathbb{Z}\rightarrow\mathbb{C}$ ,

[TABLE]

Proof.

Let us fix $2^{s}\leq q<2^{s+1}$ for some $1\leq s\leq\sqrt{t}$ . Let $\frac{1}{2}\leq\beta\leq 1$ . Suppose that $\chi$ is a quadratic Dirichlet character modulo $q$ induced by $\chi^{\star}$ having the conductor $q_{0}$ . We claim that the following holds true.

Claim 6.2.

There is $C>0$ such that for any finitely supported function $f:\mathbb{Z}\rightarrow\mathbb{C}$ ,

[TABLE]

The constant $C$ is independent of $q$ , $\beta$ and $\chi$ .

Let us first see that from Claim 6.2, we can deduce the theorem. Indeed, from (25) we easily get

[TABLE]

Recall that (see e.g. [19]),

[TABLE]

thus

[TABLE]

Hence, by log-convexity of $\ell^{1,\infty}(\mathbb{Z})$ , (see [12, 20]) we obtain

[TABLE]

which is bounded by $C\lambda^{-1}t\|f\|_{\ell^{1}}$ .

What is left now is to prove Claim 6.2. Let $r\in\{1,\ldots,q\}$ . For $x\equiv r\bmod q$ , we have

[TABLE]

where

[TABLE]

Hence, by Theorem 4.3, we obtain

[TABLE]

Next, by Young’s convolution inequality we get

[TABLE]

and

[TABLE]

Now, by Theorem 2.1, we can compute

[TABLE]

where in the last inequality we have used Lemma 4.2 together with Lemma 4.1. Since (see e.g. [19])

[TABLE]

we conclude that

[TABLE]

proving the claim and the theorem follows. ∎

Theorem 6.3.

There is $C>0$ such that for any subset $F\subset\mathbb{Z}$ of a finite cardinality and all $0<\lambda<1$ ,

[TABLE]

Proof.

We start by proving the following statement.

Claim 6.4.

There are $C,c>0$ such that for each $t>0$ , there are two sequences of operators $(A_{n}^{t}:n\in\mathbb{N})$ and $(B_{n}^{t}:n\in\mathbb{N})$ such that $\mathcal{M}_{2^{n}}=A_{n}^{t}+B_{n}^{t}$ , and for any finitely supported function $f:\mathbb{Z}\rightarrow\mathbb{C}$ ,

[TABLE]

and

[TABLE]

Without loss of generality, we may assume that $f$ is non-negative finitely supported function on $\mathbb{Z}$ . For $1\leq n<t$ , we set

[TABLE]

Since by the prime number theorem,

[TABLE]

we have

[TABLE]

Hence, by the Hardy–Littlewood theorem,

[TABLE]

For $t\leq n$ , we set

[TABLE]

In view of Corollary 5.4 and Theorem 6.1, we obtain (27) and (26), respectively, and the claim follows.

Now, the theorem is an easy consequence of Claim 6.4. Indeed, given a subset $F\subset\mathbb{Z}$ of a finite cardinality, for any $t>0$ , we can write

[TABLE]

Thus, taking

[TABLE]

we get the desired conclusion. ∎

In view of (7), Theorem 6.3 entails the following corollary, which is precisely Theorem C.

Corollary 6.5.

There is $C>0$ such that for any subset $F\subset\mathbb{Z}$ of a finite cardinality and all $0<\lambda<1$ ,

[TABLE]

7. Applications

In this section we show two applications of Theorem 6.3 and Corollary 6.5. First, we prove that the restricted weak Orlicz estimates together with strong $\ell^{2}$ bounds are sufficient to get $\ell^{p}$ maximal inequalities for all $1<p\leq 2$ . Next, we conclude almost everywhere convergence of ergodic averages for functions in some Orlicz space close to $L^{1}$ .

7.1. $\ell^{p}$ theory

Theorem 7.1.

For each $p\in(1,2]$ there is $C>0$ such that for any function $f\in\ell^{p}(\mathbb{Z})$ ,

[TABLE]

Proof.

With loss of generality, we may restrict the supremum to dyadic numbers. We claim the following holds true.

Claim 7.2.

There is $C>0$ such that for any subset $F\subset\mathbb{Z}$ of finite cardinality, and any $p_{0}\in(1,\infty)$ ,

[TABLE]

Since $\mathcal{M}_{N}$ are averaging operators, we may assume that $0<\lambda<1$ . Observe that the function

[TABLE]

attains its maximum at

[TABLE]

The maximal value equals $4e^{p_{0}-3}(p_{0}-1)^{-2}$ , thus

[TABLE]

Hence, by Theorem 6.3, we get

[TABLE]

which is what we claimed.

Next, we notice that by Theorem 3.2 and Theorem 5.1, we have

[TABLE]

Let us consider $p\in(1,2)$ . Set $p_{0}=(1+p)/2$ . Since $p_{0}>1$ , the weak $\ell^{p_{0}}(\mathbb{Z})$ is normable (see [10]), thus at the cost of the additional factor of $(p-1)^{-1}$ , we get

[TABLE]

for any $f\in\ell^{p,1}(\mathbb{Z})$ . Now, by the Marcinkiewicz interpolation theorem, [1, Theorem 11.9], based on (28) and (29) we obtain

[TABLE]

where $\theta\in(0,1)$ satisfies

[TABLE]

Since

[TABLE]

the theorem follows. ∎

7.2. Pointwise convergence

Let $(X,\mathcal{B},\mu)$ be a probability space with a measurable and measure preserving transformation $T:X\rightarrow X$ . We consider the following averages

[TABLE]

With a help of the Calderón transference principle from [8] applied to Corollary 6.5, we deduce the following proposition.

Proposition 7.3.

There is $C>0$ such that for any subset $A\in\mathcal{B}$ , and all $0<\lambda<1$ ,

[TABLE]

Proof.

Fix $A\in\mathcal{B}$ and $x\in X$ . For $R>L>0$ , we define a finite subset of $F\subset\mathbb{Z}$ by setting

[TABLE]

Then for $0\leq n\leq R-N$ , $N\leq L$ ,

[TABLE]

Hence,

[TABLE]

By Corollary 6.5,

[TABLE]

Since $T$ preserves the measure $\mu$ , by integrating with respect to $x\in X$ we obtain

[TABLE]

We now divide by $R$ and take $R$ approaching infinity to get

[TABLE]

Finally, taking $L$ tending to infinity by the monotone convergence theorem we conclude the proof. ∎

We are now in the position to show $\mu$ -almost everywhere convergence of the ergodic averages $(\mathscr{A}_{N}f:N)$ for a function $f$ from the Orlicz space $L(\log L)^{2}(\log\log L)(X,\mu)$ . Let us recall that $L(\log L)^{2}(\log\log L)(X,\mu)$ consists of functions such that

[TABLE]

where $\log^{+}t=\max\{0,\log t\}$ . The space $L(\log L)^{2}(\log\log L)(X,\mu)$ is a Banach space with the norm

[TABLE]

where $f^{*}$ is the decreasing rearrangement of $f$ , that is

[TABLE]

and

[TABLE]

Theorem 7.4.

There is $C>0$ such that for each $f\in L(\log L)^{2}(\log\log L)(X,\mu)$ ,

[TABLE]

In particular, for each $f\in L(\log L)^{2}(\log\log L)(X,\mu)$ ,

[TABLE]

for $\mu$ -almost all $x\in X$ .

Proof.

We first prove the following claim.

Claim 7.5.

There is $C>0$ such that for each $A\in\mathcal{B}$ , and any $0<\lambda<1$ ,

[TABLE]

Indeed, by monotonicity, if $\lambda\geq\mu(A)$ , then

[TABLE]

Otherwise, $\lambda\leq\mu(A)$ , which entails that

[TABLE]

In view of Proposition 7.3,

[TABLE]

which together with (31) and (32) easily lead to (30).

Now, to show the theorem, let us fix $f\in L(\log L)^{2}(\log\log L)(X,\mu)$ . We set

[TABLE]

and

[TABLE]

Since $\lvert{f(x)}\rvert\leq a_{j}$ for $x\in A_{j}$ , we have

[TABLE]

Moreover, if $j>k$ then for $x\in A_{j}$ and $y\in A_{k}$ , we have $\lvert{f(x)}\rvert\geq\lvert{f(y)}\rvert$ . Since $\mu(A_{j})=2^{-j}$ , we get

[TABLE]

Because the space $L^{1,\infty}(X,\mu)$ is log-convex (see [12, 20]), by Claim 7.5, we get

[TABLE]

On the other hand, by (33) we have

[TABLE]

which together with (34) conclude the proof. ∎

Bibliography22

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] J. Arias de Reyna, Pointwise convergence of Fourier series , Lecture Notes in Mathematics, Springer-Verlag, 2002.
2[2] G.D. Birkhoff, Proof of the ergodic theorem , Proc. Natl. Acad. Sci. USA 17 (1931), 656–660.
3[3] J. Bourgain, Estimations de certaines fonctions maximales , C. R. Acad. Sci. Paris Sér. I Math. 301 (1985), no. 10, 499–502.
4[4] by same author, An approach to pointwise ergodic theorems , Geometric Aspects of Functional Analysis, Springer, 1988, pp. 204–223.
5[5] by same author, On the maximal ergodic theorem for certain subsets of the integers , Israel J. Math. 61 (1988), 39–72.
6[6] by same author, Pointwise ergodic theorems for arithmetic sets. With an appendix by the author, Harry Furstenberg, Yitzhak Katznelson and Donald S. Ornstein. , Publ. Math.-Paris 69 (1989), no. 1, 5–45.
7[7] Z. Buczolich and R.D. Mauldin, Divergent square averages , Ann. Math. 171 (2010), no. 3, 1479–1530.
8[8] A.P. Calerón, Ergodic theory and translatina-invariant operators , Proc. Natl. Acad. Sci. 59 (1968), no. 2, 349–353.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Endpoint estimates for the maximal function over prime numbers

Abstract.

Key words and phrases:

2010 Mathematics Subject Classification:

1. Introduction

Theorem A**.**

Theorem B**.**

Theorem C**.**

Notation

2. Gauss sums

Theorem 2.1**.**

Proof.

3. Approximating multipliers

Proposition 3.1**.**

Proof.

Theorem 3.2**.**

Proof.

4. Equidistribution of weak ℓ1\ell^{1}ℓ1 norms

Lemma 4.1**.**

Lemma 4.2**.**

Theorem 4.3**.**

Proof.

Theorem 4.4**.**

5. ℓ2\ell^{2}ℓ2 theory

Theorem 5.1**.**

Proof.

Claim 5.2**.**

Claim 5.3**.**

Corollary 5.4**.**

Proof.

6. Weak type estimates

Theorem 6.1**.**

Proof.

Claim 6.2**.**

Theorem 6.3**.**

Proof.

Claim 6.4**.**

Corollary 6.5**.**

7. Applications

7.1. ℓp\ell^{p}ℓp theory

Theorem 7.1**.**

Proof.

Claim 7.2**.**

7.2. Pointwise convergence

Proposition 7.3**.**

Proof.

Theorem 7.4**.**

Proof.

Claim 7.5**.**

Theorem A.

Theorem B.

Theorem C.

Theorem 2.1.

Proposition 3.1.

Theorem 3.2.

4. Equidistribution of weak $\ell^{1}$ norms

Lemma 4.1.

Lemma 4.2.

Theorem 4.3.

Theorem 4.4.

5. $\ell^{2}$ theory

Theorem 5.1.

Claim 5.2.

Claim 5.3.

Corollary 5.4.

Theorem 6.1.

Claim 6.2.

Theorem 6.3.

Claim 6.4.

Corollary 6.5.

7.1. $\ell^{p}$ theory

Theorem 7.1.

Claim 7.2.

Proposition 7.3.

Theorem 7.4.

Claim 7.5.