Good weights for the Erd\H{o}s discrepancy problem

Nikos Frantzikinakis

arXiv:1903.01881·math.NT·July 16, 2020

Good weights for the Erd\H{o}s discrepancy problem

Nikos Frantzikinakis

PDF

Open Access

TL;DR

This paper extends the Erdős discrepancy problem to weighted variants involving structured and random weights, proving unboundedness of weighted sums of multiplicative functions and employing measure-preserving systems analysis.

Contribution

It introduces weighted versions of the Erdős discrepancy problem, combining structured and random weights, and develops new structural results for measure-preserving systems related to multiplicative functions.

Findings

01

Weighted sums of bounded multiplicative functions are unbounded.

02

Structured weights with irrationality features lead to unbounded discrepancy.

03

The analysis leverages measure-preserving systems associated with multiplicative functions.

Abstract

The Erd\H{o}s discrepancy problem, now a theorem by T. Tao, asks whether every sequence with values plus or minus one has unbounded discrepancy along all homogeneous arithmetic progressions. We establish weighted variants of this problem, for weights given either by structured sequences that enjoy some irrationality features, or certain random sequences. As an intermediate result, we establish unboundedness of weighted sums of bounded multiplicative functions and products of shifts of such functions. A key ingredient in our analysis for the structured weights, is a structural result for measure preserving systems naturally associated with bounded multiplicative functions that was recently obtained in joint work with B. Host.

Equations204

\sup_{d,n\in{\mathbb{N}}}\Big{|}\sum_{k=1}^{n}a(dk)\Big{|}=+\infty.

\sup_{d,n\in{\mathbb{N}}}\Big{|}\sum_{k=1}^{n}a(dk)\Big{|}=+\infty.

\sup_{d,n\in{\mathbb{N}}}\Big{|}\sum_{k=1}^{n}a(dk)\,w(k)\Big{|}=+\infty.

\sup_{d,n\in{\mathbb{N}}}\Big{|}\sum_{k=1}^{n}a(dk)\,w(k)\Big{|}=+\infty.

\sup_{d,n\in{\mathbb{N}}}\Big{|}\sum_{k=1}^{n}a(dk)\,e(k^{l}\alpha)\Big{|}=+\infty

\sup_{d,n\in{\mathbb{N}}}\Big{|}\sum_{k=1}^{n}a(dk)\,e(k^{l}\alpha)\Big{|}=+\infty

\sup_{n\in{\mathbb{N}}}\Big{|}\sum_{k=1}^{n}f(k)\,g(k+1)\,e(k^{l}\alpha)\Big{|}=+\infty,

\sup_{n\in{\mathbb{N}}}\Big{|}\sum_{k=1}^{n}f(k)\,g(k+1)\,e(k^{l}\alpha)\Big{|}=+\infty,

E_{n \in N}^{l o g} a (n + h) \overline{a (n)} = 0;

E_{n \in N}^{l o g} a (n + h) \overline{a (n)} = 0;

N \to \infty lim inf E_{n \in [N]}^{l o g} ∣ a (n) ∣^{2} > 0.

N \to \infty lim inf E_{n \in [N]}^{l o g} ∣ a (n) ∣^{2} > 0.

d, n \in N sup k = 1 \sum n w (k) a (d k)_{H} = + \infty.

d, n \in N sup k = 1 \sum n w (k) a (d k)_{H} = + \infty.

\sup_{d,n\in{\mathbb{N}}}\Big{|}\sum_{k=1}^{n}a(dQ(k))\,w(k)\Big{|}=+\infty.

\sup_{d,n\in{\mathbb{N}}}\Big{|}\sum_{k=1}^{n}a(dQ(k))\,w(k)\Big{|}=+\infty.

\sup_{d,n\in{\mathbb{N}}}\Big{|}\sum_{k=1}^{n}a(dk)\,\phi(P(k))\Big{|}=+\infty.

\sup_{d,n\in{\mathbb{N}}}\Big{|}\sum_{k=1}^{n}a(dk)\,\phi(P(k))\Big{|}=+\infty.

\sup_{n\in{\mathbb{N}}}\Big{|}\sum_{k=1}^{n}f(k)\,w(k)\Big{|}=+\infty.

\sup_{n\in{\mathbb{N}}}\Big{|}\sum_{k=1}^{n}f(k)\,w(k)\Big{|}=+\infty.

\sup_{n\in{\mathbb{N}}}\Big{|}\sum_{k=1}^{n}\prod_{j=1}^{\ell}f_{j}(k+h_{j})\,w(k)\Big{|}=+\infty.

\sup_{n\in{\mathbb{N}}}\Big{|}\sum_{k=1}^{n}\prod_{j=1}^{\ell}f_{j}(k+h_{j})\,w(k)\Big{|}=+\infty.

\sup_{n\in{\mathbb{N}}}\Big{|}\sum_{k=1}^{n}f(k)\,f(k+1)\Big{|}=+\infty.

\sup_{n\in{\mathbb{N}}}\Big{|}\sum_{k=1}^{n}f(k)\,f(k+1)\Big{|}=+\infty.

\limsup_{n\to\infty}\Big{|}\sum_{k=1}^{n}\lambda(k)\,\lambda(k+1)\Big{|}\geq 5

\limsup_{n\to\infty}\Big{|}\sum_{k=1}^{n}\lambda(k)\,\lambda(k+1)\Big{|}\geq 5

\sup_{n\in{\mathbb{N}}}\Big{|}\sum_{k=1}^{n}f(k)\,f(k+1)\,e(k^{l}\alpha)\Big{|}=+\infty.

\sup_{n\in{\mathbb{N}}}\Big{|}\sum_{k=1}^{n}f(k)\,f(k+1)\,e(k^{l}\alpha)\Big{|}=+\infty.

\sup_{n\in{\mathbb{N}}}\Big{|}\sum_{k=1}^{n}\prod_{j=1}^{\ell}f_{j}(k+h_{j})\,\phi(P(k))\Big{|}=+\infty.

\sup_{n\in{\mathbb{N}}}\Big{|}\sum_{k=1}^{n}\prod_{j=1}^{\ell}f_{j}(k+h_{j})\,\phi(P(k))\Big{|}=+\infty.

\sup_{n\in{\mathbb{N}}}\Big{|}\sum_{k=1}^{n}\prod_{j=1}^{\ell}f_{j}(k+h_{j})\,X_{k}(\omega)\Big{|}=+\infty.

\sup_{n\in{\mathbb{N}}}\Big{|}\sum_{k=1}^{n}\prod_{j=1}^{\ell}f_{j}(k+h_{j})\,X_{k}(\omega)\Big{|}=+\infty.

\sup_{n\in{\mathbb{N}}}\Big{|}\sum_{k=1}^{n}f(k)\Big{|}=+\infty.

\sup_{n\in{\mathbb{N}}}\Big{|}\sum_{k=1}^{n}f(k)\Big{|}=+\infty.

E_{n \in N}^{l o g} f (n + h) \overline{f (n)} = 0, h \in N,

E_{n \in N}^{l o g} f (n + h) \overline{f (n)} = 0, h \in N,

\sup_{n\in{\mathbb{N}}}\Big{|}\sum_{k=1}^{n}f(k)\,e(k^{l}\alpha)\Big{|}=+\infty,

\sup_{n\in{\mathbb{N}}}\Big{|}\sum_{k=1}^{n}f(k)\,e(k^{l}\alpha)\Big{|}=+\infty,

E_{n \in N}^{l o g} f (n + h) \overline{f (n)} e (n^{l} α) = 0.

E_{n \in N}^{l o g} f (n + h) \overline{f (n)} e (n^{l} α) = 0.

\sup_{d,n\in{\mathbb{N}}}\Big{|}\sum_{k=1}^{n}a(dk)\,b(d(k+1))\,w(k)\Big{|}=+\infty

\sup_{d,n\in{\mathbb{N}}}\Big{|}\sum_{k=1}^{n}a(dk)\,b(d(k+1))\,w(k)\Big{|}=+\infty

\sup_{d,n\in{\mathbb{N}}}\Big{|}\sum_{k=1}^{n}\prod_{j=1}^{\ell}a_{j}(d\,(k+h_{j}))\,w(k)\Big{|}=+\infty.

\sup_{d,n\in{\mathbb{N}}}\Big{|}\sum_{k=1}^{n}\prod_{j=1}^{\ell}a_{j}(d\,(k+h_{j}))\,w(k)\Big{|}=+\infty.

f^{2} (k) = E_{d \in Φ} f (d) \overline{f^{2} (d k)} f (d k^{2})

f^{2} (k) = E_{d \in Φ} f (d) \overline{f^{2} (d k)} f (d k^{2})

E_{n \in A} a (n) := \frac{1}{∣ A ∣} n \in A \sum a (n), E_{n \in A}^{l o g} a (n) := \frac{1}{\sum _{n \in A} \frac{1}{n}} n \in A \sum \frac{a ( n )}{n} .

E_{n \in A} a (n) := \frac{1}{∣ A ∣} n \in A \sum a (n), E_{n \in A}^{l o g} a (n) := \frac{1}{\sum _{n \in A} \frac{1}{n}} n \in A \sum \frac{a ( n )}{n} .

E_{n \in A} a (n) := N \to \infty lim E_{n \in A \cap [N]} a (n), E_{n \in A}^{l o g} a (n) := N \to \infty lim E_{n \in A \cap [N]}^{l o g} a (n)

E_{n \in A} a (n) := N \to \infty lim E_{n \in A \cap [N]} a (n), E_{n \in A}^{l o g} a (n) := N \to \infty lim E_{n \in A \cap [N]}^{l o g} a (n)

E_{n \in N} a (n) := l \to \infty lim E_{n \in [N_{l}]} a (n), E_{n \in N}^{l o g} a (n) := l \to \infty lim E_{n \in [N_{l}]}^{l o g} a (n)

E_{n \in N} a (n) := l \to \infty lim E_{n \in [N_{l}]} a (n), E_{n \in N}^{l o g} a (n) := l \to \infty lim E_{n \in [N_{l}]}^{l o g} a (n)

N \to \infty lim \frac{1}{∣ Φ _{N} ∣} ∣ (r^{- 1} Φ_{N}) △ Φ_{N} ∣ = 0

N \to \infty lim \frac{1}{∣ Φ _{N} ∣} ∣ (r^{- 1} Φ_{N}) △ Φ_{N} ∣ = 0

Φ_{N} := {p_{1}^{k_{1}} \dots p_{N}^{k_{N}} : 0 \leq k_{1}, \dots, k_{N} \leq N}, N \in N,

Φ_{N} := {p_{1}^{k_{1}} \dots p_{N}^{k_{N}} : 0 \leq k_{1}, \dots, k_{N} \leq N}, N \in N,

E_{n \in Φ} a (n) := N \to \infty lim E_{n \in Φ_{N}} a (n) .

E_{n \in Φ} a (n) := N \to \infty lim E_{n \in Φ_{N}} a (n) .

E_{n \in Φ} (a (r n) - a (n)) = 0.

E_{n \in Φ} (a (r n) - a (n)) = 0.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMathematical Approximation and Integration

Full text

\dajAUTHORdetails

title = Good Weights for the Erdős Discrepancy Problem, author = Nikos Frantzikinakis, plaintextauthor = Nikos Frantzikinakis, plaintexttitle = Good Weights for the Erdos Discrepancy Problem, keywords =Multiplicative functions, discrepancy, Erdős discrepancy problem, Elliott conjecture, Furstenberg correspondence., \dajEDITORdetailsyear=2020, number=8, received=30 April 2020, published=14 July 2020, doi=10.19086/da.13688,

[classification=text]

Good Weights for the Erdős Discrepancy Problem

Nikos Frantzikinakis Supported by the Hellenic Foundation for Research and Innovation, Project No: 1684.

Abstract

The Erdős discrepancy problem, now a theorem by T. Tao, asks whether every sequence with values plus or minus one has unbounded discrepancy along all homogeneous arithmetic progressions. We establish weighted variants of this problem, for weights given either by structured sequences that enjoy some irrationality features, or certain random sequences. As an intermediate result, we establish that weighted sums of bounded multiplicative functions and products of shifts of such functions are unbounded. A key ingredient in our analysis for the structured weights, is a structural result for measure preserving systems naturally associated with bounded multiplicative functions that was recently obtained in joint work with B. Host.

1 Introduction and main results

1.1 Introduction

The Erdős discrepancy problem is an elementary question that dates back to the 1930’s and asks if there is a sequence $a\colon{\mathbb{N}}\to\{-1,1\}$ that is evenly distributed along all homogeneous arithmetic progressions, in the sense that the sequence of partial sums $(\sum_{k=1}^{n}a(dk))_{n\in{\mathbb{N}}}$ is bounded uniformly in $d\in{\mathbb{N}}$ . The problem remained dormant for a long time and it was not until 2010 that interest was rejuvenated, when it became the subject of the Polymath5 project (see [7, 13] for related details). The problem was finally solved in 2015 by T. Tao [15] who proved the following (henceforth, with $\mathbb{S}$ we denote the unit circle and with $\mathbb{U}$ the complex unit disc):

Theorem 1.1 (Tao [15]).

For every sequence $a\colon{\mathbb{N}}\to\mathbb{S}$ we have

[TABLE]

We seek to obtain weighted variants of the previous result. To facilitate exposition, we introduce the following notion:

Definition 1.2.

We say that a sequence $w\colon{\mathbb{N}}\to\mathbb{U}$ is a good weight for the Erdős discrepancy problem, or simply, a good weight, if for every $a\colon{\mathbb{N}}\to\mathbb{S}$ we have

[TABLE]

Theorem 1.1 implies that $w=1$ (and more generally $w=f$ where $f\colon{\mathbb{N}}\to\mathbb{S}$ is a completely multiplicative function) is a good weight for the Erdős discrepancy problem. On the other hand, sequences with bounded partial sums, like the sequence $(e(k\alpha))_{k\in{\mathbb{N}}}$ , where $\alpha\in{\mathbb{R}}\setminus{\mathbb{Z}}$ and $e(t):=e^{2\pi it}$ , are not good weights, and more generally, a product of a completely multiplicative function $f\colon{\mathbb{N}}\to\mathbb{S}$ with a sequence that has bounded partial sums is not a good weight (take $a=\bar{f}$ ). It is less clear if some other oscillatory sequences like $(e(k^{l}\alpha))_{k\in{\mathbb{N}}}$ , where $l\geq 2$ and $\alpha$ is irrational, or random sequences of $\pm 1$ ’s are good weights. We will show in Corollary 1.5 and Theorem 1.7 that they are; that is, for every $a\colon{\mathbb{N}}\to\mathbb{S}$ we have

[TABLE]

and a similar statement holds if we use as weights random sequences of $\pm 1$ . Moreover, in Theorem 1.4 we give a rather general criterion that allows us to show that a large class of zero entropy sequences that enjoy certain irrationality features are good weights for the Erdős discrepancy problem.

On a related result of independent interest, we show that certain weighted sums of multiplicative functions are unbounded. For instance, we prove in Corollary 1.10 that if $l\geq 2$ , $\alpha$ is irrational, and $f,g\colon{\mathbb{N}}\to\mathbb{S}$ are multiplicative functions, then

[TABLE]

and in Theorems 1.11 we prove an analogous result when the weights are given by random sequences of $\pm 1$ ’s.

1.2 Results related to the weighted Erdős discrepancy problem

The next result gives necessary conditions for a bounded sequence of complex numbers to be a good weight for the Erdős discrepancy problem. In order to explain the exact assumptions needed, we use ergodic terminology that is explained in Section 3.2, and in Corollary 1.5 we give some explicit examples. See also Section 1.6 for our notation regarding averages; for reasons that are explained in Section 3.2 we use logarithmic averages.

Definition 1.3.

We say that the sequence $a\colon{\mathbb{N}}\to\mathbb{U}$

•

has vanishing self-correlations, if for every $h\in{\mathbb{N}}$ we have

[TABLE]

•

is non-null for logarithmic averages, or simply, non-null, if

[TABLE]

Our main result regarding structured (zero entropy) weights is the following one:

Theorem 1.4.

Suppose that $w\colon{\mathbb{N}}\to\mathbb{U}$ is non-null and totally ergodic, and has zero entropy and vanishing self-correlations. Then $w$ is a good weight for the Erdős discrepancy problem.

Remarks.

•

As was the case in [15], if $\mathcal{H}$ is an arbitrary inner product space and $a\colon{\mathbb{N}}\to\mathcal{H}$ is such that $\left\|a(k)\right\|_{\mathcal{H}}=1$ for all $k\in{\mathbb{N}}$ , then our argument works without any change and shows that

[TABLE]

•

Using Theorem 1.9 below, it is straightforward to adapt the proof of Theorem 1.4 in order to get the following stronger conclusion: For $Q(k)=\prod_{j=1}^{\ell}(k+h_{j})$ , $k\in{\mathbb{N}}$ , where $\ell\in{\mathbb{N}}$ , $h_{1},\ldots,h_{\ell}\in{\mathbb{Z}}^{+}$ , and $w$ is as before, we have for every sequence $a\colon{\mathbb{N}}\to\mathbb{S}$ that

[TABLE]

But our methods do not allow us to deal with the non-weighted version (where $w=1$ ) even when $Q(k)=k(k+1)$ , $k\in{\mathbb{N}}$ .

•

The zero entropy assumption cannot be removed. To see this, let $a(k)=f(k)$ and $w(k)=(-1)^{k}\overline{f(k)}$ , $k\in{\mathbb{N}}$ , where $f\colon{\mathbb{N}}\to\{-1,1\}$ is any multiplicative function that satisfies the Elliott conjecture, in which case $w$ has vanishing self-correlations and is totally ergodic (in fact Bernoulli). Also, the assumption that the self-correlations of $w$ vanish cannot be removed. To see this, let $a=1$ and $w(k)=e(k\alpha)$ , $k\in{\mathbb{N}}$ , where $\alpha$ is irrational. On the other hand, it is not clear whether the assumption of total ergodicity can be removed.

Corollary 1.5.

Let $a\colon{\mathbb{N}}\to\mathbb{S}$ be a sequence, $\phi\colon{\mathbb{T}}\to\mathbb{U}$ be Riemann integrable with $\int\phi=0$ and $\int|\phi|\neq 0$ , and let $P\colon{\mathbb{R}}\to{\mathbb{T}}$ be a polynomial with degree at least $2$ and irrational leading coefficient. Then

[TABLE]

It follows that for $l\geq 2$ and $\alpha$ irrational, the sequence $(e(k^{l}\alpha))_{k\in{\mathbb{N}}}$ and the sequence that assigns values $-1,0,$ or $1$ according to whether $\{k^{l}\alpha\}$ is in the interval $[0,1/3)$ , $[1/3,2/3)$ , or $[2/3,1)$ , are good weights.

The proof of Theorem 1.4 has a few interesting features. Unlike the proof of Theorem 1.1 in [15], we are not using explicitly or implicitly results from [10, 11, 14] on averages of multiplicative functions in short intervals, and also we do not carry out a separate analysis in the case where the sequence $(a(k))_{k\in{\mathbb{N}}}$ is a pretentious multiplicative function. To compensate for this, our argument crucially uses the following ergodic result that was proved in [3] using a combination of ergodic theory and number theory tools developed in [2] and [16] (the notions involved are defined in Section 3):

Theorem 1.6 (F., Host [3]).

All Furstenberg systems of a multiplicative function with values on $\mathbb{U}$ are disjoint from all zero entropy totally ergodic systems.

To get a sense of why Theorem 1.6 is useful, we note that it implies (via Proposition 4.1 below) that if $w$ is a totally ergodic sequence with zero entropy and $f\colon{\mathbb{N}}\to\mathbb{U}$ is a multiplicative function, then the self-correlations of the sequence $f\cdot w$ split into a product of the self-correlations of $f$ and the self-correlations of $w$ . Hence, if we assume that $w$ has vanishing self-correlations, then the same holds for $f\cdot w$ , and this property implies Theorem 1.4 (see Proposition 2.7).

Lastly, we give examples of good weights that are given by random sequences. The first result applies to independent symmetric random variables and its proof is rather elementary.

Theorem 1.7.

Let $(X_{k}(\omega))_{k\in{\mathbb{N}}}$ be a sequence of independent random variables with ${\mathbb{P}}(X_{k}=-1)={\mathbb{P}}(X_{k}=1)=\frac{1}{2}$ , $k\in{\mathbb{N}}$ , and $a\colon{\mathbb{N}}\to\mathbb{U}$ be a non-null sequence. Then $\omega$ -almost surely the sequence $(a(k)\,X_{k}(\omega))_{k\in{\mathbb{N}}}$ is a good weight for the Erdős discrepancy problem.

The second result applies to independent random variables that are not necessarily symmetric as long as they take a fixed non-zero complex value not too rarely. Its proof, due to M. Kolountzakis, is simple, but makes essential use of Theorem 1.1 (via the criterion given in Lemma 5.5 below).

Theorem 1.8.

Let $(X_{k}(\omega))_{k\in{\mathbb{N}}}$ be a sequence of independent, complex valued, random variables. Suppose that for some $c\in{\mathbb{C}}\setminus\{0\}$ the sequence $\rho_{k}:={\mathbb{P}}(X_{k}=c)$ , $k\in{\mathbb{N}}$ , is decreasing and satisfies $\sum_{k\in{\mathbb{N}}}\rho_{k}^{l}=+\infty$ for every $l\in{\mathbb{N}}$ . Then $\omega$ -almost surely the sequence $(X_{k}(\omega))_{k\in{\mathbb{N}}}$ is a good weight for the Erdős discrepancy problem.

Remark.

The assumption of monotonicity cannot be removed. To see this, take ${\mathbb{P}}(X_{k}=1)=1$ if $k$ is prime, and ${\mathbb{P}}(X_{k}=0)=1$ for all other $k\in{\mathbb{N}}$ , and let $a\colon{\mathbb{N}}\to\{-1,1\}$ be a completely multiplicative function that is equal to $(-1)^{n}$ on the $n$ -th prime. Then $\omega$ -almost surely we have $\sup_{d,n\in{\mathbb{N}}}\big{|}\sum_{k=1}^{n}a(dk)\,X_{k}(\omega)\big{|}\leq 1$ .

If we take $c=1$ and decreasing $\rho_{k}$ such that $\rho_{k}\geq\frac{1}{\log{k}}$ and ${\mathbb{P}}(X_{k}=0)=1-\rho_{k}$ for $k\geq 2$ , then Theorem 1.8 applies, and gives that the indicator functions of certain sparse random subsets of the integers are good weights for the Erdős discrepancy problem.

1.3 Results related to weighted sums of multiplicative functions

As was the case in the proof of Theorem 1.1 in [15], the unboundedness of weighted discrepancy sums for arbitrary unit modulus sequences follows from similar unboundedness properties of unit modulus completely multiplicative functions. We state next some related results that are of independent interest.

Theorem 1.9.

Let $f\colon{\mathbb{N}}\to\mathbb{U}$ be a non-null multiplicative function and $w\colon{\mathbb{N}}\to\mathbb{U}$ be non-null, totally ergodic, with zero entropy, and vanishing self-correlations. Then

[TABLE]

In fact, the following stronger property holds: If $w$ is as before, $f_{1},\ldots,f_{\ell}\colon{\mathbb{N}}\to\mathbb{U}$ are multiplicative functions, and $h_{1},\ldots,h_{\ell}\in{\mathbb{Z}}^{+}$ are such that the sequence $(\prod_{j=1}^{\ell}f_{j}(k+h_{j}))_{k\in{\mathbb{N}}}$ is non-null, then we have

[TABLE]

Remark.

Note that for $w=1$ although (3) holds for all completely multiplicative functions with values on $\mathbb{S}$ , it fails for some non-null multiplicative functions with values on $\mathbb{U}$ . For instance it fails for $f(k)=(-1)^{k+1}$ , $k\in{\mathbb{N}}$ , and for all non-trivial Dirichlet characters.

Regarding the non-weighted version of (4), not much is known for $\ell\geq 2$ . For instance, it is not known whether for every completely multiplicative function $f\colon{\mathbb{N}}\to\mathbb{S}$ we have

[TABLE]

This problem was raised by J. Teräväinen and A. Klurman, who remarked that it is not even clear how to prove that

[TABLE]

where $\lambda$ is the Liouville function. On the other hand, it is an immediate consequence of the next corollary, that if $f\colon{\mathbb{N}}\to\mathbb{S}$ is a multiplicative function, $l\geq 2$ , and $\alpha$ is irrational, then we have

[TABLE]

Corollary 1.10.

Let $\phi\colon{\mathbb{T}}\to\mathbb{U}$ be a Riemann integrable function with $\int\phi=0$ and $\int|\phi|\neq 0$ , and $P\colon{\mathbb{R}}\to{\mathbb{T}}$ be a polynomial with degree at least $2$ and irrational leading coefficient. Then for all multiplicative functions $f_{1},\ldots,f_{\ell}\colon{\mathbb{N}}\to\mathbb{U}$ and $h_{1},\ldots,h_{\ell}\in{\mathbb{Z}}^{+}$ such that the sequence $(\prod_{j=1}^{\ell}f_{j}(k+h_{j}))_{k\in{\mathbb{N}}}$ is non-null, we have

[TABLE]

Regarding weights given by random $\pm 1$ sequences, we have the following result:

Theorem 1.11.

Let $(X_{k}(\omega))_{k\in{\mathbb{N}}}$ be a sequence of independent random variables with ${\mathbb{P}}(X_{k}=-1)={\mathbb{P}}(X_{k}=1)=\frac{1}{2}$ , $k\in{\mathbb{N}}$ . Then $\omega$ -almost surely the following holds: For every $\ell\in{\mathbb{N}}$ , all multiplicative functions $f_{1},\ldots,f_{\ell}\colon{\mathbb{N}}\to\mathbb{U}$ , and $h_{1},\ldots,h_{\ell}\in{\mathbb{Z}}^{+}$ such that the sequence $(\prod_{j=1}^{\ell}f_{j}(k+h_{j}))_{k\in{\mathbb{N}}}$ is non-null, we have

[TABLE]

Remarks.

$\bullet$ It is not hard to show that for any fixed collection of arbitrary sequences $f_{1},\ldots,f_{\ell}\colon{\mathbb{N}}\to\mathbb{U}$ , we have that (5) holds $\omega$ -almost surely. So the important point in Theorem 1.11 is that the set of $\omega$ ’s for which the conclusion holds is independent of the (uncountably many) multiplicative functions $f_{1},\ldots,f_{\ell}$ .

$\bullet$ For $\ell=1$ , Theorem 1.7 gives better results that apply to not necessarily symmetric random variables. But for $\ell\geq 2$ the method of proof of Theorem 1.7 fails to give (5) (since the relevant unweighted result is not known).

Theorem 1.11 is based on Theorem 5.3 below, which is proved by combining some simple counting arguments and concentration of measure estimates for sums of independent random variables.

1.4 Proof strategy

Let us first recall the proof strategy of Theorem 1.1 given in [15]. An immediate consequence of Theorem 1.1 is that for every completely multiplicative function $f\colon{\mathbb{N}}\to\mathbb{S}$ we have

[TABLE]

It turns out that a variant of this special case (see Proposition 2.5 below for $w=1$ ) is the key ingredient in the proof of Theorem 1.1. The proof of (6) given in [15] proceeds by considering separately the case where $f$ is structured (“pretentious”) and random (“non-pretentious”). The latter case can be treated (as in Proposition 2.6 below) using the identities

[TABLE]

which hold for random-like (“non-pretentious”) multiplicative functions.

Likewise, our arguments rely on weighted variants of (6) and (7) that are of independent interest. For instance, we prove that if $l\geq 2$ and $\alpha$ is irrational, then for every multiplicative function $f\colon{\mathbb{N}}\to\mathbb{S}$ we have

[TABLE]

and we also prove stronger results involving weighted sums of products of shifts of several multiplicative functions. To prove (8) we rely on one of the main results in [3], which implies that for every $l\in{\mathbb{N}}$ and $\alpha$ irrational we have

[TABLE]

The fact that (9) holds for every multiplicative function $f\colon{\mathbb{N}}\to\mathbb{S}$ (which is not true for (7)) simplifies the proof of (8), versus the argument given for the proof of (6) in [15], and ultimately of the fact that $(e(k^{l}\alpha))_{k\in{\mathbb{N}}}$ is a good weight. One reason is that we do not have to carry out a separate analysis in the case where $f$ is structured (“pretentious”), as was the case in [15].

The proofs of the results concerning random weights are simpler. Theorem 1.7 is based on a variant of (9) that uses random weights and is proved in Theorem 5.3 via elementary techniques. Theorem 1.8 is deduced from Theorem 1.1 using an elementary argument given in Section 5.2.

1.5 Some open problems

A possible strengthening of Theorem 1.1 is given in the following problem (for $w=1$ and $a=b$ the problem was previously proposed by J. Teräväinen and A. Klurman at the December 2018 workshop of the American Institute of Mathematics “Sarnak’s Conjecture”):

Problem 1.

Is it true that for every $a,b\colon{\mathbb{N}}\to\mathbb{S}$ we have

[TABLE]

when $w(k)=1$ , $k\in{\mathbb{N}}$ , or when $w(k)=e(k^{2}\alpha)$ , $k\in{\mathbb{N}}$ , with $\alpha$ irrational?

When $w=1$ the problem is open even when $a=b=f$ , where $f\colon{\mathbb{N}}\to\mathbb{S}$ is a completely multiplicative function (see remarks on Section 1.3). More generally, one can ask whether for the previous choices of the sequence $w$ , for every $a_{1},\ldots,a_{\ell}\colon{\mathbb{N}}\to\mathbb{S}$ and all $h_{1},\ldots,h_{\ell}\in{\mathbb{Z}}^{+}$ we have

[TABLE]

Corollary 1.10 shows that the answer is yes when $a_{1},\ldots,a_{\ell}$ are multiplicative functions with values on $\mathbb{S}$ and $w$ is the sequence $(e(k^{2}\alpha))_{k\in{\mathbb{N}}}$ with $\alpha$ irrational. But unlike the previous discrepancy statements, we do not have a way to reduce Problem 1 to one about weighted sums of multiplicative functions. Any such reduction probably depends upon obtaining an integral representation result, analogous to Proposition 2.4 below, for sequences of the form $A(k_{1},\ldots,k_{\ell})={\mathbb{E}}_{d\in\Phi}\prod_{j=1}^{\ell}a_{j}(dk_{j})$ , $k_{1},\ldots,k_{\ell}\in{\mathbb{N}}$ , where $\Phi$ is a multiplicative Følner sequence (see Section 2.1) along which all previous averages exist. Note though that more complicated “higher order multiplicative functions” arise this way, for instance, if $f\colon{\mathbb{N}}\to\mathbb{S}$ is defined by $f(k)=e((n_{1}\alpha_{1}+\cdots+n_{l}\alpha_{l})^{2})$ , where $k=p_{1}^{n_{1}}\cdots p_{l}^{n_{l}}$ is the unique factorization of $k\in{\mathbb{N}}$ , and $\alpha_{1},\ldots,\alpha_{l}\in{\mathbb{R}}$ , then

[TABLE]

for every $k\in{\mathbb{N}}$ .

On a different direction, it seems likely that the zero integral condition in Corollary 1.5 can be removed. Proving this would probably necessitate to combine arguments of this article with a detailed analysis of the pretentious case (similar to the one in [15]), and it is not clear how to do this.

Problem 2.

Is it true that Corollary 1.5 holds even if we do not assume that $\int\phi=0$ ?

Let us say that a subset $S$ of ${\mathbb{N}}$ is good for the Erdős discrepancy problem, or simply, good, if the indicator function ${\bf 1}_{S}$ is a good weight for the Erdős discrepancy problem. By taking the sequence $(a(k))_{k\in{\mathbb{N}}}$ in (2) to be an appropriate multiplicative function one easily verifies that the sets $\{n\not\equiv 0\pmod{r}\}$ for $r\geq 3$ , $\{2^{n},n\in{\mathbb{N}}\}$ , and $\{p_{n},n\in{\mathbb{N}}\}$ , where $p_{n}$ is the $n$ -th prime, are bad. On the other hand, it is easy to deduce form Theorem 1.1 that the sets $r{\mathbb{Z}}$ for $r\in{\mathbb{N}}$ and $\{n^{l},n\in{\mathbb{N}}\}$ for $l\in{\mathbb{N}}$ , are good. But it is not at all clear whether certain simple sets that lack multiplicative structure are good.

Problem 3.

Are the sets $\{p_{n}+1,n\in{\mathbb{N}}\}$ , $\{n^{2}\pm 1,n\in{\mathbb{N}}\}$ , $\{2^{n}+1,n\in{\mathbb{N}}\}$ , or $\{[n^{c}],n\in{\mathbb{N}}\}$ for $c>1$ not an integer, good for the Erdős discrepancy problem?

Theorem 1.8 implies that random subsets of the integers with positive density, and certain sparse random subsets with density roughly $(\log{N})^{-1}$ in $[N]$ , are almost surely good. But how about sparser random subsets?

Problem 4.

Let $a\in(0,1]$ and $(X_{k}(\omega))_{k\in{\mathbb{N}}}$ be a sequence of independent random variables with ${\mathbb{P}}(X_{k}=1)=k^{-a}$ , ${\mathbb{P}}(X_{k}=0)=1-k^{-a}$ , $k\in{\mathbb{N}}$ . Is it true that $\omega$ -almost surely the sequence $(X_{k}(\omega))_{k\in{\mathbb{N}}}$ is a good weight for the Erdős discrepancy problem?

1.6 Notation

With $\mathbb{U}$ we denote the complex unit disc $\{z\in{\mathbb{C}}\colon|z|\leq 1\}$ and with $\mathbb{S}$ we denote the complex unit circle $\{z\in{\mathbb{C}}\colon|z|=1\}$ . With ${\mathbb{T}}$ we denote the $1$ -dimensional torus that we identify with ${\mathbb{R}}/{\mathbb{Z}}$ . With ${\mathbb{N}}$ we denote the positive integers and with ${\mathbb{Z}}^{+}$ the non-negative integers. For $N\in{\mathbb{N}}$ we let $[N]:=\{1,\ldots,N\}$ . For $t\in{\mathbb{R}}$ we also let $e(t):=e^{2\pi it}$ .

If $A$ is a non-empty finite subset of ${\mathbb{N}}$ we let

[TABLE]

If $A$ is an infinite subset of ${\mathbb{N}}$ we let

[TABLE]

whenever the limits exist.

With ${\mathbf{N}}=([N_{l}])_{l\in{\mathbb{N}}}$ we denote a sequence of intervals with $N_{l}\to\infty$ . We let

[TABLE]

whenever the limits exist. Using partial summation one sees that if ${\mathbb{E}}_{n\in{\mathbb{N}}}\,a(n)=0$ , then also $\mathbb{E}^{\log}_{n\in{\mathbb{N}}}\,a(n)=0$ (but the converse does not hold in general).

2 Reduction to statements about multiplicative functions

2.1 Multiplicative averages

We denote by ${\mathbb{Q}}^{+}$ the multiplicative group of positive rationals.

Definition 2.1.

We say that $\Phi=(\Phi_{N})_{N\in{\mathbb{N}}}$ is a multiplicative Følner sequence, if $\Phi_{N}$ is a finite subset of ${\mathbb{N}}$ for every $N\in{\mathbb{N}}$ , and for every $r\in{\mathbb{Q}}^{+}$ we have

[TABLE]

where $r^{-1}\Phi_{N}:=\{n\in{\mathbb{N}}\colon rn\in\Phi_{N}\}$ .

An example of a multiplicative Følner sequence is given by

[TABLE]

where $(p_{n})_{n\in{\mathbb{N}}}$ denotes the sequence of primes.

Definition 2.2.

If $\Phi=(\Phi_{N})_{N\in{\mathbb{N}}}$ is a multiplicative Følner sequence and $a\colon{\mathbb{N}}\to\mathbb{U}$ is such that the average below exists, we define the multiplicative average of the sequence $a$ along $\Phi$ by

[TABLE]

Note that property (10) implies the following dilation invariance property of the multiplicative averages: For every $a\colon{\mathbb{Q}}^{+}\to\mathbb{U}$ , multiplicative Følner sequence $\Phi$ , and $r\in{\mathbb{Q}}^{+}$ , we have

[TABLE]

2.2 Reduction to multiplicative functions via Bochner’s theorem

A variant of the next lemma was proved in [15, Section 2] using Fourier analysis on an appropriate finite Abelian group (of the form $({\mathbb{Z}}/M{\mathbb{Z}})^{r}$ for large $M,r\in{\mathbb{N}}$ ) and a compactness argument. We use a somewhat different approach (also used in [1, Section 10.2]) that invokes Bochner’s theorem on positive definite functions. We first introduce some notation.

Definition 2.3.

With ${\mathcal{M}}$ we denote the set of all completely multiplicative functions $f\colon{\mathbb{N}}\to\mathbb{S}$ .

Endowed with pointwise multiplication and the topology of pointwise convergence, the set ${\mathcal{M}}$ is a compact (metrizable) Abelian group.

Proposition 2.4.

Let $A\colon{\mathbb{N}}^{2}\to{\mathbb{C}}$ be defined by

[TABLE]

where $a\colon{\mathbb{N}}\to{\mathbb{C}}$ is a bounded sequence and $\Phi=(\Phi_{N})_{N\in{\mathbb{N}}}$ is a multiplicative Følner sequence such that all the averages above exist. Then there exists a (positive) measure $\sigma$ on the space ${\mathcal{M}}$ , with total mass equal to ${\mathbb{E}}_{d\in\Phi}|a(d)|^{2}$ , such that

[TABLE]

Proof.

We first extend the sequence $a$ to the positive rationals ${\mathbb{Q}}^{+}$ by letting $a(r)=0$ for $r\in{\mathbb{Q}}^{+}\setminus{\mathbb{N}}$ . We define $B\colon{\mathbb{Q}}^{+}\to{\mathbb{C}}$ as follows

[TABLE]

Using the dilation invariance property (11) and our assumption that the averages defining the sequence $A$ exist, we deduce that the averages below exist and we have

[TABLE]

We are going to use this identity in order to verify that $B$ is a positive definite sequence on ${\mathbb{Q}}^{+}$ with pointwise multiplication. Indeed, for all $c_{1},\ldots,c_{N}\in{\mathbb{C}}$ and $r_{1},\ldots,r_{N}\in{\mathbb{Q}}^{+}$ , we have

[TABLE]

Note that the dual group of $({\mathbb{Q}}^{+},\cdot)$ consists of the completely multiplicative functions on ${\mathbb{Q}}^{+}$ with unit modulus, and any such $\psi\colon{\mathbb{Q}}^{+}\to\mathbb{S}$ satisfies $\psi(m/n)=f(m)\,\overline{f(n)}$ , $m,n\in{\mathbb{N}}$ , for some completely multiplicative function $f\in{\mathcal{M}}$ . A well known theorem of Bochner gives that there exists a (positive) Borel measure $\sigma$ on the space ${\mathcal{M}}$ such that

[TABLE]

The total mass of $\sigma$ is $B(1)={\mathbb{E}}_{d\in\Phi}|a(d)|^{2}$ . Lastly, we have

[TABLE]

and the proof is complete. ∎

Using the previous representation theorem we get the following criterion:

Proposition 2.5.

Let $w\colon{\mathbb{N}}\to\mathbb{U}$ be such that for every probability measure $\sigma$ on the space ${\mathcal{M}}$ we have

[TABLE]

Then $w$ is a good weight for the Erdős discrepancy problem.

Proof.

Arguing by contradiction, suppose that $w$ is not a good weight for the Erdős discrepancy problem. Then there exists a sequence $a\colon{\mathbb{N}}\to\mathbb{S}$ such that

[TABLE]

We average with respect to $d$ over a multiplicative Følner sequence of intervals $\Phi=(\Phi_{N})_{N\in{\mathbb{N}}}$ , chosen so that all relevant averages below exist (such a sequence can always be found using a diagonalisation argument), and deduce that

[TABLE]

Expanding the square we get that the expression in (12) is equal to

[TABLE]

where

[TABLE]

By Lemma 2.4, there exists a (positive) measure $\sigma$ on the space ${\mathcal{M}}$ , with total mass ${\mathbb{E}}_{d\in\Phi}|a(d)|^{2}=1$ , such that

[TABLE]

We deduce that the expression (13), and hence the expression in (12), is equal to

[TABLE]

Hence,

[TABLE]

This contradicts our assumption and completes the proof. ∎

2.3 Reduction to correlation estimates

As was the case in [15], a key step in the proof of our main results is an elementary observation that allows to deduce unboundedness of partial sums from vanishing of self-correlations (which are defined using logarithmic averages because of reasons explained in the next section).

Proposition 2.6.

Let $b\colon{\mathbb{N}}\to\mathbb{U}$ be a non-null sequence such that for every $h\in{\mathbb{N}}$ we have

[TABLE]

Then

[TABLE]

Proof.

Arguing by contradiction, suppose that the conclusion fails. Then there exists $C>0$ such that

[TABLE]

Using this, we can find a sequence of intervals ${\mathbf{N}}=([N_{l}])_{l\in{\mathbb{N}}}$ , with $N_{l}\to\infty$ , such that all averages $\mathbb{E}^{\log}_{n\in{\mathbf{N}}}$ written below exist and for every $H\in{\mathbb{N}}$ we have

[TABLE]

Since the sequence $b$ is non-null, we have

[TABLE]

Next, notice that

[TABLE]

since by our assumption $\mathbb{E}^{\log}_{n\in{\mathbf{N}}}\,b(n+h_{1})\,\overline{b(n+h_{2})}=0$ for $h_{1}\neq h_{2}$ and we also used twice that the logarithmic averages of a bounded sequence are translation invariant. From the above we deduce that $HB\leq 4C^{2}$ and we get a contradiction by choosing $H>4C^{2}/B$ .

∎

Proposition 2.7.

Let $w\colon{\mathbb{N}}\to\mathbb{U}$ be a non-null sequence such that for every multiplicative function $f\colon{\mathbb{N}}\to\mathbb{S}$ and every $h\in{\mathbb{N}}$ we have

[TABLE]

Then $w$ is a good weight for the Erdős discrepancy problem.

Proof.

Arguing by contradiction, suppose that the conclusion fails. Then by Proposition 2.5 there exist a sequence $w\colon{\mathbb{N}}\to\mathbb{U}$ , a probability measure $\sigma$ on the space ${\mathcal{M}}$ , and $C>0$ , such that

[TABLE]

Using this and a diagonalization argument, we can find a sequence of intervals ${\mathbf{N}}=([N_{l}])_{l\in{\mathbb{N}}}$ , with $N_{l}\to\infty$ , such that $\mathbb{E}^{\log}_{n\in{\mathbf{N}}}|w(n)|^{2}$ and all averages $\mathbb{E}^{\log}_{n\in{\mathbf{N}}}$ written below exist and for every $H\in{\mathbb{N}}$ we have

[TABLE]

We let

[TABLE]

where the positiveness follows since the sequence $w$ is non-null by our assumption. Next, notice that

[TABLE]

since by our assumption $\mathbb{E}^{\log}_{n\in{\mathbb{N}}}\,(f\cdot w)(n+h_{1})\,\overline{(f\cdot w)(n+h_{2})}=0$ for $h_{1}\neq h_{2}$ . Since $\sigma$ is a probability measure, we deduce using the bounded convergence theorem that

[TABLE]

Combining (14) and (15) we deduce that $H\,A\leq 4C$ and we get a contradiction by choosing $H>4C/A$ . ∎

3 Notions and results from ergodic theory

The proof of our main results regarding structured (zero entropy) sequences depend on some notions and results in ergodic theory that we describe next. The material in this section is not needed for the results concerning random weights.

3.1 Measure preserving systems

A measure preserving system, or simply a system, is a quadruple $(X,{\mathcal{X}},\mu,T)$ where $(X,{\mathcal{X}},\mu)$ is a probability space and $T\colon X\to X$ is an invertible, measurable, measure preserving transformation. We typically omit the $\sigma$ -algebra ${\mathcal{X}}$ and write $(X,\mu,T)$ . Throughout, for $n\in{\mathbb{N}}$ we denote by $T^{n}$ the composition $T\circ\cdots\circ T$ ( $n$ times) and let $T^{-n}:=(T^{n})^{-1}$ and $T^{0}:=\operatorname{id}_{X}$ . Also, for $f\in L^{1}(\mu)$ and $n\in{\mathbb{Z}}$ we denote by $T^{n}f$ the function $f\circ T^{n}$ .

We say that the system $(X,\mu,T)$ is ergodic if the only functions $f\in L^{1}(\mu)$ that satisfy $Tf=f$ are the constant ones. It is totally ergodic if $(X,\mu,T^{d})$ is ergodic for every $d\in{\mathbb{N}}$ .

3.2 Furstenberg systems

For readers convenience, we reproduce here some ergodic notions and constructions that can also be found in [2, 3]. For the purposes of this article, all averages in the definitions below are taken to be logarithmic. The reason is that we later on invoke results from ergodic theory, like Theorem 3.7 below, that are only known when the joint Furstenberg systems are defined using logarithmic averages. This limitation comes from the number theoretic input used in the proof of Theorem 3.7, in particular, the identities in [3, Theorem 3.1].

Definition 3.1.

Let ${\mathbf{N}}:=([N_{l}])_{l\in{\mathbb{N}}}$ be a sequence of intervals with $N_{l}\to\infty$ . We say that a finite collection of bounded sequences $\mathcal{A}=\{a_{1},\ldots,a_{\ell}\}$ admits log-correlations on ${\mathbf{N}}$ , if the limits

[TABLE]

exist for all $m\in{\mathbb{N}}$ , all $h_{1},\ldots,h_{m}\in{\mathbb{Z}}$ , and all $\tilde{a}_{1},\ldots,\tilde{a}_{m}\in\mathcal{A}\cup\overline{\mathcal{A}}$ .

For every finite collection of sequences that admits log-correlations on a given sequence of intervals, we use a variant of the correspondence principle of Furstenberg [5, 6] in order to associate a measure preserving system that captures the statistical properties of these sequences.

Definition 3.2.

Let $a_{1},\ldots,a_{\ell}\colon{\mathbb{Z}}\to\mathbb{U}$ be sequences that admit log-correlations on the sequence of intervals ${\mathbf{N}}:=([N_{l}])_{l\in{\mathbb{N}}}$ . We let $\mathcal{A}:=\{a_{1},\ldots,a_{\ell}\}$ , $X:=(\mathbb{U}^{\ell})^{\mathbb{Z}}$ , $T$ be the shift transformation on $X$ , and $\mu$ be the weak-star limit of the sequence of measures $(\mathbb{E}^{\log}_{n\in[N_{l}]}\,\delta_{T^{n}a})_{l\in{\mathbb{N}}}$ where $a:=(a_{1},\ldots,a_{\ell})$ is thought of as an element of $X$ . We call $(X,\mu,T)$ the joint Furstenberg system associated with ( $\mathcal{A}$ , ${\mathbf{N}}$ ).

Remark.

If we are given sequences $a_{1},\ldots,a_{\ell}\colon{\mathbb{N}}\to\mathbb{U}$ that are defined on ${\mathbb{N}}$ , we extend them to ${\mathbb{Z}}$ in an arbitrary way. It is easy to check that the measure $\mu$ will not depend on the extension.

Note that a collection of sequences $a_{1},\ldots,a_{\ell}\colon{\mathbb{Z}}\to\mathbb{U}$ may have several non-isomorphic joint Furstenberg systems depending on which sequence of intervals ${\mathbf{N}}$ we use in the evaluation of their joint correlations. For convenience of exposition, we sometimes associate a property of ergodic nature with a given finite collection of sequences if all joint Furstenberg systems of the collection have this property. In particular, we often use the following terminology:

Definition 3.3.

We say that a sequence $a\colon{\mathbb{Z}}\to\mathbb{U}$ is totally ergodic and/or has zero entropy, if all its Furstenberg systems are totally ergodic and/or have zero entropy.

Remark.

In [8], a zero entropy sequence is called completely deterministic.

Examples of zero entropy sequences include the sequences $(e(n^{l}\alpha))_{n\in{\mathbb{N}}}$ where $l\in{\mathbb{N}}$ and $\alpha\in{\mathbb{R}}$ ; these sequences are also totally ergodic if $\alpha$ is irrational (see Proposition 4.2 below).

3.3 Disjointness properties

We will use the following notion that was introduced by Furstenberg in [4]:

Definition 3.4.

We say that two systems $(X,\mu,T)$ and $(Y,\nu,S)$ are disjoint, if the only $T\times S$ invariant measure on the product space $(X\times Y,\mu\times\nu)$ , with first and second marginals the measures $\mu$ and $\nu$ respectively, is the product measure $\mu\times\nu$ .

The notion of disjointness in ergodic theory naturally introduces the following notion of statistical disjointness of two finite collections of bounded sequences.

Definition 3.5.

We say that two finite collections $\mathcal{A}$ and $\mathcal{B}$ of sequences with values on $\mathbb{U}$ , are statistically disjoint, if all the joint Furstenberg systems of the collection $\mathcal{A}$ are (measure-theoretically) disjoint form all the joint Furstenberg systems of the collection $\mathcal{B}$ .

The next result shows that if two collections of sequences are statistically disjoint, then all their joint correlations decouple into products of joint correlations of $\mathcal{A}$ and joint correlations of $\mathcal{B}$ .

Proposition 3.6.

Let $\mathcal{A}=\{a_{1},\ldots,a_{\ell}\}$ and $\mathcal{A}^{\prime}=\{a^{\prime}_{1},\ldots,a^{\prime}_{\ell^{\prime}}\}$ be two collections of sequences with values on $\mathbb{U}$ that are statistically disjoint.Then

[TABLE]

for all choices $A_{n}=\prod_{j=1}^{m}\tilde{a}_{j}(n+h_{j})$ , $A^{\prime}_{n}=\prod_{j=1}^{m^{\prime}}\tilde{a}^{\prime}_{j}(n+h_{j}^{\prime})$ , $n\in{\mathbb{N}}$ , where $m,m^{\prime},h_{j},h_{j}^{\prime}\in{\mathbb{N}}$ and $\tilde{a}_{j}\in\mathcal{A}\cup\overline{\mathcal{A}}$ , $\tilde{a}^{\prime}_{j}\in\mathcal{A}^{\prime}\cup\overline{\mathcal{A}^{\prime}}$ are arbitrary.

Proof.

Arguing by contradiction, suppose that the conclusion fails. Then there exists a sequence of intervals ${\mathbf{N}}=([N_{l}])_{l\in{\mathbb{N}}}$ , with $N_{l}\to\infty$ , on which the family $\mathcal{A}\cup\mathcal{A}^{\prime}$ admits log-correlations and we have

[TABLE]

for some choice of $A_{n}=\prod_{j=1}^{m}\tilde{a}_{j}(n+h_{j})$ , $A^{\prime}_{n}=\prod_{j=1}^{m^{\prime}}\tilde{a}^{\prime}_{j}(n+h_{j}^{\prime})$ , $n\in{\mathbb{N}}$ , where $m,m^{\prime},h_{j},h_{j}^{\prime}\in{\mathbb{N}}$ and $\tilde{a}_{j}\in\mathcal{A}\cup\overline{\mathcal{A}}$ , $\tilde{a}^{\prime}_{j}\in\mathcal{A}^{\prime}\cup\overline{\mathcal{A}^{\prime}}$ . Let $(X,\mu,T)$ and $(X^{\prime},\mu^{\prime},T^{\prime})$ be the joint Furstenberg systems associated with ( $\mathcal{A}$ , ${\mathbf{N}}$ ) and ( $\mathcal{A}^{\prime}$ , ${\mathbf{N}}$ ) respectively.

We let $x_{0}:=(a_{1},\ldots,a_{\ell})\in X$ and $x_{0}^{\prime}:=(a^{\prime}_{1},\ldots,a^{\prime}_{\ell^{\prime}})\in X^{\prime}$ . After passing to a subsequence of ${\mathbf{N}}$ (which for simplicity we denote again by ${\mathbf{N}}$ ), we can assume that the weak-star limit

[TABLE]

exists and defines a $T\times T^{\prime}$ invariant measure on $X\times X^{\prime}$ . The projection of $\rho$ on $X$ is the weak-star limit $\lim_{l\to\infty}\mathbb{E}^{\log}_{n\in[N_{l}]}\delta_{x_{0}}$ , which is the measure $\mu$ . Likewise, the projection of $\rho$ on $X^{\prime}$ is the measure $\mu^{\prime}$ . Since the families $\mathcal{A}$ and $\mathcal{A}^{\prime}$ are statistically disjoint, the systems $(X,\mu,T)$ and $(X^{\prime},\mu^{\prime},T^{\prime})$ are disjoint, hence

[TABLE]

Now for $x=(x_{1}(n),\ldots,x_{\ell}(n))_{n\in{\mathbb{Z}}}\in X$ we let

[TABLE]

Likewise, for $x^{\prime}=(x^{\prime}_{1}(n),\ldots,x^{\prime}_{\ell^{\prime}}(n))_{n\in{\mathbb{Z}}}\in X^{\prime}$ we let

[TABLE]

With the above notation, we define the function $F(x):=\prod_{j=1}^{m}G_{h_{j},j}(x)$ , $x\in X$ , where for $j=1,\ldots,m$ if $\tilde{a}_{j}=a_{k_{j}}$ or $\overline{a}_{k_{j}}$ for some $k_{j}\in\{1,\ldots,\ell\}$ we set $G_{h_{j},j}$ to be $F_{h_{j},k_{j}}$ or $\overline{F}_{h_{j},k_{j}}$ respectively. Likewise, we define the function $F^{\prime}(x^{\prime}):=\prod_{j=1}^{m^{\prime}}G^{\prime}_{h^{\prime}_{j},j}(x^{\prime})$ , $x^{\prime}\in X^{\prime}$ . Then using (16) and the definition of the measures $\mu,\mu^{\prime}$ and the measure $\rho$ given by (17), we get that

[TABLE]

This contradicts (18) and completes the proof. ∎

The next result follows by combining the structural result of [3, Theorem 1.5] with the disjointness statement of [2, Proposition 3.12].

Theorem 3.7 (F., Host [2, 3]).

All joint Furstenberg systems of any collection of multiplicative functions with values on $\mathbb{U}$ are disjoint from all zero entropy totally ergodic systems.

Restating Theorem 3.7 using terminology introduced in the previous definitions we get the following result:

Theorem 3.8.

Every finite collection of multiplicative functions with values on $\mathbb{U}$ is statistically disjoint from every totally ergodic sequence with zero entropy.

4 Proof of main results for structured weights

4.1 Proof of Theorems 1.4 and 1.9

First we show that the assumption of Proposition 2.6 is satisfied for various sequences of interest.

Proposition 4.1.

Suppose that $w\colon{\mathbb{N}}\to\mathbb{U}$ is a totally ergodic sequence with zero entropy and vanishing self-correlations. Let also $f_{1},\ldots,f_{\ell}\colon{\mathbb{N}}\to\mathbb{U}$ be multiplicative functions, $h_{1},\ldots,h_{\ell}\in{\mathbb{Z}}^{+}$ , and $b(n):=w(n)\,\prod_{j=1}^{\ell}f_{j}(n+h_{j})$ , $n\in{\mathbb{N}}$ . Then for every $h\in{\mathbb{N}}$ we have

[TABLE]

Remark.

For the purpose of proving Theorem 1.4 we only need to consider the case where $\ell=1$ and $f_{1}$ is completely multiplicative of unit modulus. But this special case does not seem to offer significant simplifications.

Proof.

By Theorem 3.8, the collection of sequences $\{f_{1},\ldots,f_{\ell}\}$ and $\{w\}$ are statistically disjoint. By Proposition 3.6, we have that the difference between the average

[TABLE]

and the product of averages

[TABLE]

converges to zero as $N\to\infty$ . Since by our assumption $\mathbb{E}^{\log}_{n\in{\mathbb{N}}}\,w(n+h)\,\overline{w(n)}=0$ for every $h\in{\mathbb{N}}$ , the result follows. ∎

Proof of Theorems 1.4 and 1.9.

Theorem 1.4 follows immediately from Propositions 2.7 and 4.1 (for $\ell=1$ , $h_{1}=0$ ).

To prove Theorem 1.9, we note first that by Theorem 3.8, the collection of sequences $\{f_{1},\ldots,f_{\ell}\}$ and $\{w\}$ are statistically disjoint. Hence, Proposition 3.6 gives that the difference

[TABLE]

converges to [math] as $N\to\infty$ . Using this and our assumption that the sequences $(w(n))_{n\in{\mathbb{N}}}$ and $(\prod_{j=1}^{\ell}f_{j}(n+h_{j}))_{n\in{\mathbb{N}}}$ are non-null, we deduce that their product is also non-null. With this in mind, Theorem 1.9 follows from Propositions 2.6 and 4.1. ∎

4.2 Proof of Corollaries 1.5 and 1.10

We will need the following fact:

Proposition 4.2.

Let $P\in{\mathbb{R}}[t]$ be a non-constant polynomial with irrational leading coefficient and let $\phi\colon{\mathbb{T}}\to\mathbb{U}$ be Riemann integrable. Then the sequence $(\phi(P(n)))_{n\in{\mathbb{N}}}$ has zero entropy, is totally ergodic, and has a unique Furstenberg system.

Remark.

In order to have total ergodicity it is essential that the leading coefficient of $P$ (and not just any non-constant coefficient) is irrational. For example, if $a(n):=e(\frac{n^{3}}{3}+n^{2}\alpha$ ), $n\in{\mathbb{N}}$ , where $\alpha$ is irrational, then it turns out that the sequence $(a(n))$ is not totally ergodic. We thank S. Pattison for pointing this out, see Sections 5.3 and 5.4 in [12] for a related discussion.

Proof.

Let $d:=\deg{P}$ . We start with the well known fact (see [6, Section 1.7] or [12, Section 4.4]) that there exists a unipotent affine transformation $S\colon{\mathbb{T}}^{d}\to{\mathbb{T}}^{d}$ , with unique invariant measure the Haar measure $m_{{\mathbb{T}}^{d}}$ , so that the system $({\mathbb{T}}^{d},m_{{\mathbb{T}}^{d}},S)$ is totally ergodic (here we used that the leading coefficient of $P$ is irrational), a Riemann integrable function $\Psi\colon{\mathbb{T}}^{d}\to\mathbb{U}$ , and $y_{0}\in{\mathbb{T}}^{d}$ , such that

[TABLE]

(For instance, when $P(n)=n^{2}\alpha$ , $n\in{\mathbb{N}}$ , we can take $S(t,s)=(t+\alpha,s+2t+\alpha)$ , $\Psi(t,s)=\phi(t)$ , $t,s\in{\mathbb{T}}$ , and $y_{0}=(0,0)$ .) We let $X:=\mathbb{U}^{\mathbb{Z}}$ and $T$ be the shift transformation on $X$ . We define the map $\pi\colon{\mathbb{T}}^{d}\to X$ by

[TABLE]

Clearly we have $\pi\circ T=S\circ\pi$ . Next, let $m\in{\mathbb{N}}$ and $\ell_{-m},\ldots,\ell_{m}\in{\mathbb{Z}}$ . We define the function

[TABLE]

where we used the following conventions: for $z\in\mathbb{U}$ and $k<0$ we have $z^{k}:=\overline{z^{-k}}$ and $0^{0}=0$ . Note that the linear span of all such functions forms a conjugation closed subalgebra of $C(X)$ that separates points, hence it is dense in $C(X)$ .

Next note that for $x_{0}:=(\phi(P(n)))_{n\in{\mathbb{Z}}}\in X$ we have

[TABLE]

where to justify the second identity we use (19), for the third we use the unique ergodicity of $S$ and the fact that $\Psi\circ S^{n}$ is Riemann integrable for $n\in{\mathbb{Z}}$ , and for the fourth we use (20). By linearity and density, it follows that the sequence of measures $({\mathbb{E}}_{n\in[N]}\delta_{T^{n}x_{0}})_{N\in{\mathbb{N}}}$ (and hence the sequence $(\mathbb{E}^{\log}_{n\in[N]}\delta_{T^{n}x_{0}})_{N\in{\mathbb{N}}}$ ) converges weak-star to a measure $\mu$ on $X$ , which is equal to the image of the measure $m_{{\mathbb{T}}^{d}}$ under $\pi$ . From the above, we deduce that the sequence $(\phi(P(n)))_{n\in{\mathbb{Z}}}$ has a unique Furstenberg system, which is $(X,\mu,T)$ , and $\pi$ is a factor map from the system $({\mathbb{T}}^{d},m_{{\mathbb{T}}^{d}},S)$ to the system $(X,\mu,T)$ . Since the system $({\mathbb{T}}^{d},m_{{\mathbb{T}}^{d}},S)$ is totally ergodic and has zero entropy, the same holds for its factor $(X,\mu,T)$ . This completes the proof. ∎

Proof of Corollaries 1.5 and 1.10.

It suffices to verify that the sequence $w(n):=\phi(P(n))$ , $n\in{\mathbb{N}}$ , satisfies the assumptions of Theorem 1.4. Since $P$ has a non-constant coefficient irrational, the sequence $(P(n))_{n\in{\mathbb{N}}}$ is equidistributed in ${\mathbb{T}}$ , which gives that $\mathbb{E}^{\log}_{n\in{\mathbb{N}}}|w(n)|^{2}=\int|\phi|^{2}>0$ , so $w$ is non-null. Moreover, it follows from Proposition 4.2 that $w$ has zero entropy and is totally ergodic. It remains to verify that it has vanishing self-correlations, meaning,

[TABLE]

for every $h\in{\mathbb{N}}$ . In fact, we establish a stronger property: If $\phi,\psi\colon{\mathbb{T}}\to{\mathbb{C}}$ are Riemann integrable, then for every $h\in{\mathbb{N}}$ we have

[TABLE]

Using standard Weyl estimates this is easily shown to be the case when $\phi(t):=e(kt)$ and $\psi:=e(lt)$ for some $k,l\in{\mathbb{Z}}$ (this is the only point where we use the assumption that $P$ has a non-linear coefficient irrational). Using linearity and uniform approximation by trigonometric polynomials, we deduce that (21) holds for all $\phi,\psi\in C({\mathbb{T}})$ . Finally, we deduce that (21) holds for all Riemann integrable $\phi,\psi$ by approximating them in $L^{1}(m_{\mathbb{T}})$ by continuous functions and using that the sequence $(P(n+h))_{n\in{\mathbb{N}}}$ is equidistributed in ${\mathbb{T}}$ for every $h\in{\mathbb{Z}}$ . This completes the proof. ∎

5 Proof of main results for random weights

5.1 Proof of Theorems 1.7 and 1.11

For $N\in{\mathbb{N}}$ , we denote by ${\bf 1}_{[N]}$ the indicator function of the set $[N]$ and let

[TABLE]

We also let $B_{\varepsilon}$ be an $\varepsilon$ -net of points in $\mathbb{U}$ of minimal cardinality (thus $|B_{\varepsilon}|\leq 4\varepsilon^{-2}$ ) and define

[TABLE]

We need two lemmas. The first is an approximation property.

Lemma 5.1.

Let $f\colon{\mathbb{N}}\to\mathbb{U}$ be a multiplicative function. Then for every $\varepsilon>0$ and $N\in{\mathbb{N}}$ , there exists $g\in\mathcal{M}_{\varepsilon,N}$ such that

[TABLE]

Proof.

Since $B_{\varepsilon}$ is an $\varepsilon$ -net of $\mathbb{U}$ , and an element of ${\mathcal{M}}$ can take arbitrary prescribed values on prime powers, as long as these values are taken in $\mathbb{U}$ , there exists $g\in\mathcal{M}_{\varepsilon,N}$ such that $g(1)=f(1)$ and

[TABLE]

For $n\in\{2,\ldots,N\}$ , let $n=k_{1}\cdots k_{l}$ , where $l\leq\log_{2}{N}$ , be the unique factorization of $n$ into prime powers $k_{1},\ldots,k_{l}$ . Using the multiplicativity of $f$ and $g$ , the estimate (22), and telescoping, we get

[TABLE]

This completes the proof. ∎

For $\varepsilon>0$ and $\ell,N\in{\mathbb{N}}$ , we let

[TABLE]

The next lemma gives an upper bound on the elements of $\mathcal{M}_{\ell,\varepsilon,N}$ that suffices for our purposes.

Lemma 5.2.

Let $\varepsilon>0$ and $\ell\in{\mathbb{N}}$ . Then for all large enough $N\in{\mathbb{N}}$ we have

[TABLE]

Proof.

Notice first that because of multiplicativity, an $\ell$ -tuple $(f_{1},\ldots,f_{\ell})\in\mathcal{M}_{\ell,\varepsilon,N}$ is uniquely determined by the values $(f_{1}(k_{1}),\ldots,f_{\ell}(k_{\ell}))$ , where $k_{1},\ldots,k_{\ell}$ range over all prime powers in $[N]$ . Since for large enough $N$ there are at most $2\frac{N}{\log{N}}$ prime powers up to $N$ and $f_{j}(k)\in B_{\varepsilon}$ for $j=1,\ldots,\ell$ , we deduce that

[TABLE]

The asserted bound follows since $|B_{\varepsilon}|\leq 4\varepsilon^{-2}$ . ∎

Combining the previous two lemmas we can prove the following result, which is an essential ingredient of the proofs of Theorems 1.7 and 1.11.

Theorem 5.3.

Let $(X_{n}(\omega))_{n\in{\mathbb{N}}}$ be a sequence of independent random variables with ${\mathbb{P}}(X_{n}=-1)={\mathbb{P}}(X_{n}=1)=\frac{1}{2}$ , $n\in{\mathbb{N}}$ . Then for every $a\colon{\mathbb{N}}\to\mathbb{U}$ we have that $\omega$ -almost surely the following holds: For every $\ell\in{\mathbb{N}}$ , all multiplicative functions $f_{1},\ldots,f_{\ell}\colon{\mathbb{N}}\to\mathbb{U}$ , and all $h_{1},\ldots,h_{\ell}\in{\mathbb{Z}}^{+}$ , we have

[TABLE]

Remarks.

$\bullet$ As was the case with Theorem 1.11, the important point in this statement is that the set of $\omega$ ’s for which (24) holds can be chosen independently of the (uncountably many) multiplicative functions $f_{1},\ldots,f_{\ell}\colon{\mathbb{N}}\to\mathbb{U}$ .

$\bullet$ We note that for $\ell=1$ the previous result can also be proved using an orthogonality criterion by utilizing the fact that for every $b\colon{\mathbb{N}}\to\mathbb{U}$ we have $\omega$ -almost surely ${\mathbb{E}}_{n\in{\mathbb{N}}}\,b(n)\,X_{np}(\omega)\,X_{nq}(\omega)=0$ for all $p\neq q$ . But this method does not seem to be of much help when $\ell\geq 2$ and it is the $\ell=2$ case that is needed in the proof of Theorem 1.7.

Proof.

Since $\ell$ and $h_{1},\ldots,h_{\ell}$ take values on a countable set, it suffices to show that for all fixed $\ell\in{\mathbb{N}}$ , $h_{1},\ldots,h_{\ell}\in{\mathbb{Z}}^{+}$ , and $a\colon{\mathbb{N}}\to\mathbb{U}$ , the following statement holds $\omega$ -almost surely: For all multiplicative functions $f_{1},\ldots,f_{\ell}\colon{\mathbb{N}}\to\mathbb{U}$ we have

[TABLE]

To prove this, we first note that using standard concentration of measure estimates (for example Bernstein’s exponential inequality) we have for every fixed sequence $b\colon{\mathbb{N}}\to\mathbb{U}$ and every $N\in{\mathbb{N}}$ and $\delta>0$ that

[TABLE]

We let

[TABLE]

Using the notation introduced in (23), we get for every large enough $N\in{\mathbb{N}}$ that

[TABLE]

where the first estimate follows from the union bound and (25), and the second estimate follows from Lemma 5.2. Using the Borel-Cantelli lemma we deduce that $\omega$ -almost surely we have

[TABLE]

Using Lemma 5.1, the fact that $\varepsilon_{N}\log{N}\to 0$ , and telescoping, we deduce that $\omega$ -almost surely we have

[TABLE]

This completes the proof. ∎

Proof of Theorems 1.7 and 1.11.

Let $f_{1},\ldots,f_{\ell}$ and $h_{1},\ldots,h_{\ell}$ be as in Theorem 1.11. Note that $\omega$ -almost surely the sequence $(X_{k}(\omega)\prod_{j=1}^{\ell}f_{j}(k+h_{j}))_{k\in{\mathbb{N}}}$ is non-null, since $\omega$ -almost surely $|X_{k}(\omega)|=1$ , $k\in{\mathbb{N}}$ , and by assumption $(\prod_{j=1}^{\ell}f_{j}(k+h_{j}))_{k\in{\mathbb{N}}}$ is non-null. Likewise, if $a\colon{\mathbb{N}}\to\mathbb{U}$ is a non-null sequence and $f\colon{\mathbb{N}}\to\mathbb{S}$ is a multiplicative function, then $\omega$ -almost surely $(a(k)\,X_{k}(\omega)f(k))_{k\in{\mathbb{N}}}$ is non-null.

Since all fixed parameters that appear below take values on a countable set, by Proposition 2.7 (for Theorem 1.7) and Proposition 2.6 (for Theorem 1.11) it suffices to show that for every fixed $b\colon{\mathbb{N}}\to\mathbb{S}$ , all $h,\ell\in{\mathbb{N}}$ , and all $h_{1},\ldots,h_{\ell}\in{\mathbb{Z}}^{+}$ , we have $\omega$ -almost surely the following (for Theorem 1.7 we only need to use the case $\ell=1$ , $h_{1}=0$ ): For all multiplicative functions $f_{1},\ldots,f_{\ell}\colon{\mathbb{N}}\to\mathbb{U}$ we have

[TABLE]

(Note that then (26) also holds with $\mathbb{E}^{\log}_{n\in{\mathbb{N}}}$ in place of ${\mathbb{E}}_{n\in{\mathbb{N}}}$ .) We partition the positive integers into the following two sets

[TABLE]

We let

[TABLE]

Note that ${\mathbb{P}}(Y_{n}=-1)={\mathbb{P}}(Y_{n}=1)=\frac{1}{2}$ for all $n\in{\mathbb{N}}$ . Moreover, for $n\in S_{1}$ (and fixed $h\in{\mathbb{N}}$ ) the random variables $Y_{n}(\omega)$ are independent, and the same holds for the random variables $Y_{n}(\omega)$ for $n\in S_{2}$ . For $i=1,2$ we consider independent random variables $Z_{n,i}(\omega)$ , $n\in{\mathbb{N}}$ , such that ${\mathbb{P}}(Z_{n,i}=-1)={\mathbb{P}}(Z_{n,i}=1)=\frac{1}{2}$ , $n\in{\mathbb{N}}$ , and $Z_{n,i}:=Y_{n}$ for $n\in S_{i}$ . For $i=1,2$ , we apply Theorem 5.3 for the random variables $(Z_{n,i}(\omega))_{n\in{\mathbb{N}}}$ and $a_{i}(n):=b(n)\,{\bf 1}_{S_{i}}(n)$ (then $a_{i}(n)\,Z_{n,i}=b(n)\,{\bf 1}_{S_{i}}(n)\,Y_{n}$ , $n\in{\mathbb{N}}$ ), and deduce that $\omega$ -almost surely we have

[TABLE]

for $i=1,2$ . Adding the two identities we get (26). This completes the proof. ∎

5.2 Proof of Theorem 1.8

We will use the following finitistic strengthening of Theorem 1.1 that can be deduced from Theorem 1.1 using a compactness argument:

Theorem 5.4.

For every $C>0$ there exists $m\in{\mathbb{N}}$ such that for every sequence $a\colon[m]\to\mathbb{S}$ there exist $d,n\in{\mathbb{N}}$ with $dn\leq m$ such that $|\sum_{k=1}^{n}a(dk)|>C$ .

We deduce from this some necessary conditions for a sequence to be a good weight for the Erdős discrepancy problem.

Lemma 5.5.

Let $w\colon{\mathbb{N}}\to{\mathbb{C}}$ be a sequence and $c\in{\mathbb{C}}\setminus\{0\}$ . Suppose that for infinitely many $m\in{\mathbb{N}}$ there exists $r\in{\mathbb{N}}$ such that

[TABLE]

Then $w$ is a good weight for the Erdős discrepancy problem.

Remark.

The conclusion fails if we simply assume that $w$ is equal to a non-zero constant on a union of arbitrarily long intervals. To see this, let $(a(k))_{k\in{\mathbb{N}}}$ be a completely multiplicative function that is equal to $(-1)^{n}$ on a sequence of intervals with lengths even numbers that increase to infinity (such a multiplicative function can be explicitly constructed). Let also $w$ be the indicator function of the union of this sequence of intervals. Then $\sup_{d,n\in{\mathbb{N}}}\big{|}\sum_{k=1}^{n}a(dk)\,w(k)\big{|}\leq 1$ .

Proof.

Let $a\colon{\mathbb{N}}\to\mathbb{S}$ be a sequence and $C>0$ . Let $m\in{\mathbb{N}}$ be so that Theorem 5.4 applies for this $C$ and (27) holds for some $c\in{\mathbb{C}}\setminus\{0\}$ and $r\in{\mathbb{N}}$ . We use Theorem 5.4 for the sequence $(a(rm!+k))_{k\in[m]}$ and we get that there exist $d,n\in{\mathbb{N}}$ , with $dn\leq m$ , such that

[TABLE]

We let

[TABLE]

Note that

[TABLE]

Since $d,n\leq m$ , using the previous identity, and (27), (28), we deduce that

[TABLE]

Hence, either $|S_{d}\big{(}r\frac{m!}{d}+n\big{)}|\geq\frac{C}{2}$ or $|S_{d}\big{(}r\frac{m!}{d}\big{)}|\geq\frac{C}{2}$ . Since $C$ was arbitrary, we deduce that $\sup_{d,N\in{\mathbb{N}}}|S_{d}(N)|=+\infty$ . This completes the proof. ∎

Proof of Theorem 1.8.

Let $c\in{\mathbb{C}}\setminus\{0\}$ be such that $\sum_{k\in{\mathbb{N}}}\rho_{k}^{l}=+\infty$ for every $l\in{\mathbb{N}}$ , where $\rho_{k}:={\mathbb{P}}(X_{k}=c)$ , $k\in{\mathbb{N}}$ . Let $m\geq 4$ . By Lemma 5.5, it suffices to show that $\omega$ -almost surely there exists $r\in{\mathbb{N}}$ such that

[TABLE]

One easily verifies that for any fixed $m\geq 4$ the random variables $X_{r\frac{m!}{i}+j}$ , $i,j\in[m]$ , $r\in m!{\mathbb{N}}+1$ , are independent. Hence,

[TABLE]

Since $(\rho_{k})_{k\in{\mathbb{N}}}$ is decreasing, we have that $\prod_{i,j\in[m]}\rho_{r\frac{m!}{i}+j}\geq\rho^{m^{2}}_{r(m+1)!}$ for all $r\in{\mathbb{N}}$ . Moreover, since $\sum_{k\in{\mathbb{N}}}\rho_{k}^{m^{2}}=+\infty$ , using again that $(\rho_{k})_{k\in{\mathbb{N}}}$ is decreasing, we get that $\sum_{r\in m!{\mathbb{N}}+1}\rho^{m^{2}}_{r(m+1)!}=+\infty$ . We deduce that

[TABLE]

Since the sets involved in the above probabilities are independent, the Borel-Cantelli theorem applies, and gives that $\omega$ -almost surely for infinitely many $r\in{\mathbb{N}}$ we have that $X_{r\frac{m!}{i}+j}(\omega)=c$ for all $i,j\in[m]$ . This completes the proof. ∎

Acknowledgments

I would like to thank M. Kolountzakis for providing the proof of Theorem 1.8 and other useful remarks, and S. Pattison for pointing out a correction in the statement of Proposition 4.2. I would also like to thank the American Institute of Mathematics (AIM) for its hospitality; part of this work was motivated by problems raised during the 2018 workshop “Sarnak’s Conjecture”.

Bibliography16

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] N. Frantzikinakis, B. Host. Higher order Fourier analysis of multiplicative functions and applications. J. Amer. Math. Soc. 30 (2017), 67–157.
2[2] N. Frantzikinakis, B. Host. The logarithmic Sarnak conjecture for ergodic weights. Ann. of Math. (2) 187 (2018), 869–931.
3[3] N. Frantzikinakis, B. Host. Furstenberg systems of bounded multiplicative functions and applications. To appear in Int. Math. Res. Not. , ar Xiv:1804.08556
4[4] H. Furstenberg. Disjointness in ergodic theory, minimal sets, and a problem in diophantine approximation. Math. Systems Theory 1 (1967), 1–49.
5[5] H. Furstenberg. Ergodic behavior of diagonal measures and a theorem of Szemerédi on arithmetic progressions. J. Analyse Math. 31 (1977), 204–256.
6[6] H. Furstenberg. Recurrence in Ergodic Theory and Combinatorial Number Theory. Princeton University Press, Princeton 1981.
7[7] W. T. Gowers. Erdős and arithmetic progressions. Erdős Centeniall, Bolyai Society Mathematical Studies, 25, L. Lovasz, I. Z. Rusza, V. T. Sos eds., Springer 2013, 265–287.
8[8] T. Kamae. Subsequences of normal sequences. Israel J. Math. 16 (1973), 121–149.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

Good Weights for the Erdős Discrepancy Problem

Abstract

1 Introduction and main results

1.1 Introduction

Theorem 1.1** (Tao [15]).**

Definition 1.2**.**

1.2 Results related to the weighted Erdős discrepancy problem

Definition 1.3**.**

Theorem 1.4**.**

Remarks**.**

Corollary 1.5**.**

Theorem 1.6** (F., Host [3]).**

Theorem 1.7**.**

Theorem 1.8**.**

Remark**.**

1.3 Results related to weighted sums of multiplicative functions

Theorem 1.9**.**

Remark**.**

Corollary 1.10**.**

Theorem 1.11**.**

Remarks**.**

1.4 Proof strategy

1.5 Some open problems

Problem 1**.**

Problem 2**.**

Problem 3**.**

Problem 4**.**

1.6 Notation

2 Reduction to statements about multiplicative functions

2.1 Multiplicative averages

Definition 2.1**.**

Definition 2.2**.**

2.2 Reduction to multiplicative functions via Bochner’s theorem

Definition 2.3**.**

Proposition 2.4**.**

Proof.

Proposition 2.5**.**

Proof.

2.3 Reduction to correlation estimates

Proposition 2.6**.**

Proof.

Proposition 2.7**.**

Proof.

3 Notions and results from ergodic theory

3.1 Measure preserving systems

3.2 Furstenberg systems

Definition 3.1**.**

Definition 3.2**.**

Remark**.**

Definition 3.3**.**

Remark**.**

3.3 Disjointness properties

Definition 3.4**.**

Definition 3.5**.**

Proposition 3.6**.**

Proof.

Theorem 3.7** (F., Host [2, 3]).**

Theorem 3.8**.**

4 Proof of main results for structured weights

4.1 Proof of Theorems 1.4 and 1.9

Proposition 4.1**.**

Remark**.**

Proof.

Proof of Theorems 1.4 and 1.9.

4.2 Proof of Corollaries 1.5 and 1.10

Proposition 4.2**.**

Remark**.**

Proof.

Proof of Corollaries 1.5 and 1.10.

5 Proof of main results for random weights

5.1 Proof of Theorems 1.7 and 1.11

Lemma 5.1**.**

Proof.

Theorem 1.1 (Tao [15]).

Definition 1.2.

Definition 1.3.

Theorem 1.4.

Remarks.

Corollary 1.5.

Theorem 1.6 (F., Host [3]).

Theorem 1.7.

Theorem 1.8.

Remark.

Theorem 1.9.

Remark.

Corollary 1.10.

Theorem 1.11.

Remarks.

Problem 1.

Problem 2.

Problem 3.

Problem 4.

Definition 2.1.

Definition 2.2.

Definition 2.3.

Proposition 2.4.

Proposition 2.5.

Proposition 2.6.

Proposition 2.7.

Definition 3.1.

Definition 3.2.

Remark.

Definition 3.3.

Remark.

Definition 3.4.

Definition 3.5.

Proposition 3.6.

Theorem 3.7 (F., Host [2, 3]).

Theorem 3.8.

Proposition 4.1.

Remark.

Proposition 4.2.

Remark.

Lemma 5.1.

Lemma 5.2.

Theorem 5.3.

Remarks.

Theorem 5.4.

Lemma 5.5.

Remark.