Good weights for the Erd\H{o}s discrepancy problem
Nikos Frantzikinakis

TL;DR
This paper extends the Erdős discrepancy problem to weighted variants involving structured and random weights, proving unboundedness of weighted sums of multiplicative functions and employing measure-preserving systems analysis.
Contribution
It introduces weighted versions of the Erdős discrepancy problem, combining structured and random weights, and develops new structural results for measure-preserving systems related to multiplicative functions.
Findings
Weighted sums of bounded multiplicative functions are unbounded.
Structured weights with irrationality features lead to unbounded discrepancy.
The analysis leverages measure-preserving systems associated with multiplicative functions.
Abstract
The Erd\H{o}s discrepancy problem, now a theorem by T. Tao, asks whether every sequence with values plus or minus one has unbounded discrepancy along all homogeneous arithmetic progressions. We establish weighted variants of this problem, for weights given either by structured sequences that enjoy some irrationality features, or certain random sequences. As an intermediate result, we establish unboundedness of weighted sums of bounded multiplicative functions and products of shifts of such functions. A key ingredient in our analysis for the structured weights, is a structural result for measure preserving systems naturally associated with bounded multiplicative functions that was recently obtained in joint work with B. Host.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMathematical Approximation and Integration
\dajAUTHORdetails
title = Good Weights for the Erdős Discrepancy Problem, author = Nikos Frantzikinakis, plaintextauthor = Nikos Frantzikinakis, plaintexttitle = Good Weights for the Erdos Discrepancy Problem, keywords =Multiplicative functions, discrepancy, Erdős discrepancy problem, Elliott conjecture, Furstenberg correspondence., \dajEDITORdetailsyear=2020, number=8, received=30 April 2020, published=14 July 2020, doi=10.19086/da.13688,
[classification=text]
Good Weights for the Erdős Discrepancy Problem
Nikos Frantzikinakis Supported by the Hellenic Foundation for Research and Innovation, Project No: 1684.
Abstract
The Erdős discrepancy problem, now a theorem by T. Tao, asks whether every sequence with values plus or minus one has unbounded discrepancy along all homogeneous arithmetic progressions. We establish weighted variants of this problem, for weights given either by structured sequences that enjoy some irrationality features, or certain random sequences. As an intermediate result, we establish that weighted sums of bounded multiplicative functions and products of shifts of such functions are unbounded. A key ingredient in our analysis for the structured weights, is a structural result for measure preserving systems naturally associated with bounded multiplicative functions that was recently obtained in joint work with B. Host.
1 Introduction and main results
1.1 Introduction
The Erdős discrepancy problem is an elementary question that dates back to the 1930’s and asks if there is a sequence that is evenly distributed along all homogeneous arithmetic progressions, in the sense that the sequence of partial sums is bounded uniformly in . The problem remained dormant for a long time and it was not until 2010 that interest was rejuvenated, when it became the subject of the Polymath5 project (see [7, 13] for related details). The problem was finally solved in 2015 by T. Tao [15] who proved the following (henceforth, with we denote the unit circle and with the complex unit disc):
Theorem 1.1** (Tao [15]).**
For every sequence we have
[TABLE]
We seek to obtain weighted variants of the previous result. To facilitate exposition, we introduce the following notion:
Definition 1.2**.**
We say that a sequence is a good weight for the Erdős discrepancy problem, or simply, a good weight, if for every we have
[TABLE]
Theorem 1.1 implies that (and more generally where is a completely multiplicative function) is a good weight for the Erdős discrepancy problem. On the other hand, sequences with bounded partial sums, like the sequence , where and , are not good weights, and more generally, a product of a completely multiplicative function with a sequence that has bounded partial sums is not a good weight (take ). It is less clear if some other oscillatory sequences like , where and is irrational, or random sequences of ’s are good weights. We will show in Corollary 1.5 and Theorem 1.7 that they are; that is, for every we have
[TABLE]
and a similar statement holds if we use as weights random sequences of . Moreover, in Theorem 1.4 we give a rather general criterion that allows us to show that a large class of zero entropy sequences that enjoy certain irrationality features are good weights for the Erdős discrepancy problem.
On a related result of independent interest, we show that certain weighted sums of multiplicative functions are unbounded. For instance, we prove in Corollary 1.10 that if , is irrational, and are multiplicative functions, then
[TABLE]
and in Theorems 1.11 we prove an analogous result when the weights are given by random sequences of ’s.
1.2 Results related to the weighted Erdős discrepancy problem
The next result gives necessary conditions for a bounded sequence of complex numbers to be a good weight for the Erdős discrepancy problem. In order to explain the exact assumptions needed, we use ergodic terminology that is explained in Section 3.2, and in Corollary 1.5 we give some explicit examples. See also Section 1.6 for our notation regarding averages; for reasons that are explained in Section 3.2 we use logarithmic averages.
Definition 1.3**.**
We say that the sequence
- •
has vanishing self-correlations, if for every we have
[TABLE]
- •
is non-null for logarithmic averages, or simply, non-null, if
[TABLE]
Our main result regarding structured (zero entropy) weights is the following one:
Theorem 1.4**.**
Suppose that is non-null and totally ergodic, and has zero entropy and vanishing self-correlations. Then is a good weight for the Erdős discrepancy problem.
Remarks**.**
- •
As was the case in [15], if is an arbitrary inner product space and is such that for all , then our argument works without any change and shows that
[TABLE]
- •
Using Theorem 1.9 below, it is straightforward to adapt the proof of Theorem 1.4 in order to get the following stronger conclusion: For , , where , , and is as before, we have for every sequence that
[TABLE]
But our methods do not allow us to deal with the non-weighted version (where ) even when , .
- •
The zero entropy assumption cannot be removed. To see this, let and , , where is any multiplicative function that satisfies the Elliott conjecture, in which case has vanishing self-correlations and is totally ergodic (in fact Bernoulli). Also, the assumption that the self-correlations of vanish cannot be removed. To see this, let and , , where is irrational. On the other hand, it is not clear whether the assumption of total ergodicity can be removed.
Corollary 1.5**.**
Let be a sequence, be Riemann integrable with and , and let be a polynomial with degree at least and irrational leading coefficient. Then
[TABLE]
It follows that for and irrational, the sequence and the sequence that assigns values or according to whether is in the interval , , or , are good weights.
The proof of Theorem 1.4 has a few interesting features. Unlike the proof of Theorem 1.1 in [15], we are not using explicitly or implicitly results from [10, 11, 14] on averages of multiplicative functions in short intervals, and also we do not carry out a separate analysis in the case where the sequence is a pretentious multiplicative function. To compensate for this, our argument crucially uses the following ergodic result that was proved in [3] using a combination of ergodic theory and number theory tools developed in [2] and [16] (the notions involved are defined in Section 3):
Theorem 1.6** (F., Host [3]).**
All Furstenberg systems of a multiplicative function with values on are disjoint from all zero entropy totally ergodic systems.
To get a sense of why Theorem 1.6 is useful, we note that it implies (via Proposition 4.1 below) that if is a totally ergodic sequence with zero entropy and is a multiplicative function, then the self-correlations of the sequence split into a product of the self-correlations of and the self-correlations of . Hence, if we assume that has vanishing self-correlations, then the same holds for , and this property implies Theorem 1.4 (see Proposition 2.7).
Lastly, we give examples of good weights that are given by random sequences. The first result applies to independent symmetric random variables and its proof is rather elementary.
Theorem 1.7**.**
Let be a sequence of independent random variables with , , and be a non-null sequence. Then -almost surely the sequence is a good weight for the Erdős discrepancy problem.
The second result applies to independent random variables that are not necessarily symmetric as long as they take a fixed non-zero complex value not too rarely. Its proof, due to M. Kolountzakis, is simple, but makes essential use of Theorem 1.1 (via the criterion given in Lemma 5.5 below).
Theorem 1.8**.**
Let be a sequence of independent, complex valued, random variables. Suppose that for some the sequence , , is decreasing and satisfies for every . Then -almost surely the sequence is a good weight for the Erdős discrepancy problem.
Remark**.**
The assumption of monotonicity cannot be removed. To see this, take if is prime, and for all other , and let be a completely multiplicative function that is equal to on the -th prime. Then -almost surely we have \sup_{d,n\in{\mathbb{N}}}\big{|}\sum_{k=1}^{n}a(dk)\,X_{k}(\omega)\big{|}\leq 1.
If we take and decreasing such that and for , then Theorem 1.8 applies, and gives that the indicator functions of certain sparse random subsets of the integers are good weights for the Erdős discrepancy problem.
1.3 Results related to weighted sums of multiplicative functions
As was the case in the proof of Theorem 1.1 in [15], the unboundedness of weighted discrepancy sums for arbitrary unit modulus sequences follows from similar unboundedness properties of unit modulus completely multiplicative functions. We state next some related results that are of independent interest.
Theorem 1.9**.**
Let be a non-null multiplicative function and be non-null, totally ergodic, with zero entropy, and vanishing self-correlations. Then
[TABLE]
In fact, the following stronger property holds: If is as before, are multiplicative functions, and are such that the sequence is non-null, then we have
[TABLE]
Remark**.**
Note that for although (3) holds for all completely multiplicative functions with values on , it fails for some non-null multiplicative functions with values on . For instance it fails for , , and for all non-trivial Dirichlet characters.
Regarding the non-weighted version of (4), not much is known for . For instance, it is not known whether for every completely multiplicative function we have
[TABLE]
This problem was raised by J. Teräväinen and A. Klurman, who remarked that it is not even clear how to prove that
[TABLE]
where is the Liouville function. On the other hand, it is an immediate consequence of the next corollary, that if is a multiplicative function, , and is irrational, then we have
[TABLE]
Corollary 1.10**.**
Let be a Riemann integrable function with and , and be a polynomial with degree at least and irrational leading coefficient. Then for all multiplicative functions and such that the sequence is non-null, we have
[TABLE]
Regarding weights given by random sequences, we have the following result:
Theorem 1.11**.**
Let be a sequence of independent random variables with , . Then -almost surely the following holds: For every , all multiplicative functions , and such that the sequence is non-null, we have
[TABLE]
Remarks**.**
It is not hard to show that for any fixed collection of arbitrary sequences , we have that (5) holds -almost surely. So the important point in Theorem 1.11 is that the set of ’s for which the conclusion holds is independent of the (uncountably many) multiplicative functions .
For , Theorem 1.7 gives better results that apply to not necessarily symmetric random variables. But for the method of proof of Theorem 1.7 fails to give (5) (since the relevant unweighted result is not known).
Theorem 1.11 is based on Theorem 5.3 below, which is proved by combining some simple counting arguments and concentration of measure estimates for sums of independent random variables.
1.4 Proof strategy
Let us first recall the proof strategy of Theorem 1.1 given in [15]. An immediate consequence of Theorem 1.1 is that for every completely multiplicative function we have
[TABLE]
It turns out that a variant of this special case (see Proposition 2.5 below for ) is the key ingredient in the proof of Theorem 1.1. The proof of (6) given in [15] proceeds by considering separately the case where is structured (“pretentious”) and random (“non-pretentious”). The latter case can be treated (as in Proposition 2.6 below) using the identities
[TABLE]
which hold for random-like (“non-pretentious”) multiplicative functions.
Likewise, our arguments rely on weighted variants of (6) and (7) that are of independent interest. For instance, we prove that if and is irrational, then for every multiplicative function we have
[TABLE]
and we also prove stronger results involving weighted sums of products of shifts of several multiplicative functions. To prove (8) we rely on one of the main results in [3], which implies that for every and irrational we have
[TABLE]
The fact that (9) holds for every multiplicative function (which is not true for (7)) simplifies the proof of (8), versus the argument given for the proof of (6) in [15], and ultimately of the fact that is a good weight. One reason is that we do not have to carry out a separate analysis in the case where is structured (“pretentious”), as was the case in [15].
The proofs of the results concerning random weights are simpler. Theorem 1.7 is based on a variant of (9) that uses random weights and is proved in Theorem 5.3 via elementary techniques. Theorem 1.8 is deduced from Theorem 1.1 using an elementary argument given in Section 5.2.
1.5 Some open problems
A possible strengthening of Theorem 1.1 is given in the following problem (for and the problem was previously proposed by J. Teräväinen and A. Klurman at the December 2018 workshop of the American Institute of Mathematics “Sarnak’s Conjecture”):
Problem 1**.**
Is it true that for every we have
[TABLE]
when , , or when , , with irrational?
When the problem is open even when , where is a completely multiplicative function (see remarks on Section 1.3). More generally, one can ask whether for the previous choices of the sequence , for every and all we have
[TABLE]
Corollary 1.10 shows that the answer is yes when are multiplicative functions with values on and is the sequence with irrational. But unlike the previous discrepancy statements, we do not have a way to reduce Problem 1 to one about weighted sums of multiplicative functions. Any such reduction probably depends upon obtaining an integral representation result, analogous to Proposition 2.4 below, for sequences of the form , , where is a multiplicative Følner sequence (see Section 2.1) along which all previous averages exist. Note though that more complicated “higher order multiplicative functions” arise this way, for instance, if is defined by , where is the unique factorization of , and , then
[TABLE]
for every .
On a different direction, it seems likely that the zero integral condition in Corollary 1.5 can be removed. Proving this would probably necessitate to combine arguments of this article with a detailed analysis of the pretentious case (similar to the one in [15]), and it is not clear how to do this.
Problem 2**.**
Is it true that Corollary 1.5 holds even if we do not assume that ?
Let us say that a subset of is good for the Erdős discrepancy problem, or simply, good, if the indicator function is a good weight for the Erdős discrepancy problem. By taking the sequence in (2) to be an appropriate multiplicative function one easily verifies that the sets for , , and , where is the -th prime, are bad. On the other hand, it is easy to deduce form Theorem 1.1 that the sets for and for , are good. But it is not at all clear whether certain simple sets that lack multiplicative structure are good.
Problem 3**.**
Are the sets , , , or for not an integer, good for the Erdős discrepancy problem?
Theorem 1.8 implies that random subsets of the integers with positive density, and certain sparse random subsets with density roughly in , are almost surely good. But how about sparser random subsets?
Problem 4**.**
Let and be a sequence of independent random variables with , , . Is it true that -almost surely the sequence is a good weight for the Erdős discrepancy problem?
1.6 Notation
With we denote the complex unit disc and with we denote the complex unit circle . With we denote the -dimensional torus that we identify with . With we denote the positive integers and with the non-negative integers. For we let . For we also let .
If is a non-empty finite subset of we let
[TABLE]
If is an infinite subset of we let
[TABLE]
whenever the limits exist.
With we denote a sequence of intervals with . We let
[TABLE]
whenever the limits exist. Using partial summation one sees that if , then also (but the converse does not hold in general).
2 Reduction to statements about multiplicative functions
2.1 Multiplicative averages
We denote by the multiplicative group of positive rationals.
Definition 2.1**.**
We say that is a multiplicative Følner sequence, if is a finite subset of for every , and for every we have
[TABLE]
where .
An example of a multiplicative Følner sequence is given by
[TABLE]
where denotes the sequence of primes.
Definition 2.2**.**
If is a multiplicative Følner sequence and is such that the average below exists, we define the multiplicative average of the sequence along by
[TABLE]
Note that property (10) implies the following dilation invariance property of the multiplicative averages: For every , multiplicative Følner sequence , and , we have
[TABLE]
2.2 Reduction to multiplicative functions via Bochner’s theorem
A variant of the next lemma was proved in [15, Section 2] using Fourier analysis on an appropriate finite Abelian group (of the form for large ) and a compactness argument. We use a somewhat different approach (also used in [1, Section 10.2]) that invokes Bochner’s theorem on positive definite functions. We first introduce some notation.
Definition 2.3**.**
With we denote the set of all completely multiplicative functions .
Endowed with pointwise multiplication and the topology of pointwise convergence, the set is a compact (metrizable) Abelian group.
Proposition 2.4**.**
Let be defined by
[TABLE]
where is a bounded sequence and is a multiplicative Følner sequence such that all the averages above exist. Then there exists a (positive) measure on the space , with total mass equal to , such that
[TABLE]
Proof.
We first extend the sequence to the positive rationals by letting for . We define as follows
[TABLE]
Using the dilation invariance property (11) and our assumption that the averages defining the sequence exist, we deduce that the averages below exist and we have
[TABLE]
We are going to use this identity in order to verify that is a positive definite sequence on with pointwise multiplication. Indeed, for all and , we have
[TABLE]
Note that the dual group of consists of the completely multiplicative functions on with unit modulus, and any such satisfies , , for some completely multiplicative function . A well known theorem of Bochner gives that there exists a (positive) Borel measure on the space such that
[TABLE]
The total mass of is . Lastly, we have
[TABLE]
and the proof is complete. ∎
Using the previous representation theorem we get the following criterion:
Proposition 2.5**.**
Let be such that for every probability measure on the space we have
[TABLE]
Then is a good weight for the Erdős discrepancy problem.
Proof.
Arguing by contradiction, suppose that is not a good weight for the Erdős discrepancy problem. Then there exists a sequence such that
[TABLE]
We average with respect to over a multiplicative Følner sequence of intervals , chosen so that all relevant averages below exist (such a sequence can always be found using a diagonalisation argument), and deduce that
[TABLE]
Expanding the square we get that the expression in (12) is equal to
[TABLE]
where
[TABLE]
By Lemma 2.4, there exists a (positive) measure on the space , with total mass , such that
[TABLE]
We deduce that the expression (13), and hence the expression in (12), is equal to
[TABLE]
Hence,
[TABLE]
This contradicts our assumption and completes the proof. ∎
2.3 Reduction to correlation estimates
As was the case in [15], a key step in the proof of our main results is an elementary observation that allows to deduce unboundedness of partial sums from vanishing of self-correlations (which are defined using logarithmic averages because of reasons explained in the next section).
Proposition 2.6**.**
Let be a non-null sequence such that for every we have
[TABLE]
Then
[TABLE]
Proof.
Arguing by contradiction, suppose that the conclusion fails. Then there exists such that
[TABLE]
Using this, we can find a sequence of intervals , with , such that all averages written below exist and for every we have
[TABLE]
Since the sequence is non-null, we have
[TABLE]
Next, notice that
[TABLE]
since by our assumption for and we also used twice that the logarithmic averages of a bounded sequence are translation invariant. From the above we deduce that and we get a contradiction by choosing .
∎
Proposition 2.7**.**
Let be a non-null sequence such that for every multiplicative function and every we have
[TABLE]
Then is a good weight for the Erdős discrepancy problem.
Proof.
Arguing by contradiction, suppose that the conclusion fails. Then by Proposition 2.5 there exist a sequence , a probability measure on the space , and , such that
[TABLE]
Using this and a diagonalization argument, we can find a sequence of intervals , with , such that and all averages written below exist and for every we have
[TABLE]
We let
[TABLE]
where the positiveness follows since the sequence is non-null by our assumption. Next, notice that
[TABLE]
since by our assumption for . Since is a probability measure, we deduce using the bounded convergence theorem that
[TABLE]
Combining (14) and (15) we deduce that and we get a contradiction by choosing . ∎
3 Notions and results from ergodic theory
The proof of our main results regarding structured (zero entropy) sequences depend on some notions and results in ergodic theory that we describe next. The material in this section is not needed for the results concerning random weights.
3.1 Measure preserving systems
A measure preserving system, or simply a system, is a quadruple where is a probability space and is an invertible, measurable, measure preserving transformation. We typically omit the -algebra and write . Throughout, for we denote by the composition ( times) and let and . Also, for and we denote by the function .
We say that the system is ergodic if the only functions that satisfy are the constant ones. It is totally ergodic if is ergodic for every .
3.2 Furstenberg systems
For readers convenience, we reproduce here some ergodic notions and constructions that can also be found in [2, 3]. For the purposes of this article, all averages in the definitions below are taken to be logarithmic. The reason is that we later on invoke results from ergodic theory, like Theorem 3.7 below, that are only known when the joint Furstenberg systems are defined using logarithmic averages. This limitation comes from the number theoretic input used in the proof of Theorem 3.7, in particular, the identities in [3, Theorem 3.1].
Definition 3.1**.**
Let be a sequence of intervals with . We say that a finite collection of bounded sequences admits log-correlations on , if the limits
[TABLE]
exist for all , all , and all .
For every finite collection of sequences that admits log-correlations on a given sequence of intervals, we use a variant of the correspondence principle of Furstenberg [5, 6] in order to associate a measure preserving system that captures the statistical properties of these sequences.
Definition 3.2**.**
Let be sequences that admit log-correlations on the sequence of intervals . We let , , be the shift transformation on , and be the weak-star limit of the sequence of measures where is thought of as an element of . We call the joint Furstenberg system associated with (, ).
Remark**.**
If we are given sequences that are defined on , we extend them to in an arbitrary way. It is easy to check that the measure will not depend on the extension.
Note that a collection of sequences may have several non-isomorphic joint Furstenberg systems depending on which sequence of intervals we use in the evaluation of their joint correlations. For convenience of exposition, we sometimes associate a property of ergodic nature with a given finite collection of sequences if all joint Furstenberg systems of the collection have this property. In particular, we often use the following terminology:
Definition 3.3**.**
We say that a sequence is totally ergodic and/or has zero entropy, if all its Furstenberg systems are totally ergodic and/or have zero entropy.
Remark**.**
In [8], a zero entropy sequence is called completely deterministic.
Examples of zero entropy sequences include the sequences where and ; these sequences are also totally ergodic if is irrational (see Proposition 4.2 below).
3.3 Disjointness properties
We will use the following notion that was introduced by Furstenberg in [4]:
Definition 3.4**.**
We say that two systems and are disjoint, if the only invariant measure on the product space , with first and second marginals the measures and respectively, is the product measure .
The notion of disjointness in ergodic theory naturally introduces the following notion of statistical disjointness of two finite collections of bounded sequences.
Definition 3.5**.**
We say that two finite collections and of sequences with values on , are statistically disjoint, if all the joint Furstenberg systems of the collection are (measure-theoretically) disjoint form all the joint Furstenberg systems of the collection .
The next result shows that if two collections of sequences are statistically disjoint, then all their joint correlations decouple into products of joint correlations of and joint correlations of .
Proposition 3.6**.**
Let and be two collections of sequences with values on that are statistically disjoint.Then
[TABLE]
for all choices , , , where and , are arbitrary.
Proof.
Arguing by contradiction, suppose that the conclusion fails. Then there exists a sequence of intervals , with , on which the family admits log-correlations and we have
[TABLE]
for some choice of , , , where and , . Let and be the joint Furstenberg systems associated with (, ) and (, ) respectively.
We let and . After passing to a subsequence of (which for simplicity we denote again by ), we can assume that the weak-star limit
[TABLE]
exists and defines a invariant measure on . The projection of on is the weak-star limit , which is the measure . Likewise, the projection of on is the measure . Since the families and are statistically disjoint, the systems and are disjoint, hence
[TABLE]
Now for we let
[TABLE]
Likewise, for we let
[TABLE]
With the above notation, we define the function , , where for if or for some we set to be or respectively. Likewise, we define the function , . Then using (16) and the definition of the measures and the measure given by (17), we get that
[TABLE]
This contradicts (18) and completes the proof. ∎
The next result follows by combining the structural result of [3, Theorem 1.5] with the disjointness statement of [2, Proposition 3.12].
Theorem 3.7** (F., Host [2, 3]).**
All joint Furstenberg systems of any collection of multiplicative functions with values on are disjoint from all zero entropy totally ergodic systems.
Restating Theorem 3.7 using terminology introduced in the previous definitions we get the following result:
Theorem 3.8**.**
Every finite collection of multiplicative functions with values on is statistically disjoint from every totally ergodic sequence with zero entropy.
4 Proof of main results for structured weights
4.1 Proof of Theorems 1.4 and 1.9
First we show that the assumption of Proposition 2.6 is satisfied for various sequences of interest.
Proposition 4.1**.**
Suppose that is a totally ergodic sequence with zero entropy and vanishing self-correlations. Let also be multiplicative functions, , and , . Then for every we have
[TABLE]
Remark**.**
For the purpose of proving Theorem 1.4 we only need to consider the case where and is completely multiplicative of unit modulus. But this special case does not seem to offer significant simplifications.
Proof.
By Theorem 3.8, the collection of sequences and are statistically disjoint. By Proposition 3.6, we have that the difference between the average
[TABLE]
and the product of averages
[TABLE]
converges to zero as . Since by our assumption for every , the result follows. ∎
Proof of Theorems 1.4 and 1.9.
Theorem 1.4 follows immediately from Propositions 2.7 and 4.1 (for , ).
To prove Theorem 1.9, we note first that by Theorem 3.8, the collection of sequences and are statistically disjoint. Hence, Proposition 3.6 gives that the difference
[TABLE]
converges to [math] as . Using this and our assumption that the sequences and are non-null, we deduce that their product is also non-null. With this in mind, Theorem 1.9 follows from Propositions 2.6 and 4.1. ∎
4.2 Proof of Corollaries 1.5 and 1.10
We will need the following fact:
Proposition 4.2**.**
Let be a non-constant polynomial with irrational leading coefficient and let be Riemann integrable. Then the sequence has zero entropy, is totally ergodic, and has a unique Furstenberg system.
Remark**.**
In order to have total ergodicity it is essential that the leading coefficient of (and not just any non-constant coefficient) is irrational. For example, if ), , where is irrational, then it turns out that the sequence is not totally ergodic. We thank S. Pattison for pointing this out, see Sections 5.3 and 5.4 in [12] for a related discussion.
Proof.
Let . We start with the well known fact (see [6, Section 1.7] or [12, Section 4.4]) that there exists a unipotent affine transformation , with unique invariant measure the Haar measure , so that the system is totally ergodic (here we used that the leading coefficient of is irrational), a Riemann integrable function , and , such that
[TABLE]
(For instance, when , , we can take , , , and .) We let and be the shift transformation on . We define the map by
[TABLE]
Clearly we have . Next, let and . We define the function
[TABLE]
where we used the following conventions: for and we have and . Note that the linear span of all such functions forms a conjugation closed subalgebra of that separates points, hence it is dense in .
Next note that for we have
[TABLE]
where to justify the second identity we use (19), for the third we use the unique ergodicity of and the fact that is Riemann integrable for , and for the fourth we use (20). By linearity and density, it follows that the sequence of measures (and hence the sequence ) converges weak-star to a measure on , which is equal to the image of the measure under . From the above, we deduce that the sequence has a unique Furstenberg system, which is , and is a factor map from the system to the system . Since the system is totally ergodic and has zero entropy, the same holds for its factor . This completes the proof. ∎
Proof of Corollaries 1.5 and 1.10.
It suffices to verify that the sequence , , satisfies the assumptions of Theorem 1.4. Since has a non-constant coefficient irrational, the sequence is equidistributed in , which gives that , so is non-null. Moreover, it follows from Proposition 4.2 that has zero entropy and is totally ergodic. It remains to verify that it has vanishing self-correlations, meaning,
[TABLE]
for every . In fact, we establish a stronger property: If are Riemann integrable, then for every we have
[TABLE]
Using standard Weyl estimates this is easily shown to be the case when and for some (this is the only point where we use the assumption that has a non-linear coefficient irrational). Using linearity and uniform approximation by trigonometric polynomials, we deduce that (21) holds for all . Finally, we deduce that (21) holds for all Riemann integrable by approximating them in by continuous functions and using that the sequence is equidistributed in for every . This completes the proof. ∎
5 Proof of main results for random weights
5.1 Proof of Theorems 1.7 and 1.11
For , we denote by the indicator function of the set and let
[TABLE]
We also let be an -net of points in of minimal cardinality (thus ) and define
[TABLE]
We need two lemmas. The first is an approximation property.
Lemma 5.1**.**
Let be a multiplicative function. Then for every and , there exists such that
[TABLE]
Proof.
Since is an -net of , and an element of can take arbitrary prescribed values on prime powers, as long as these values are taken in , there exists such that and
[TABLE]
For , let , where , be the unique factorization of into prime powers . Using the multiplicativity of and , the estimate (22), and telescoping, we get
[TABLE]
This completes the proof. ∎
For and , we let
[TABLE]
The next lemma gives an upper bound on the elements of that suffices for our purposes.
Lemma 5.2**.**
Let and . Then for all large enough we have
[TABLE]
Proof.
Notice first that because of multiplicativity, an -tuple is uniquely determined by the values , where range over all prime powers in . Since for large enough there are at most prime powers up to and for , we deduce that
[TABLE]
The asserted bound follows since . ∎
Combining the previous two lemmas we can prove the following result, which is an essential ingredient of the proofs of Theorems 1.7 and 1.11.
Theorem 5.3**.**
Let be a sequence of independent random variables with , . Then for every we have that -almost surely the following holds: For every , all multiplicative functions , and all , we have
[TABLE]
Remarks**.**
As was the case with Theorem 1.11, the important point in this statement is that the set of ’s for which (24) holds can be chosen independently of the (uncountably many) multiplicative functions .
We note that for the previous result can also be proved using an orthogonality criterion by utilizing the fact that for every we have -almost surely for all . But this method does not seem to be of much help when and it is the case that is needed in the proof of Theorem 1.7.
Proof.
Since and take values on a countable set, it suffices to show that for all fixed , , and , the following statement holds -almost surely: For all multiplicative functions we have
[TABLE]
To prove this, we first note that using standard concentration of measure estimates (for example Bernstein’s exponential inequality) we have for every fixed sequence and every and that
[TABLE]
We let
[TABLE]
Using the notation introduced in (23), we get for every large enough that
[TABLE]
where the first estimate follows from the union bound and (25), and the second estimate follows from Lemma 5.2. Using the Borel-Cantelli lemma we deduce that -almost surely we have
[TABLE]
Using Lemma 5.1, the fact that , and telescoping, we deduce that -almost surely we have
[TABLE]
This completes the proof. ∎
Proof of Theorems 1.7 and 1.11.
Let and be as in Theorem 1.11. Note that -almost surely the sequence is non-null, since -almost surely , , and by assumption is non-null. Likewise, if is a non-null sequence and is a multiplicative function, then -almost surely is non-null.
Since all fixed parameters that appear below take values on a countable set, by Proposition 2.7 (for Theorem 1.7) and Proposition 2.6 (for Theorem 1.11) it suffices to show that for every fixed , all , and all , we have -almost surely the following (for Theorem 1.7 we only need to use the case , ): For all multiplicative functions we have
[TABLE]
(Note that then (26) also holds with in place of .) We partition the positive integers into the following two sets
[TABLE]
We let
[TABLE]
Note that for all . Moreover, for (and fixed ) the random variables are independent, and the same holds for the random variables for . For we consider independent random variables , , such that , , and for . For , we apply Theorem 5.3 for the random variables and (then , ), and deduce that -almost surely we have
[TABLE]
for . Adding the two identities we get (26). This completes the proof. ∎
5.2 Proof of Theorem 1.8
We will use the following finitistic strengthening of Theorem 1.1 that can be deduced from Theorem 1.1 using a compactness argument:
Theorem 5.4**.**
For every there exists such that for every sequence there exist with such that .
We deduce from this some necessary conditions for a sequence to be a good weight for the Erdős discrepancy problem.
Lemma 5.5**.**
Let be a sequence and . Suppose that for infinitely many there exists such that
[TABLE]
Then is a good weight for the Erdős discrepancy problem.
Remark**.**
The conclusion fails if we simply assume that is equal to a non-zero constant on a union of arbitrarily long intervals. To see this, let be a completely multiplicative function that is equal to on a sequence of intervals with lengths even numbers that increase to infinity (such a multiplicative function can be explicitly constructed). Let also be the indicator function of the union of this sequence of intervals. Then \sup_{d,n\in{\mathbb{N}}}\big{|}\sum_{k=1}^{n}a(dk)\,w(k)\big{|}\leq 1.
Proof.
Let be a sequence and . Let be so that Theorem 5.4 applies for this and (27) holds for some and . We use Theorem 5.4 for the sequence and we get that there exist , with , such that
[TABLE]
We let
[TABLE]
Note that
[TABLE]
Since , using the previous identity, and (27), (28), we deduce that
[TABLE]
Hence, either |S_{d}\big{(}r\frac{m!}{d}+n\big{)}|\geq\frac{C}{2} or |S_{d}\big{(}r\frac{m!}{d}\big{)}|\geq\frac{C}{2}. Since was arbitrary, we deduce that . This completes the proof. ∎
Proof of Theorem 1.8.
Let be such that for every , where , . Let . By Lemma 5.5, it suffices to show that -almost surely there exists such that
[TABLE]
One easily verifies that for any fixed the random variables , , , are independent. Hence,
[TABLE]
Since is decreasing, we have that for all . Moreover, since , using again that is decreasing, we get that . We deduce that
[TABLE]
Since the sets involved in the above probabilities are independent, the Borel-Cantelli theorem applies, and gives that -almost surely for infinitely many we have that for all . This completes the proof. ∎
Acknowledgments
I would like to thank M. Kolountzakis for providing the proof of Theorem 1.8 and other useful remarks, and S. Pattison for pointing out a correction in the statement of Proposition 4.2. I would also like to thank the American Institute of Mathematics (AIM) for its hospitality; part of this work was motivated by problems raised during the 2018 workshop “Sarnak’s Conjecture”.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] N. Frantzikinakis, B. Host. Higher order Fourier analysis of multiplicative functions and applications. J. Amer. Math. Soc. 30 (2017), 67–157.
- 2[2] N. Frantzikinakis, B. Host. The logarithmic Sarnak conjecture for ergodic weights. Ann. of Math. (2) 187 (2018), 869–931.
- 3[3] N. Frantzikinakis, B. Host. Furstenberg systems of bounded multiplicative functions and applications. To appear in Int. Math. Res. Not. , ar Xiv:1804.08556
- 4[4] H. Furstenberg. Disjointness in ergodic theory, minimal sets, and a problem in diophantine approximation. Math. Systems Theory 1 (1967), 1–49.
- 5[5] H. Furstenberg. Ergodic behavior of diagonal measures and a theorem of Szemerédi on arithmetic progressions. J. Analyse Math. 31 (1977), 204–256.
- 6[6] H. Furstenberg. Recurrence in Ergodic Theory and Combinatorial Number Theory. Princeton University Press, Princeton 1981.
- 7[7] W. T. Gowers. Erdős and arithmetic progressions. Erdős Centeniall, Bolyai Society Mathematical Studies, 25, L. Lovasz, I. Z. Rusza, V. T. Sos eds., Springer 2013, 265–287.
- 8[8] T. Kamae. Subsequences of normal sequences. Israel J. Math. 16 (1973), 121–149.
