On consecutive values of random completely multiplicative functions
Joseph Najnudel

TL;DR
This paper investigates the independence and empirical distribution convergence of consecutive values of random completely multiplicative functions, revealing conditions under which these values behave independently and their empirical measures stabilize almost surely.
Contribution
It establishes independence of consecutive values for large indices under specific distributions and proves almost sure convergence of empirical measures for these functions.
Findings
Consecutive values become independent for large n under certain distributions.
Empirical measures of the functions converge almost surely as N increases.
Rate of convergence of the empirical measure is estimated.
Abstract
In this article, we study the behavior of consecutive values of random completely multiplicative functions whose values are i.i.d. at primes. We prove that for uniform on the unit circle, or uniform on the set of roots of unity of a given order, and for fixed , are independent if is large enough. Moreover, with the same assumption, we prove the almost sure convergence of the empirical measure when goes to infinity, with an estimate of the rate of convergence. At the end of the paper, we also show that for any probability distribution on the unit circle followed by , the empirical measure converges almost surely when .
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
On consecutive values of random completely multiplicative functions
Joseph Najnudel
Abstract.
In this article, we study the behavior of consecutive values of random completely multiplicative functions whose values are i.i.d. at primes. We prove that for uniform on the unit circle, or uniform on the set of roots of unity of a given order, and for fixed , are independent if is large enough. Moreover, with the same assumption, we prove the almost sure convergence of the empirical measure when goes to infinity, with an estimate of the rate of convergence. At the end of the paper, we also show that for any probability distribution on the unit circle followed by , the empirical measure converges almost surely when .
1. Introduction
Many arithmetic functions of interest are multiplicative, i.e. their value at is the product of their values at and , for all coprime integers . For example, it is the case for the Möbius function, which is defined by if is divisible by the square of at least one prime number, if is the product of an even number of distinct primes, and if is the product of an odd number of distinct primes. Similarly, Dirichlet characters are multiplicative, as well as the Liouville function, which is equal to on integers with prime factors, counted with multiplicity: these functions are even completely multiplicative, which means that their value at is the product of their values at and for all integers . The behavior of the Möbius and the Liouville functions is far from being known with complete accuracy, even if partial results have been proven. This difficulty can be encoded by the corresponding Dirichlet series, which involve the Riemann zeta function. For example, the partial sum, up to , of the Möbius function in known to be negligible with respect to , and it is conjectured to be negligible with respect to for all : the first statement can quite easily be proven to be equivalent to the prime number theorem, whereas the second is equivalent to the Riemann hypothesis.
It has been noticed that the same bound for all is obtained if we take the partial sums of i.i.d., bounded and centered random variables. This suggests the naive idea to compare the Möbius function on square-free integers with i.i.d. random variables on . However, a major difference between the two situations is that in the random case, we lose the multiplicativity of the function. A less naive randomized version of Möbius functions can be obtained as follows: one takes i.i.d. uniform random variables on on prime numbers, [math] on prime powers of order larger than or equal to , and one completes the definition by multiplicativity.
In [29], Wintner considers a completely multiplicative function with i.i.d. values at primes, uniform on (which corresponds to a randomized version of the Liouville function rather than the Möbius function), and proves that we have almost surely the same bound () for the partial sums, as for the sums of i.i.d. random variables, or for the partial sums of Möbius function if the Riemann hypothesis is true. The estimate in [29] has been refined by Halász in [9], and then by Lau, Tenenbaum and Wu in [17]. Some lower bounds can also be deduced from moment estimates by Harper [10]. In order to get more general results, it can be useful to consider complex-valued random multiplicative functions. For example, it has been proven by Bohr and Jessen [2] that for , the law of , for uniformly distributed on , tends to a limiting random variable when goes to infinity. This limiting random variable can be written as , when is a random completely multiplicative function such that are i.i.d. uniform on the unit circle, denoting, as in all the sequel of the present paper, the set of prime numbers. The fact that the series just above converges is a direct consequence (by partial summation) of the analog of the result of Wintner for the partial sums of : one can prove that almost surely, for .
This discussion shows that it is often much less difficult to prove accurate results for random multiplicative function than for the arithmetic functions which are usually considered. In some informal sense, the arithmetic difficulties are diluted into the randomization, which is much simpler to deal with.
In the present paper, we study another example of results which are stronger and less difficult to prove in the random setting than in the deterministic one. The example we detail in this article is motivated by the following question, initially posed in the deterministic setting: for , what can we say about the distribution of the -tuples , or , where and are the Möbius and the Liouville functions, varies from to , tends to infinity? This question is only very partially solved. One knows (it is essentially a consequence of the prime number theorem), that for , the proportion of integers such that is equal to or tends to . For the Möbius function, the limiting proportions are for or and for [math]. It has been proven by Hildebrand [15] that for , the eight possible values of appears infinitely often. This result has been improved by Matomäki, Radziwiłł and Tao [19], who prove that these eight values appear with a positive lower density: in other words, for all ,
[TABLE]
The similar result is proven for the nine possible values of . A conjecture by Chowla [3] states that for all , each possible pattern of appears with asymptotic density . This conjecture is still open, however, partial results have been recently proven, in particular in papers by Tao and Teräväinen ([24], [25], [26]).
In the present paper, we prove results similar to this conjecture for random completely multiplicative functions . The random functions we will consider take i.i.d. values on the unit circle on prime numbers. Their distribution is then entirely determined by the distribution of . The two particular cases we will study in the largest part of the paper are the following: is uniform on the unit circle , and is uniform on the set of -th roots of unity, for . In this case, we will show the following results: for all , and for all large enough depending on , the variables are independent, and exactly i.i.d. uniform on the unit circle if is uniform. Moreover, the empirical distribution
[TABLE]
tends almost surely to the uniform distribution on if is uniform on , and to the uniform distribution on if is uniform on . In particular, the analog of Chowla’s conjecture holds almost surely in the case where is uniform on . We have also an estimate on the speed of convergence of the empirical measure: in the case of the uniform distribution on , each of the possible patterns for almost surely occurs with a proportion for running between and , for all . We have a similar result in the uniform case, if the test functions we consider are sufficiently smooth. It would be interesting to have similar results when the distribution of on the unit circle is not specified. For , we are unfortunately not able to show similar results, but we nevertheless can prove that the empirical distribution of almost surely converges to a limiting distribution for any distribution of on the unit circle. We specify this distribution, which is always uniform on or uniform on for some , and in the latter case, we give an estimate of the rate of convergence. This rate corresponds to a negative power of , which is much slower than what we obtain when is uniform on .
The techniques we use in our proofs are elementary in general, mixing classical tools in probability theory and number theory. However, a part of our arguments need to use deep results on diophantine equations, in order to bound the number and the size of their solutions.
The sequel of the paper is organized as follows. In Sections 2 and 3, we study the law of for large depending on , first in the case where is uniform on , then in the case where is uniform on . In Section 4, we study the empirical measure of in the case of uniform on . In the proof of the convergence of this empirical measure, we need to estimate the second moment of sums of the form . The problem of estimating moments of order different from two for such sums is discussed in Section 5. The proof of convergence of empirical measure in the case of uniform variables on is given in Section 6. Finally, we consider the case of a general distribution for in Section 7.
2. Independence in the uniform case
In this section, we suppose that are i.i.d. uniform random variables on the unit circle. By convenience, we will extend our multiplicative function to positive rational numbers by setting : the result is independent of the choice of and , and we have for all rationals . Moreover, is uniform on the unit circle for all positive rational . In this section, we will show that for fixed , are independent if is sufficiently large. The following result gives a criterion for such independence:
Lemma 2.1**.**
For all , the variables are independent if and only if are linearly independent on .
Proof.
Since the variables are uniform on the unit circle, they are independent if and only if
[TABLE]
for all . This equality is equivalent to
[TABLE]
i.e.
[TABLE]
or
[TABLE]
∎
We then get the following result:
Proposition 2.2**.**
The variables are i.i.d. as soon as . In particular, for fixed, this property is true for all but finitely many .
Remark 2.3**.**
The same result is proven in [27], Theorem 3. (i), with an asymptotically better bound, namely where is a constant. However, their proof uses a deep result by Shorey [23] on linear forms in the logarithms of algebraic numbers, involving technical tools by Gelfond and Baker, whereas our proof is elementary. Moreover, the constant involved in [27] is not given, even if it is explicitly computable.
Proof.
Let us assume that we have a linear dependence (1) between : necessarily . Moreover, the integers for which cannot be divisible by a prime larger than : otherwise this factor remains in the product since none of the for can be divisible by , and then the product cannot be equal to . We can rewrite the dependence as follows:
[TABLE]
for a subset of and for . Let us assume that the cardinality is as small as possible. Taking the decomposition in prime factors, we get for all ,
[TABLE]
where denotes the exponent of in the prime factorization. If , , then we can write these equalities in a matricial way . The minimality of ensures that the matrix has rank . Moreover, since all the prime factors of are smaller than , all the rows of indexed by prime numbers larger than are identically zero, and then the rank of is at most , the number of primes smaller than or equal to . Moreover, we can extract a subset of of cardinality such that the restriction of to the rows with indices in is invertible. We have with obvious notation: , and then by Cramer’s rule, the entries of can be written as the quotients of determinants of matrices obtained from by replacing one column by , by the determinant of . All the entries involved in these matrices are -adic valuations of integers smaller than or equal to , so they are at most . By Hadamard inequality, the absolute value of the determinants are smaller than or equal to . Since , we deduce, after multiplying by , that there exists a linear dependence between involving only integers of absolute value at most : let us keep the notation of (1) for this dependence. Let be the smallest nonnegative integer such that : from the fact that the Vandermonde matrices are invertible, one deduces that . Using the fact that
[TABLE]
we deduce, by writing the dependence above:
[TABLE]
if and
[TABLE]
if . Since the first factor in the left-hand side of these inequalities is a non-zero integer, it is at least . From the bounds we have on the ’s, we deduce
[TABLE]
for and
[TABLE]
for . Hence
[TABLE]
if , which implies, since ,
[TABLE]
If , we deduce
[TABLE]
i.e.
[TABLE]
Now, one has obviously for all , and then for all integers , and more accurately, it is known that , which tends to at infinity by the prime number theorem, reaches its maximum at : this fact is in particular an immediate consequence of [22], Corollary 2, equation (3.7). Hence,
[TABLE]
where
[TABLE]
and then
[TABLE]
Let us assume that . The function is increasing for . Moreover, by studying the function for , we check that for all . Hence, since ,
[TABLE]
[TABLE]
which contradicts the previous inequality.
Hence,
[TABLE]
and this bound is of course also available for . ∎
This result implies that theoretically, for fixed , one can find all the values of such that are not independent by brute force computation. In practice, the bound we have obtained is far from optimal, and is too poor to be directly useable except for very small values of , for which a more careful reasoning can solve the problem directly. Here is an example for :
Proposition 2.4**.**
For , the variables are independent except if .
Proof.
If with integers not all equal to zero, then as soon as has a prime factor larger than or equal to : otherwise, this prime factor cannot be cancelled by the factors for . Hence, the values of such that have only prime factors and , and at most one of them has both factors since it should then be divisible by . Moreover, if , there can be at most one power of and one power of among . One deduces that dependence is only possible if among , there are three numbers, respectively of the form , for integers . The quotient between two of these integers is between and since we here assume . Hence, and then . Similarly, , which implies . The numbers and are then both divisible by ; since they differ by at most , . The numbers and are both divisible by , and then . Therefore, and . If , the only possible values of such that can be different from zero are and , which are multiplicatively independent. If , the only possible values are and , which are also independent, if , the values to consider are and . The only remaining values of are , which are exceptions since
[TABLE]
∎
The results above give an upper bound, for fixed , of the maximal value of such that are not independent. By considering two consecutive squares and their geometric mean, whose logarithms are linearly dependent, one deduces the lower bound for the maximal . As written in a note by Dubickas [4], this bound can be improved to a quantity equivalent to , by considering the identity:
[TABLE]
In [4], as an improvement of a result of [27], it is also shown that for all , the lower bound occurs for infinitely many values of .
A computer search gives, for between and , and , the following largest values for which we do not have independent variables: 1, 5, 7, 14, 23, 24, 47, 71, 71, 71, 239. For example, if and , the five integers have only the four prime factors , so we necessarily have a dependence, namely:
[TABLE]
It would remain to check if there are dependences for .
3. Independence in the case of roots of unity
We now suppose that are i.i.d., uniform on the set of -th roots of unity, being a fixed integer. If , we get symmetric Bernoulli random variables. For all integers , we will denote by the largest divisor of such that is a -th power. The analog of Lemma 2.1 in the present setting is the following:
Lemma 3.1**.**
For , the variables are all uniform on the set of -th roots of unity if and only if for all between and . They are independent if and only if the only -tuple , such that
[TABLE]
is .
Proof.
For any , , we have
[TABLE]
which is equal to if is divisible by for all , and to [math] otherwise. The condition giving is equivalent to the fact that is a multiple of , which is . Hence, is a uniform -th root of unity, which implies the first part of the proposition.
The variables are independent if and only if for all ,
[TABLE]
Since is a uniform -th root of unity, both sides of the equality depend only on the values of modulo for . This implies that we can assume, without loss of generality, that for all . If all the ’s are zero, both sides are obviously equal to . Otherwise, the right-hand side is equal to zero, and then we have independence if and only if it is also the case of the left-hand side, i.e. for all , ,
[TABLE]
which is true if and only if
[TABLE]
∎
We then have the following result, similar to Proposition 2.2:
Proposition 3.2**.**
For fixed , there exists an explicitely computable such that are independent as soon as .
The bound can be deduced from bounds on the solutions of certain diophantine equations which are available in the literature: we do not take care of its precise value, which is anyway far too large to be of any use if we want to find in practice the values of such that are not independent.
Proof.
For each value of such that are dependent, there exist , not all zero, such that
[TABLE]
There are finitely many choices, depending only on and , for the -tuples and , so it is sufficient to show that the values of corresponding to each choice of -tuples is bounded by an explicitely computable quantity. At least two of the ’s are non-zero: otherwise is divisible by for all , being the unique index such that , and then is divisible by : this contradicts the inequality .
On the other hand, if is a prime larger than , at most one of the terms is non-zero, and then all the terms are divisible by , since it is the case for their sum.
We deduce that is the product of a power of order and a number whose prime factors are all smaller than . Moreover, one can assume that is ”-th power free”, i.e. that all its -adic valuations are strictly smaller than . Hence there exist
[TABLE]
and an integer such that . The value of the exponents are fixed by the ’s, and at least two of them are strictly larger than , since at least two of the ’s are non-zero. Let us first assume that there exist distinct and such that and . One finds an explicitly computable bound on in this case as soon as we find an explicitly computable bound for the solutions of each diophantine equation in and :
[TABLE]
for each such that and , . These equations can be rewritten as: , where and
[TABLE]
This polynomial has all simple roots (the -th roots of ) and then at least two of them; it has at least three if since is supposed to be at least in this case. By a result of Baker [1], all the solutions are bounded by an explicitly computable quantity, which gives the desired result (the same result with an ineffective bound was already proven by Siegel).
In remains to deal with the case where for all such that . In this case, is even and is divisible by , which implies that when . By looking at the prime factors larger than , one deduces that for all such that , is a square times a product of distinct primes smaller than or equal to . If at least three of the ’s are non-zero, it then suffices to find an explicitly computable bound for the solutions of each system of diophantine equations:
[TABLE]
for squarefree, , . From these equations, we deduce, for :
[TABLE]
The four roots of the right-hand side are the square roots of and , which are all distinct since , , . Again by Baker’s result, one deduces that the solutions are explicitly bounded, which then gives an explicit bound for .
The remaining case is when exactly two of the ’s are non-zero, with , and then . The dependence modulo then means that is a square for distinct between and , which implies that and are both squares where . These squares have difference smaller than , which implies that they are smaller than . Moreover, divides , and then , which gives . ∎
Here, we explicitly solve a particular case:
Proposition 3.3**.**
For , are independent for all and not for .
Proof.
A dependence means that there exists a product of distinct non-square integers among which is a square. For a prime , at most one -adic valuation is non-zero, which implies that all the -adic valuations are even. Hence, the factors involved in the product are all squares multiplied by or . Since they differ by at most , they cannot be in the same of the three ”categories”, which implies, since the product is a square, that there exist three numbers, respectively of the form , , , in the interval between and . Now, Hajdu and Pintér [6] have determined all the triples of distinct integers in intervals of length at most 12 whose product is a square. For length , the only positive triple is , which implies that the only dependence in the present setting is . ∎
Remark 3.4**.**
The list given in [6] shows that for , there are dependences for quite large values of as soon as . For example, we have for and for .
4. Convergence of the empirical measure in the uniform case
In this section, are uniform on the unit circle, and is a fixed integer. For , we consider the empirical measure of the first -tuples:
[TABLE]
It is reasonable to expect that tends to the uniform distribution on , which is the common distribution of for all but finitely many values of . In order to prove this result, we will estimate the second moment of the Fourier transform of , given by
[TABLE]
Proposition 4.1**.**
Let be integers, not all equal to zero. Then, for all ,
[TABLE]
and there exists , independent of and , such that
[TABLE]
Moreover, under the same assumption,
[TABLE]
[TABLE]
Finally, for , one can take or equal to [math], and for , one can take if is proportional to and otherwise.
Proof.
We have, using the completely multiplicative extension of to all :
[TABLE]
and then the left-hand side is equal to the number of couples in such that
[TABLE]
The number of trivial solutions of this equation is equal to , which gives a lower bound on the second moment we have to estimate. On the other hand, the derivative of the rational fraction can be written as the product of , which is strictly positive on , by the polynomial
[TABLE]
The polyomial has degree at most and is non-zero, since and then is non-constant. We deduce that has at most zeros, and then on , is strictly monotonic on each of at most intervals of , whose bounds are [math], the positive zeros of and . Hence, for each choice of , there are at most values of satisfying (2), i.e. at most one in each interval, which gives the upper bound for the moment we are estimating.
Moreover, since is strictly monotonic on an interval of the form for some , we deduce that for any non-trivial solution of (2), the minimum of and is at most . Hence, there are finitely many possibilities for the common value of the two sides of (2), and for each of these values, at most possibilities for and for . Hence, for fixed , the total number of non-trivial solutions of (2) is finite, which gives the bound of the proposition.
The statement involving the empirical measure is deduced by taking and by dividing everything by .
The claim for is an immediate consequence of the following statement we will prove now: the only integers , , such that
[TABLE]
are , , proportional to , which corresponds to the equality:
[TABLE]
If have the same sign and are not all zero, is strictly monotonic in , and then we cannot get a solution of (3) with . By changing all the signs if necessary, we may assume that one of the integers is strictly negative and the others are nonnegative. For , the fraction obtained by writing can only be simplified by prime factors dividing two of the integers , and then only by a power of . If and then , the numerator and the denominator have different parity, and then the fraction is irreducible for all : we do not get any solution of (3) in this case. Otherwise, or is strictly negative. If solves (3), let us define and if , and and if . The denominators of the two fractions corresponding to the two sides of (3) are respectively a power of and the same power of : if (3) is satisfied, these denominators should differ only by a power of , since the fractions can be only simplified by such a power. Hence, for some , and by looking at the numerators of the fractions, we deduce that there exists such that
[TABLE]
If , the ratios and are at least since and , and then the ratio between the right-hand side and the left-hand side of the previous equality is at least , which gives
[TABLE]
On the other hand, the -adic valution of the right-hand side is since modulo 4, whereas the valuation of the left-hand side is at least , which gives
[TABLE]
We then get a contradiction for , except in the case , where we already know that there is no solution of (3). If , we get
[TABLE]
In this case, the prime factors of , which are odd (), should divide or , then or , and finally or . Hence, is a power of . Similarly, the odd factors of , and then of , should divide or , and then or : is the product of a power of and a power of . If we write , , we must have . If , we have . If , we get , and if , we get , which is impossible. If , is divisible by , and then is congruent to or modulo , which implies , and then . Now, by induction, one proves that the order of modulo is equal to (i.e. is a primitive root modulo the powers of ). This result is classical, and can be deduced, for example, from Rosen [21], Theorem 8.9. For sake of completeness, we give a proof here. The result is easy to check be direct computation for and . Let us assume that it is true for all values until . The order of modulo is a multiple of the order of modulo , and then a multiple of by assumption. On the other hand, it is a divisor of by Euler’s theorem. Hence, it is either or . Moreover, since is assumed to be the order of modulo but strictly smaller than the order of modulo , we have
[TABLE]
where is not divisible by . Raising to the cube, we deduce
[TABLE]
where
[TABLE]
is not divisible by (recall that here). Hence, the order of modulo is not : it can only be , which proves by induction that is a primitive root of for all . Now, in the present situation, the order of modulo , i.e. , should divide , since modulo , and then ( is not possible) which implies , i.e. .
If and , we get , , and then , . We should solve . Taking the -adic valuation gives , taking the -adic valuation gives , and then , which implies .
If and , we get , , , , which gives the equation . Taking the -adic valuation gives , taking the -adic valuation gives , and then should be proportional to : in this case, we get one of the solutions already mentioned.
If , should be or , and then , , , , , , . We have to solve . Taking the -adic valuation gives , taking the -adic valuation gives , and then .
∎
Corollary 4.2**.**
For all , converges in , and then in probability, to , i.e. to the corresponding Fourier coefficient of the uniform distribution on . In other words, converges weakly in probability to .
In this setting, we also have a strong law of large numbers, with an estimate of the rate of convergence, for sufficiently smooth test functions. Before stating the corresponding result, we will show the following lemma, which will be useful:
Lemma 4.3**.**
Let , , and let be a sequence of random variables such that and for all ,
[TABLE]
Then, almost surely, : more precisely, we have for ,
[TABLE]
where depends only on and .
Proof.
For , and , we have:
[TABLE]
Since , we deduce that the probability that
[TABLE]
for all is at least , where depends only on and , and then only on and . Now, if (4) occurs for all , if we take the binary expansion with , and if for all , then we get if , and
[TABLE]
if . Adding these inequalities from to , we deduce that , where is the number of ’s in the binary expansion of . Hence,
[TABLE]
where depends only on and (recall that ), and then only on and . We then have, for :
[TABLE]
which gives the desired result after replacing by . ∎
From this lemma, we deduce the following:
Proposition 4.4**.**
Almost surely, weakly converges to . More precisely, the following holds with probability one: for all , for all continuous functions from to such that
[TABLE]
* denoting any norm on , and for all ,*
[TABLE]
Remark 4.5**.**
By Cauchy-Schwarz inequality, we have
[TABLE]
which implies that the assumption on given in the proposition is satisfied for all in the Sobolev space as soon as .
Unfortunately, the proposition does not apply if is a product of indicators of arcs. The weak convergence implies that
[TABLE]
even in this case, but we don’t know at which rate this convergence occurs.
Proof.
From Proposition 4.1, and Lemma 4.3 applied to , and
[TABLE]
we get, for all , ,
[TABLE]
For fixed , we apply this estimate to and get
[TABLE]
Since , we deduce, by Borel-Cantelli lemma, that almost surely,
[TABLE]
for all but finitely many . Therefore, almost surely,
[TABLE]
i.e.
[TABLE]
for , . Almost surely, this estimates simultaneously occurs for all rationals and (with a random implicit constant in , depending on and ) and then for all reals and .
Let us now assume that this almost sure property holds, let us fix , , and let be a function satisfying the assumptions of the proposition. Since the Fourier coefficients of are summable (i.e. is in the Wiener algebra of ), the corresponding Fourier series converges uniformly to a function which is necessarily equal to , since it has the same Fourier coefficients. We can then write:
[TABLE]
which implies
[TABLE]
By assumption, the last sum is dominated by
[TABLE]
which is finite by the assumptions made in the proposition, and then .
∎
5. Moments of order different from two
Since we have a law of large numbers on , with rate of decay of order , it is natural to look if we have a central limit theorem. In order to do that, a possibility consists in studying moments of sums in of products of variables from to . For the sums , we do not have convergence to a non-zero Gaussian random variable after normalization by . Indeed, the second moment of the absolute value of the renormalized sum is equal to , so if this variable converges to a non-zero complex Gaussian variable, we need to have the convergence of
[TABLE]
towards a non-zero constant. In [12], Harper, Nikeghbali and Radziwiłł prove that the quantity just above decays at most like when goes to infinity, whereas a conjecture by Helson [14] states that it tends to [math]. The order of magnitude of the left-hand side has later been found by Harper in [10]: it is , which in particular proves Helson’s conjecture.
On the other hand, an equivalent of the moments of of even integer order are computed in [12] and [13], and they are not bounded with respect to : the moment of order is equivalent to an explicit constant times . The order of magnitude of the moments of any positive order, not necessarily integer, is given by Harper in [10] and [11].
In the case of sums different from , the moment computations involve arithmetic problems of different nature: here, we look in some detail the case of the sum . In this case, the fact that consecutive, and then necessarily coprime integers are involved gives more independence than when we study the sum . In particular, it seems reasonable to expect that satisfies the same central limit theorem as the sum of i.i.d. uniform variables on the unit circle, and that this fact can be proven by moment computations. The convergence of the second moment is obvious, and we will now show that the convergence of the fourth moment also occurs. We start with the following result:
Proposition 5.1**.**
We have
[TABLE]
where (resp. ) is the number of solutions of the diophantine equation such that the integers satisfy (resp. ). Moreover, for all , there exists , independent of , such that for all ,
[TABLE]
Hence,
[TABLE]
Proof.
Expanding the fourth moment, we immediately obtain that it is equal to the total number of solutions of the previous diophantine equation, with . One has trivial solutions: for which , for which , for which . It remains to count the number of non-trivial solutions. Such a solution has a minimal element among . This element is unique: if two minimal elements are on the same side, then necessarily , if two minimal elements are on different sides, then the other elements should be equal, which also gives a trivial solution. Dividing the number of solutions by four, we can assume that is the unique smallest integer, which implies that is the largest one. For , we get solutions, and for , we get solutions, the factor coming from the possible exchange between and .
The lower bound comes from the solutions and for , and from the solutions of the form for .
Let us now prove the upper bound. We start by slightly simplifying the equation by introducing the odd integers , , , , which should satisfy:
[TABLE]
If are large, then and should be odd and close to each other. It is then quite natural to introduce
[TABLE]
which is expected to be small with respect to . More precisely, since and are closer to each other than and , we need
[TABLE]
and then , since
[TABLE]
The last equality, gives, after factorizing the left-hand side and replacing by :
[TABLE]
and in particular
[TABLE]
If we neglect the term , expected to be small with respect to , we get the positivity of a quadratic form in and , which gives a restriction on the possible values of the ratio . More precisely, if we assume , we deduce
[TABLE]
and then
[TABLE]
since it is an odd integer, and then , which gives a contradiction. Any solution should then satisfy . We now discuss in function of the value of . For , we have necessarily , and then , and then there are only possibilities for the couple . Now, and should be divisors of , and by the classical divisor bound, we deduce that there are possibilities for when and are chosen. Hence, the number of solutions for is bounded by the estimate we have claimed.
It remains to bound the number of solutions for : we will get a bound for the number of solutions for each value of , which will be multiplied by at the end. Each solution should satisfy
[TABLE]
i.e. by writing the quadratic form in and as a difference of squares:
[TABLE]
We know that , and then , which gives, for each value of , possibilities for . For the moment, let us admit that for each of these possibilities, there are choices for and . Then, for fixed , we have choices for . For each choice, are fixed, and then also , and finally and . Hence, we have solutions for each , and then solutions by counting all the possible .
The claim we have admitted is a consequence of the following fact we will prove now: for , the number of representations of in integers by the quadratic form is , uniformly in the strictly positive integer . Indeed, for such a representation, the ideal should be a divisor of in the ring of integers of , and each such ideal gives at most couples representing . Indeed, the group of invertible elements in has order at most . This fact is classical (see for example Jarvis [16], Chapter 6), and can be proven as follows: if is invertible in for , then is also invertible in , and
[TABLE]
is an integer since it is in , whereas
[TABLE]
is an invertible integer, necessarily equal to . Hence, is a complex number of modulus , with real part equal to or , i.e. a fourth root or a sixth root of unity. We now only need to bound the number of divisors of in by , uniformly in . The number of divisors of is , where we have the prime ideal decomposition
[TABLE]
Now, by considering the decomposition of prime numbers as products of ideals, we deduce:
[TABLE]
denoting an ideal of norm , and then the number of divisors of is
[TABLE]
where is the number of divisors, in the usual sense, of the integer . This gives the desired bound . ∎
Remark 5.2**.**
Using the previous proof, one can show the following quite curious property: all the solutions of in integers satisfy . Indeed, let us assume the contrary. With the previous notation, , and then , which gives , i.e.
[TABLE]
[TABLE]
a contradiction since implies . The bound is sharp, since we have the solutions of the form , where
[TABLE]
A consequence of the previous proposition corresponds to a bound on all the moments of order [math] to :
Corollary 5.3**.**
We have, for all ,
[TABLE]
where and .
Proof.
Hölder inequality implies that the logarithm of the -th moment of a nonnegative random variable is a convex function of . Now, we have proven that this logarithm is equal to [math] for and and to for . The corollary can now be deduced from the following fact, easy to check: if is a convex fonction from to such that , , then
[TABLE]
and
[TABLE]
∎
We have proven that the fourth moment of converges to , which is also the limit of the fourth moment of where are i.i.d. random variables, uniform on the unit circle. Unfortunately, we are not able to prove a similar convergence for higher moments, and then we do not know how to prove a central limit theorem. However, the following result holds:
Proposition 5.4**.**
If for all integers , the number of non-trivial solutions of the diophantine equation
[TABLE]
is negligible with respect to the number of trivial solutions when (i.e. ) then we have
[TABLE]
where denotes a standard Gaussian complex variable, i.e. where are independent standard real Gaussian variables.
Proof.
If
[TABLE]
then for integers , the moment is equal to times the number of solutions of
[TABLE]
If , there are at most choices for , and once these integers are fixed, at most choices for by the divisor bound. Hence, the moment tends to zero when , and we have the same conclusion for . Finally, if , by assumption, the moment is equivalent to times the number of trivial solutions of the corresponding diophantine equation, i.e. to the corresponding moment for the sum of i.i.d. variables, uniform on the unit circle. By the central limit theorem,
[TABLE]
We have then proven that for all integers ,
[TABLE]
which gives the claim. ∎
We have proven the assumption of the previous proposition for , however, our method does not generalize to larger values of . The divisor bound gives immediately a domination by for the number of solutions, and then it seems reasonable to expect that the arithmetic constraints implied by the equation are sufficient to save at least a small power of . Note that the situation is different for the sum : for example, for , the number of non-trivial solutions of the equation for is not , as we can see by considering the equalities for and odd, .
The previous proposition giving a ”conditional CLT” can be generalized to the sums of the form
[TABLE]
when the ’s have the same sign. The situation is more difficult if the ’s have different signs since the divisor bound alone does not directly give a useful bound on the number of solutions.
6. Convergence of the empirical measure in the case of roots of unity
Here, we suppose that are i.i.d. uniform on the set of -th roots of unity, being fixed. With the notation of the previous section, we now get:
Proposition 6.1**.**
Let be integers, not all divisible by , let and let . Then,
[TABLE]
and
[TABLE]
where depends only on .
Proof.
We can obviously assume that are between [math] and , which gives finitely many possibilities for these integers, depending only on and . We can then suppose that are fixed at the beginning. We have to upper bound the number of couples on such that
[TABLE]
where, in this proof, denotes the set of -th powers of positive rational numbers. Now, any positive integer can be decomposed as a product of a ”smooth” integer whose prime factors are all strictly smaller than , and a ”rough” integer whose prime factors are all larger than or equal to . If the ”rough” integer is denoted , the condition just above implies:
[TABLE]
Now, the numerator and the denominator of this expression can both be written in a unique way as a product of a -th perfect power and an integer whose -adic valuation is between [math] and for all . If the quotient is a -th power, necessarily the numerator and the denominator have the same ”-th power free” part. Hence, there exists a -th power free integer such that
[TABLE]
being the set of -th powers of positive integers. Hence, the number of couples we have to estimate is bounded by
[TABLE]
where is the number of integers such that
[TABLE]
If a prime number divides and for , it divides , and then . Hence, the rough parts of are pairwise coprime. Now, if are the -th power free integers such that , we have . Now, are coprime, and then is -th power free, which implies . Hence
[TABLE]
Let us now fix an index such that is not multiple of . We have
[TABLE]
where denotes the product of the distinct prime factors of . The condition on means that for all , ,
[TABLE]
i.e. is divisible by and
[TABLE]
where . Since is coprime with , the last congruence is equivalent to a congruence modulo between and a fixed integer, which is not divisible by if and only if divides . We deduce that the condition on implies that , i.e.
[TABLE]
where is a -th power free integer whose prime factors are strictly smaller than , is an integer and is an integer depending only on , and , which is divisible by . For a fixed value of , the values of should be in the interval
[TABLE]
whose size is at most
[TABLE]
by the concavity of the power , the fact that since is not divisible by , which implies . Now, the conditions on for imply a condition of congruence for , modulo all the primes dividing one of the ’s for . These primes do not divide , since has all prime factors smaller than , and divides . They also do not divide , since this integer has the same prime factors as , which is prime with . Hence, we get a condition of congruence for modulo all primes dividing for some . For each of these primes, this gives at most congruence classes for , and then, by the chinese reminder theorem, we get at most classes modulo , where denotes the number of prime factors of an integer. The number of integers satisfying the congruence conditions is then at most:
[TABLE]
where denotes the number of divisors of . Now, has prime factors smaller than and -adic valuations smaller than , which certainly gives . Hence, by considering all the possible values of , and all the possible , which should divide , we deduce
[TABLE]
If , we have necessarily
[TABLE]
Using the divisor bound, we deduce that for all , there exists such that for all ,
[TABLE]
and then
[TABLE]
i.e.
[TABLE]
which implies
[TABLE]
since the right-hand side is nonnegative. Summing the square of this bound for all possible gives
[TABLE]
Now, since all numbers up to have prime factors smaller than this quantity, we deduce, using the multiplicativity of the radical:
[TABLE]
which, by Mertens’ theorem, is smaller than a constant, depending on and , times . We deduce that there exists a constant , such that
[TABLE]
Now, it is clear that
[TABLE]
since this sum counts all the integers from to , regrouped in function of the -th power free part of . Using the inequality , available for all , we deduce
[TABLE]
This result gives the first inequality of the proposition, for
[TABLE]
The second inequality is obtained by taking and dividing by . ∎
Corollary 6.2**.**
For all , converges in , and then in probability, to the corresponding Fourier coefficient of the uniform distribution on . In other words, converges weakly in probability to .
We also have a strong law of large numbers.
Proposition 6.3**.**
Almost surely, weakly converges to . More precisely, for all , the proportion of such that is almost surely for all .
Proof.
By Lemma 4.3 and Proposition 6.1, we deduce that almost surely, for all , , ,
[TABLE]
Since we have finitely many values of , we can take the uniform in . Then, by inverting discrete Fourier transform on , we deduce the claim. ∎
7. More general distributions on the unit circle
In this section, are i.i.d., with any distribution on the unit circle. We will study the empirical distribution of , but not of the patterns for . More precisely, the goal of the section is to prove a strong law of large numbers for when goes to infinity. We will use the following result, due to Halász, Montgomery and Tenenbaum (see [7], [8], [5], [20], [28] p. 343):
Proposition 7.1**.**
Let be a multiplicative function such that for all . For , we set
[TABLE]
Then:
[TABLE]
where is an absolute constant.
From this result, we show the following:
Proposition 7.2**.**
Let be a random multiplicative function such that are i.i.d., with , and . Then, almost surely, for all
[TABLE]
Proof.
First, we observe that for integers, ,
[TABLE]
where, by a classical refinement of the prime number theorem,
[TABLE]
for all . The bracket is dominated by , the second part of the last integral is dominated by
[TABLE]
and the error term of the first part is dominated by . Hence
[TABLE]
where
[TABLE]
Now, for all ,
[TABLE]
which gives
[TABLE]
Now, the integral of on is conditionally convergent: tends to when and the convergence of the integral at is easily deduced from an integration by parts. Hence, the integral of on any interval of is uniformly bounded, which implies
[TABLE]
We deduce
[TABLE]
Bounding the sum on primes smaller than by taking the absolute value, we get:
[TABLE]
and then by taking , for large enough depending on ,
[TABLE]
[TABLE]
and then by letting and using the symmetry of the imaginary part for ,
[TABLE]
for . This estimate can also be deduced from known bounds on the Riemann zeta function on the line . Now, for all whose real part is in , we have
[TABLE]
The first term is at least the sum of divided by , and then at least . The second term is by the previous discussion. Hence,
[TABLE]
Now, let , and . The variables are centered, independent, bounded by . By Hoeffding’s lemma (see, for example, Massart [18], p. 21), for all ,
[TABLE]
and then by independence,
[TABLE]
[TABLE]
Applying the same inequality to , we deduce
[TABLE]
[TABLE]
The derivative of the last sum in with respect to is dominated by
[TABLE]
and then the sum cannot vary more than when runs between two consecutive multiples of . Hence,
[TABLE]
If we define, for , as the integer part of , we deduce, by Borel-Cantelli lemma, that almost surely, for all but finitely many ,
[TABLE]
If this event occurs, we deduce, using (5),
[TABLE]
Then, by Proposition 7.1, we get
[TABLE]
Since , we deduce
[TABLE]
which gives the claimed result along the sequence . Now, if , we have, since all the ’s have modulus at most ,
[TABLE]
This allows to remove the restriction to the sequence . ∎
Using Fourier transform, we deduce a law of large numbers for the empirical measure , under the assumptions of this section.
Proposition 7.3**.**
If for all integers , , then almost surely, tends to the uniform measure on the unit circle.
Proof.
For all , takes its values on the unit circle, and it is not a.s. equal to . Applying the previous proposition to , we deduce that tends to zero almost surely, which gives the desired result. ∎
Proposition 7.4**.**
If for , almost surely, but for all strict divisors of , then almost surely, tends to the uniform measure on . More precisely, almost surely, for all , the proportion of such that is , as soon as
[TABLE]
this infimum being strictly positive.
Proof.
The infimum is strictly positive since by assumption, for all . Now, we apply the previous result to for all , and we get the claim after doing a discrete Fourier inversion. ∎
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] A. Baker: Bounds for solutions of hyperelliptic equations , Proc. Cambridge Phil. Soc. 65 (1969), 439–444.
- 2[2] H. Bohr and B. Jessen: Über die Wertverteilung der Riemannschen Zetafunktion, Erste Mitteilung , Acta Math. 54 (1930), 1–35, Zweite Mitteilung , Acta Math. 58 (1932), 1–55.
- 3[3] S. Chowla: The Riemann hypothesis and Hilbert’s tenth problem , Gordon and Breach, New York, 1965.
- 4[4] A. Dubickas: A note on the multiplicative dependence of consecutive integers , Scient. works of Lith. Math. Soc.: suppl. to ”Liet. Matem. Rink.”, Technika, Vilnius (1998), 21–23.
- 5[5] A. Granville, K. Soundararajan: Decay of Mean Values of Multiplicative Functions , Canad. J. Math. 55 (2003), 1191–1230.
- 6[6] L. Hajdu, A. Pintér: Square product of three integers in short intervals , Math. of Computation 68 (1999), 1299–1301.
- 7[7] G. Halász: Über die Mittelwerte multiplikativer zahlentheoretischer Funktionen , Acta Math. Acad. Sci. Hung. 19 (1968), 365–403.
- 8[8] G. Halász: On the distribution of additive and the mean values of multiplicative arithmetic functions , Studia Sci. Math. Hung. 6 (1971), 211–233.
