On consecutive values of random completely multiplicative functions

Joseph Najnudel

arXiv:1702.01470·math.PR·April 27, 2020

On consecutive values of random completely multiplicative functions

Joseph Najnudel

PDF

TL;DR

This paper investigates the independence and empirical distribution convergence of consecutive values of random completely multiplicative functions, revealing conditions under which these values behave independently and their empirical measures stabilize almost surely.

Contribution

It establishes independence of consecutive values for large indices under specific distributions and proves almost sure convergence of empirical measures for these functions.

Findings

01

Consecutive values become independent for large n under certain distributions.

02

Empirical measures of the functions converge almost surely as N increases.

03

Rate of convergence of the empirical measure is estimated.

Abstract

In this article, we study the behavior of consecutive values of random completely multiplicative functions $(X_{n})_{n \geq 1}$ whose values are i.i.d. at primes. We prove that for $X_{2}$ uniform on the unit circle, or uniform on the set of roots of unity of a given order, and for fixed $k \geq 1$ , $X_{n + 1}, \dots, X_{n + k}$ are independent if $n$ is large enough. Moreover, with the same assumption, we prove the almost sure convergence of the empirical measure $N^{- 1} \sum_{n = 1}^{N} δ_{(X_{n + 1}, \dots, X_{n + k})}$ when $N$ goes to infinity, with an estimate of the rate of convergence. At the end of the paper, we also show that for any probability distribution on the unit circle followed by $X_{2}$ , the empirical measure converges almost surely when $k = 1$ .

Equations341

N \to \infty lim in f \frac{1}{N} n = 1 \sum N \mathds 1_{λ (n + 1) = ϵ_{1}, λ (n + 2) = ϵ_{2}, λ (n + 3) = ϵ_{3}} > 0.

N \to \infty lim in f \frac{1}{N} n = 1 \sum N \mathds 1_{λ (n + 1) = ϵ_{1}, λ (n + 2) = ϵ_{2}, λ (n + 3) = ϵ_{3}} > 0.

\frac{1}{N} n = 1 \sum N δ_{(X_{n + 1}, \dots, X_{n + k})}

\frac{1}{N} n = 1 \sum N δ_{(X_{n + 1}, \dots, X_{n + k})}

E [X_{n + 1}^{m_{1}} \dots X_{n + k}^{m_{k}}] = 0

E [X_{n + 1}^{m_{1}} \dots X_{n + k}^{m_{k}}] = 0

E [X_{(n + 1)^{m_{1}} \dots (n + k)^{m_{k}}}] = 0,

E [X_{(n + 1)^{m_{1}} \dots (n + k)^{m_{k}}}] = 0,

(n + 1)^{m_{1}} \dots (n + 1)^{m_{k}} \neq = 1

(n + 1)^{m_{1}} \dots (n + 1)^{m_{k}} \neq = 1

m_{1} lo g (n + 1) + \dots + m_{k} lo g (n + k) \neq = 0.

m_{1} lo g (n + 1) + \dots + m_{k} lo g (n + k) \neq = 0.

lo g (n + j) = ℓ \in A \sum r_{ℓ} lo g (n + ℓ),

lo g (n + j) = ℓ \in A \sum r_{ℓ} lo g (n + ℓ),

v_{p} (n + j) = ℓ \in A \sum v_{p} (n + ℓ) r_{ℓ},

v_{p} (n + j) = ℓ \in A \sum v_{p} (n + ℓ) r_{ℓ},

lo g (n + j) - (lo g n + r = 1 \sum q (- 1)^{r - 1} \frac{j ^{r}}{r n ^{r}}) \leq \frac{j ^{q + 1}}{( q + 1 ) n ^{q + 1}},

lo g (n + j) - (lo g n + r = 1 \sum q (- 1)^{r - 1} \frac{j ^{r}}{r n ^{r}}) \leq \frac{j ^{q + 1}}{( q + 1 ) n ^{q + 1}},

j = 1 \sum k j^{q} m_{j} \frac{1}{q n ^{q}} \leq j = 1 \sum k \frac{∣ m _{j} ∣ j ^{q + 1}}{( q + 1 ) n ^{q + 1}}

j = 1 \sum k j^{q} m_{j} \frac{1}{q n ^{q}} \leq j = 1 \sum k \frac{∣ m _{j} ∣ j ^{q + 1}}{( q + 1 ) n ^{q + 1}}

j = 1 \sum k m_{j} lo g n \leq j = 1 \sum k \frac{∣ m _{j} ∣ j}{n}

j = 1 \sum k m_{j} lo g n \leq j = 1 \sum k \frac{∣ m _{j} ∣ j}{n}

\frac{1}{q n ^{q}} \leq \frac{D k ^{q + 2}}{( q + 1 ) n ^{q + 1}}

\frac{1}{q n ^{q}} \leq \frac{D k ^{q + 2}}{( q + 1 ) n ^{q + 1}}

lo g n \leq \frac{D k ^{2}}{n} .

lo g n \leq \frac{D k ^{2}}{n} .

1 \leq (\frac{q}{q + 1} \lor \frac{1}{lo g n}) \frac{D k ^{q + 2}}{n} \leq \frac{D k ^{q + 2}}{n}

1 \leq (\frac{q}{q + 1} \lor \frac{1}{lo g n}) \frac{D k ^{q + 2}}{n} \leq \frac{D k ^{q + 2}}{n}

n \leq D k^{k + 1} \leq [π (k) lo g (n + k) / lo g 2]^{π (k)} k^{k + 1} .

n \leq D k^{k + 1} \leq [π (k) lo g (n + k) / lo g 2]^{π (k)} k^{k + 1} .

2 n \leq 2 [π (k) lo g (2 n) / lo g 2]^{π (k)} k^{k + 1},

2 n \leq 2 [π (k) lo g (2 n) / lo g 2]^{π (k)} k^{k + 1},

\frac{2 n}{[ lo g ( 2 n ) ] ^{π (k)}} \leq 2 [π (k) / lo g 2]^{π (k)} k^{k + 1} .

\frac{2 n}{[ lo g ( 2 n ) ] ^{π (k)}} \leq 2 [π (k) / lo g 2]^{π (k)} k^{k + 1} .

\frac{2 n}{[ lo g ( 2 n ) ] ^{π (k)}} \leq 2 (2 k)^{c k / l o g k} k^{k + 1}

\frac{2 n}{[ lo g ( 2 n ) ] ^{π (k)}} \leq 2 (2 k)^{c k / l o g k} k^{k + 1}

c = \frac{1}{2} \frac{π ( 113 ) lo g 113}{113} \leq 0.63

c = \frac{1}{2} \frac{π ( 113 ) lo g 113}{113} \leq 0.63

\frac{2 n}{[ lo g ( 2 n ) ] ^{π (k)}} \leq 2 (2^{0.63 k / l o g 2}) k^{0.63 k / l o g k} k^{k + 1} \leq 2 e^{1.26 k} k^{k + 1} \leq (e^{1.26} k)^{k + 1} \leq (3.6 k)^{k + 1} .

\frac{2 n}{[ lo g ( 2 n ) ] ^{π (k)}} \leq 2 (2^{0.63 k / l o g 2}) k^{0.63 k / l o g k} k^{k + 1} \leq 2 e^{1.26 k} k^{k + 1} \leq (e^{1.26} k)^{k + 1} \leq (3.6 k)^{k + 1} .

\frac{2 n}{[ lo g ( 2 n ) ] ^{π (k)}} \geq \frac{( 100 k ) ^{k + 1}}{(( k + 1 ) lo g ( 100 k ) ) ^{π (k)}} \geq \frac{( 100 k ) ^{k + 1}}{( k + 1 ) ^{2.52 π (k + 1)}}

\frac{2 n}{[ lo g ( 2 n ) ] ^{π (k)}} \geq \frac{( 100 k ) ^{k + 1}}{(( k + 1 ) lo g ( 100 k ) ) ^{π (k)}} \geq \frac{( 100 k ) ^{k + 1}}{( k + 1 ) ^{2.52 π (k + 1)}}

\geq \frac{( 100 k ) ^{k + 1}}{( k + 1 ) ^{(2.52) (1.26) (k + 1) / l o g (k + 1)}} \geq (100 k e^{- 3.18})^{k + 1} \geq (4 k)^{k + 1},

\geq \frac{( 100 k ) ^{k + 1}}{( k + 1 ) ^{(2.52) (1.26) (k + 1) / l o g (k + 1)}} \geq (100 k e^{- 3.18})^{k + 1} \geq (4 k)^{k + 1},

n \leq 2 n \leq (100 k)^{k + 1},

n \leq 2 n \leq (100 k)^{k + 1},

8^{4} \cdot 9^{3} \cdot 1 2^{- 6} = 6^{- 6} \cdot 8^{2} \cdot 9^{3} = 4^{3} \cdot 8^{- 2} = 3^{2} \cdot 4 \cdot 6^{- 2} = 2^{2} \cdot 4^{- 1} = 1.

8^{4} \cdot 9^{3} \cdot 1 2^{- 6} = 6^{- 6} \cdot 8^{2} \cdot 9^{3} = 4^{3} \cdot 8^{- 2} = 3^{2} \cdot 4 \cdot 6^{- 2} = 2^{2} \cdot 4^{- 1} = 1.

(n^{3} - 3 n - 2) (n^{3} - 3 n + 2) n^{3} = (n^{3} - 4 n) (n^{3} - n)^{2} .

(n^{3} - 3 n - 2) (n^{3} - 3 n + 2) n^{3} = (n^{3} - 4 n) (n^{3} - n)^{2} .

24 0^{65} \cdot 24 3^{31} \cdot 24 5^{55} \cdot 25 0^{- 40} \cdot 25 2^{- 110} = 1.

24 0^{65} \cdot 24 3^{31} \cdot 24 5^{55} \cdot 25 0^{- 40} \cdot 25 2^{- 110} = 1.

\forall p \in P, j = 1 \sum k m_{j} v_{p} (n + j) \equiv 0 (mod. q)

\forall p \in P, j = 1 \sum k m_{j} v_{p} (n + j) \equiv 0 (mod. q)

E [X_{s}^{ℓ}] = p \in P \prod E [X_{p}^{ℓ v_{p} (s)}],

E [X_{s}^{ℓ}] = p \in P \prod E [X_{p}^{ℓ v_{p} (s)}],

E [j = 1 \prod k X_{n + j}^{m_{j}}] = j = 1 \prod k E [X_{n + j}^{m_{j}}] .

E [j = 1 \prod k X_{n + j}^{m_{j}}] = j = 1 \prod k E [X_{n + j}^{m_{j}}] .

E [j = 1 \prod k X_{n + j}^{m_{j}}] = E p \in P \prod X_{p}^{\sum_{1 \leq j \leq k} m_{j} v_{p} (n + j)} = p \in P \prod E [X_{p}^{\sum_{1 \leq j \leq k} m_{j} v_{p} (n + j)}] = 0,

E [j = 1 \prod k X_{n + j}^{m_{j}}] = E p \in P \prod X_{p}^{\sum_{1 \leq j \leq k} m_{j} v_{p} (n + j)} = p \in P \prod E [X_{p}^{\sum_{1 \leq j \leq k} m_{j} v_{p} (n + j)}] = 0,

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

On consecutive values of random completely multiplicative functions

Joseph Najnudel

Abstract.

In this article, we study the behavior of consecutive values of random completely multiplicative functions $(X_{n})_{n\geq 1}$ whose values are i.i.d. at primes. We prove that for $X_{2}$ uniform on the unit circle, or uniform on the set of roots of unity of a given order, and for fixed $k\geq 1$ , $X_{n+1},\dots,X_{n+k}$ are independent if $n$ is large enough. Moreover, with the same assumption, we prove the almost sure convergence of the empirical measure $N^{-1}\sum_{n=1}^{N}\delta_{(X_{n+1},\dots,X_{n+k})}$ when $N$ goes to infinity, with an estimate of the rate of convergence. At the end of the paper, we also show that for any probability distribution on the unit circle followed by $X_{2}$ , the empirical measure converges almost surely when $k=1$ .

1. Introduction

Many arithmetic functions of interest are multiplicative, i.e. their value at $mn$ is the product of their values at $m$ and $n$ , for all coprime integers $m,n\geq 1$ . For example, it is the case for the Möbius function, which is defined by $\mu(n)=0$ if $n\geq 1$ is divisible by the square of at least one prime number, $\mu(n)=1$ if $n$ is the product of an even number of distinct primes, and $\mu(n)=-1$ if $n$ is the product of an odd number of distinct primes. Similarly, Dirichlet characters are multiplicative, as well as the Liouville function, which is equal to $(-1)^{k}$ on integers with $k$ prime factors, counted with multiplicity: these functions are even completely multiplicative, which means that their value at $mn$ is the product of their values at $m$ and $n$ for all integers $m,n\geq 1$ . The behavior of the Möbius and the Liouville functions is far from being known with complete accuracy, even if partial results have been proven. This difficulty can be encoded by the corresponding Dirichlet series, which involve the Riemann zeta function. For example, the partial sum, up to $x$ , of the Möbius function in known to be negligible with respect to $x$ , and it is conjectured to be negligible with respect to $x^{r}$ for all $r>1/2$ : the first statement can quite easily be proven to be equivalent to the prime number theorem, whereas the second is equivalent to the Riemann hypothesis.

It has been noticed that the same bound $x^{r}$ for all $r>1/2$ is obtained if we take the partial sums of i.i.d., bounded and centered random variables. This suggests the naive idea to compare the Möbius function on square-free integers with i.i.d. random variables on $\{-1,1\}$ . However, a major difference between the two situations is that in the random case, we lose the multiplicativity of the function. A less naive randomized version of Möbius functions can be obtained as follows: one takes i.i.d. uniform random variables on $\{-1,1\}$ on prime numbers, [math] on prime powers of order larger than or equal to $2$ , and one completes the definition by multiplicativity.

In [29], Wintner considers a completely multiplicative function with i.i.d. values at primes, uniform on $\{-1,1\}$ (which corresponds to a randomized version of the Liouville function rather than the Möbius function), and proves that we have almost surely the same bound $x^{r}$ ( $r>1/2$ ) for the partial sums, as for the sums of i.i.d. random variables, or for the partial sums of Möbius function if the Riemann hypothesis is true. The estimate in [29] has been refined by Halász in [9], and then by Lau, Tenenbaum and Wu in [17]. Some lower bounds can also be deduced from moment estimates by Harper [10]. In order to get more general results, it can be useful to consider complex-valued random multiplicative functions. For example, it has been proven by Bohr and Jessen [2] that for $\sigma>1/2$ , the law of $\zeta(\sigma+iTU)$ , for $U$ uniformly distributed on $[0,1]$ , tends to a limiting random variable when $T$ goes to infinity. This limiting random variable can be written as $\sum_{n\geq 1}X_{n}n^{-\sigma}$ , when $(X_{n})_{n\geq 1}$ is a random completely multiplicative function such that $(X_{p})_{p\in\mathcal{P}}$ are i.i.d. uniform on the unit circle, $\mathcal{P}$ denoting, as in all the sequel of the present paper, the set of prime numbers. The fact that the series just above converges is a direct consequence (by partial summation) of the analog of the result of Wintner for the partial sums of $(X_{n})_{n\geq 1}$ : one can prove that almost surely, $\sum_{n\leq x}X_{n}=o(x^{r})$ for $r>1/2$ .

This discussion shows that it is often much less difficult to prove accurate results for random multiplicative function than for the arithmetic functions which are usually considered. In some informal sense, the arithmetic difficulties are diluted into the randomization, which is much simpler to deal with.

In the present paper, we study another example of results which are stronger and less difficult to prove in the random setting than in the deterministic one. The example we detail in this article is motivated by the following question, initially posed in the deterministic setting: for $k\geq 1$ , what can we say about the distribution of the $k$ -tuples $(\mu(n+1),\dots,\mu(n+k))$ , or $(\lambda(n+1),\dots,\lambda(n+k))$ , where $\mu$ and $\lambda$ are the Möbius and the Liouville functions, $n$ varies from $1$ to $N$ , $N$ tends to infinity? This question is only very partially solved. One knows (it is essentially a consequence of the prime number theorem), that for $k=1$ , the proportion of integers such that $\lambda$ is equal to $1$ or $-1$ tends to $1/2$ . For the Möbius function, the limiting proportions are $3/\pi^{2}$ for $1$ or $-1$ and $1-(6/\pi^{2})$ for [math]. It has been proven by Hildebrand [15] that for $k=3$ , the eight possible values of $(\lambda(n+1),\lambda(n+2),\lambda(n+3))$ appears infinitely often. This result has been improved by Matomäki, Radziwiłł and Tao [19], who prove that these eight values appear with a positive lower density: in other words, for all $(\epsilon_{1},\epsilon_{2},\epsilon_{3})\in\{-1,1\}^{3}$ ,

[TABLE]

The similar result is proven for the nine possible values of $(\mu(n+1),\mu(n+2))$ . A conjecture by Chowla [3] states that for all $k\geq 1$ , each possible pattern of $(\lambda(n+1),\dots,\lambda(n+k))$ appears with asymptotic density $2^{-k}$ . This conjecture is still open, however, partial results have been recently proven, in particular in papers by Tao and Teräväinen ([24], [25], [26]).

In the present paper, we prove results similar to this conjecture for random completely multiplicative functions $(X_{n})_{n\geq 1}$ . The random functions we will consider take i.i.d. values on the unit circle on prime numbers. Their distribution is then entirely determined by the distribution of $X_{2}$ . The two particular cases we will study in the largest part of the paper are the following: $X_{2}$ is uniform on the unit circle $\mathbb{U}$ , and $X_{2}$ is uniform on the set $\mathbb{U}_{q}$ of $q$ -th roots of unity, for $q\geq 2$ . In this case, we will show the following results: for all $k\geq 1$ , and for all $n\geq 1$ large enough depending on $k$ , the variables $X_{n+1},\dots,X_{n+k}$ are independent, and exactly i.i.d. uniform on the unit circle if $X_{2}$ is uniform. Moreover, the empirical distribution

[TABLE]

tends almost surely to the uniform distribution on $\mathbb{U}^{k}$ if $X_{2}$ is uniform on $\mathbb{U}$ , and to the uniform distribution on $\mathbb{U}_{q}^{k}$ if $X_{2}$ is uniform on $\mathbb{U}_{q}$ . In particular, the analog of Chowla’s conjecture holds almost surely in the case where $X_{2}$ is uniform on $\{-1,1\}$ . We have also an estimate on the speed of convergence of the empirical measure: in the case of the uniform distribution on $\mathbb{U}_{q}$ , each of the $q^{k}$ possible patterns for $(X_{n+1},\dots,X_{n+k})$ almost surely occurs with a proportion $q^{-k}+O(N^{-t})$ for $n$ running between $1$ and $N$ , for all $t<1/2$ . We have a similar result in the uniform case, if the test functions we consider are sufficiently smooth. It would be interesting to have similar results when the distribution of $X_{2}$ on the unit circle is not specified. For $k\geq 2$ , we are unfortunately not able to show similar results, but we nevertheless can prove that the empirical distribution of $X_{n}$ almost surely converges to a limiting distribution for any distribution of $X_{2}$ on the unit circle. We specify this distribution, which is always uniform on $\mathbb{U}$ or uniform on $\mathbb{U}_{q}$ for some $q\geq 1$ , and in the latter case, we give an estimate of the rate of convergence. This rate corresponds to a negative power of $\log N$ , which is much slower than what we obtain when $X_{2}$ is uniform on $\mathbb{U}_{q}$ .

The techniques we use in our proofs are elementary in general, mixing classical tools in probability theory and number theory. However, a part of our arguments need to use deep results on diophantine equations, in order to bound the number and the size of their solutions.

The sequel of the paper is organized as follows. In Sections 2 and 3, we study the law of $(X_{n+1},\dots,X_{n+k})$ for $n$ large depending on $k$ , first in the case where $X_{2}$ is uniform on $\mathbb{U}$ , then in the case where $X_{2}$ is uniform on $\mathbb{U}_{q}$ . In Section 4, we study the empirical measure of $(X_{n+1},\dots,X_{n+k})$ in the case of $X_{2}$ uniform on $\mathbb{U}$ . In the proof of the convergence of this empirical measure, we need to estimate the second moment of sums of the form $\sum_{n=N^{\prime}+1}^{N}\prod_{j=1}^{k}X_{n+j}^{m_{j}}$ . The problem of estimating moments of order different from two for such sums is discussed in Section 5. The proof of convergence of empirical measure in the case of uniform variables on $\mathbb{U}_{q}$ is given in Section 6. Finally, we consider the case of a general distribution for $X_{2}$ in Section 7.

2. Independence in the uniform case

In this section, we suppose that $(X_{p})_{p\in\mathcal{P}}$ are i.i.d. uniform random variables on the unit circle. By convenience, we will extend our multiplicative function to positive rational numbers by setting $X_{p/q}:=X_{p}/X_{q}$ : the result is independent of the choice of $p$ and $q$ , and we have $X_{r}X_{s}=X_{rs}$ for all rationals $r,s>0$ . Moreover, $X_{r}$ is uniform on the unit circle for all positive rational $r\neq 1$ . In this section, we will show that for fixed $k\geq 1$ , $(X_{n+1},\dots,X_{n+k})$ are independent if $n$ is sufficiently large. The following result gives a criterion for such independence:

Lemma 2.1.

For all $n,k\geq 1$ , the variables $(X_{n+1},\dots,X_{n+k})$ are independent if and only if $\log(n+1),\dots,\log(n+k)$ are linearly independent on $\mathbb{Q}$ .

Proof.

Since the variables $(X_{n+1},\dots,X_{n+k})$ are uniform on the unit circle, they are independent if and only if

[TABLE]

for all $(m_{1},\dots,m_{k})\in\mathbb{Z}^{k}\backslash\{(0,0,\dots,0)\}$ . This equality is equivalent to

[TABLE]

i.e.

[TABLE]

or

[TABLE]

∎

We then get the following result:

Proposition 2.2.

The variables $(X_{n+1},\dots,X_{n+k})$ are i.i.d. as soon as $n\geq(100k)^{k+1}$ . In particular, for $k$ fixed, this property is true for all but finitely many $n$ .

Remark 2.3.

The same result is proven in [27], Theorem 3. (i), with an asymptotically better bound, namely $n\geq e^{ck\log\log(k+2)/\log(k+1)}$ where $c>0$ is a constant. However, their proof uses a deep result by Shorey [23] on linear forms in the logarithms of algebraic numbers, involving technical tools by Gelfond and Baker, whereas our proof is elementary. Moreover, the constant $c$ involved in [27] is not given, even if it is explicitly computable.

Proof.

Let us assume that we have a linear dependence (1) between $\log(n+1),\dots,\log(n+k)$ : necessarily $k\geq 2$ . Moreover, the integers $n+j$ for which $m_{j}\neq 0$ cannot be divisible by a prime larger than $k$ : otherwise this factor remains in the product $\prod_{\ell=1}^{k}(n+\ell)^{m_{j}}$ since none of the $n+\ell$ for $\ell\neq j$ can be divisible by $p$ , and then the product cannot be equal to $1$ . We can rewrite the dependence as follows:

[TABLE]

for a subset $A$ of $\{1,\dots,k\}\backslash\{j\}$ and for $R:=(r_{\ell})_{\ell\in A}\in\mathbb{Q}^{A}$ . Let us assume that the cardinality $|A|$ is as small as possible. Taking the decomposition in prime factors, we get for all $p\in\mathcal{P}$ ,

[TABLE]

where $v_{p}$ denotes the exponent of $p$ in the prime factorization. If $M:=(v_{p}(n+\ell))_{p\in\mathcal{P},\ell\in A}$ , $V:=(v_{p}(n+j))_{p\in\mathcal{P}}$ , then we can write these equalities in a matricial way $V=MR$ . The minimality of $|A|$ ensures that the matrix $M$ has rank $|A|$ . Moreover, since all the prime factors of $(n+\ell)_{\ell\in A}$ are smaller than $k$ , all the rows of $M$ indexed by prime numbers larger than $k$ are identically zero, and then the rank $|A|$ of $M$ is at most $\pi(k)$ , the number of primes smaller than or equal to $k$ . Moreover, we can extract a subset $\mathcal{Q}$ of $\mathcal{P}$ of cardinality $|A|$ such that the restriction $M^{(\mathcal{Q})}$ of $M$ to the rows with indices in $\mathcal{Q}$ is invertible. We have with obvious notation: $V^{(\mathcal{Q})}=M^{(\mathcal{Q})}R$ , and then by Cramer’s rule, the entries of $R$ can be written as the quotients of determinants of matrices obtained from $M^{(\mathcal{Q})}$ by replacing one column by $V^{(\mathcal{Q})}$ , by the determinant of $M^{(\mathcal{Q})}$ . All the entries involved in these matrices are $p$ -adic valuations of integers smaller than or equal to $n+k$ , so they are at most $\log(n+k)/\log 2$ . By Hadamard inequality, the absolute value of the determinants are smaller than or equal to $([\log(n+k)/\log(2)]^{|A|})|A|^{|A|/2}$ . Since $|A|\leq\pi(k)$ , we deduce, after multiplying by $\det(M^{(\mathcal{Q})})$ , that there exists a linear dependence between $\log(n+1),\dots,\log(n+k)$ involving only integers of absolute value at most $D:=[\sqrt{\pi(k)}\log(n+k)/\log 2]^{\pi(k)}$ : let us keep the notation of (1) for this dependence. Let $q$ be the smallest nonnegative integer such that $\sum_{j=1}^{k}j^{q}m_{j}\neq 0$ : from the fact that the Vandermonde matrices are invertible, one deduces that $q\leq k-1$ . Using the fact that

[TABLE]

we deduce, by writing the dependence above:

[TABLE]

if $q\geq 1$ and

[TABLE]

if $q=0$ . Since the first factor in the left-hand side of these inequalities is a non-zero integer, it is at least $1$ . From the bounds we have on the $m_{j}$ ’s, we deduce

[TABLE]

for $q\geq 1$ and

[TABLE]

for $q=0$ . Hence

[TABLE]

if $n\geq 3$ , which implies, since $q\leq k-1$ ,

[TABLE]

If $n\geq k\vee 3$ , we deduce

[TABLE]

i.e.

[TABLE]

Now, one has obviously $\pi(k)\leq 2k/3$ for all $k\geq 2$ , and then $\sqrt{\pi(k)}/\log 2\leq\sqrt{2k}$ for all integers $k\geq 2$ , and more accurately, it is known that $(\pi(k)\log k)/k$ , which tends to $1$ at infinity by the prime number theorem, reaches its maximum at $k=113$ : this fact is in particular an immediate consequence of [22], Corollary 2, equation (3.7). Hence,

[TABLE]

where

[TABLE]

and then

[TABLE]

Let us assume that $2n\geq(100k)^{k+1}$ . The function $x\mapsto x/\log^{\pi(k)}(x)$ is increasing for $x\geq e^{\pi(k)}$ . Moreover, by studying the function $x\mapsto\log\log(100x)/\log(x+1)$ for $x\geq 2$ , we check that $\log(100k)\leq(k+1)^{1.52}$ for all $k\geq 2$ . Hence, since $\pi(k)\leq\pi(k+1)$ ,

[TABLE]

which contradicts the previous inequality.

Hence,

[TABLE]

and this bound is of course also available for $n\leq k\vee 3$ . ∎

This result implies that theoretically, for fixed $k$ , one can find all the values of $n$ such that $(X_{n+1},\dots,X_{n+k})$ are not independent by brute force computation. In practice, the bound we have obtained is far from optimal, and is too poor to be directly useable except for very small values of $k$ , for which a more careful reasoning can solve the problem directly. Here is an example for $k=5$ :

Proposition 2.4.

For $n\geq 1$ , the variables $(X_{n+1},X_{n+2},X_{n+3},X_{n+4},X_{n+5})$ are independent except if $n\in\{1,2,3,4,5,7\}$ .

Proof.

If $\prod_{j=1}^{5}(n+j)^{m_{j}}=1$ with integers $m_{1},\dots m_{5}$ not all equal to zero, then $m_{j}=0$ as soon as $n+j$ has a prime factor larger than or equal to $5$ : otherwise, this prime factor cannot be cancelled by the factors $(n+k)^{m_{k}}$ for $k\neq j$ . Hence, the values of $n+j$ such that $m_{j}\neq 0$ have only prime factors $2$ and $3$ , and at most one of them has both factors since it should then be divisible by $6$ . Moreover, if $n\geq 4$ , there can be at most one power of $2$ and one power of $3$ among $n+1,\dots,n+5$ . One deduces that dependence is only possible if among $n+1,\dots,n+5$ , there are three numbers, respectively of the form $2^{k},3^{\ell},2^{r}.3^{s}$ , for integers $k,\ell,r,s>0$ . The quotient between two of these integers is between $1/2$ and $2$ since we here assume $n\geq 4$ . Hence, $2^{k}\geq 2^{r}.3^{s}/2\geq 2^{r}$ and then $k\geq r$ . Similarly, $3^{\ell}\geq 2^{r}.3^{s}/2\geq 3^{s}$ , which implies $\ell\geq s$ . The numbers $2^{k}$ and $2^{r}.3^{s}$ are then both divisible by $2^{r}$ ; since they differ by at most $4$ , $r\leq 2$ . The numbers $3^{\ell}$ and $2^{r}.3^{s}$ are both divisible by $3^{s}$ , and then $s\leq 1$ . Therefore, $2^{r}.3^{s}\leq 12$ and $n\leq 11$ . If $9\leq n\leq 11$ , the only possible values of $n+j$ such that $m_{j}$ can be different from zero are $12$ and $16$ , which are multiplicatively independent. If $n=8$ , the only possible values are $9$ and $12$ , which are also independent, if $n=6$ , the values to consider are $8$ and $9$ . The only remaining values of $n$ are $1,2,3,4,5,7$ , which are exceptions since

[TABLE]

∎

The results above give an upper bound, for fixed $k$ , of the maximal value of $n$ such that $(X_{n+1},\dots,X_{n+k})$ are not independent. By considering two consecutive squares and their geometric mean, whose logarithms are linearly dependent, one deduces the lower bound $([k/2]-1)^{2}-1\geq(k-1)(k-5)/4$ for the maximal $n$ . As written in a note by Dubickas [4], this bound can be improved to a quantity equivalent to $(k/4)^{3}$ , by considering the identity:

[TABLE]

In [4], as an improvement of a result of [27], it is also shown that for all $\epsilon>0$ , the lower bound $e^{\log^{2}k/[(4+\epsilon)\log\log k]}$ occurs for infinitely many values of $k$ .

A computer search gives, for $k$ between $3$ and $13$ , and $n\leq 1000$ , the following largest values for which we do not have independent variables: 1, 5, 7, 14, 23, 24, 47, 71, 71, 71, 239. For example, if $k=13$ and $n=239$ , the five integers $240,243,245,250,252$ have only the four prime factors $2,3,5,7$ , so we necessarily have a dependence, namely:

[TABLE]

It would remain to check if there are dependences for $n>1000$ .

3. Independence in the case of roots of unity

We now suppose that $(X_{p})_{p\geq 1}$ are i.i.d., uniform on the set of $q$ -th roots of unity, $q\geq 1$ being a fixed integer. If $q=2$ , we get symmetric Bernoulli random variables. For all integers $s\geq 2$ , we will denote by $\mu_{s,q}$ the largest divisor $d$ of $q$ such that $s$ is a $d$ -th power. The analog of Lemma 2.1 in the present setting is the following:

Lemma 3.1.

For $n,k\geq 1$ , the variables $(X_{n+1},\dots,X_{n+k})$ are all uniform on the set of $q$ -th roots of unity if and only if $\mu_{n+j,q}=1$ for all $j$ between $1$ and $k$ . They are independent if and only if the only $k$ -tuple $(m_{1},\dots,m_{k})$ , $0\leq m_{j}<q/\mu_{n+j,q}$ such that

[TABLE]

is $(0,0,\dots,0)$ .

Proof.

For any $s\geq 2$ , $\ell\in\mathbb{Z}$ , we have

[TABLE]

which is equal to $1$ if $\ell v_{p}(s)$ is divisible by $q$ for all $p\in\mathcal{P}$ , and to [math] otherwise. The condition giving $1$ is equivalent to the fact that $\ell$ is a multiple of $q/(\operatorname{gcd}(q,(v_{p}(s))_{p\in\mathcal{P}}))$ , which is $q/\mu_{s,q}$ . Hence, $X_{s}$ is a uniform $(q/\mu_{s,q})$ -th root of unity, which implies the first part of the proposition.

The variables $(X_{n+1},\dots,X_{n+k})$ are independent if and only if for all $m_{1},\dots,m_{k}\in\mathbb{Z}$ ,

[TABLE]

Since $X_{n+j}$ is a uniform $(q/\mu_{n+j,q})$ -th root of unity, both sides of the equality depend only on the values of $m_{j}$ modulo $q/\mu_{n+j,q}$ for $1\leq j\leq k$ . This implies that we can assume, without loss of generality, that $0\leq m_{j}<q/\mu_{n+j,q}$ for all $j$ . If all the $m_{j}$ ’s are zero, both sides are obviously equal to $1$ . Otherwise, the right-hand side is equal to zero, and then we have independence if and only if it is also the case of the left-hand side, i.e. for all $(m_{1},\dots,m_{k})\neq(0,0,\dots,0)$ , $0\leq m_{j}<q/\mu_{n+j,q}$ ,

[TABLE]

which is true if and only if

[TABLE]

∎

We then have the following result, similar to Proposition 2.2:

Proposition 3.2.

For fixed $k,q\geq 1$ , there exists an explicitely computable $n_{0}(k,q)$ such that $(X_{n+1},\dots,X_{n+k})$ are independent as soon as $n\geq n_{0}(k,q)$ .

The bound $n_{0}(k,q)$ can be deduced from bounds on the solutions of certain diophantine equations which are available in the literature: we do not take care of its precise value, which is anyway far too large to be of any use if we want to find in practice the values of $n$ such that $(X_{n+1},\dots,X_{n+k})$ are not independent.

Proof.

For each value of $n\geq 1$ such that $(X_{n+1},\dots,X_{n+k})$ are dependent, there exist $0\leq m_{j}<q/\mu_{n+j,q}$ , not all zero, such that

[TABLE]

There are finitely many choices, depending only on $k$ and $q$ , for the $k$ -tuples $(\mu_{n+j,q})_{1\leq j\leq k}$ and $(m_{j})_{1\leq j\leq k}$ , so it is sufficient to show that the values of $n$ corresponding to each choice of $k$ -tuples is bounded by an explicitely computable quantity. At least two of the $m_{j}$ ’s are non-zero: otherwise $m_{j}v_{p}(n+j)$ is divisible by $q$ for all $p\in\mathcal{P}$ , $j$ being the unique index such that $m_{j}\neq 0$ , and then $m_{j}$ is divisible by $q/\mu_{n+j,q}$ : this contradicts the inequality $0<m_{j}<q/\mu_{n+j,q}$ .

On the other hand, if $p$ is a prime larger than $k$ , at most one of the terms $m_{j}v_{p}(n+j)$ is non-zero, and then all the terms are divisible by $q$ , since it is the case for their sum.

We deduce that $n+j$ is the product of a power of order $\rho_{j}:=q/\operatorname{gcd}(m_{j},q)$ and a number $A_{j}$ whose prime factors are all smaller than $k$ . Moreover, one can assume that $A_{j}$ is ” $\rho_{j}$ -th power free”, i.e. that all its $p$ -adic valuations are strictly smaller than $\rho_{j}$ . Hence there exist

[TABLE]

and an integer $B_{j}\geq 1$ such that $n+j=A_{j}B_{j}^{\rho_{j}}$ . The value of the exponents $\rho_{j}$ are fixed by the $m_{j}$ ’s, and at least two of them are strictly larger than $1$ , since at least two of the $m_{j}$ ’s are non-zero. Let us first assume that there exist distinct $j$ and $j^{\prime}$ such that $\rho_{j}\geq 2$ and $\rho_{j^{\prime}}\geq 3$ . One finds an explicitly computable bound on $n$ in this case as soon as we find an explicitly computable bound for the solutions of each diophantine equation in $x$ and $z$ :

[TABLE]

for each $A,A^{\prime},d$ such that $1\leq A,A^{\prime}\leq(k!)^{q}$ and $-k<d<k$ , $d\neq 0$ . These equations can be rewritten as: $y^{\rho_{j}}=f(x)$ , where $y=Az$ and

[TABLE]

This polynomial has all simple roots (the $\rho_{j^{\prime}}$ -th roots of $-d/A^{\prime}$ ) and then at least two of them; it has at least three if $\rho_{j}=2$ since $\rho_{j^{\prime}}$ is supposed to be at least $3$ in this case. By a result of Baker [1], all the solutions are bounded by an explicitly computable quantity, which gives the desired result (the same result with an ineffective bound was already proven by Siegel).

In remains to deal with the case where $\rho_{j}=2$ for all $j$ such that $m_{j}\neq 0$ . In this case, $q$ is even and $m_{j}$ is divisible by $q/2$ , which implies that $m_{j}=q/2$ when $m_{j}\neq 0$ . By looking at the prime factors larger than $k$ , one deduces that for all $j$ such that $m_{j}\neq 0$ , $n+j$ is a square times a product of distinct primes smaller than or equal to $k$ . If at least three of the $m_{j}$ ’s are non-zero, it then suffices to find an explicitly computable bound for the solutions of each system of diophantine equations:

[TABLE]

for $1\leq A,B,C\leq k!$ squarefree, $-k<d_{1},d_{2}<k$ , $d_{1},d_{2},d_{1}-d_{2}\neq 0$ . From these equations, we deduce, for $x=BCyz$ :

[TABLE]

The four roots of the right-hand side are the square roots of $-d_{1}/A$ and $-d_{2}/A$ , which are all distinct since $d_{1}\neq d_{2}$ , $d_{1}\neq 0$ , $d_{2}\neq 0$ . Again by Baker’s result, one deduces that the solutions are explicitly bounded, which then gives an explicit bound for $n$ .

The remaining case is when exactly two of the $m_{j}$ ’s are non-zero, with $\rho_{j}=2$ , and then $m_{j}=q/2$ . The dependence modulo $q$ then means that $(n+j)(n+j^{\prime})$ is a square for distinct $j,j^{\prime}$ between $1$ and $k$ , which implies that $(n+j)/g$ and $(n+j^{\prime})/g$ are both squares where $g=\operatorname{gcd}(n+j,n+j^{\prime})$ . These squares have difference smaller than $k$ , which implies that they are smaller than $k^{2}$ . Moreover, $g$ divides $|j-j^{\prime}|\leq k$ , and then $g\leq k$ , which gives $n\leq k^{3}$ . ∎

Here, we explicitly solve a particular case:

Proposition 3.3.

For $q=2$ , $(X_{n+1},\dots,X_{n+5})$ are independent for all $n\geq 2$ and not for $n=1$ .

Proof.

A dependence means that there exists a product of distinct non-square integers among $n+1,\dots,n+5$ which is a square. For a prime $p\geq 5$ , at most one $p$ -adic valuation is non-zero, which implies that all the $p$ -adic valuations are even. Hence, the factors involved in the product are all squares multiplied by $2,3$ or $6$ . Since they differ by at most $4$ , they cannot be in the same of the three ”categories”, which implies, since the product is a square, that there exist three numbers, respectively of the form $2x^{2}$ , $3y^{2}$ , $6z^{2}$ , in the interval between $n+1$ and $n+5$ . Now, Hajdu and Pintér [6] have determined all the triples of distinct integers in intervals of length at most 12 whose product is a square. For length $5$ , the only positive triple is $(2,3,6)$ , which implies that the only dependence in the present setting is $X_{2}X_{3}X_{6}=1$ . ∎

Remark 3.4.

The list given in [6] shows that for $q=2$ , there are dependences for quite large values of $n$ as soon as $k\geq 6$ . For example, we have $X_{240}X_{243}X_{245}=1$ for $k=6$ and $X_{10082}X_{10086}X_{10092}=1$ for $k=11$ .

4. Convergence of the empirical measure in the uniform case

In this section, $(X_{p})_{p\in\mathcal{P}}$ are uniform on the unit circle, and $k\geq 1$ is a fixed integer. For $N\geq 1$ , we consider the empirical measure of the $N$ first $k$ -tuples:

[TABLE]

It is reasonable to expect that $\mu_{k,N}$ tends to the uniform distribution on $\mathbb{U}^{k}$ , which is the common distribution of $(X_{n+1},\dots X_{n+k})$ for all but finitely many values of $n$ . In order to prove this result, we will estimate the second moment of the Fourier transform of $\mu_{k,N}$ , given by

[TABLE]

Proposition 4.1.

Let $m_{1},\dots,m_{k}$ be integers, not all equal to zero. Then, for all $N>N^{\prime}\geq 0$ ,

[TABLE]

and there exists $C_{m_{1},\dots,m_{k}}\geq 0$ , independent of $N$ and $N^{\prime}$ , such that

[TABLE]

Moreover, under the same assumption,

[TABLE]

Finally, for $k\in\{1,2\}$ , one can take $C_{m_{1}}$ or $C_{m_{1},m_{2}}$ equal to [math], and for $k=3$ , one can take $C_{m_{1},m_{2},m_{3}}=2$ if $(m_{1},m_{2},m_{3})$ is proportional to $(2,1,-4)$ and $C_{m_{1},m_{2},m_{3}}=0$ otherwise.

Proof.

We have, using the completely multiplicative extension of $X_{r}$ to all $r\in\mathbb{Q}_{+}^{*}$ :

[TABLE]

and then the left-hand side is equal to the number of couples $(n_{1},n_{2})$ in $\{N^{\prime}+1,\dots,N\}^{2}$ such that

[TABLE]

The number of trivial solutions $n_{1}=n_{2}$ of this equation is equal to $N-N^{\prime}$ , which gives a lower bound on the second moment we have to estimate. On the other hand, the derivative of the rational fraction $\prod_{j=1}^{k}(X+j)^{m_{j}}$ can be written as the product of $\prod_{j=1}^{k}(X+j)^{m_{j}-1}$ , which is strictly positive on $\mathbb{R}_{+}$ , by the polynomial

[TABLE]

The polyomial $Q$ has degree at most $k-1$ and is non-zero, since $(m_{1},\dots,m_{k})\neq(0,\dots,0)$ and then $\prod_{j=1}^{k}(X+j)^{m_{j}}$ is non-constant. We deduce that $Q$ has at most $k-1$ zeros, and then on $\mathbb{R}_{+}$ , $\prod_{j=1}^{k}(X+j)^{m_{j}}$ is strictly monotonic on each of at most $k$ intervals of $\mathbb{R}_{+}$ , whose bounds are [math], the positive zeros of $Q$ and $+\infty$ . Hence, for each choice of $n_{1}$ , there are at most $k$ values of $n_{2}$ satisfying (2), i.e. at most one in each interval, which gives the upper bound $k(N-N^{\prime})$ for the moment we are estimating.

Moreover, since $\prod_{j=1}^{k}(X+j)^{m_{j}}$ is strictly monotonic on an interval of the form $[A,\infty)$ for some $A>0$ , we deduce that for any non-trivial solution $(n_{1},n_{2})$ of (2), the minimum of $n_{1}$ and $n_{2}$ is at most $A$ . Hence, there are finitely many possibilities for the common value of the two sides of (2), and for each of these values, at most $k$ possibilities for $n_{1}$ and for $n_{2}$ . Hence, for fixed $(m_{1},\dots,m_{k})$ , the total number of non-trivial solutions of (2) is finite, which gives the bound $N-N^{\prime}+C_{m_{1},\dots,m_{k}}$ of the proposition.

The statement involving the empirical measure is deduced by taking $N^{\prime}=0$ and by dividing everything by $N^{2}$ .

The claim for $k\leq 3$ is an immediate consequence of the following statement we will prove now: the only integers $n_{1}>n_{2}\geq 1$ , $(m_{1},m_{2},m_{3})\neq(0,0,0)$ , such that

[TABLE]

are $n_{1}=7$ , $n_{2}=2$ , $(m_{1},m_{2},m_{3})$ proportional to $(2,1,-4)$ , which corresponds to the equality:

[TABLE]

If $m_{1},m_{2},m_{3}$ have the same sign and are not all zero, $(n+1)^{m_{1}}(n+2)^{m_{2}}(n+3)^{m_{3}}$ is strictly monotonic in $n\geq 1$ , and then we cannot get a solution of (3) with $n_{1}>n_{2}$ . By changing all the signs if necessary, we may assume that one of the integers $m_{1},m_{2},m_{3}$ is strictly negative and the others are nonnegative. For $n\geq 1$ , the fraction obtained by writing $(n+1)^{m_{1}}(n+2)^{m_{2}}(n+3)^{m_{3}}$ can only be simplified by prime factors dividing two of the integers $n+1,n+2,n+3$ , and then only by a power of $2$ . If $m_{2}<0$ and then $m_{1},m_{3}\geq 0$ , the numerator and the denominator have different parity, and then the fraction is irreducible for all $n$ : we do not get any solution of (3) in this case. Otherwise, $m_{1}$ or $m_{3}$ is strictly negative. If $(n_{1},n_{2})$ solves (3), let us define $s:=1$ and $j:=n_{2}+1$ if $m_{1}<0$ , and $s:=-1$ and $j:=n_{2}+3$ if $m_{3}<0$ . The denominators of the two fractions corresponding to the two sides of (3) are respectively a power of $j$ and the same power of $n_{1}+2-s$ : if (3) is satisfied, these denominators should differ only by a power of $2$ , since the fractions can be only simplified by such a power. Hence, $n_{1}+2-s=2^{\ell}j$ for some $\ell\geq 0$ , and by looking at the numerators of the fractions, we deduce that there exists $r\geq 0$ such that

[TABLE]

If $\ell\geq 2$ , the ratios $(2^{\ell}j+s)/(j+s)$ and $(2^{\ell}j+2s)/(j+2s)$ are at least $(4\cdot 2+2)/(2+2)=5/2$ since $j\geq n_{2}+1\geq 2$ and $|2s|\leq 2$ , and then the ratio between the right-hand side and the left-hand side of the previous equality is at least $(5/2)^{m_{2+s}+m_{2}}2^{-r}$ , which gives

[TABLE]

On the other hand, the $2$ -adic valution of the right-hand side is $m_{2+s}$ since $2^{\ell}j+2s\equiv 2$ modulo 4, whereas the valuation of the left-hand side is at least $r$ , which gives

[TABLE]

We then get a contradiction for $\ell\geq 2$ , except in the case $m_{2+s}=m_{2}=0$ , where we already know that there is no solution of (3). If $\ell=1$ , we get

[TABLE]

In this case, the prime factors of $2j+s$ , which are odd ( $|s|=1$ ), should divide $j+s$ or $j+2s$ , then $2j+2s$ or $2j+4s$ , and finally $s$ or $3s$ . Hence, $2j+s$ is a power of $3$ . Similarly, the odd factors of $j+2s$ , and then of $2j+4s$ , should divide $2j+s$ or $2j+2s$ , and then $s$ or $3s$ : $2j+4s$ is the product of a power of $2$ and a power of $3$ . If we write $2j+s=3^{a}$ , $2j+4s=2^{b}3^{c}$ , we must have $|3^{a}-2^{b}3^{c}|=3$ . If $a\leq 1$ , we have $2j+s\leq 3$ . If $s=1$ , we get $n_{2}+1=j\leq 1$ , and if $s=-1$ , we get $n_{2}+3=j\leq 2$ , which is impossible. If $a\geq 2$ , $3^{a}$ is divisible by $9$ , and then $2^{b}3^{c}$ is congruent to $3$ or $6$ modulo $9$ , which implies $c=1$ , and then $|3^{a-1}-2^{b}|=1$ . Now, by induction, one proves that the order of $2$ modulo $3^{a-1}$ is equal to $2.3^{a-2}$ (i.e. $2$ is a primitive root modulo the powers of $3$ ). This result is classical, and can be deduced, for example, from Rosen [21], Theorem 8.9. For sake of completeness, we give a proof here. The result is easy to check be direct computation for $a=2$ and $a=3$ . Let us assume that it is true for all values until $a\geq 3$ . The order of $2$ modulo $3^{a}$ is a multiple of the order of $2$ modulo $3^{a-1}$ , and then a multiple of $2.3^{a-2}$ by assumption. On the other hand, it is a divisor of $2.3^{a-1}$ by Euler’s theorem. Hence, it is either $2.3^{a-2}$ or $2.3^{a-1}$ . Moreover, since $2.3^{a-3}$ is assumed to be the order of $2$ modulo $3^{a-2}$ but strictly smaller than the order of $2$ modulo $3^{a-1}$ , we have

[TABLE]

where $u$ is not divisible by $3$ . Raising to the cube, we deduce

[TABLE]

where

[TABLE]

is not divisible by $3$ (recall that $a\geq 3$ here). Hence, the order of $2$ modulo $3^{a}$ is not $2.3^{a-2}$ : it can only be $2.3^{a-1}$ , which proves by induction that $2$ is a primitive root of $3^{a-1}$ for all $a\geq 2$ . Now, in the present situation, the order of $2$ modulo $3^{a-1}$ , i.e. $2.3^{a-2}$ , should divide $2b$ , since $2^{b}\equiv\pm 1$ modulo $3^{a-1}$ , and then $b\geq 3^{a-2}$ ( $b=0$ is not possible) which implies $2^{3^{a-2}}\leq 3^{a-1}+1$ , i.e. $a\in\{2,3\}$ .

If $a=2$ and $s=1$ , we get $2j+1=9$ , $j=4$ , and then $n_{1}=7$ , $n_{2}=3$ . We should solve $4^{m_{1}}5^{m_{2}}6^{m_{3}}=8^{m_{1}}9^{m_{2}}10^{m_{3}}$ . Taking the $3$ -adic valuation gives $m_{3}=2m_{2}$ , taking the $5$ -adic valuation gives $m_{3}=m_{2}$ , and then $m_{2}=m_{3}=0$ , which implies $m_{1}=0$ .

If $a=2$ and $s=-1$ , we get $2j-1=9$ , $j=5$ , $n_{1}=7$ , $n_{2}=2$ , which gives the equation $3^{m_{1}}4^{m_{2}}5^{m_{3}}=8^{m_{1}}9^{m_{2}}10^{m_{3}}$ . Taking the $2$ -adic valuation gives $2m_{2}=3m_{1}+m_{3}$ , taking the $3$ -adic valuation gives $m_{1}=2m_{2}$ , and then $(m_{1},m_{2},m_{3})$ should be proportional to $(2,1,-4)$ : in this case, we get one of the solutions already mentioned.

If $a=3$ , $2^{b}$ should be $8$ or $10$ , and then $b=3$ , $2j+s=27$ , $2j+4s=24$ , $j=14$ , $s=-1$ , $n_{1}=25$ , $n_{2}=11$ . We have to solve $12^{m_{1}}13^{m_{2}}14^{m_{3}}=26^{m_{1}}27^{m_{2}}28^{m_{3}}$ . Taking the $3$ -adic valuation gives $m_{1}=3m_{2}$ , taking the $13$ -adic valuation gives $m_{1}=m_{2}$ , and then $m_{1}=m_{2}=m_{3}=0$ .

∎

Corollary 4.2.

For all $(m_{1},\dots,m_{k})\in\mathbb{Z}^{k}$ , $\hat{\mu}_{k,N}(m_{1},\dots,m_{k})$ converges in $L^{2}$ , and then in probability, to $\mathds{1}_{m_{1}=\dots=m_{k}=0}$ , i.e. to the corresponding Fourier coefficient of the uniform distribution $\mu_{k}$ on $\mathbb{U}^{k}$ . In other words, $\mu_{k,N}$ converges weakly in probability to $\mu_{k}$ .

In this setting, we also have a strong law of large numbers, with an estimate of the rate of convergence, for sufficiently smooth test functions. Before stating the corresponding result, we will show the following lemma, which will be useful:

Lemma 4.3.

Let $\epsilon>\delta\geq 0$ , $C>0$ , and let $(A_{n})_{n\geq 0}$ be a sequence of random variables such that $A_{0}=0$ and for all $N>N^{\prime}\geq 0$ ,

[TABLE]

Then, almost surely, $A_{N}=O(N^{1/2+\epsilon})$ : more precisely, we have for $M>0$ ,

[TABLE]

where $K_{\epsilon,\delta}>0$ depends only on $\delta$ and $\epsilon$ .

Proof.

For $\ell,q\geq 0$ , $M>0$ and $\epsilon^{\prime}:=(\delta+\epsilon)/2\in(\delta,\epsilon)$ , we have:

[TABLE]

Since $\epsilon^{\prime}>\delta$ , we deduce that the probability that

[TABLE]

for all $\ell,q\geq 0$ is at least $1-DCM^{-2}$ , where $D$ depends only on $\epsilon^{\prime}$ and $\delta$ , and then only on $\delta$ and $\epsilon$ . Now, if (4) occurs for all $\ell,q\geq 0$ , if we take the binary expansion $N=\sum_{j=0}^{\infty}\delta_{j}2^{j}$ with $\delta_{j}\in\{0,1\}$ , and if $N_{r}=\sum_{j=r}^{\infty}\delta_{j}2^{j}$ for all $r\geq 0$ , then we get $|A_{N_{r}}-A_{N_{r+1}}|=0$ if $\delta_{r}=0$ , and

[TABLE]

if $\delta_{r}=1$ . Adding these inequalities from $r=0$ to $\infty$ , we deduce that $|A_{N}|\leq M\mu(N)N^{1/2+\epsilon^{\prime}}$ , where $\mu(N)$ is the number of $1$ ’s in the binary expansion of $N$ . Hence,

[TABLE]

where $B>0$ depends only on $\epsilon^{\prime}$ and $\epsilon$ (recall that $\epsilon>\epsilon^{\prime}$ ), and then only on $\delta$ and $\epsilon$ . We then have, for $M^{\prime}:=BM$ :

[TABLE]

which gives the desired result after replacing $M^{\prime}$ by $M$ . ∎

From this lemma, we deduce the following:

Proposition 4.4.

Almost surely, $\mu_{k,N}$ weakly converges to $\mu_{k}$ . More precisely, the following holds with probability one: for all $u>k/2$ , for all continuous functions $f$ from $\mathbb{U}^{k}$ to $\mathbb{C}$ such that

[TABLE]

$||\cdot||$ * denoting any norm on $\mathbb{R}^{k}$ , and for all $\epsilon>0$ ,*

[TABLE]

Remark 4.5.

By Cauchy-Schwarz inequality, we have

[TABLE]

which implies that the assumption on $f$ given in the proposition is satisfied for all $f$ in the Sobolev space $H^{s}$ as soon as $s>k$ .

Unfortunately, the proposition does not apply if $f$ is a product of indicators of arcs. The weak convergence implies that

[TABLE]

even in this case, but we don’t know at which rate this convergence occurs.

Proof.

From Proposition 4.1, and Lemma 4.3 applied to $\epsilon>0$ , $\delta=0$ and

[TABLE]

we get, for all $m\in\mathbb{Z}^{k}\backslash\{0\}$ , $M>0$ ,

[TABLE]

For fixed $u>k/2$ , we apply this estimate to $M=||m||^{u}$ and get

[TABLE]

Since $-2u<-k$ , we deduce, by Borel-Cantelli lemma, that almost surely,

[TABLE]

for all but finitely many $m\in\mathbb{Z}^{k}\backslash\{0\}$ . Therefore, almost surely,

[TABLE]

i.e.

[TABLE]

for $m\in\mathbb{Z}^{k}\backslash\{0\}$ , $N\geq 1$ . Almost surely, this estimates simultaneously occurs for all rationals $u>k/2$ and $\epsilon>0$ (with a random implicit constant in $O$ , depending on $u$ and $\epsilon$ ) and then for all reals $u>k/2$ and $\epsilon>0$ .

Let us now assume that this almost sure property holds, let us fix $u>k/2$ , $\epsilon>0$ , and let $f$ be a function satisfying the assumptions of the proposition. Since the Fourier coefficients of $f$ are summable (i.e. $f$ is in the Wiener algebra of $\mathbb{U}^{k}$ ), the corresponding Fourier series converges uniformly to a function which is necessarily equal to $f$ , since it has the same Fourier coefficients. We can then write:

[TABLE]

which implies

[TABLE]

By assumption, the last sum is dominated by

[TABLE]

which is finite by the assumptions made in the proposition, and then $O(N^{-1/2+\epsilon})$ .

∎

5. Moments of order different from two

Since we have a law of large numbers on $\mu_{k,N}$ , with rate of decay of order $N^{-1/2+\epsilon}$ , it is natural to look if we have a central limit theorem. In order to do that, a possibility consists in studying moments of sums in $n$ of products of variables from $X_{n+1}$ to $X_{n+k}$ . For the sums $\sum_{n=1}^{N}X_{n}$ , we do not have convergence to a non-zero Gaussian random variable after normalization by $1/\sqrt{N}$ . Indeed, the second moment of the absolute value of the renormalized sum $\frac{1}{\sqrt{N}}\sum_{n=1}^{N}X_{n}$ is equal to $1$ , so if this variable converges to a non-zero complex Gaussian variable, we need to have the convergence of

[TABLE]

towards a non-zero constant. In [12], Harper, Nikeghbali and Radziwiłł prove that the quantity just above decays at most like $(\log\log N)^{-3+o(1)}$ when $N$ goes to infinity, whereas a conjecture by Helson [14] states that it tends to [math]. The order of magnitude of the left-hand side has later been found by Harper in [10]: it is $(\log\log N)^{-1/4}$ , which in particular proves Helson’s conjecture.

On the other hand, an equivalent of the moments of $\frac{1}{\sqrt{N}}\left|\sum_{n=1}^{N}X_{n}\right|$ of even integer order are computed in [12] and [13], and they are not bounded with respect to $N$ : the moment of order $2p$ is equivalent to an explicit constant times $(\log N)^{(p-1)^{2}}$ . The order of magnitude of the moments of any positive order, not necessarily integer, is given by Harper in [10] and [11].

In the case of sums different from $\sum_{n=1}^{N}X_{n}$ , the moment computations involve arithmetic problems of different nature: here, we look in some detail the case of the sum $\sum_{n=1}^{N}X_{n}X_{n+1}$ . In this case, the fact that consecutive, and then necessarily coprime integers are involved gives more independence than when we study the sum $\sum_{n=1}^{N}X_{n}$ . In particular, it seems reasonable to expect that $\sum_{n=1}^{N}X_{n}X_{n+1}$ satisfies the same central limit theorem as the sum of i.i.d. uniform variables on the unit circle, and that this fact can be proven by moment computations. The convergence of the second moment is obvious, and we will now show that the convergence of the fourth moment also occurs. We start with the following result:

Proposition 5.1.

We have

[TABLE]

where $\mathcal{N}(N)$ (resp. $\mathcal{N}_{=}(N)$ ) is the number of solutions of the diophantine equation $a(a+1)d(d+1)=b(b+1)c(c+1)$ such that the integers $a,b,c,d$ satisfy $0<a<b<c<d\leq N$ (resp. $0<a<b=c<d\leq N$ ). Moreover, for all $\epsilon>0$ , there exists $C_{\epsilon}>0$ , independent of $N$ , such that for all $N\geq 8$ ,

[TABLE]

Hence,

[TABLE]

Proof.

Expanding the fourth moment, we immediately obtain that it is equal to the total number of solutions of the previous diophantine equation, with $a,b,c,d\in\{1,2,\dots,N\}$ . One has $2N^{2}-N$ trivial solutions: $N(N-1)$ for which $a=c\neq b=d$ , $N(N-1)$ for which $a=b\neq c=d$ , $N$ for which $a=b=c=d$ . It remains to count the number of non-trivial solutions. Such a solution has a minimal element among $a,b,c,d$ . This element is unique: if two minimal elements are on the same side, then necessarily $a=b=c=d$ , if two minimal elements are on different sides, then the other elements should be equal, which also gives a trivial solution. Dividing the number of solutions by four, we can assume that $a$ is the unique smallest integer, which implies that $d$ is the largest one. For $b=c$ , we get $\mathcal{N}_{=}(N)$ solutions, and for $b\neq c$ , we get $2\mathcal{N}_{=}(N)$ solutions, the factor $2$ coming from the possible exchange between $b$ and $c$ .

The lower bound $N/2$ comes from the solutions $(1,3,3,8)$ and $(1,2,5,9)$ for $8\leq N\leq 24$ , and from the solutions of the form $(n,2n+1,3n,6n+2)$ for $N\geq 25$ .

Let us now prove the upper bound. We start by slightly simplifying the equation by introducing the odd integers $A=2a+1$ , $B=2b+1$ , $C=2c+1$ , $D=2d+1$ , which should satisfy:

[TABLE]

If $A,B,C,D$ are large, then $AD$ and $BC$ should be odd and close to each other. It is then quite natural to introduce

[TABLE]

which is expected to be small with respect to $A,B,C,D$ . More precisely, since $B$ and $C$ are closer to each other than $A$ and $D$ , we need

[TABLE]

and then $\delta>0$ , since

[TABLE]

The last equality, gives, after factorizing the left-hand side and replacing $BC$ by $AD-2\delta$ :

[TABLE]

and in particular

[TABLE]

If we neglect the term $4\delta(\delta+1)$ , expected to be small with respect to $AD$ , we get the positivity of a quadratic form in $A$ and $D$ , which gives a restriction on the possible values of the ratio $D/A$ . More precisely, if we assume $1<D/A\leq 2\delta+2$ , we deduce

[TABLE]

and then

[TABLE]

$AD\leq 2\delta+1$ since it is an odd integer, and then $BC=AD-2\delta\leq 1$ , which gives a contradiction. Any solution should then satisfy $D/A>2\delta+2$ . We now discuss in function of the value of $\delta$ . For $\delta>\sqrt{N}$ , we have necessarily $A<D/(2\sqrt{N}+2)\leq(2N+1)/(2\sqrt{N}+2)=O(\sqrt{N})$ , and then $a=O(\sqrt{N})$ , and then there are only $O(N^{3/2})$ possibilities for the couple $(a,d)$ . Now, $b$ and $c$ should be divisors of $a(a+1)d(d+1)=O(N^{4})$ , and by the classical divisor bound, we deduce that there are $O(N^{\epsilon})$ possibilities for $(b,c)$ when $a$ and $d$ are chosen. Hence, the number of solutions for $\delta>\sqrt{N}$ is bounded by the estimate we have claimed.

It remains to bound the number of solutions for $\delta\leq\sqrt{N}$ : we will get a bound for the number of solutions for each value of $\delta$ , which will be multiplied by $\sqrt{N}$ at the end. Each solution should satisfy

[TABLE]

i.e. by writing the quadratic form in $A$ and $D$ as a difference of squares:

[TABLE]

We know that $D\geq A(2\delta+2)$ , and then $0<D-(2\delta+1)A\leq 2N+1$ , which gives, for each value of $\delta$ , $O(N)$ possibilities for $D-(2\delta+1)A$ . For the moment, let us admit that for each of these possibilities, there are $O(N^{\epsilon})$ choices for $B-C$ and $A$ . Then, for fixed $\delta$ , we have $O(N^{1+\epsilon})$ choices for $(D-(2\delta+1)A,A,B-C)$ . For each choice, $B-C,A,D$ are fixed, and then also $BC=AD-2\delta$ , and finally $B$ and $C$ . Hence, we have $O(N^{1+\epsilon})$ solutions for each $\delta\leq\sqrt{N}$ , and then $O(N^{3/2+\epsilon})$ solutions by counting all the possible $\delta$ .

The claim we have admitted is a consequence of the following fact we will prove now: for $\epsilon>0$ , the number of representations of $M$ in integers by the quadratic form $X^{2}+PY^{2}$ is $O(M^{\epsilon})$ , uniformly in the strictly positive integer $P$ . Indeed, for such a representation, the ideal $(X+Y\sqrt{-P})$ should be a divisor of $(M)$ in the ring of integers $\mathcal{O}_{P}$ of $\mathbb{Q}[\sqrt{-P}]$ , and each such ideal gives at most $6$ couples $(X,Y)$ representing $M$ . Indeed, the group of invertible elements in $\mathcal{O}_{P}$ has order at most $6$ . This fact is classical (see for example Jarvis [16], Chapter 6), and can be proven as follows: if $\alpha+\beta\sqrt{-P}$ is invertible in $\mathcal{O}_{P}$ for $\alpha,\beta\in\mathbb{Q}$ , then $\alpha-\beta\sqrt{-P}$ is also invertible in $\mathcal{O}_{P}$ , and

[TABLE]

is an integer since it is in $\mathcal{O}_{P}$ , whereas

[TABLE]

is an invertible integer, necessarily equal to $1$ . Hence, $\alpha+\beta\sqrt{-P}$ is a complex number of modulus $1$ , with real part equal to $-1,-1/2,0,1/2$ or $1$ , i.e. a fourth root or a sixth root of unity. We now only need to bound the number of divisors of $(M)$ in $\mathcal{O}_{P}$ by $O(M^{\epsilon})$ , uniformly in $P$ . The number of divisors of $(M)$ is $\prod_{\mathfrak{p}}(v_{\mathfrak{p}}(M)+1)$ , where we have the prime ideal decomposition

[TABLE]

Now, by considering the decomposition of prime numbers as products of ideals, we deduce:

[TABLE]

$\mathfrak{p}_{p}$ denoting an ideal of norm $p$ , and then the number of divisors of $(M)$ is

[TABLE]

where $\tau(M)$ is the number of divisors, in the usual sense, of the integer $M$ . This gives the desired bound $O(M^{\epsilon})$ . ∎

Remark 5.2.

Using the previous proof, one can show the following quite curious property: all the solutions of $a(a+1)d(d+1)=b(b+1)c(c+1)$ in integers $0<a<b\leq c<d$ satisfy $d/a>3+2\sqrt{2}$ . Indeed, let us assume the contrary. With the previous notation, $3+2\sqrt{2}\geq d/a\geq D/A>2\delta+2$ , and then $\delta=1$ , which gives $A^{2}-6AD+D^{2}+8\geq 0$ , i.e.

[TABLE]

a contradiction since $1<d/a\leq 3+2\sqrt{2}$ implies $a^{2}-6ad+d^{2}\leq 0$ . The bound $3+2\sqrt{2}$ is sharp, since we have the solutions of the form $(u_{2k},u_{2k+1},u_{2k+1},u_{2k+2})$ , where

[TABLE]

A consequence of the previous proposition corresponds to a bound on all the moments of order [math] to $4$ :

Corollary 5.3.

We have, for all $q\in[0,2]$ ,

[TABLE]

where $c_{q}=2^{-(q-1)_{-}}\geq 1/2$ and $C_{q}=2^{(q-1)_{+}}\leq 2$ .

Proof.

Hölder inequality implies that the logarithm of the $2q$ -th moment of a nonnegative random variable is a convex function of $q$ . Now, we have proven that this logarithm is equal to [math] for $q=0$ and $q=1$ and to $\ln 2+o(1)$ for $q=2$ . The corollary can now be deduced from the following fact, easy to check: if $f$ is a convex fonction from $[0,2]$ to $\mathbb{R}$ such that $f(0)=f(1)=0$ , $f(2)=1$ , then

[TABLE]

and

[TABLE]

∎

We have proven that the fourth moment of $\left|\frac{1}{\sqrt{N}}\sum_{n=1}^{N}X_{n}X_{n+1}\right|$ converges to $2$ , which is also the limit of the fourth moment of $\left|\frac{1}{\sqrt{N}}\sum_{n=1}^{N}Z_{n}\right|$ where $(Z_{n})_{n\geq 1}$ are i.i.d. random variables, uniform on the unit circle. Unfortunately, we are not able to prove a similar convergence for higher moments, and then we do not know how to prove a central limit theorem. However, the following result holds:

Proposition 5.4.

If for all integers $q\geq 1$ , the number of non-trivial solutions $(n_{1},\dots,n_{2q})\in\{1,\dots,N\}^{2q}$ of the diophantine equation

[TABLE]

is negligible with respect to the number of trivial solutions when $N\rightarrow\infty$ (i.e. $o(N^{q})$ ) then we have

[TABLE]

where $\mathcal{N}_{\mathbb{C}}$ denotes a standard Gaussian complex variable, i.e. $(\mathcal{N}_{1}+i\mathcal{N}_{2})/\sqrt{2}$ where $\mathcal{N}_{1},\mathcal{N}_{2}$ are independent standard real Gaussian variables.

Proof.

If

[TABLE]

then for integers $q_{1},q_{2}\geq 0$ , the moment $\mathbb{E}[Y_{N}^{q_{1}}\overline{Y_{N}}^{\,q_{2}}]$ is equal to $N^{-(q_{1}+q_{2})/2}$ times the number of solutions $(n_{1},\dots,n_{q_{1}+q_{2}})\in\{1,\dots,N\}^{q_{1}+q_{2}}$ of

[TABLE]

If $0\leq q_{1}<q_{2}$ , there are at most $N^{q_{1}}$ choices for $n_{1},\dots,n_{q_{1}}$ , and once these integers are fixed, at most $N^{o(1)}$ choices for $n_{q_{1}+1},\dots,n_{q_{1}+q_{2}}$ by the divisor bound. Hence, the moment tends to zero when $N\rightarrow\infty$ , and we have the same conclusion for $0\leq q_{2}<q_{1}$ . Finally, if $0\leq q_{1}=q_{2}=q$ , by assumption, the moment is equivalent to $N^{-q}$ times the number of trivial solutions of the corresponding diophantine equation, i.e. to the corresponding moment for the sum of i.i.d. variables, uniform on the unit circle. By the central limit theorem,

[TABLE]

We have then proven that for all integers $q_{1},q_{2}\geq 0$ ,

[TABLE]

which gives the claim. ∎

We have proven the assumption of the previous proposition for $q\in\{1,2\}$ , however, our method does not generalize to larger values of $q$ . The divisor bound gives immediately a domination by $N^{q+o(1)}$ for the number of solutions, and then it seems reasonable to expect that the arithmetic constraints implied by the equation are sufficient to save at least a small power of $N$ . Note that the situation is different for the sum $\sum_{n=1}^{N}X_{n}$ : for example, for $q=2$ , the number of non-trivial solutions of the equation $n_{1}n_{2}=n_{3}n_{4}$ for $1\leq n_{1},n_{2},n_{3},n_{4}\leq N$ is not $o(N^{2})$ , as we can see by considering the equalities $a(2b)=b(2a)$ for $a$ and $b$ odd, $a<b$ .

The previous proposition giving a ”conditional CLT” can be generalized to the sums of the form

[TABLE]

when the $m_{j}$ ’s have the same sign. The situation is more difficult if the $m_{j}$ ’s have different signs since the divisor bound alone does not directly give a useful bound on the number of solutions.

6. Convergence of the empirical measure in the case of roots of unity

Here, we suppose that $(X_{p})_{p\in\mathcal{P}}$ are i.i.d. uniform on the set $\mathbb{U}_{q}$ of $q$ -th roots of unity, $q\geq 1$ being fixed. With the notation of the previous section, we now get:

Proposition 6.1.

Let $m_{1},\dots,m_{k}$ be integers, not all divisible by $q$ , let $\epsilon>0$ and let $N>N^{\prime}\geq 0$ . Then,

[TABLE]

and

[TABLE]

where $C_{q,k,\epsilon}>0$ depends only on $q,k,\epsilon$ .

Proof.

We can obviously assume that $m_{1},\dots,m_{k}$ are between [math] and $q-1$ , which gives finitely many possibilities for these integers, depending only on $q$ and $k$ . We can then suppose that $m_{1},\dots,m_{k}$ are fixed at the beginning. We have to upper bound the number of couples $(n_{1},n_{2})$ on $\{N^{\prime}+1,\dots,N\}^{2}$ such that

[TABLE]

where, in this proof, $(\mathbb{Q}_{+}^{*})^{q}$ denotes the set of $q$ -th powers of positive rational numbers. Now, any positive integer $r$ can be decomposed as a product of a ”smooth” integer whose prime factors are all strictly smaller than $k$ , and a ”rough” integer whose prime factors are all larger than or equal to $k$ . If the ”rough” integer is denoted $\sharp_{k}(r)$ , the condition just above implies:

[TABLE]

Now, the numerator and the denominator of this expression can both be written in a unique way as a product of a $q$ -th perfect power and an integer whose $p$ -adic valuation is between [math] and $q-1$ for all $p\in\mathcal{P}$ . If the quotient is a $q$ -th power, necessarily the numerator and the denominator have the same ” $q$ -th power free” part. Hence, there exists a $q$ -th power free integer $g$ such that

[TABLE]

$\mathbb{N}^{q}$ being the set of $q$ -th powers of positive integers. Hence, the number of couples $(n_{1},n_{2})$ we have to estimate is bounded by

[TABLE]

where $\mathcal{N}(q,k,g,N^{\prime},N)$ is the number of integers $n\in\{N^{\prime}+1,\dots,N\}$ such that

[TABLE]

If a prime number $p\in\mathcal{P}$ divides $n+j$ and $n+j^{\prime}$ for $j\neq j^{\prime}\in\{1,\dots,k\}$ , it divides $|j-j^{\prime}|\in\{1,\dots,k-1\}$ , and then $p<k$ . Hence, the rough parts of $(n+j)^{m_{j}}$ are pairwise coprime. Now, if $g_{1},\dots,g_{k}$ are the $q$ -th power free integers such that $\sharp_{k}[(n+j)^{m_{j}}]\in g_{j}\mathbb{N}^{q}$ , we have $g_{1}g_{2}\dots g_{k}\in g\mathbb{N}^{q}$ . Now, $g_{1},\dots,g_{k}$ are coprime, and then $g_{1}g_{2}\dots g_{k}$ is $q$ -th power free, which implies $g_{1}\dots g_{k}=g$ . Hence

[TABLE]

Let us now fix an index $j_{0}$ such that $m_{j_{0}}$ is not multiple of $q$ . We have

[TABLE]

where $\operatorname{rad}(g_{j})$ denotes the product of the distinct prime factors of $g_{j}$ . The condition on $(n+j_{0})^{m_{j_{0}}}$ means that for all $p\in\mathcal{P}$ , $p\geq k$ ,

[TABLE]

i.e. $v_{p}(g_{j_{0}})$ is divisible by $\operatorname{gcd}(m_{j_{0}},q)$ and

[TABLE]

where $\rho_{j_{0}}:=q/\operatorname{gcd}(m_{j_{0}},q)$ . Since $m_{j_{0}}/\operatorname{gcd}(m_{j_{0}},q)$ is coprime with $\rho_{j_{0}}$ , the last congruence is equivalent to a congruence modulo $\rho_{j_{0}}$ between $v_{p}(n+j_{0})$ and a fixed integer, which is not divisible by $\rho_{j_{0}}$ if and only if $p$ divides $g_{j_{0}}$ . We deduce that the condition on $(n+j_{0})^{m_{j_{0}}}$ implies that $\sharp_{k}(n+j_{0})\in h(q,m_{j_{0}},g_{j_{0}})\mathbb{N}^{\rho_{j_{0}}}$ , i.e.

[TABLE]

where $\alpha$ is a $\rho_{j_{0}}$ -th power free integer whose prime factors are strictly smaller than $k$ , $A$ is an integer and $h(q,m_{j_{0}},g_{j_{0}})$ is an integer depending only on $q$ , $m_{j_{0}}$ and $g_{j_{0}}$ , which is divisible by $\operatorname{rad}(g_{j_{0}})$ . For a fixed value of $\alpha$ , the values of $A$ should be in the interval

[TABLE]

whose size is at most

[TABLE]

by the concavity of the power $1/\rho_{j_{0}}$ , the fact that $\rho_{j_{0}}\geq 2$ since $m_{j_{0}}$ is not divisible by $q$ , which implies $x^{1/\rho_{j_{0}}}\leq 1+\sqrt{x}$ . Now, the conditions on $n+j$ for $j\neq j_{0}$ imply a condition of congruence for $\alpha h(q,m_{j_{0}},g_{j_{0}})A^{\rho_{j_{0}}}$ , modulo all the primes dividing one of the $g_{j}$ ’s for $j\neq j_{0}$ . These primes do not divide $\alpha$ , since $\alpha$ has all prime factors smaller than $k$ , and $g_{j}$ divides $\sharp_{k}[(n+j)^{m_{j}}]$ . They also do not divide $h(q,m_{j_{0}},g_{j_{0}})$ , since this integer has the same prime factors as $g_{j_{0}}$ , which is prime with $g_{j}$ . Hence, we get a condition of congruence for $A^{\rho_{j_{0}}}$ modulo all primes dividing $g_{j}$ for some $j\neq j_{0}$ . For each of these primes, this gives at most $\rho_{j_{0}}\leq q$ congruence classes for $A$ , and then, by the chinese reminder theorem, we get at most $q^{\omega\left(\prod_{j\neq j_{0}}g_{j}\right)}$ classes modulo $\prod_{j\neq j_{0}}\operatorname{rad}(g_{j})$ , where $\omega$ denotes the number of prime factors of an integer. The number of integers $A\in I$ satisfying the congruence conditions is then at most:

[TABLE]

where $\tau(g)$ denotes the number of divisors of $g$ . Now, $\alpha$ has prime factors smaller than $k$ and $p$ -adic valuations smaller than $q$ , which certainly gives $\alpha\leq(k!)^{q}$ . Hence, by considering all the possible values of $\alpha$ , and all the possible $g_{1},\dots,g_{k}$ , which should divide $g$ , we deduce

[TABLE]

If $\mathcal{N}(q,k,g,N^{\prime},N)>0$ , we have necessarily

[TABLE]

Using the divisor bound, we deduce that for all $\epsilon>0$ , there exists $C^{(1)}_{q,k,\epsilon}$ such that for all $g\leq(1+k)^{kq}N^{kq}$ ,

[TABLE]

and then

[TABLE]

i.e.

[TABLE]

which implies

[TABLE]

since the right-hand side is nonnegative. Summing the square of this bound for all possible $g$ gives

[TABLE]

Now, since all numbers up to $(1+k)^{kq}N^{kq}$ have prime factors smaller than this quantity, we deduce, using the multiplicativity of the radical:

[TABLE]

which, by Mertens’ theorem, is smaller than a constant, depending on $k$ and $q$ , times $\log^{q-1}(1+N)$ . We deduce that there exists a constant $C^{(2)}_{q,k,\epsilon}>0$ , such that

[TABLE]

Now, it is clear that

[TABLE]

since this sum counts all the integers $n$ from $N^{\prime}+1$ to $N$ , regrouped in function of the $q$ -th power free part of $\sharp_{k}\left(\prod_{j=1}^{k}(n+j)^{m_{j}}\right)$ . Using the inequality $x^{2}\leq(x-a)_{+}^{2}+2ax$ , available for all $a,x\geq 0$ , we deduce

[TABLE]

This result gives the first inequality of the proposition, for

[TABLE]

The second inequality is obtained by taking $N^{\prime}=0$ and dividing by $N^{2}$ . ∎

Corollary 6.2.

For all $(m_{1},\dots,m_{k})\in\mathbb{Z}^{k}$ , $\hat{\mu}_{k,N}(m_{1},\dots,m_{k})$ converges in $L^{2}$ , and then in probability, to the corresponding Fourier coefficient of the uniform distribution $\mu_{k,q}$ on $\mathbb{U}_{q}^{k}$ . In other words, $\mu_{k,N}$ converges weakly in probability to $\mu_{k,q}$ .

We also have a strong law of large numbers.

Proposition 6.3.

Almost surely, $\mu_{k,N}$ weakly converges to $\mu_{k,q}$ . More precisely, for all $(t_{1},\dots,t_{k})\in(\mathbb{U}_{q})^{k}$ , the proportion of $n\leq N$ such that $(X_{n+1},\dots,X_{n+k})=(t_{1},\dots,t_{k})$ is almost surely $q^{-k}+O(N^{-1/2+\epsilon})$ for all $\epsilon>0$ .

Proof.

By Lemma 4.3 and Proposition 6.1, we deduce that almost surely, for all $\epsilon>0$ , $0\leq m_{1},\dots,m_{k}\leq q-1$ , $(m_{1},\dots,m_{k})\neq(0,0,\dots,0)$ ,

[TABLE]

Since we have finitely many values of $m_{1},\dots,m_{k}$ , we can take the $O$ uniform in $m_{1},\dots,m_{k}$ . Then, by inverting discrete Fourier transform on $\mathbb{U}_{q}^{k}$ , we deduce the claim. ∎

7. More general distributions on the unit circle

In this section, $(X_{p})_{p\in\mathcal{P}}$ are i.i.d., with any distribution on the unit circle. We will study the empirical distribution of $(X_{n})_{n\geq 1}$ , but not of the patterns $(X_{n+1},\dots,X_{n+k})_{n\geq 1}$ for $k\geq 2$ . More precisely, the goal of the section is to prove a strong law of large numbers for $N^{-1}\sum_{n=1}^{N}\delta_{X_{n}}$ when $N$ goes to infinity. We will use the following result, due to Halász, Montgomery and Tenenbaum (see [7], [8], [5], [20], [28] p. 343):

Proposition 7.1.

Let $(Y_{n})_{n\geq 1}$ be a multiplicative function such that $|Y_{n}|\leq 1$ for all $n\geq 1$ . For $N\geq 3,T>0$ , we set

[TABLE]

Then:

[TABLE]

where $C>0$ is an absolute constant.

From this result, we show the following:

Proposition 7.2.

Let $(Y_{n})_{n\geq 1}$ be a random multiplicative function such that $(Y_{p})_{p\in\mathcal{P}}$ are i.i.d., with $\mathbb{P}[|Y_{p}|\leq 1]=1$ , $\mathbb{P}[Y_{p}=1]<1$ and $\mathbb{P}[Y_{p}=-1]<1$ . Then, almost surely, for all $c\in(0,1-|\mathbb{E}[\Re(Y_{2})]|)$

[TABLE]

Proof.

First, we observe that for $1<N^{\prime}<N$ integers, $\lambda>0$ ,

[TABLE]

where, by a classical refinement of the prime number theorem,

[TABLE]

for all $A>1$ . The bracket is dominated by $1/\log(N^{\prime})$ , the second part of the last integral is dominated by

[TABLE]

and the error term of the first part is dominated by $(1+\lambda)/\log^{A}(N^{\prime})$ . Hence

[TABLE]

where

[TABLE]

Now, for all $a\geq 1$ ,

[TABLE]

which gives

[TABLE]

Now, the integral of $(\sin y)/y$ on $\mathbb{R}^{*}_{+}$ is conditionally convergent: $(\sin y)/y$ tends to $1$ when $y\rightarrow 0$ and the convergence of the integral at $\infty$ is easily deduced from an integration by parts. Hence, the integral of $(\sin y/y)$ on any interval of $\mathbb{R}^{*}_{+}$ is uniformly bounded, which implies

[TABLE]

We deduce

[TABLE]

Bounding the sum on primes smaller than $N^{\prime}$ by taking the absolute value, we get:

[TABLE]

and then by taking $N^{\prime}=e^{(\log N)^{10/A}}$ , for $N$ large enough depending on $A$ ,

[TABLE]

and then by letting $A\rightarrow\infty$ and using the symmetry of the imaginary part for $\lambda\mapsto-\lambda$ ,

[TABLE]

for $N\rightarrow\infty$ . This estimate can also be deduced from known bounds on the Riemann zeta function on the line $\Re=1+1/\log N$ . Now, for all $\rho$ whose real part is in $[-1,1)$ , we have

[TABLE]

The first term is at least the sum of $1-|\Re(\rho)|$ divided by $p$ , and then at least $[1-|\Re(\rho)|+o(1)]\log\log N$ . The second term is $o(\log\log N)$ by the previous discussion. Hence,

[TABLE]

Now, let $\rho:=\mathbb{E}[Y_{2}]$ , and $Z_{p,\lambda}:=\Re[(Y_{p}-\rho)p^{-i\lambda}]$ . The variables $(Z_{p,\lambda})_{p\in\mathcal{P}}$ are centered, independent, bounded by $2$ . By Hoeffding’s lemma (see, for example, Massart [18], p. 21), for all $u\geq 0$ ,

[TABLE]

and then by independence,

[TABLE]

Applying the same inequality to $-Z_{p,\lambda}$ , we deduce

[TABLE]

The derivative of the last sum in $p$ with respect to $\lambda$ is dominated by

[TABLE]

and then the sum cannot vary more than $O(1)$ when $\lambda$ runs between two consecutive multiples of $\log^{-1}N$ . Hence,

[TABLE]

If we define, for $k\geq 1$ , $N_{k}$ as the integer part of $e^{k^{1/5}}$ , we deduce, by Borel-Cantelli lemma, that almost surely, for all but finitely many $k\geq 1$ ,

[TABLE]

If this event occurs, we deduce, using (5),

[TABLE]

Then, by Proposition 7.1, we get

[TABLE]

Since $-(1-|\Re(\rho)|)\geq-1>-5$ , we deduce

[TABLE]

which gives the claimed result along the sequence $(N_{k})_{k\geq 1}$ . Now, if $N\in[N_{k},N_{k+1}]$ , we have, since all the $Y_{n}$ ’s have modulus at most $1$ ,

[TABLE]

This allows to remove the restriction to the sequence $(N_{k})_{k\geq 1}$ . ∎

Using Fourier transform, we deduce a law of large numbers for the empirical measure $\mu_{N}=\frac{1}{N}\sum_{n=1}^{N}\delta_{X_{n}}$ , under the assumptions of this section.

Proposition 7.3.

If for all integers $q\geq 1$ , $\mathbb{P}[X_{2}\in\mathbb{U}_{q}]<1$ , then almost surely, $\mu_{N}$ tends to the uniform measure on the unit circle.

Proof.

For all $m\neq 0$ , $X_{2}^{m}$ takes its values on the unit circle, and it is not a.s. equal to $1$ . Applying the previous proposition to $Y_{n}=X_{n}^{m}$ , we deduce that $\hat{\mu}_{N}(m)$ tends to zero almost surely, which gives the desired result. ∎

Proposition 7.4.

If for $q\geq 2$ , $X_{2}\in\mathbb{U}_{q}$ almost surely, but $\mathbb{P}[X_{2}\in\mathbb{U}_{r}]<1$ for all strict divisors $r$ of $q$ , then almost surely, $\mu_{N}$ tends to the uniform measure on $\mathbb{U}_{q}$ . More precisely, almost surely, for all $t\in\mathbb{U}_{q}$ , the proportion of $n\leq N$ such that $X_{n}=t$ is $q^{-1}+O((\log N)^{-c})$ , as soon as

[TABLE]

this infimum being strictly positive.

Proof.

The infimum is strictly positive since by assumption, $\mathbb{P}[X_{2}^{m}=1]<1$ for all $m\in\{1,\dots,q-1\}$ . Now, we apply the previous result to $Y_{n}=X_{n}^{m}$ for all $m\in\{1,\dots,q-1\}$ , and we get the claim after doing a discrete Fourier inversion. ∎

Bibliography29

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] A. Baker: Bounds for solutions of hyperelliptic equations , Proc. Cambridge Phil. Soc. 65 (1969), 439–444.
2[2] H. Bohr and B. Jessen: Über die Wertverteilung der Riemannschen Zetafunktion, Erste Mitteilung , Acta Math. 54 (1930), 1–35, Zweite Mitteilung , Acta Math. 58 (1932), 1–55.
3[3] S. Chowla: The Riemann hypothesis and Hilbert’s tenth problem , Gordon and Breach, New York, 1965.
4[4] A. Dubickas: A note on the multiplicative dependence of consecutive integers , Scient. works of Lith. Math. Soc.: suppl. to ”Liet. Matem. Rink.”, Technika, Vilnius (1998), 21–23.
5[5] A. Granville, K. Soundararajan: Decay of Mean Values of Multiplicative Functions , Canad. J. Math. 55 (2003), 1191–1230.
6[6] L. Hajdu, A. Pintér: Square product of three integers in short intervals , Math. of Computation 68 (1999), 1299–1301.
7[7] G. Halász: Über die Mittelwerte multiplikativer zahlentheoretischer Funktionen , Acta Math. Acad. Sci. Hung. 19 (1968), 365–403.
8[8] G. Halász: On the distribution of additive and the mean values of multiplicative arithmetic functions , Studia Sci. Math. Hung. 6 (1971), 211–233.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

On consecutive values of random completely multiplicative functions

Abstract.

1. Introduction

2. Independence in the uniform case

Lemma 2.1**.**

Proof.

Proposition 2.2**.**

Remark 2.3**.**

Proof.

Proposition 2.4**.**

Proof.

3. Independence in the case of roots of unity

Lemma 3.1**.**

Proof.

Proposition 3.2**.**

Proof.

Proposition 3.3**.**

Proof.

Remark 3.4**.**

4. Convergence of the empirical measure in the uniform case

Proposition 4.1**.**

Proof.

Corollary 4.2**.**

Lemma 4.3**.**

Proof.

Proposition 4.4**.**

Remark 4.5**.**

Proof.

5. Moments of order different from two

Proposition 5.1**.**

Proof.

Remark 5.2**.**

Corollary 5.3**.**

Proof.

Proposition 5.4**.**

Proof.

6. Convergence of the empirical measure in the case of roots of unity

Proposition 6.1**.**

Proof.

Corollary 6.2**.**

Proposition 6.3**.**

Proof.

7. More general distributions on the unit circle

Proposition 7.1**.**

Proposition 7.2**.**

Proof.

Proposition 7.3**.**

Proof.

Proposition 7.4**.**

Proof.

Lemma 2.1.

Proposition 2.2.

Remark 2.3.

Proposition 2.4.

Lemma 3.1.

Proposition 3.2.

Proposition 3.3.

Remark 3.4.

Proposition 4.1.

Corollary 4.2.

Lemma 4.3.

Proposition 4.4.

Remark 4.5.

Proposition 5.1.

Remark 5.2.

Corollary 5.3.

Proposition 5.4.

Proposition 6.1.

Corollary 6.2.

Proposition 6.3.

Proposition 7.1.

Proposition 7.2.

Proposition 7.3.

Proposition 7.4.