Algorithmic classification of noncorrelated binary pattern sequences

Jakub Konieczny

arXiv:1905.03283·math.NT·August 24, 2021

Algorithmic classification of noncorrelated binary pattern sequences

Jakub Konieczny

PDF

TL;DR

This paper presents an algorithm to verify noncorrelation in binary pattern sequences, computes the number of such sequences up to length 4, and proposes a conjecture with partial verification for longer sequences.

Contribution

It introduces an algorithmic method for verifying noncorrelation and provides exact counts for sequences of certain lengths, along with a new sufficient condition for specific pattern classes.

Findings

01

Exactly 2272 noncorrelated sequences of length ≤ 4

02

A sufficient condition for noncorrelation when patterns do not end with 0

03

Conjecture on the necessity of the condition verified for lengths ≤ 5

Abstract

We show that it is possible to algorithmically verify if a given pattern sequence is noncorrelated. As an application, we compute that there are exactly $2272$ noncorrelated binary pattern sequences of length $\leq 4$ . If we restrict our attention to patterns that do not end with $0$ , we put forward a sufficient condition for a pattern sequence to be noncorrelated. We conjecture that this condition is also necessary, and verify this conjecture for lengths $\leq 5$ .

Equations154

N \to \infty lim ∣ {0 \leq n < N ∣ t (A n + B) = + 1} ∣ / N = 1/2

N \to \infty lim ∣ {0 \leq n < N ∣ t (A n + B) = + 1} ∣ / N = 1/2

γ_{a} (m) := N \to \infty lim \frac{1}{N} n = 0 \sum N - 1 a (n) \overline{a} (n + m),

γ_{a} (m) := N \to \infty lim \frac{1}{N} n = 0 \sum N - 1 a (n) \overline{a} (n + m),

γ_{t} (2^{ℓ}) = - 1/3 for all ℓ \in N_{0},

γ_{t} (2^{ℓ}) = - 1/3 for all ℓ \in N_{0},

N \to \infty lim \frac{1}{N} m = 0 \sum N - 1 γ_{t} (m)^{2} = 0,

N \to \infty lim \frac{1}{N} m = 0 \sum N - 1 γ_{t} (m)^{2} = 0,

a (n) = a_{A} (n) = (- 1)^{# (A, n)},

a (n) = a_{A} (n) = (- 1)^{# (A, n)},

a_{A} (n) = (- 1)^{# (A, n)} .

a_{A} (n) = (- 1)^{# (A, n)} .

a_{A} \cdot a_{B} = a_{A \oplus B} . \qed

a_{A} \cdot a_{B} = a_{A \oplus B} . \qed

# (v, n) = i = 0 \sum k - 1 # (i v, n)

# (v, n) = i = 0 \sum k - 1 # (i v, n)

N \to \infty lim \frac{∣ { n < N ∣ a ( n ) = i , a ( n + m ) = i ^{'} } ∣}{N} = \frac{1}{4} for all i, i^{'} \in {+ 1, - 1} .

N \to \infty lim \frac{∣ { n < N ∣ a ( n ) = i , a ( n + m ) = i ^{'} } ∣}{N} = \frac{1}{4} for all i, i^{'} \in {+ 1, - 1} .

Λ_{i} a (n) = a (k n + i) .

Λ_{i} a (n) = a (k n + i) .

N_{k} (a) = {n \mapsto a (k^{α} n + r) ∣ α, r \in N_{0}, 0 \leq r < k^{α}} \subseteq C^{N_{0}} .

N_{k} (a) = {n \mapsto a (k^{α} n + r) ∣ α, r \in N_{0}, 0 \leq r < k^{α}} \subseteq C^{N_{0}} .

M_{a} := N \to \infty lim \frac{1}{N} n = 0 \sum N - 1 a (n), M_{a}^{l o g} := N \to \infty lim \frac{1}{lo g N} n = 0 \sum N - 1 \frac{1}{n + 1} a (n) .

M_{a} := N \to \infty lim \frac{1}{N} n = 0 \sum N - 1 a (n), M_{a}^{l o g} := N \to \infty lim \frac{1}{lo g N} n = 0 \sum N - 1 \frac{1}{n + 1} a (n) .

# (v, k n + i) = {# (v, n) + 1 # (v, n) if v is a suffix of 0^{∣ v ∣ - 1} (k n + i)_{k}, otherwise.

# (v, k n + i) = {# (v, n) + 1 # (v, n) if v is a suffix of 0^{∣ v ∣ - 1} (k n + i)_{k}, otherwise.

(2^{k^{ℓ - 1}})^{k - 1} \cdot 2^{k^{ℓ - 1} - 1} = 2^{k^{ℓ} - 1} .

(2^{k^{ℓ - 1}})^{k - 1} \cdot 2^{k^{ℓ - 1} - 1} = 2^{k^{ℓ} - 1} .

a ([u 0]_{k}) = a ([u]_{k}) .

a ([u 0]_{k}) = a ([u]_{k}) .

a ([u 0]_{k}) = (- 1)^{# (A, u 0)} = (- 1)^{# (A, u) + 1} = - a ([u]_{k}),

a ([u 0]_{k}) = (- 1)^{# (A, u 0)} = (- 1)^{# (A, u) + 1} = - a ([u]_{k}),

i = 0 \sum k - 1 # (v i, n) - # (v, n) = {10 if v is a suffix of n, otherwise.

i = 0 \sum k - 1 # (v i, n) - # (v, n) = {10 if v is a suffix of n, otherwise.

γ_{a, b} (m) := N \to \infty lim \frac{1}{N} n = 0 \sum N - 1 a (n) \overline{b} (n + m),

γ_{a, b} (m) := N \to \infty lim \frac{1}{N} n = 0 \sum N - 1 a (n) \overline{b} (n + m),

γ_{a, b}^{l o g} (m) := N \to \infty lim \frac{1}{lo g N} n = 0 \sum N - 1 \frac{1}{n + 1} a (n) \overline{b} (n + m),

γ_{a, b}^{l o g} (m) := N \to \infty lim \frac{1}{lo g N} n = 0 \sum N - 1 \frac{1}{n + 1} a (n) \overline{b} (n + m),

γ_{a, b}^{l o g} (m) = \frac{1}{k} i = 0 \sum k - 1 γ_{a_{i}^{'}, b_{i}^{'}}^{l o g} (m_{i}^{'}),

γ_{a, b}^{l o g} (m) = \frac{1}{k} i = 0 \sum k - 1 γ_{a_{i}^{'}, b_{i}^{'}}^{l o g} (m_{i}^{'}),

a_{i}^{'} = Λ_{i} a, b_{i}^{'} = Λ_{m + i mod k} b, m_{i}^{'} = ⌊ \frac{m + i}{k} ⌋ .

a_{i}^{'} = Λ_{i} a, b_{i}^{'} = Λ_{m + i mod k} b, m_{i}^{'} = ⌊ \frac{m + i}{k} ⌋ .

n = 0 \sum N - 1 \frac{a ( n ) b ( n + m )}{n + 1}

n = 0 \sum N - 1 \frac{a ( n ) b ( n + m )}{n + 1}

= \frac{1}{k} i = 0 \sum k - 1 n = 0 \sum ⌊ N / k ⌋ - 1 \frac{a _{i}^{'} ( n ) b _{i}^{'} ( n + m _{i}^{'} )}{n + 1} + O (1),

\frac{1}{lo g N} n = 0 \sum N - 1 \frac{a ( n ) b ( n + m )}{n + 1}

\frac{1}{lo g N} n = 0 \sum N - 1 \frac{a ( n ) b ( n + m )}{n + 1}

γ_{a, b} (m) = \frac{1}{k} i = 0 \sum k - 1 γ_{a_{i}^{'}, b_{i}^{'}} (m_{i}^{'}),

γ_{a, b} (m) = \frac{1}{k} i = 0 \sum k - 1 γ_{a_{i}^{'}, b_{i}^{'}} (m_{i}^{'}),

γ_{a, b} (m; x) = \frac{1}{⌊ x ⌋} n = 0 \sum ⌊ x ⌋ - 1 a (n) \overline{b} (n + m) .

γ_{a, b} (m; x) = \frac{1}{⌊ x ⌋} n = 0 \sum ⌊ x ⌋ - 1 a (n) \overline{b} (n + m) .

γ_{a, b} (m; x) = \frac{1}{k} i = 0 \sum k - 1 γ_{a_{i}^{'}, b_{i}^{'}} (m_{i}^{'}; x / k) + O (1/ x),

γ_{a, b} (m; x) = \frac{1}{k} i = 0 \sum k - 1 γ_{a_{i}^{'}, b_{i}^{'}} (m_{i}^{'}; x / k) + O (1/ x),

γ_{a, b} (1; x) = \frac{1}{k} i = 0 \sum k - 2 γ_{a_{i}^{'}, b_{i}^{'}} (0; x / k) + \frac{1}{k} γ_{a_{k - 1}^{'}, b_{k - 1}^{'}} (1; x / k) + O (1/ x) .

γ_{a, b} (1; x) = \frac{1}{k} i = 0 \sum k - 2 γ_{a_{i}^{'}, b_{i}^{'}} (0; x / k) + \frac{1}{k} γ_{a_{k - 1}^{'}, b_{k - 1}^{'}} (1; x / k) + O (1/ x) .

γ_{a, b} (1; x) = a^{'}, b^{'} \in N \sum w_{a^{'}, b^{'}}^{(t)} γ_{a^{'}, b^{'}} (0; x / k^{t}) + \frac{1}{k ^{t}} γ_{a^{(t)}, b^{(t)}} (1; x / k^{t}) + O (k^{t} / x) .

γ_{a, b} (1; x) = a^{'}, b^{'} \in N \sum w_{a^{'}, b^{'}}^{(t)} γ_{a^{'}, b^{'}} (0; x / k^{t}) + \frac{1}{k ^{t}} γ_{a^{(t)}, b^{(t)}} (1; x / k^{t}) + O (k^{t} / x) .

x \to \infty lim sup γ_{a, b} (1; x) - γ_{a, b}^{(t)} (1) = O (1/ k^{t}) .

x \to \infty lim sup γ_{a, b} (1; x) - γ_{a, b}^{(t)} (1) = O (1/ k^{t}) .

γ_{a, b} (1) = x \to \infty lim γ_{a, b} (1; x) = t \to \infty lim γ_{a, b}^{(t)} (1) .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Algorithmic classification of

noncorrelated binary pattern sequences

Jakub Konieczny

Camille Jordan Institute, Claude Bernard University Lyon 1, 43 Boulevard du 11 novembre 1918, 69622 Villeurbanne Cedex, France

Faculty of Mathematics and Computer Science, Jagiellonian University in Kraków, Łojasiewicza 6, 30-348 Kraków, Poland

[email protected]

Abstract.

The main subject of this paper are binary pattern sequences, that is, sequences of the form $(-1)^{\#(n,A)}$ where $A$ is a set of strings of $\mathtt{0}$ s and $\mathtt{1}$ s, and $\#(n,A)$ denotes the total number of times patterns from $A$ appear in the binary expansion of $n$ . A sequence is said to be noncorrelated if the corresponding spectral measure is equal to the Lebesgue measure.

We show that it is possible to algorithmically verify if a given binary pattern sequence is noncorrelated. As an application, we compute that there are exactly $2272$ noncorrelated binary pattern sequences of length $\leq 4$ . If we restrict our attention to patterns that do not end with $\mathtt{0}$ , we put forward a sufficient condition for a pattern sequence to be noncorrelated. We conjecture that this condition is also necessary, and verify this conjecture for lengths $\leq 5$ .

2010 Mathematics Subject Classification:

Primary: 47B15; Secondary: 11B50

1. Introduction

Uniformity properties of sequences defined in terms of digital expansions have long been studied. Consider, for instance, the Thue–Morse sequence $t(n)=(-1)^{s_{2}(n)}$ , where $s_{2}(n)$ denotes the sum of binary digits of $n$ , discussed at length by Allouche and Shallit in the survey paper [AS99]. It was shown by Gelfond [Gel68] (see also [MS98]) that $t(n)$ is equidistributed in arithmetic progressions:

[TABLE]

for all $A\in\mathbb{N}$ and $B\in\mathbb{N}_{0}$ , and the rate of convergence can be made explicit. Analogous results hold also for other bases, with mild additional assumptions to account for the fact that $s_{k}(n)\equiv n\bmod{k-1}$ . Mauduit and Sárközy [MS98] also observed that the Thue–Morse sequence admits large self-correlations. Here, the (self-)correlation coefficients of a sequence $a\colon\mathbb{N}\to\mathbb{C}$ are defined by

[TABLE]

and a simple computation shows that $\gamma_{t}(1)=-1/3\neq 0$ (see Section 3 for details). By the same token,

[TABLE]

meaning in particular that $\gamma(m)\not\to 0$ as $m\to\infty$ . On the other hand, the coefficients $\gamma_{t}(m)$ tend to be rather small; in particular

[TABLE]

which follows e.g. from results in [Coq76]. The spectral measure $\mu_{a}$ on $\mathbb{R}/\mathbb{Z}$ associated to a sequence $a\colon\mathbb{N}\to\mathbb{C}$ is characterised by the identity $\int_{\mathbb{R}/\mathbb{Z}}\exp(2\pi imt)d\mu_{a}(t)=\gamma_{a}(m)$ , and (4) is equivalent to absolute continuity of $\mu_{t}$ .

Many other notions of uniformity have been investigated for the Thue–Morse sequence. In an influential paper, Mauduit and Rivat showed that $t(n)$ and its analogues in different bases are equidistributed along the primes [MR10]. Drmota, Mauduit and Rivat [DMR19] showed that $t(n^{2})$ is a normal sequence, meaning that each finite sequence of $\pm 1$ s appears with the expected frequency. Spiegelhofer [Spi18] proved that $t(n)$ has level of distribution $1$ , which is a far-reaching quantitative generalisation of (1) and can be used to show equdistribution along Piatetski–Shapiro sequences $\left\lfloor n^{c}\right\rfloor$ , $1<c<2$ (see [FM96] for analogous, but somewhat weaker, results in different bases). It was also shown by the author [Kon19] that $t(n)$ has small Gowers norms, meaning that it is uniform from the point of view of higher order Fourier analysis.

Another oft-studied sequence carries the name of Rudin–Shapiro and is given by $r(n)=(-1)^{\#(\mathtt{11},n)}$ , where $\#(\mathtt{11},n)$ denotes the number of times the pattern $\mathtt{11}$ appears in the binary expansion of $n$ , allowing overlaps. Similarly to the Thue–Morse sequence, the Rudin–Shapiro sequence is equidistributed in arithmetic progressions and along the primes [MR15], and has small Gowers norms [Kon19]. However, in contrast to (3), the Rudin–Shapiro sequence is noncorrelated, by which we mean that $\gamma_{r}(m)=0$ for all $m\geq 1$ or, equivalently, that the spectral measure $\mu_{r}$ is is the Lebesgue measure. Intuitively, noncorrelated sequences are free of any sort of periodic behaviour.

The Thue–Morse and the Rudin–Shapiro sequences are special cases of what we call binary pattern sequences. In general, a binary pattern sequence takes the form

[TABLE]

where $A$ is a finite set of patterns over the alphabet $\{\mathtt{0},\mathtt{1}\}$ and $\#(A,n)$ denotes the total number of appearances of patterns from $A$ in the binary expansion of $n$ (see Section 2 for details). Pattern sequences were studied in a more general context by Morton and Mourant [MM89, Mor90], Coquet, Kamae and Mendès France [CKMF77], and Boyd, Cook and Morton [BCM89]. Generalised Rudin–Shapiro sequences and their correlation coefficients were studied by Allouche and Liardet [AL91]. Finally, Zheng, Peng and Kamae [ZPK18] studied correlation coefficients of binary pattern sequences, and obtained a complete classification of noncorrelated sequences corresponding to sets of patterns of length $\leq 3$ . Examples of sets $A$ that give rise to noncorrelated sequences include:

•

$\{\mathtt{11}\}$ (then $a_{A}(n)=r(n)$ is the Rudin–Shapiro sequence);

•

$\{\mathtt{11},\mathtt{1}\}$ (then $a_{A}(n)=r(n)t(n)$ ) and $\{\mathtt{10},\mathtt{1}\}$ (then $a_{A}(n)=(-1)^{n}r(n)$ );

•

$\{\mathtt{101},\mathtt{111}\}$ , or more generally $\{\mathtt{101},\mathtt{111}\}\cup B$ for a set $B\subseteq\{\mathtt{0},\mathtt{1}\}^{2}$ .

In this paper, we extend the result of [ZPK18] to patterns of length $\leq 4$ and put the findings in a wider context provided by the theory of automatic and regular sequences. Many of the ideas we use have their analogues and prototypes in [ZPK18]; throughout the paper we give references to the relevant results therein.

Unfortunately, there does not appear to be a simple criterion that determines if a given pattern sequence is noncorrelated (except for the partial information suggested by Conjecture 1.2 below). Due to practical limitations we only state a counting result here, as opposed to a complete list.111The list, together with the code which can be used to produce it, is available from the author.

Theorem A.

There are precisely $2272$ noncorrelated binary pattern sequences corresponding to patterns of length $\leq 4$ .

As a key step towards obtaining the above result, we reduce the task of verifying whether a given binary pattern sequence is noncorrelated to a finite computation, which can then be automated. The time complexity of the resulting algorithm is polynomial in $2^{\ell}$ , where $\ell$ denotes the length of patterns under consideration. Since it takes approximately $2^{\ell}$ bits to specify a binary pattern sequence, this is optimal up to improvements in the exponent.

Theorem B.

There exists an algorithm which, given a finite set of patterns $A\subseteq\{\mathtt{0},\mathtt{1}\}^{\ell}$ , performs $2^{O(\ell)}$ operations and decides if the corresponding pattern sequence $a_{A}$ is noncorrelated.

While we keep the exposition fairly self-contained, we also wish to emphasize that the above problem can be seen as a part of a larger theory. We note that binary pattern sequences are $2$ -automatic (see Section 2 for the relevant definitions). A crucial component of our reasoning is Theorem 3.5, which assets that the correlation sequences coming from automatic sequences are regular. While this result will not come as a surprise to the experts in the field, to the best of our knowledge it does not appear in print elsewhere. Its importance stems from the fact that a regular sequence admits a simple recursive description, an hence many properties are easily verified for such a sequence. In our particular application, we reduce the task of determining if a pattern sequence is noncorrelated to the ostensibly simpler task of determining if a $2$ -regular sequence is identically zero.

The problem of classifying noncorrelated pattern sequences becomes more tractable if we impose additional assumptions on the set of patterns under consideration. Let us call a binary pattern sequence $a\colon\mathbb{N}_{0}\to\{+1,-1\}$ dilation-invariant if $a(2n)=a(n)$ for all $n\in\mathbb{N}_{0}$ , or equivalently, if $a=a_{A}$ for a set $A$ that contains only patterns that begin and end with $\mathtt{1}$ (see Section 2.4 for details). In the dilation-invariant case, we have a conjectural classification, which we are able to confirm in one direction in full generality, and in the opposite direction for patterns of length $\leq 5$ .

Theorem C.

Let $A$ be a set of patterns over the alphabet $\{\mathtt{0},\mathtt{1}\}$ , all of which begin and end with $\mathtt{1}$ . Let $\ell$ be the length of the longest word in $A$ and let $a=a_{A}$ be the corresponding binary pattern sequence. If $\ell\geq 2$ and $\mathtt{1}\{\mathtt{0},\mathtt{1}\}^{\ell-2}\mathtt{1}\subseteq A$ then $a$ is noncorrelated. Conversely, if $2\leq\ell\leq 5$ and $a$ is noncorrelated then $\mathtt{1}\{\mathtt{0},\mathtt{1}\}^{\ell-2}\mathtt{1}\subseteq A$ .

Conjecture 1.1.

Let $A$ , $\ell$ and $a$ be as in Theorem C. If $a$ is noncorrelated then $\ell\geq 2$ and $\mathtt{1}\{\mathtt{0},\mathtt{1}\}^{\ell-2}\mathtt{1}\subseteq A$ .

If $A=\mathtt{1}\{\mathtt{0},\mathtt{1}\}^{\ell-2}\mathtt{1}$ then the fact that $a_{A}$ is noncorrelated follows from [ZPK18]. More generally, Theorem 1.3 in [ZPK18] (see also [AL91]) provides a classification of all noncorrelated binary pattern sequences $a_{A}$ for sets of patterns of the form $A=w_{1}\{\mathtt{0},\mathtt{1}\}^{\ell_{1}}w_{2}\{\mathtt{0},\mathtt{1}\}^{\ell_{2}}\dots w_{s}\{\mathtt{0},\mathtt{1}\}^{\ell_{s}}w_{s+1}$ where $s\in\mathbb{N}$ , $w_{i}$ are words over the alphabet $\{\mathtt{0},\mathtt{1}\}$ and $\ell_{1},\ell_{2},\dots,\ell_{s}\in\mathbb{N}_{0}$ . Conjecture 1.1 is consistent with said classification.

Returning to the general case, we notice that each binary pattern sequence $a\colon\mathbb{N}_{0}\to\{+1,-1\}$ can be written as the product of a periodic sequence $h$ and a dilation-invariant pattern sequence $b$ (Lemma 2.9). The correlation coefficients of $a$ and $b$ are closely related (see also Remark 5.6), and in all cases that we were able to check (i.e., $\ell\leq 4$ ), if $a$ is noncorrelated then so is $b$ . This motivates us to put forward the following conjecture.

Conjecture 1.2.

Let $a\colon\mathbb{N}_{0}\to\{+1,-1\}$ be a noncorrelated binary pattern sequence. Then $a$ is the product of a periodic sequence and an dilation-invariant noncorrelated binary pattern sequence

Above we restricted our attention to base $2$ for the sake of brevity. In the remaining part of the paper, we work in arbitrary base $k\geq 2$ . In particular, the natural base- $k$ variant of Theorem B holds true. The same applies to the first part of Theorem C, except that it is less clear what the base- $k$ variant should be and the resulting statement is vacuous for many values of $k$ (see Proposition 5.3). When it comes to computations, we only consider base $2$ since for larger bases the number of distinct pattern sequences becomes so large that merely listing them all is already infeasible even for modest pattern lengths.

Acknowledgements

While writing this paper, the author was supported by the ERC grant ErgComNum 682150 at the Hebrew University of Jerusalem. During the review process, the author was working within the framework of the LABEX MILYON (ANR-10-LABX-0070) of Université de Lyon, within the program ”Investissements d’Avenir” (ANR-11-IDEX-0007) operated by the French National Research Agency (ANR). The author also acknowledges support from the Foundation for Polish Science (FNP).

The author wishes to express his gratitude to Boris Adamczewski, Jakub Byszewski, Aihua Fan and Tamar Ziegler for helpful conversations and to the anonymous Referee for the careful reading of this paper and constrictive suggestions.

2. Background and definitions

Convention: Throughout the paper, $k$ denotes the base and is considered to be fixed. In particular, all constructions and constants are allowed to depend on $k$ unless explicitly stated otherwise.

2.1. Pattern sequences

We let $\Sigma_{k}=\{\mathtt{0},\mathtt{1},\dots,k-1\}$ denote the set of digits in base $k$ . For a set $X$ , we let $X^{*}$ denote the monoid consisting of words over the alphabet $X$ , equipped with the operation of concatenation and neutral element $\epsilon$ , the empty word. For $v\in X^{*}$ , $\left|v\right|$ denotes the length of $v$ . For $n\in\mathbb{N}_{0}$ , $(n)_{k}\in\Sigma_{k}^{*}$ denotes the expansion of $n$ in base $k$ (without leading zeros). Conversely, for $v\in\Sigma_{k}^{*}$ , $[v]_{k}\in\mathbb{N}_{0}$ denotes the integer encoded by $v$ .

Let $X$ be a set. We say that a word $v\in X^{*}$ appears in another word $w\in X^{*}$ , or that $v$ is a factor of $w$ , if there exist $x,y\in X^{*}$ such that $w=xvy$ . We call $v$ a prefix (resp. suffix) of $w$ if we may take $x=\epsilon$ (resp. $y=\epsilon$ ). We further define $\#(v,w)$ to be the number of times $v$ appears in $w$ , that is, the number distinct of pairs $(x,y)\in X^{*}\times X^{*}$ such that $w=xvy$ . We note that this definition allows for overlaps, so for instance $\#(\mathtt{010},\mathtt{01010})=2$ . More generally, for a finite set $A\subseteq X^{*}$ , we define $\#(A,w)=\sum_{v\in A}\#(v,w)$ .

Accordingly, for $n\in\mathbb{N}_{0}$ and $v\in\Sigma_{k}^{*}\setminus\{\mathtt{0}\}^{*}$ , $\#(n,v)$ denotes the number of times that $v$ appears in the base- $k$ expansion of $n$ padded with sufficiently many leading zeros, that is, $\#(v,n)=(v,\mathtt{0}^{\left|v\right|-1}(n)_{k})$ . The inclusion of the leading zeros in the expansion of $n$ ensures better behaviour of the map $n\mapsto\#(v,n)$ ; in particular, for each $n,m\in\mathbb{N}_{0}$ and sufficiently large $\alpha\in\mathbb{N}_{0}$ we have $\#(v,k^{\alpha}m+n)=\#(v,k^{\alpha}m)+\#(v,n)$ . The assumption that $v$ is not a string of zeros ensures that $\#(v,n)$ is well-defined, in the sense that for fixed $n$ , the sequence $(v,\mathtt{0}^{\alpha}(n)_{k})$ ( $\alpha\in\mathbb{N}_{0}$ ) is eventually constant.

We will call a set $A\subseteq\Sigma_{k}^{*}$ admissible if $A$ is finite and $A\cap\{\mathtt{0}\}^{*}=\emptyset$ , so that we may define $\#(A,n)=\sum_{v\in A}\#(v,n)$ . For any admissible set $A$ , the corresponding pattern sequence is defined by (cf. [ZPK18, Definition 1.1])

[TABLE]

If additionally $\left|u\right|\leq\ell$ for all $u\in A$ then we say that $a$ is a pattern sequence of length $\leq\ell$ , or equivalently we define the length of $a$ as the least possible value of $\max_{u\in A}\left|u\right|$ among all representations of $a$ in the form (5), where $A\subseteq\Sigma_{k}^{*}$ is an admissible set. Note that one pattern sequence can have multiple representations of the aforementioned form.

For two sets $A,B$ , we let $A\oplus B$ denote the symmetric difference $(A\setminus B)\cup(B\setminus A)$ .

Lemma 2.1.

The class of pattern sequences $\mathbb{N}_{0}\to\{+1,-1\}$ is closed under multiplication.

Proof.

It is enough to note that for any admissible sets $A,B\subseteq\Sigma_{k}^{*}$ we have

[TABLE]

It will usually be convenient to impose further restrictions on the set of patterns $A$ . Depending on the context we require that either $A$ has not leading zeros (in the sense that that $\mathtt{0}$ is not a prefix of $v$ for any $v\in A$ ) or that $A$ has constant length (in the sense that there is some $\ell\in\mathbb{N}$ such that $\left|v\right|=\ell$ for all $v\in A$ ).

Lemma 2.2.

Let $\ell\in\mathbb{N}_{0}$ and let $a\colon\mathbb{N}_{0}\to\{+1,-1\}$ be a pattern sequence of length $\leq\ell$ . Then there exist admissible sets $B,C\subseteq\Sigma_{k}^{*}$ such that $B$ has no leading zeros, $C$ has constant length $\ell$ , and $a=a_{B}=a_{C}$ . Moreover, $B$ and $C$ are uniquely determined by $a$ .

Proof.

Pick any admissible set $A\subseteq\Sigma_{k}^{*}$ with $a=a_{A}$ . Note that for each $v\in\Sigma_{k}^{*}$ and each $n\in\mathbb{N}_{0}$ we have

[TABLE]

To construct $B$ , begin with $A$ and as long as $A$ contains at least one word starting with $\mathtt{0}$ , say $\mathtt{0}v$ , replace $A$ with $A=A\oplus\left\{iv\ \middle|\ i\in\Sigma_{k}\right\}\oplus\{v\}$ . Because of (6), this operation does not change the sequence $a_{A}$ . Since each iteration decreases the total number of leading zeros in the patterns in $A$ , after a finite number of steps this procedure must terminate and the resulting set of patterns has no leading zeros.

To construct $C$ , likewise, begin with $A$ and as long as $A$ contains at least one word $v$ with length $\left|v\right|<\ell$ , pick the shortest such word $v$ and replace $A$ with $A\oplus\left\{iv\ \middle|\ i\in\Sigma_{k}\right\}\oplus\{v\}$ . Like before, this operation does not change the sequence $a_{A}$ . Each iteration either decreases the number of words in $A$ with least possible length, or increases the length of the shortest word in $A$ . At the same time, no words of length larger than $\ell$ are introduced. Hence, after a finite number of steps this procedure must terminate and the resulting set of patterns has constant length equal to $\ell$ .

It remains to show uniqueness. Using Lemma 2.1, we may assume that $A=\emptyset$ . For the sake of contradiction, suppose that one of $B$ and $C$ is non-empty. Consider first the case when $B\neq\emptyset$ and let $v$ be the shortest word in $B$ . Then $1=a_{B}([v]_{k})=-1$ , since $\mathtt{0}^{\ell}v$ contains exactly one pattern from $B$ , namely $v$ . Hence, we have reached a contradiction. Consider next the case when $C\neq\emptyset$ and choose the word $\mathtt{0}^{m}v\in C$ where $m$ is largest possible. Then we again reach the contradiction: $1=a_{C}([v]_{k})=-1$ . ∎

Remark 2.3.

We focus our attention on $\pm 1$ -valued sequences for two basic reasons. The first one is practical: The noncorrelation phenomenon that we are interested in relies on occurrence of certain arithmetic coincidences, which become less likely as the number of possible values increases; accordingly, the computational part of the problem becomes increasingly resource-intensive as sequences under consideration become more complicated. The second reason is conceptual: For a $\pm 1$ -valued sequence $a$ with mean $\mathrm{M}_{a}=0$ , noncorrelation is tantamount to equidistribution of the pairs $a(n),a(n+m)$ . More precisely, for each $m\in\mathbb{N}$ , if we additionally assume that the limits mentioned above exist then $\gamma_{a}(m)=0$ if and only if

[TABLE]

The analogous characterisation is false without the assumption that $a$ is allowed to take more than $2$ values.

2.2. Automatic sequences

In this section we briefly discuss the basics of the theory of automatic sequences; for extensive background see [AS03a]. For $i\in\Sigma_{k}$ , we define the operators $\Lambda_{i}$ acting on sequences $\mathbb{N}_{0}\to\mathbb{C}$ by

[TABLE]

The $k$ -kernel of a sequence $a\colon\mathbb{N}_{0}\to\mathbb{C}$ consists of all sequences $\mathbb{N}_{0}\to\mathbb{C}$ that can be obtained from $a$ by repeated application of $\Lambda_{i}$ ’s, that is,

[TABLE]

It will also be convenient to introduce the shift operator $S$ acting on sequences $\mathbb{N}_{0}\to\mathbb{C}$ by $Sa(n)=a(n+1)$ . For future reference, we record how the introduced operators interact.

Lemma 2.4.

For each $0\leq i<k-1$ we have $\Lambda_{i}S=\Lambda_{i+1}$ . Moreover, $\Lambda_{k-1}S=S\Lambda_{1}$ .

Proof.

Direct computation. ∎

A sequence $a$ is $k$ -automatic (or just automatic, if $k$ is clear from the context) if and only if $\mathcal{N}_{k}(f)$ is finite. Many equivalent definitions of automaticity are possible, and we briefly mention some of them to provide context. Details and terminology can be found in [AS03a]. As the name suggests, a sequence is $k$ -automatic if and only if it is computed by a deterministic finite $k$ -automaton with output. Any fixed point of a $k$ -uniform morphism is $k$ -automatic, and conversely any $k$ -automatic sequence can be obtained as a letter-to-letter coding of a fixed point of a $k$ -uniform morphism. When $k$ is a prime and $a$ is a sequence taking values in a finite field $\mathbb{F}$ of characteristic $k$ , yet another criterion due to Christol shows that $a$ is automatic if and only if the associated formal power series is algebraic over $\mathbb{F}$ .

It is a well-known fact that the class of $k$ -automatic complex-valued sequences is closed under addition, multiplication, conjugation and restriction to subsequences, that is, if $a,b\colon\mathbb{N}_{0}\to\mathbb{C}$ are $k$ -automatic, then so are $n\mapsto a(n)+b(n)$ , $n\mapsto a(n)\cdot b(n)$ , $n\mapsto\overline{a}(n)$ and $n\mapsto a(An+B)$ for any $A\in\mathbb{N}$ , $B\in\mathbb{N}_{0}$ . More generally, if $a,b\colon\mathbb{N}_{0}\to\mathbb{C}$ are $k$ -automatic and $h\colon\mathbb{C}^{2}\to\mathbb{C}$ is arbitrary, then the sequence $n\mapsto h(a(n),b(n))$ is $k$ -automatic.

For a sequence $a\colon\mathbb{N}_{0}\to\mathbb{C}$ , we define the mean and the logarithmic mean:

[TABLE]

We note that $\mathrm{M}_{a}$ are not guaranteed to exist, even when the sequence $a$ is automatic. (Consider, for instance, the sequence defined by $a(0)=0$ and $a(n)=(-1)^{\alpha}$ if $k^{\alpha}\leq n<k^{\alpha+1}$ , $\alpha\in\mathbb{N}_{0}$ .) On the other hand, we have the following positive result for logarithmic means.

Theorem 2.5 ([AS03a, Thm. 8.4.8]).

Let $a\colon\mathbb{N}_{0}\to\mathbb{C}$ be a $k$ -automatic sequence. Then $\mathrm{M}_{a}^{\log}$ exists.

We also record the fact that that if $a\colon\mathbb{N}_{0}\to\mathbb{C}$ is a bounded sequence and $\mathrm{M}_{a}$ exists then $\mathrm{M}_{a}^{\log}$ also exists and $\mathrm{M}_{a}^{\log}=\mathrm{M}_{a}$ , see e.g. [AS03a, Prop. 8.4.4 (a)].

Pattern sequences are, unsurprisingly, automatic. In fact, we have the following characterisation of pattern sequences in terms of their $k$ -kernels (cf. [ZPK18, Lemma 2.2]).

Lemma 2.6.

Let $a\colon\mathbb{N}_{0}\to\{+1,-1\}$ be a sequence with $a(0)=+1$ and $\ell\in\mathbb{N}$ . Then the following conditions are equivalent:

(i)

There exists a set $A\subseteq\Sigma_{k}^{\ell}\setminus\{\mathtt{0}^{\ell}\}$ with $a=a_{A}$ . 2. (ii)

For each $b\in\mathcal{N}_{k}(a)$ , the sequence $b/a$ has period $k^{\ell-1}$ .

Proof.

(i) $\Rightarrow$ (ii): Let $i\in\Sigma_{k}$ and $n\in\mathbb{N}_{0}$ . Then each factor of $(n)_{k}$ is also a factor of $(kn+i)_{k}=(n)_{k}i$ and conversely each factor of $(kn+i)_{k}$ that is not a suffix is a factor of $(n)_{k}$ . More precisely, for each $v\in\Sigma_{k}^{*}\setminus\{\mathtt{0}\}^{*}$ we have

[TABLE]

Consequently, $\Lambda_{i}a(n)/a(n)=-1$ if the suffix of $\mathtt{0}^{\ell-1}(kn+i)_{k}$ of length $\ell$ belongs to $A$ and $\Lambda_{i}a(n)/a(n)=+1$ otherwise. It follows that $h_{i}:=\Lambda_{i}a/a$ is $k^{\ell-1}$ -periodic. Since for each $i\in\Sigma_{i}$ , the operator $\Lambda_{i}$ maps $k^{\ell-1}$ -periodic sequences to $k^{\ell-2}$ -periodic sequences (or constant sequences, if $\ell=1$ ), it follows that all sequences in the $k$ -kernel of $a$ take the form $a\cdot h$ where $h$ is $k^{\ell-1}$ -periodic.

(ii) $\Rightarrow$ (i): For each $i\in\Sigma_{k}$ , let $h_{i}:=\Lambda_{i}a/a$ . Note that $h_{0}(0)=a(0)/a(0)=+1$ and that the sequences $h_{i}$ take values in $\{+1,-1\}$ . Conversely, given any $k$ -tuple of $k^{\ell-1}$ -periodic $\{+1,-1\}$ -valued sequences $h_{i}^{\prime}$ ( $i\in\Sigma_{k}$ ) with $h_{0}^{\prime}(0)=+1$ , we can inductively construct a sequence $a\colon\mathbb{N}_{0}\to\{+1,-1\}$ with $a(0)=+1$ and $h_{i}=h_{i}^{\prime}$ for all $i\in\Sigma_{k}$ . Hence, the number of sequences that satisfy (ii) is

[TABLE]

On the other hand, the number of subsets of $\Sigma_{k}^{\ell}\setminus\{\mathtt{0}^{\ell}\}$ is also equal to $2^{k^{\ell}-1}$ , and by the previously proven implication and Lemma 2.2, each of these choices gives rise to a different sequence satisfying (ii). It follows that each sequence satisfying (ii) has a representation as in (i). ∎

2.3. Regular sequences

The class of $k$ -regular sequences was introduced by Allouche and Shallit [AS92, AS03b] as a natural generalization of the class of $k$ -automatic sequences.

Let $R$ be a ring contained in $\mathbb{C}$ . A sequence $f\colon\mathbb{N}_{0}\to\mathbb{C}$ is $(R,k)$ -regular if $\mathcal{N}_{k}(f)$ is contained in a finitely generated $R$ -module. Note that if $R^{\prime}\subseteq\mathbb{C}$ is another ring and $R\subseteq R^{\prime}$ then any $(R,k)$ -regular sequence is also $(R^{\prime},k)$ -regular. In our context, the choice of the ring $R$ does not play a major role: For the sake of brevity, we set $R=\mathbb{Q}$ throughout the paper and omit $R$ from the notation. (Strictly speaking we could have worked with $R=\mathbb{Z}[1/k]$ , making some results marginally stronger.) The fact that the ring under consideration is in fact a field leads to a slightly more succinct definition of regularity: A sequence $f\colon\mathbb{N}_{0}\to\mathbb{C}$ is $k$ -regular if and only if its $k$ -kernel spans a finite dimensional vector space over $\mathbb{Q}$ : $\dim\operatorname{span}_{\mathbb{Q}}\mathcal{N}_{k}(f)<\infty$ .

The class of $k$ -regular sequences enjoys closure properties analogous to $k$ -automatic sequences: If $f,g\colon\mathbb{N}_{0}\to\mathbb{C}$ are $k$ -regular, then so are $n\mapsto f(n)+g(n)$ , $n\mapsto f(n)+g(n)$ , $n\mapsto\overline{f}(n)$ , $n\mapsto zf(n)$ ( $z\in\mathbb{C}$ ) and $n\mapsto f(An+B)$ ( $A\in\mathbb{N},\ B\in\mathbb{N}_{0}$ ). In particular, $k$ -regular sequences $\mathbb{N}_{0}\to\mathbb{C}$ form an involutive algebra over $\mathbb{C}$ (with addition and multiplication defined pointwise).

We will need a method to verify if a given regular sequence is identically zero. The following lemma provides a simple criterion.

Lemma 2.7.

Let $f\colon\mathbb{N}_{0}\to\mathbb{C}$ be $k$ -regular and non-zero. Then there exists $g\in\mathcal{N}_{k}(f)$ with $g(0)\neq 0$ .

Proof.

For the sake of contradiction, suppose that $g(0)=0$ for all $g\in\mathcal{N}_{k}(f)$ . We show by induction on $\alpha$ that $g(n)=0$ for all $g\in\mathcal{N}_{k}(f)$ and $0\leq n<k^{\alpha}$ . If $\alpha=0$ then $n=0$ , so there is nothing to prove. If $\alpha>0$ and $n<k^{\alpha}$ then $n=kn^{\prime}+i$ where $i\in\Sigma_{k}$ and $n^{\prime}<k^{\alpha-1}$ . Hence, $g(n)=\Lambda_{i}g(n^{\prime})=0$ by the inductive assumption. ∎

2.4. Invariant sequences

We will say that a sequence $a\colon\mathbb{N}_{0}\to\mathbb{C}$ is dilation-invariant if $a(kn)=a(n)$ for all $n\in\mathbb{N}_{0}$ . The dilation-invariant pattern sequences admit a simple description. Following the convention in Section 2.1, we will say that a set $A\subseteq\Sigma_{k}^{*}$ has no trailing zeros if $\mathtt{0}$ is not a suffix of any $v\in A$ .

Lemma 2.8.

Let $a\colon\mathbb{N}_{0}\to\{+1,-1\}$ be a pattern sequence. Then $a$ is dilation-invariant if and only if there exists a set $A\subseteq\Sigma_{k}^{*}$ that has no leading and no trailing zeros and such that $a=a_{A}$ .

Proof.

If $A\subseteq\Sigma_{k}^{*}$ has no trailing zeros then $\#(v,n)=\#(v,kn)$ for all $v\in A$ and $n\in\mathbb{N}_{0}$ , so $a_{A}$ is dilation-invariant.

Conversely, suppose that $a$ is dilation-invariant and let $A\subseteq\Sigma_{k}^{*}$ be a set of patterns without leading zeros such that $a=a_{A}$ , which exists by Lemma 2.2. Suppose for the sake of contradiction that $A$ contains a pattern ending with $\mathtt{0}$ , say $u\mathtt{0}\in A$ for some $u\in\Sigma_{k}^{*}$ , and let $u$ be as short as possible. Since $a$ is dilation-invariant, we have

[TABLE]

On the other hand, each $v\in A$ either ends in a non-zero digit (in which case $\#(v,u\mathtt{0})=\#(v,u)$ ), or ends in $\mathtt{0}$ and is not a factor or $u\mathtt{0}$ (in which case $\#(v,u\mathtt{0})=\#(v,u)=0$ ), or is equal to $u\mathtt{0}$ (in which case $\#(v,u\mathtt{0})=1$ and $\#(v,u)=0$ ). As a consequence,

[TABLE]

which contradicts (10) and finishes the argument. ∎

We also record the fact that every pattern sequence is the product of a dilation-invariant sequence and a periodic sequence. As we will see (cf. Remark 5.6) the introduction of the multiplicative factor affects the correlation coefficients in a relatively simple way, which motivates our focus on dilation-invariant sequences.

Lemma 2.9.

Let $\ell\in\mathbb{N}_{0}$ and let $a\colon\mathbb{N}_{0}\to\{+1,-1\}$ be a pattern sequence of length $\leq\ell$ . Then there exist a unique dilation-invariant pattern sequence of length $\leq\ell$ such that $a/b$ is $k^{\ell-1}$ -periodic.

Proof.

By Lemma 2.2, we may assume that $a=a_{A}$ for a set $A\subseteq\Sigma_{k}^{*}$ without leading zeros. Reasoning along similar lines as in the proof of Lemma 2.2, we note that for any word $v$ , we have

[TABLE]

In particular, letting $D(v):=\left\{vi\ \middle|\ i\in\Sigma_{k}\right\}\cup\{v\}$ , we see that the sequence $a_{D(v)}$ is $k^{\left|v\right|}$ -periodic. We construct a sequence of sets $A:=A_{0},A_{1},\dots,A_{t}=:B$ , where $A_{j+1}=A_{j}\oplus D(v)$ if $A_{j}$ contains the word $v\mathtt{0}$ for some $v\in\Sigma_{k}^{*}$ and $t$ is the first index such that no word in $A_{t}$ ends with $\mathtt{0}$ . This construction is guaranteed to terminate because each step decreases the total length of words in $A_{j}$ that end with $\mathtt{0}$ . Letting $b:=a_{B}$ we observe that $a/b$ is the product of $k^{\ell-1}$ -periodic sequences and hence $k^{\ell-1}$ -periodic. ∎

3. Correlation coefficients

In this section we study correlation coefficients of $k$ -automatic sequences and show that they are $k$ -regular (Corollary 3.5). This allows us to reduce the task of verifying if a given $k$ -automatic sequence is noncorrelated to checking if a given $k$ -regular sequence is identically zero on $\mathbb{N}$ , which can be accomplished with the help of Lemma 2.7.

3.1. Definitions

For two sequences $a,b\colon\mathbb{N}_{0}\to\mathbb{C}$ , we define the correlation coefficients:

[TABLE]

if the limit exists (otherwise, $\gamma_{a,b}(m)$ is considered undefined). We are often interested in the case where $a=b$ , when we write $\gamma_{a}$ in place of $\gamma_{a,a}$ . Unfortunately, the limit defining $\gamma_{a,b}(m)$ is not guaranteed to converge even if $a$ and $b$ are automatic. This motivates us to consider the logarithmic correlation coefficients, defined by

[TABLE]

If $a$ and $b$ are automatic and $m\in\mathbb{N}_{0}$ , then the sequence $n\mapsto a(n)\overline{b}(n+m)$ is also automatic. Since Theorem 2.5 guarantees existence of logarithmic means of automatic sequences, we have the following fact.

Corollary 3.1.

Let $a,b\colon\mathbb{N}_{0}\to\mathbb{C}$ be $k$ -automatic sequences. Then the coefficients $\gamma^{\log}_{a,b}(m)$ are well-defined for all $m\in\mathbb{N}_{0}$ . Moreover, if the coefficient $\gamma_{a,b}(m)$ is well-defined for some $m\in\mathbb{N}_{0}$ then $\gamma_{a,b}(m)=\gamma^{\log}_{a,b}(m)$ .

3.2. Recurrence

Our next goal is to obtain a recursive description of the correlation coefficients discussed above. Recall that for a $k$ -automatic sequence $a$ , the kernel $\mathcal{N}_{k}(a)$ is finite and closed under the operators $\Lambda_{i}$ defined in (7) for all $i\in\Sigma_{k}$ .

Lemma 3.2.

Let $\mathcal{N}$ be a finite set of sequences $\mathbb{N}_{0}\to\mathbb{C}$ , closed under the operators $\Lambda_{i}$ for all $i\in\Sigma_{k}$ . Then for all $a,b\in\mathcal{N}$ it holds that

[TABLE]

where $a_{i}^{\prime},b_{i}^{\prime}$ and $m_{i}^{\prime}$ are given by

[TABLE]

Proof.

Rescaling if necessary, we assume that all sequences in $\mathcal{N}$ are $1$ -bounded (that is, $\left|a(n)\right|\leq 1$ for all $a\in\mathcal{N}$ and $n\in\mathbb{N}_{0}$ ). For each $N>0$ , splitting $[N]$ into residue classes modulo $k$ we obtain

[TABLE]

where $a_{i}^{\prime},b_{i}^{\prime}$ and $m_{i}^{\prime}$ are given by (15), and we use the estimate ${1}/\left(kn+i+1\right)={1}/{k(n+1)}+O(1/(n+1)^{2})$ together with the fact that $\sum_{n=0}^{\infty}1/(n+1)^{2}$ is summable. Dividing by $\log N$ and recalling that $1/\log(N/k)=1/\log N+O(1/\log^{2}N)$ we obtain

[TABLE]

Letting $N\to\infty$ , we obtain (14). ∎

While the coefficients $\gamma^{\log}_{a,b}(m)$ are better-behaved in general, our original motivation concerns the coefficients $\gamma_{a,b}(m)$ (where additionally $a=b$ ). Fortunately, existence of the latter is easy to ensure under mild additional assumptions.

Lemma 3.3.

Let $\mathcal{N}$ be a finite set of sequences $\mathbb{N}_{0}\to\mathbb{C}$ , closed under the operators $\Lambda_{i}$ for all $i\in\Sigma_{k}$ . Suppose that $\gamma_{a,b}(0)$ exists for each $a,b\in\mathcal{N}$ . Then also $\gamma_{a,b}(m)$ exists for all $a,b\in\mathcal{N}$ and $m\in\mathbb{N}_{0}$ and, using the notation from (15), satisfy

[TABLE]

Proof.

Rescaling if necessary, we may assume that all sequences in $\mathcal{N}$ are $1$ -bounded. Generalizing the definition of $\gamma_{a,b}(m)$ slightly, for $x\geq 1$ let us put

[TABLE]

Then, following the same reasoning as in Lemma 3.2, we find the recursive relation

[TABLE]

where $a_{i}^{\prime},b_{i}^{\prime}$ and $m_{i}^{\prime}$ are given by (15).

In particular, for $m=1$ we obtain

[TABLE]

Iterating (19) $t$ times, we conclude that there exist weights $w_{a^{\prime},b^{\prime}}^{(t)}\geq 0$ ( $a^{\prime},b^{\prime}\in\mathcal{N}$ ) with $\sum_{a^{\prime},b^{\prime}\in\mathcal{N}}w_{a^{\prime},b^{\prime}}^{(t)}=1-1/k^{t}$ and sequences $a^{(t)},b^{(t)}\in\mathcal{N}$ such that

[TABLE]

Since $\gamma_{a^{\prime},b^{\prime}}(0;y)\to\gamma_{a^{\prime},b^{\prime}}(0)$ as $y\to\infty$ for each $a^{\prime},b^{\prime}\in\mathcal{N}$ , letting $x\to\infty$ in (20) we conclude that there exists a number $\gamma_{a,b}^{(t)}(1)=\sum_{a^{\prime},b^{\prime}\in\mathcal{N}}w_{a^{\prime},b^{\prime}}^{(t)}\gamma_{a^{\prime},b^{\prime}}(0)$ such that

[TABLE]

It follows that the sequence $\gamma_{a,b}^{(t)}(1)$ ( $t\in\mathbb{N}$ ) is Cauchy, and $\gamma_{a,b}(1)$ is well-defined:

[TABLE]

We are now ready to prove by induction on $m$ that the coefficients $\gamma_{a,b}(m)$ are well-defined for all $m\in\mathbb{N}_{0}$ and $a,b\in\mathcal{N}$ . The case $m=0$ is included in the assumptions, and we have dealt with $m=1$ above. Suppose now that $m\geq 2$ . For each $i\in\Sigma_{k}$ , since $\left\lfloor x/k\right\rfloor\leq x/k<x$ for all $x>0$ , we have

[TABLE]

Hence, existence of $\gamma_{a,b}(m)$ follows from (19) and the inductive assumption. Finally, to obtain (16) it remains to pass to the limit $x\to\infty$ in (18) (or use Lemma 3.2 combined with the remark after Theorem 2.5). ∎

3.3. Regularity

We are now ready to show that the logarithmic correlation sequences coming from $k$ -automatic sequences are $k$ -regular. In fact, bearing in mind applications in Section 4 we record a slightly more precise statement. Recall that for a sequence $a$ , the sequence $Sa$ is given by $Sa(n)=a(n+1)$ . Similar ideas can be seen in [AS03b, Thm. 6].

Proposition 3.4.

Let $\mathcal{N}$ be a finite set of sequences $\mathbb{N}_{0}\to\mathbb{C}$ , closed under the operators $\Lambda_{i}$ for all $i\in\Sigma_{k}$ . Let $\mathcal{M}=\left\{S^{e}\gamma^{\log}_{a,b}\ \middle|\ a,b\in\mathcal{N},e\in\{0,1\}\right\}$ . Then $\operatorname{span}_{\mathbb{Q}}\mathcal{M}$ is closed under the operators $\Lambda_{i}$ for all $i\in\Sigma_{k}$ .

Proof.

Pick any $g=S^{e}\gamma^{\log}_{a,b}\in\mathcal{M}$ ( $a,b\in\mathcal{N}$ , $e\in\{0,1\}$ ) and $j\in\Sigma_{k}$ . It follows from Lemma 3.2 that

[TABLE]

where for each $i\in\Sigma_{k}$ , $a_{i}^{\prime},b_{i}^{\prime}\in\mathcal{N}$ and

[TABLE]

It remains to note that each of the functions of $n$ appearing under the sum on the right hand side of (24) belongs to $\mathcal{M}$ . ∎

Theorem 3.5.

If $a\colon\mathbb{N}_{0}\to\mathbb{C}$ is $k$ -automatic then the sequence $\gamma_{a}^{\log}$ is $k$ -regular and $\dim\operatorname{span}_{\mathbb{Q}}\mathcal{N}_{k}(\gamma_{a,a}^{\log})\leq 2\left|\mathcal{N}_{k}(a)\right|^{2}$ .

4. Verifying noncorrelation

We now discuss the practical details of how one can check if a given pattern sequence is noncorrelated. We begin by setting up the notation and adapting the general results from previous sections to the situation at hand; this is done in subsections 4.1 and 4.2. Then, in subsections 4.3 and 4.4 we discuss how the relevant computations can be performed. Finally, in subsection 4.5 we discuss the complexity of the resulting algorithm, which finishes the proof of Theorem B. Implementation of this algorithm allows us to verify Theorem A by direct computation.

4.1. Setup

Throughout this section, $A\subseteq\Sigma_{k}^{\ell}$ denotes an admissible set and $a\colon\mathbb{N}_{0}\to\{+1,-1\}$ denotes the corresponding pattern sequence:

[TABLE]

We also introduce the sequence $f\colon\mathbb{N}_{0}\to\mathbb{R}$ given by

[TABLE]

Our task amounts to verifying that $f$ is well-defined (i.e., that the limits defining $\gamma_{a}(m)$ exist for all $m\in\mathbb{N}$ ) and determining whether it is identically zero. The existence question is easily accounted for (cf. [ZPK18, Section 3]).

Lemma 4.1.

For each $b,c\in\mathcal{N}_{k}(a)$ and $m\in\mathbb{N}_{0}$ , the coefficient $\gamma_{b,c}(m)$ exists.

Proof.

By Lemma 2.6, all sequences in $\mathcal{N}_{k}(a)$ are products of $a$ and $k^{\ell-1}$ -periodic sequences. Hence, there is a $k^{\ell-1}$ -periodic sequence $h$ such that $b(n)c(n)=a(n)^{2}h(n)=h(n)$ for all $n\in\mathbb{N}_{0}$ , and consequently

[TABLE]

exists. Existence of $\gamma_{b,c}(m)$ for $m\in\mathbb{N}$ now follows from Lemma 3.3. ∎

Recall that $f=\mathbf{1}_{\mathbb{N}}\cdot\gamma^{\log}_{a}$ is $k$ -regular by Theorem 3.5. In principle, in order to decide if $f$ is identically zero, it is now enough to follow the arguments in Section 3 to describe the structure of the $k$ -kernel of $f$ and then apply Lemma 2.7. In practice, we essentially follow this route, but we also take advantage of the fact that $f$ is a $k$ -regular sequence of a rather specific form.

4.2. Recursive relations

As a first step towards describing the recursive relations that define $f$ , we introduce a set that spans $\mathcal{N}_{k}(f)$ , in analogy to Proposition 3.4. It will be convenient to introduce the restricted averages

[TABLE]

Note that these averages are well-defined thanks to Theorem 2.5. Additionally, it follows from Lemma 2.6 and Lemma 4.1 that the logarithmic averages can be replaced with unweighted averages:

[TABLE]

As a direct consequence of the relevant definitions, we have

[TABLE]

Proposition 4.2.

Each sequence in $\mathcal{N}_{k}(f)$ is a linear combination of the sequences $\mathbf{1}_{k^{\ell}\mathbb{N}_{0}+q}\cdot S^{e}\gamma^{(r)}$ , where $e\in\{0,1\}$ , $0\leq r<k^{\ell}$ and $0\leq q\leq k^{\ell}$ . In particular, $\dim\operatorname{span}_{\mathbb{Q}}\mathcal{N}_{k}(f)\leq 2k^{\ell}(k^{\ell}+1)$ .

The proof of the above proposition will follow directly once we describe the behaviour of the base sequences $\mathbf{1}_{k^{\ell}\mathbb{N}_{0}+q}\cdot S^{e}\gamma^{(r)}$ under the operators $\Lambda_{i}$ ( $i\in\Sigma_{k}$ ). To simplify this description, it will be convenient to introduce the auxiliary sequence $h\colon\mathbb{N}_{0}\to\{+1,-1\}$ , given by

[TABLE]

The following basic fact is analogous to [ZPK18, Lemma 2.1].

Lemma 4.3.

The sequence $h$ given by (28) is $k^{\ell}$ -periodic.

Proof.

Follows immediately from Lemma 2.6. ∎

Lemma 4.4.

Let $e\in\{0,1\}$ , $i\in\Sigma_{k}$ , $0\leq q\leq k^{\ell}$ and $0\leq r<k^{\ell}$ . If $i\neq q\bmod{k}$ then $\Lambda_{i}\mathbf{1}_{k^{\ell}\mathbb{N}_{0}+q}=0$ . If $i=q\bmod{k}$ then

[TABLE]

where the value of $e$ and the ranges of the summations are given by

[TABLE]

Proof.

The case $e=0$ follows by a standard adaptation of the proof of Lemma 3.2. Then, the case $e=1$ is derived using Lemma 2.4. ∎

4.3. Small shifts

Bearing in mind that we hope to apply Lemma 2.7, we need to be able to compute the values $S^{e}\gamma^{(r)}(0)=\gamma^{(r)}(e)$ for $e\in\{0,1\}$ and $0\leq r\leq k^{\ell}$ . This can, in principle, be accomplished by straightforward adaptations of the arguments in Lemma 3.3 and Lemma 4.1. Here, we discuss the practical details of how the computations are performed. Recall that $\gamma^{(r)}(0)=1$ , so we only need to compute $\gamma^{(r)}(1)$ .

For $0\leq r<k^{\ell}$ , let $\nu=\nu(r)$ denote the first position where a digit distinct from $k-1$ appears in the base- $k$ expansion $(r)_{k}$ ; if $r=k^{\alpha}-1$ for some $\alpha\geq 0$ then $\nu=\alpha$ . We consider $r$ in nondecreasing order with respect to $\nu(r)$ . We have three ranges to consider: $r=0$ , $1\leq r<\ell$ and $r=\ell.$

If $\nu(r)=0$ then it follows from Lemma 4.4 that

[TABLE]

here and elsewhere, the summation over $r^{\prime}$ runs through $r^{\prime}\in k^{\ell-1}\Sigma_{k}+\left\lfloor r/k\right\rfloor$ . Since we can readily compute $h(r)$ and $h(r+1)$ , we can compute $\gamma^{(r)}(1)$ .

If $1\leq\nu(r)<\ell$ then another application of Lemma 4.4 yields

[TABLE]

For all $r^{\prime}$ appearing in the above sum we have $\nu(r^{\prime})=\nu(r)-1$ , and hence $\gamma^{(r^{\prime})}(1)$ has been previously computed. Hence, again, we can directly compute $\gamma^{(r)}(1)$ .

Finally, if $\nu=\ell$ (meaning that $r=k^{\ell}-1$ ) then (31) continues to hold, and we have $\nu(r^{\prime})=\ell-1$ for all summands on the right-hand-side except for the one corresponding to $r^{\prime}=r$ . Hence, we can compute $\gamma^{(r)}$ as

[TABLE]

4.4. Basis construction

Recall that our general strategy calls for a construction of a spanning set of $\operatorname{span}_{\mathbb{Q}}\mathcal{N}_{k}(f)$ . For technical reasons, it appears to be slightly more convenient and efficient to instead work with the potentially larger space

[TABLE]

It remains true that $f=0$ if and only if $\mathcal{M}=\{0\}$ , and that $\mathcal{M}$ is closed under $\Lambda_{i}$ for all $i\in\Sigma_{k}$ . Additionally, $\mathcal{M}$ admits a decomposition

[TABLE]

where the sequences $\eta_{q}$ are given by

[TABLE]

By Lemma 2.7, to show that $\mathcal{M}=\{0\}$ it suffices to verify that $g(0)=0$ for each $g\in\mathcal{M}$ , which is trivially satisfied for $g\in\mathcal{M}_{q}$ for all $1\leq q\leq k^{\ell}$ .

We proceed to construct a list of sequences $f_{1},f_{2},\dots\in\mathcal{M}$ which spans $\mathcal{M}$ . Additionally, we ensure that for each $t\geq 1$ , the sequence $f_{t}$ belongs to $\mathcal{M}_{q_{t}}$ for some $0\leq q_{t}\leq k^{\ell}$ and we keep track the value of $q_{t}$ . By Proposition 4.2, each $f_{t}$ has a decomposition

[TABLE]

for some coefficients $w_{r,e}^{(t)}$ , which we also keep track of. While we cannot ensure that $f_{1},f_{2},\dots$ are linearly independent (in fact, we are primarily interested in the case when $f_{1}=f_{2}=\dots=0$ ), we will ensure that for each $1\leq q\leq k^{\ell}$ , the (multi-)set of coefficient vectors $\left\{w^{(t)}\ \middle|\ q_{t}=q\right\}\subseteq\mathbb{R}^{2k^{\ell}}$ is linearly independent.

We start by setting for $1\leq t\leq k^{\ell}$ ,

[TABLE]

and accordingly $w^{(t)}_{r,e}=\mathbf{1}_{\{0\}}(e)$ ( $0\leq r<k^{\ell}$ , $e\in\{0,1\}$ ).

Suppose next that at a certain stage we have constructed $f_{1},f_{2},\dots,f_{v}$ and that for all $1\leq t\leq u$ we have ensured that $\Lambda_{i}f_{t}\in\operatorname{span}_{\mathbb{Q}}\{f_{1},f_{2},\dots,f_{v}\}$ for all $i\in\Sigma_{k}$ . (Initially, $v=k^{\ell}$ and $u=0$ .) If $u=v$ then $\operatorname{span}_{\mathbb{Q}}\{f_{1},f_{2},\dots,f_{v}\}$ is a subset of $\mathcal{M}$ that is closed under $\Lambda_{i}$ ( $i\in\Sigma_{k}$ ) and under multiplication by $\mathbf{1}_{k^{\ell}\mathbb{N}_{0}+q}$ ( $0\leq q\leq k^{\ell}$ ), hence $\operatorname{span}_{\mathbb{Q}}\{f_{1},f_{2},\dots,f_{v}\}=\mathcal{M}$ and the construction is complete.

Let us next consider the case when $u<v$ . Put $q=q_{u+1}$ , $g=f_{u+1}$ and $w=w^{(u+1)}$ . Recall that the only value of $i$ for which $\Lambda_{i}g$ could be non-zero is $i=q\bmod{k}$ . If $q=0$ then $g=g(0)\mathbf{1}_{\{0\}}$ . Hence, either $g(0)\neq 0$ , in which case $a$ is not noncorrelated and we are done; or $g(0)=0$ , in which case $g=0$ and so $\Lambda_{i}g=0$ as well. Suppose now that $1\leq q\leq k^{\ell}$ . Applying Lemma 4.4, we obtain a representation of $\Lambda_{i}g$ in the form

[TABLE]

where the ranges of summation are given by $0\leq q^{\prime}\leq k^{\ell}$ , $0\leq r^{\prime}<k^{\ell}$ and $0\leq e^{\prime}\leq 1$ , and the coefficients $w^{\prime}$ are given by explicit formulae coming from (29). Bearing in mind that $\mathbf{1}_{k^{\ell}\mathbb{N}_{0}}=\mathbf{1}_{k^{\ell}\mathbb{N}}+\mathbf{1}_{\{0\}}$ , we find the decomposition

[TABLE]

where the coefficients $w^{\prime\prime}_{q^{\prime},r^{\prime},e^{\prime}}$ are given by:

[TABLE]

For each $q^{\prime}$ , we append $g^{\prime}_{q^{\prime}}$ to the list $f_{1},f_{2},\dots,f_{v}$ if (and only if)

[TABLE]

If (38) holds then we also record $g^{\prime}_{q^{\prime}}\in\mathcal{M}_{q^{\prime}}$ (that is, we append $q^{\prime}$ to the list $q_{1},q_{2},\dots,q_{v}$ ) and that the decomposition of $g^{\prime}_{q^{\prime}}$ as the sum of basis sequences is given by (36) (what is, we append $w^{\prime\prime}_{q^{\prime}}$ to the list $w^{(1)},w^{(2)},\dots,w^{(v)}$ . Each time a new sequence is added, $v$ increases by $1$ and after all $q^{\prime}$ have been processed, $u$ increases by $1$ .

The linear independence condition (38) ensures that for each $1\leq q\leq k^{\ell}$ , there are at most $2k^{\ell}$ values of $t$ with $q_{t}=q$ , and hence the construction needs to terminate after a bounded number of steps. As the result, we either find, for some $t\geq 1$ , a sequence $f_{t}\in\mathcal{M}$ with $f_{t}(0)\neq 0$ (in which case $a$ is not noncorrelated) or we construct a finite list of sequences $f_{1},f_{2},\dots,f_{N}\in\mathcal{M}$ that spans $\mathcal{M}$ and satisfies $f_{t}(0)=0$ for all $1\leq t\leq N$ (in which case $a$ is noncorrelated). In either case, we are able to determine whether $a$ is noncorrelated.

4.5. Complexity

We now provide quantitative estimates for the amount of computational power needed to verify if the pattern sequence $a$ is noncorrelated using the method described above. Throughout, we treat $k$ as fixed, and hence are interested in the regime $\ell\to\infty$ . It will be convenient to introduce, for a function $F\colon\mathbb{N}\to\mathbb{R}_{>0}$ , the shorthand $\widetilde{O}(F(\ell))$ to denote $O(\ell^{O(1)}F(\ell))$ . Thus, for instance, addition or multiplication of two integers of size $O(k^{\ell})$ can be performed using $\widetilde{O}(1)$ operations.

At several points, we need to compute the values of $a(n)$ where $n=O(k^{\ell})$ . For a word $w\in\Sigma_{k}^{*}$ with length $\left|w\right|\leq\ell$ , computing $\#(n,w)$ directly from the definition requires $\widetilde{O}(1)$ operations. Since $\left|A\right|\leq k^{\ell}$ , the values $\#(n,A)$ and $a(n)$ can be computed in time $\widetilde{O}(k^{\ell})$ . Consequently, we can also compute $h(n)$ in time $\widetilde{O}(k^{\ell})$ .

Following the steps in subsection 4.3, we compute $\gamma^{(r)}(1)$ for all $0\leq r<k^{\ell}$ . It takes $\widetilde{O}(k^{\ell})$ operations to write the values of $r$ ( $0\leq r<k^{\ell}$ ) in an order consistent with $\nu(r)$ . Note that each of the formulae (30), (31), (32) produces the corresponding value of $\gamma^{(r)}(1)$ using $\widetilde{O}(1)$ arithmetic operations on rational numbers. One can also check by a simple inductive argument that all denominators and numerators that appear in these computations are bounded by $O(k^{\ell})$ , and hence each arithmetic operation takes only $\widetilde{O}(1)$ basic operations. We also note that all the denominators take the form $(k\pm 1)k^{\alpha}$ .

We next proceed to the computation of the sequences $f_{t}$ ( $t=1,2,3,\dots$ ) in subsection 4.4. Strictly speaking, we compute the sequence $w^{(t)}$ , which uniquely determine $f_{t}$ via (33), and the auxiliary sequence $q_{t}$ . For $t\leq k^{\ell}$ , the explicit formula (34) allows us to compute $w^{(t)}$ and $q_{t}$ with $\widetilde{O}(k^{2\ell})$ operations (note that $w^{(t)}=\big{(}w^{(t)}_{r,e}\big{)}_{r,e}$ has $k^{2\ell}$ entries, so this is the least number of operations possible).

Let us now consider the amount of computation required to compute $f_{t}$ for $t>k^{\ell}$ . Consider any $u,v$ , as in the iterative procedure in second half of subsection 4.4. We note that the application of Lemma 4.4 used to compute $w^{\prime}$ in (35) requires no more than $\widetilde{O}(k^{3\ell})$ arithmetic operations (for each of $O(k^{\ell})$ summands in the decomposition of $g$ , we substitute a sum of size $O(k^{2\ell})$ ). Once $w^{\prime}$ is computed, it only takes $\widetilde{O}(k^{2\ell})$ operations to compute $w^{\prime\prime}$ . Then, for each of $O(k^{\ell})$ values of $q^{\prime}$ , in order to verify if $g^{\prime}_{q^{\prime}}$ should be appended to the list $f_{1},f_{2},\dots$ , we need to verify if the corresponding vector of coefficients belongs to a certain linear subspace of $\mathbb{R}^{2k^{\ell}}$ , see (38). Keeping track of how much the complexity increases in each step of the construction, we see that for each $t>k^{\ell}$ , the entries of $w^{(t)}$ are rational numbers whose numerators are $\widetilde{O}(k^{3t})$ , and whose denominators are $O(k^{t})$ and divide $(k^{2}-1)k^{\alpha}$ for some integer $\alpha$ . Thus, in (38) we may scale all of the relevant vectors by a factor of $(k^{2}-1)k^{O(u)}$ , leaving us with the task of verifying if an integer-valued vector belongs to the span of other integer-valued vectors. The latter task is well-known to have polynomial complexity (with respect to dimensions and lengths of representations of entries), see e.g. [BCS97, Chpt. 16]. Hence, for each $q^{\prime}$ in order to decide if $g^{\prime}_{q^{\prime}}$ should appended, we perform $\widetilde{O}(k^{O(\ell)})=k^{O(\ell)}$ operations. Consequently, the number of operations needed to process the step corresponding to the index $u$ is $k^{O(\ell)}$ .

Because of the linear independence conditions discussed at the end of subsection 4.4, the total number of the sequences $f_{1},f_{2},\dots$ we construct is at most $2k^{2\ell+1}$ . It follows that in total, we perform at most $k^{O(\ell)}$ operations.

5. Dilation-invariant sequences

We now turn to the classification of dilation-invariant pattern sequences. Throughout, let $A\subseteq\Sigma_{k}^{*}$ be a set of patterns with no leading or trailing zeros, and let $a=a_{A}$ be the corresponding pattern sequence. We also retain the notation from Section 4, specifically the coefficients $\gamma_{r}$ defined in (25). We let $\ell=\max_{v\in A}\left|v\right|$ denote the length of $a$ , and we assume that $\ell\geq 2$ .

The following condition turns out to be closely connected to the question of whether $a$ is noncorrelated:

[TABLE]

Above, using the standard notation from semigroup theory, for a word $u\in\Sigma_{k}^{*}$ and a set $X\subseteq\Sigma_{k}^{*}$ , we let $Xu^{-1}:=\left\{v\in\Sigma_{k}^{*}\ \middle|\ vu\in X\right\}$ .

Remark 5.1.

The condition ( $\dagger$ ‣ 5) can be stated in simpler terms when $k=2$ . Then, necessarily, $\{i_{0},i_{1}\}=\{\mathtt{0},\mathtt{1}\}$ and since $A$ has no trailing zeros, $A\mathtt{0}^{-1}=\emptyset$ . Hence, ( $\dagger$ ‣ 5) says that $\left|A(u\mathtt{1})^{-1}\right|=1$ for all $u\in\Sigma_{k}^{\ell-2}$ . Because all patterns in $A$ have length $\leq\ell$ , $A(u\mathtt{1})^{-1}\subseteq\{\mathtt{0},\mathtt{1}\}$ ; and because $A$ has no leading zeros, $\mathtt{0}\not\in A(u\mathtt{1})^{-1}$ . Thus, ( $\dagger$ ‣ 5) reduces to the statement that $\mathtt{1}u\mathtt{1}\in A$ for all $u\in\Sigma_{k}^{\ell-2}$ , that is, $\mathtt{1}\Sigma_{2}^{\ell-2}\mathtt{1}\subseteq A$ . This is precisely the assumption that appears in Theorem C.

Remark 5.2.

For general $k\geq 2$ , it is not a priori clear if there exists a set of patterns $A$ such that ( $\dagger$ ‣ 5) holds. Fix $u\in\Sigma_{k}^{\ell-2}$ and consider the matrix $M=\big{(}M_{i,j}^{(u)}\big{)}_{i,j=0}^{k-1}$ where $M_{i,j}^{(u)}=-1$ if $iuj\in A$ and $M_{i,j}^{(u)}=+1$ otherwise. Then ( $\dagger$ ‣ 5) says that $M^{\mathrm{T}}M=kI$ , where $I$ denotes the identity matrix, meaning that $M$ is a Hadamard matrix. Additionally, $M^{(u)}_{i,j}=+1$ if $i=0$ or $j=0$ , meaning that $M$ is normalized. Conversely, given any normalized Hadamard matrix $M^{\prime}$ , one can easily reconstruct $A$ so that $M=M^{\prime}$ for each choice of $u\in\Sigma_{k}^{\ell-2}$ . Thus, it is possible to satisfy the condition ( $\dagger$ ‣ 5) if and only if there is at least one Hadamard matrix of dimension $k$ .

The question of existence of Hadamard matrices of a given dimension has long been investigated. They are easily constructed when $k$ is a power of $2$ through a tensor-power construction. More generally, given Hadamard matrices of dimensions $k$ and $k^{\prime}$ one can construct a Hadamard matrix of dimension $k\cdot k^{\prime}$ . It is conjectured that Hadamard matrices exist for $k=1,2$ and all $k$ divisible by $4$ . So far, this has been confirmed for $k<668$ . See e.g. [CD07, Chpt. V] for further discussion.

The main goal of this section is to prove a slightly more general variant of Theorem C. The second part of this theorem asserts that if $a$ is noncorrelated, $k=2$ and $\ell\leq 5$ then ( $\dagger$ ‣ 5) holds. This is verified by exhaustive search222Code available from the author., using the methods developed in Section 4. The remaining part of Theorem C follows from the following result, whose proof will occupy the remainder of this section.

Proposition 5.3.

Suppose that ( $\dagger$ ‣ 5) holds. Then the sequence $a$ is noncorrelated.

From this point onwards, assume that ( $\dagger$ ‣ 5) holds. Proceeding along similar lines as in Lemma 3.3 (or Section 4.3), we will compute $\gamma_{r}(m)$ for small values of $m\in\mathbb{N}_{0}$ ( $0\leq r<k^{\ell}$ ). The following lemma is the main consequence of ( $\dagger$ ‣ 5) that we use.

Lemma 5.4.

Let $u\in\Sigma_{k}^{\ell-2}$ and $j_{0},j_{1}\in\Sigma_{k}$ , $j_{0}\neq j_{1}$ . Then

[TABLE]

Proof.

Multiplying by $a\left([uj_{0}]_{k}\right)a\left([uj_{1}]_{k}\right)$ , we see that (39) is equivalent to

[TABLE]

Each pattern $v$ in $A$ of length $<\ell$ and each $i\in\Sigma_{k}$ , considering the different positions where $v$ can appear, one can check that

[TABLE]

Conversely, if $v\in A$ and $\left|v\right|=\ell$ then

[TABLE]

since $\left|uj_{0}\right|,\left|uj_{1}\right|<\ell$ , and for each $i\in\Sigma_{k}$

[TABLE]

Substituting the above identities into the sum on the left-hand side of (39) and applying ( $\dagger$ ‣ 5) we conclude that

[TABLE]

Lemma 5.5.

Let $0\leq r<k^{\ell}$ and $m\geq 0$ . Put $j=r\bmod{k}$ . Then

[TABLE]

Proof.

Let us write $m=km^{\prime}+i$ with $m^{\prime}\geq 0$ and $i\in\Sigma_{k}$ . Then by Lemma 4.4 (or, equivalently, by Lemma 3.2) we have

[TABLE]

where as usual $r^{\prime}\in k^{\ell-1}\Sigma_{k}+\left\lfloor r/k\right\rfloor$ and $e^{\prime}=\left\lfloor(i+j)/k\right\rfloor\in\{0,1\}$ . We consider several different cases.

Case 0: $m=0$ . It follows directly from the definition of $\gamma_{r}$ that

[TABLE]

Case 1: $m\neq 0$ and $j+m<k$ . Applying (41) and noticing that $i=m$ , $m^{\prime}=0$ , and $e^{\prime}=0$ , we obtain

[TABLE]

where the second equality holds because $\left\lfloor r/k\right\rfloor=\left\lfloor(r+m)/k\right\rfloor$ .

In all of the remaining cases, we will show that $\gamma_{r}(m)=0$ . We start with the simplest situation where $e^{\prime}=1$ .

Case 2: $m=1$ and $j+m\geq k$ , meaning that $j=k-1$ . Let $\nu(r)$ denote the first position where a digit distinct from $k-1$ appears in the expansion of $r$ , allowing $\nu(r)=\alpha$ if $r=k^{\alpha}-1$ . By (41),

[TABLE]

If $\nu(r)=1$ then from the previously considered cases and Lemma 5.4 it follows that

[TABLE]

If $1<\nu(r)<\ell$ then $\nu(r^{\prime})=\nu(r)-1$ for all $r^{\prime}$ that enter the sum (43). Hence, reasoning by induction on $\nu(r)$ we conclude that $\gamma_{r}(1)=0$ . Finally, if $\nu(r)=\ell$ then $r=k^{\ell}-1$ , and $\nu(r^{\prime})=\ell-1$ for all $r^{\prime}$ that appear in the sum (43) except for $r^{\prime}=r$ . It follows that

[TABLE]

which is only possible if $\gamma_{k^{\ell}-1}(1)=0$ .

Case 3: $2\leq m<k$ and $j+m\geq k$ . By (41) and Case 2,

[TABLE]

Case 4: $k\leq m<k^{2}$ . By (41),

[TABLE]

Let $j^{\prime}:=\left\lfloor r/k\right\rfloor\bmod r$ and $i^{\prime}:=m^{\prime}+e^{\prime}$ . Note that $r^{\prime}\bmod{k}=j^{\prime}$ for all $r^{\prime}$ in the sum in (45), where we are using the fact that $\ell\geq 2$ . We have several subcases to consider. If $j^{\prime}+i^{\prime}<k$ then

[TABLE]

by Cases 0 and 1 and Lemma 5.4. If $j^{\prime}+i^{\prime}\geq k$ while $i^{\prime}<k$ (i.e. $m^{\prime}\neq k-1$ or $e^{\prime}\neq 1$ ) then $\gamma_{r^{\prime}}(m^{\prime}+e^{\prime})=0$ for all $r^{\prime}$ by Cases 2 and 3, and consequently also $\gamma_{r}(m)=0$ . Finally, if $j^{\prime}+i^{\prime}=k$ (i.e. $m^{\prime}=k-1$ and $e^{\prime}=1$ ) then

[TABLE]

by the previously considered subcases.

Case 5: $m\geq k^{2}$ . We reason by induction on $m$ . By (41) and the inductive assumption,

[TABLE]

since $k\leq m^{\prime}+e^{\prime}<m$ . ∎

Now that we have computed the values of the coefficients $\gamma_{r}(m)$ , the remainder of the argument is straightforward.

Proof of Proposition 5.3.

We need to show that

[TABLE]

for all $m\geq 1$ . If $m\geq k$ there is nothing to prove since $\gamma_{r}(m)=0$ . Suppose now that $1\leq m<k$ . We may write arbitrary $0\leq r<k^{\ell}-1$ in the form $r=k^{\ell-1}i+ks+j$ where $i,j\in\Sigma_{k}$ and $0\leq s<k^{\ell-2}$ . Then, $\gamma_{r}(m)=0$ if $j+m\geq k$ and $\gamma_{r}(m)=a(r)a(r+m)$ otherwise. It follows that

[TABLE]

where the inner-most sum vanishes by Lemma 5.4. ∎

Remark 5.6.

Let $a^{\prime}\colon\mathbb{N}_{0}\to\{+1,-1\}$ be a sequence such that $a^{\prime}/a$ is $k^{\ell-1}$ -periodic. Then $a^{\prime}$ is pattern by Lemma 2.2. Defining $\gamma^{\prime}$ and $\gamma_{r}^{\prime}$ in analogy to $\gamma$ and $\gamma_{r}$ , with $a^{\prime}$ in place of $a$ , by a direct computation we show for all $m\geq 0$ and $0\leq r<k^{\ell}$ that

[TABLE]

It follows that $\gamma_{r}^{\prime}(m)=0$ for all $m\geq k$ . In particular, $\gamma^{\prime}(m)=0$ for all $m\geq k$ .

We check by exhaustive search that all noncorrelated binary pattern sequences of length $\leq 4$ can arise as $a^{\prime}$ in the construction outlined above. It seems plausible that the same holds for all lengths. If this is the case, and if Conjecture 1.1 holds true, then the task of verifying if a given binary pattern sequence $b^{\prime}$ is noncorrelated can be split into two independent steps: First, check if the dilation-invariant sequence $b$ obtained from $b^{\prime}$ in Lemma 2.9 satisfies ( $\dagger$ ‣ 5); if not then $b^{\prime}$ is not noncorrelated333For the sake of simplicity, we work under the additional assumption that $b$ and $b^{\prime}$ have equal lengths, which is not true in general.. Second, check if the $\pm$ signs in (the analogue of) (47) align in a way that ensures $\gamma_{b^{\prime}}(1)=0$ . While the condition from the first step is quite conceptual, it appears that the second step relies mostly on arithmetic coincidence. This would provide an intuitive explanation for why the results in the dilation-invariant case are considerably more concise.

Bibliography21

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[AL 91] J.-P. Allouche and P. Liardet. Generalized Rudin-Shapiro sequences. Acta Arith. , 60(1):1–27, 1991.
2[AS 92] J.-P. Allouche and J. Shallit. The ring of k 𝑘 k -regular sequences. Theoret. Comput. Sci. , 98(2):163–197, 1992.
3[AS 99] J.-P. Allouche and J. Shallit. The ubiquitous Prouhet-Thue-Morse sequence. In Sequences and their applications (Singapore, 1998) , Springer Ser. Discrete Math. Theor. Comput. Sci., pages 1–16. Springer, London, 1999.
4[AS 03a] J.-P. Allouche and J. Shallit. Automatic sequences . Cambridge University Press, Cambridge, 2003. Theory, applications, generalizations.
5[AS 03b] J.-P. Allouche and J. Shallit. The ring of k 𝑘 k -regular sequences. II. Theoret. Comput. Sci. , 307(1):3–29, 2003. Words.
6[BCM 89] D. W. Boyd, J. Cook, and P. Morton. On sequences of ± 1 plus-or-minus 1 \pm 1 ’s defined by binary patterns. Dissertationes Math. (Rozprawy Mat.) , 283:64, 1989.
7[BCS 97] P. Bürgisser, M. Clausen, and M. A. Shokrollahi. Algebraic complexity theory , volume 315 of Grundlehren der Mathematischen Wissenschaften [Fundamental Principles of Mathematical Sciences] . Springer-Verlag, Berlin, 1997. With the collaboration of Thomas Lickteig.
8[CD 07] C. J. Colbourn and J. H. Dinitz, editors. Handbook of combinatorial designs . Discrete Mathematics and its Applications (Boca Raton). Chapman & Hall/CRC, Boca Raton, FL, second edition, 2007.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Algorithmic classification of

Abstract.

2010 Mathematics Subject Classification:

1. Introduction

Theorem A**.**

Theorem B**.**

Theorem C**.**

Conjecture 1.1**.**

Conjecture 1.2**.**

Acknowledgements

2. Background and definitions

2.1. Pattern sequences

Lemma 2.1**.**

Proof.

Lemma 2.2**.**

Proof.

Remark 2.3**.**

2.2. Automatic sequences

Lemma 2.4**.**

Proof.

Theorem 2.5** ([AS03a, Thm. 8.4.8]).**

Lemma 2.6**.**

Proof.

2.3. Regular sequences

Lemma 2.7**.**

Proof.

2.4. Invariant sequences

Lemma 2.8**.**

Proof.

Lemma 2.9**.**

Proof.

3. Correlation coefficients

3.1. Definitions

Corollary 3.1**.**

3.2. Recurrence

Lemma 3.2**.**

Proof.

Lemma 3.3**.**

Proof.

3.3. Regularity

Proposition 3.4**.**

Proof.

Theorem 3.5**.**

4. Verifying noncorrelation

4.1. Setup

Lemma 4.1**.**

Proof.

4.2. Recursive relations

Proposition 4.2**.**

Lemma 4.3**.**

Proof.

Lemma 4.4**.**

Proof.

4.3. Small shifts

4.4. Basis construction

4.5. Complexity

5. Dilation-invariant sequences

Remark 5.1**.**

Remark 5.2**.**

Proposition 5.3**.**

Lemma 5.4**.**

Proof.

Lemma 5.5**.**

Proof.

Proof of Proposition 5.3.

Remark 5.6**.**

Theorem A.

Theorem B.

Theorem C.

Conjecture 1.1.

Conjecture 1.2.

Lemma 2.1.

Lemma 2.2.

Remark 2.3.

Lemma 2.4.

Theorem 2.5 ([AS03a, Thm. 8.4.8]).

Lemma 2.6.

Lemma 2.7.

Lemma 2.8.

Lemma 2.9.

Corollary 3.1.

Lemma 3.2.

Lemma 3.3.

Proposition 3.4.

Theorem 3.5.

Lemma 4.1.

Proposition 4.2.

Lemma 4.3.

Lemma 4.4.

Remark 5.1.

Remark 5.2.

Proposition 5.3.

Lemma 5.4.

Lemma 5.5.

Remark 5.6.