Highly nonlinear functions over finite fields

Kai-Uwe Schmidt

arXiv:1906.11678·math.CO·September 17, 2019·Finite Fields Their Appl.

Highly nonlinear functions over finite fields

Kai-Uwe Schmidt

PDF

TL;DR

This paper proves a long-standing conjecture about the maximum Hamming distance of functions from finite fields to finite fields, extending previous results and determining the asymptotic behavior of Reed-Muller codes.

Contribution

It generalizes the Patterson-Wiedemann conjecture to all finite fields, using advanced number theory and probabilistic methods, and determines the asymptotic covering radius.

Findings

01

Proves the conjecture for most finite fields unconditionally.

02

Establishes the asymptotic maximum distance for functions as dimension grows.

03

Determines the asymptotic covering radius of Reed-Muller codes.

Abstract

We consider a generalisation of a conjecture by Patterson and Wiedemann from 1983 on the Hamming distance of a function from $F_{q}^{n}$ to $F_{q}$ to the set of affine functions from $F_{q}^{n}$ to $F_{q}$ . We prove the conjecture for each $q$ such that the characteristic of $F_{q}$ lies in a subset of the primes with density $1$ and we prove the conjecture for all $q$ by assuming the generalised Riemann hypothesis. Roughly speaking, we show the existence of functions for which the distance to the affine functions is maximised when $n$ tends to infinity. This also determines the asymptotic behaviour of the covering radius of the $[q^{n}, n + 1]$ Reed-Muller code over $F_{q}$ and so answers a question raised by Leducq in 2013. Our results extend the case $q = 2$ , which was recently proved by the author and which corresponds to the original conjecture by…

Equations200

d (g, h) = # {y \in F_{q}^{n} : g (y) \neq = h (y)} .

d (g, h) = # {y \in F_{q}^{n} : g (y) \neq = h (y)} .

N (g) = h min d (g, h),

N (g) = h min d (g, h),

μ_{q} (n) = \frac{q ^{n - 1} ( q - 1 ) - ρ _{q} ( n )}{q ^{n /2 - 1}} .

μ_{q} (n) = \frac{q ^{n - 1} ( q - 1 ) - ρ _{q} ( n )}{q ^{n /2 - 1}} .

1 \leq μ_{q} (n + 2 k) \leq μ_{q} (n)

1 \leq μ_{q} (n + 2 k) \leq μ_{q} (n)

1 \leq μ_{q} (n) \leq q

1 \leq μ_{q} (n) \leq q

μ_{2} (n) \leq \frac{27}{32} 2 = 1.19 \dots for each n \geq 15

μ_{2} (n) \leq \frac{27}{32} 2 = 1.19 \dots for each n \geq 15

μ_{2} (n) \leq \frac{7}{8} 2 = 1.23 \dots for each n \geq 9 .

μ_{2} (n) \leq \frac{7}{8} 2 = 1.23 \dots for each n \geq 9 .

n \to \infty lim μ_{2} (n) = 1

n \to \infty lim μ_{2} (n) = 1

μ_{3} (n) \leq \frac{2}{3} 3 = 1.15 \dots for each n \geq 3 .

μ_{3} (n) \leq \frac{2}{3} 3 = 1.15 \dots for each n \geq 3 .

\begin{array}[]{c||ccccccccccccccc}p&2&3&5&7&11&13&17&19&23&29&31&37&41&43&47\\ \hline\cr r&7&23&11&31&7&23&19&31&7&23&11&7&23&19&11\end{array}

\begin{array}[]{c||ccccccccccccccc}p&2&3&5&7&11&13&17&19&23&29&31&37&41&43&47\\ \hline\cr r&7&23&11&31&7&23&19&31&7&23&11&7&23&19&11\end{array}

\frac{ϕ ( ϕ ( r ^{2} ))}{ϕ ( r ^{2} )} = \frac{ϕ ( r - 1 )}{r},

\frac{ϕ ( ϕ ( r ^{2} ))}{ϕ ( r ^{2} )} = \frac{ϕ ( r - 1 )}{r},

d_{i} = d_{i - 1} + \frac{ϕ ( r _{i} - 1 )}{r _{i}} - d_{i - 1} \cdot \frac{ϕ ( r _{i} - 1 )}{r _{i}}

d_{i} = d_{i - 1} + \frac{ϕ ( r _{i} - 1 )}{r _{i}} - d_{i - 1} \cdot \frac{ϕ ( r _{i} - 1 )}{r _{i}}

μ (g) = \frac{q ^{n - 1} ( q - 1 ) - N ( g )}{q ^{n /2 - 1}},

μ (g) = \frac{q ^{n - 1} ( q - 1 ) - N ( g )}{q ^{n /2 - 1}},

μ_{q} (n) = g min μ (g),

μ_{q} (n) = g min μ (g),

q (q - 1) ⌊ \frac{v}{q ( q - 1 )} ⌋

q (q - 1) ⌊ \frac{v}{q ( q - 1 )} ⌋

f (y) = {f_{T} (y) f_{S} (y) for y \in T for y \in S,

f (y) = {f_{T} (y) f_{S} (y) for y \in T for y \in S,

f_{T} (λa y) = f_{T} (y) for each λ \in F_{q}^{*}, each a \in H, and each y \in T .

f_{T} (λa y) = f_{T} (y) for each λ \in F_{q}^{*}, each a \in H, and each y \in T .

μ (f) \leq 1 + 309 q^{5/2} \frac{lo g ( 2 q ^{2} v )}{v} .

μ (f) \leq 1 + 309 q^{5/2} \frac{lo g ( 2 q ^{2} v )}{v} .

ord_{v} (p) = \frac{1}{2} ϕ (v) = \frac{1}{2} (r - 1) r^{e - 1} .

ord_{v} (p) = \frac{1}{2} ϕ (v) = \frac{1}{2} (r - 1) r^{e - 1} .

\frac{A}{2} (1 - (- 1)^{\frac{p - 1}{2}} \frac{1}{p ^{2} - p - 1})

\frac{A}{2} (1 - (- 1)^{\frac{p - 1}{2}} \frac{1}{p ^{2} - p - 1})

A = r prime \prod (1 - \frac{1}{r ( r - 1 )}) = 0.373955 \dots

A = r prime \prod (1 - \frac{1}{r ( r - 1 )}) = 0.373955 \dots

Tr_{K / F} (y) = σ \in Gal (K / F) \sum σ (y)

Tr_{K / F} (y) = σ \in Gal (K / F) \sum σ (y)

η (y) = exp (2 π i Tr_{F_{q} / F_{p}} (y) / p)

η (y) = exp (2 π i Tr_{F_{q} / F_{p}} (y) / p)

ψ (y) = η (Tr_{F_{q^{n}} / F_{q}} (y))

ψ (y) = η (Tr_{F_{q^{n}} / F_{q}} (y))

g (a, λ) = \frac{1}{q ^{n /2}} y \in F_{q^{n}} \sum η (λ g (y)) \overline{ψ (a y)}

g (a, λ) = \frac{1}{q ^{n /2}} y \in F_{q^{n}} \sum η (λ g (y)) \overline{ψ (a y)}

μ (g) = a \in F_{q^{n}} max b \in F_{q} max λ \in F_{q}^{*} \sum \overline{η (λb)} g (λa, λ) .

μ (g) = a \in F_{q^{n}} max b \in F_{q} max λ \in F_{q}^{*} \sum \overline{η (λb)} g (λa, λ) .

\frac{1}{q} λ \in F_{q} \sum η (λ z) = {10 for z = 0 otherwise .

\frac{1}{q} λ \in F_{q} \sum η (λ z) = {10 for z = 0 otherwise .

d (g, h)

d (g, h)

= q^{n - 1} (q - 1) - \frac{1}{q} λ \in F_{q}^{*} \sum y \in F_{q^{n}} \sum η (λ (g (y) - h (y))) .

h_{a, b} (y) = Tr_{F_{q^{n}} / F_{q}} (a y) + b .

h_{a, b} (y) = Tr_{F_{q^{n}} / F_{q}} (a y) + b .

d (g, h_{a, b})

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Highly nonlinear functions over finite fields

Kai-Uwe Schmidt

Department of Mathematics, Paderborn University, Warburger Str. 100, 33098 Paderborn, Germany.

[email protected]

(Date: 16 September 2019)

Abstract.

We consider a generalisation of a conjecture by Patterson and Wiedemann from 1983 on the Hamming distance of a function from $\mathbb{F}_{q}^{n}$ to $\mathbb{F}_{q}$ to the set of affine functions from $\mathbb{F}_{q}^{n}$ to $\mathbb{F}_{q}$ . We prove the conjecture for each $q$ such that the characteristic of $\mathbb{F}_{q}$ lies in a subset of the primes with density $1$ and we prove the conjecture for all $q$ by assuming the generalised Riemann hypothesis. Roughly speaking, we show the existence of functions for which the distance to the affine functions is maximised when $n$ tends to infinity. This also determines the asymptotic behaviour of the covering radius of the $[q^{n},n+1]$ Reed-Muller code over $\mathbb{F}_{q}$ and so answers a question raised by Leducq in 2013. Our results extend the case $q=2$ , which was recently proved by the author and which corresponds to the original conjecture by Patterson and Wiedemann. Our proof combines evaluations of Gauss sums in the semiprimitive case, probabilistic arguments, and methods from discrepancy theory.

2010 Mathematics Subject Classification:

Primary: 05D40; Secondary: 94B05

1. Introduction and results

The Hamming distance of two functions $g,h:\mathbb{F}_{q}^{n}\to\mathbb{F}_{q}$ is

[TABLE]

We define the nonlinearity of $g:\mathbb{F}_{q}^{n}\to\mathbb{F}_{q}$ to be

[TABLE]

where the minimum is over all $q^{n+1}$ affine functions $h$ from $\mathbb{F}_{q}^{n}$ to $\mathbb{F}_{q}$ . We are interested in functions with largest nonlinearity. Accordingly define $\rho_{q}(n)$ to be the maximum of $N(g)$ over all functions $g$ from $\mathbb{F}_{q}^{n}$ to $\mathbb{F}_{q}$ .

The number $\rho_{2}(n)$ equals the covering radius of binary Reed-Muller code of order one $R_{2}(1,n)$ [6] and in general $\rho_{q}(n)$ is the covering radius of the appropriate generalisation $R_{q}(1,n)$ over $\mathbb{F}_{q}$ [10]. The determination of the covering radius of $R_{q}(1,n)$ appears to be one of the most mysterious problems in coding theory [17], [10]. We refer to [7] for background on Reed-Muller codes over $\mathbb{F}_{q}$ and to [2] for background on the covering radius of codes in general and its combinatorial and geometric significance.

It is convenient to use the normalisation

[TABLE]

It is known that

[TABLE]

for all prime powers $q$ and all positive integers $n$ and $k$ . This was proved in [6] for $q=2$ and in [10, Proposition 11 and Lemma 19] for all $q$ . It is not difficult to see that $\mu_{q}(2)=1$ and so $\mu_{q}(n)=1$ for all even $n$ , as shown in [6, Corollary 1] for $q=2$ and [10, Corollary 13] for all $q$ .

We are interested in the case that $n$ is odd. It is readily verified [10, p. 1594] that $\mu_{q}(1)=\sqrt{q}$ and therefore

[TABLE]

for all prime powers $q$ and all positive integers $n$ . It is known that $\mu_{2}(n)=\sqrt{2}$ for each $n\in\{3,5,7\}$ [13]. Patterson and Wiedemann [15] improved the upper bound in (3) for $q=2$ to

[TABLE]

and, more recently, Kavut and Yücel [8] showed that

[TABLE]

A famous conjecture by Patterson and Wiedemann [15] asserts that

[TABLE]

and this conjecture was recently proved in [16].

This paper concerns the case that $q>2$ . Leducq [10] herself was able to improve the upper bound in (3) for $q=3$ , by showing that $\mu_{3}(3)=\tfrac{2}{3}\sqrt{3}$ and so

[TABLE]

This suggests that for $q>2$ a similar phenomenon occurs as in the case $q=2$ and indeed we prove a corresponding result for many values of $q$ .

Theorem 1.1.

Let $q$ be a power of a prime $p$ and suppose that there is another prime $r>3$ such that $r\equiv 3\pmod{4}$ and $-p$ is a primitive root modulo $r^{2}$ . Then $\lim_{n\to\infty}\mu_{q}(n)=1$ .

We list possible primes $r$ satisfying the conditions of Theorem 1.1 for the first 15 primes $p$ :

[TABLE]

For each prime $r$ , there are $\phi(\phi(r^{2}))$ primitive roots modulo $r^{2}$ and by Dirichlet’s theorem on primes in arithmetic progressions, each of the corresponding $\phi(\phi(r^{2}))$ congruence classes modulo $r^{2}$ contains a fraction of $1/\phi(r^{2})$ of all primes. Hence, by taking a prime $r>3$ with $r\equiv 3\pmod{4}$ , the condition of Theorem 1.1 is satisfied for all $p$ in a subset of the primes with density

[TABLE]

where $\phi$ is Euler’s totient function. For example, for $2/7$ of all primes $p$ , we can take $r=7$ in Theorem 1.1.

It is known from [20] and [3] that there are infinitely many primes of the form $r=2\ell+1$ , where $\ell\geq 3$ is an odd number with at most three prime factors. Let $r_{k}$ be the $k$ -th prime of this form. By the Chinese Remainder Theorem, the density of primes $p$ such that the condition in Theorem 1.1 is satisfied for one of the primes $r_{1},\dots,r_{k}$ is $d_{k}$ , where $d_{k}$ can be recursively defined by $d_{1}=\phi(r_{1}-1)/r_{1}$ and

[TABLE]

for all $i\geq 2$ . Since $r_{k}-1$ has a bounded number of prime factors, $\phi(r_{k}-1)/r_{k}$ is bounded from below by some positive number and hence we have $\lim_{k\to\infty}d_{k}=1$ . We therefore obtain the following corollary of Theorem 1.1.

Corollary 1.2.

We have $\lim_{n\to\infty}\mu_{q}(n)=1$ for all powers $q$ of a prime $p$ lying in a subset of the primes with density $1$ .

We shall see that the conclusion of Corollary 1.2 can be proved for all prime powers $q$ if one can show that, for each prime $p$ , there are infinitely many primes $r\equiv 3\pmod{4}$ such that $-p$ is a primitive root modulo $r$ . This is known to be true conditionally under the Generalised Riemann Hypothesis (GRH) and gives the following result.

Theorem 1.3.

Assume GRH. Then we have $\lim_{n\to\infty}\mu_{q}(n)=1$ for all prime powers $q$ .

For the proof of our results we use a semiprobabilistic construction. We present this construction in the next section (Proposition 2.1) and then show how our main results follow from this result. The proof that this construction gives the desired properties uses methods from number theory and discrepancy theory and the details are contained in Sections 3 and 4. The overall structure of the proof is based on the idea of [16] to prove Theorem 1.1 for $q=2$ . However, in the general case, several additional ideas are crucially involved.

2. Proof overview

For a function $g:\mathbb{F}_{q}^{n}\to\mathbb{F}_{q}$ , we define the normalisation

[TABLE]

where $N(g)$ is the nonlinearity of $g$ , given in (1). Hence

[TABLE]

where the minimum is over all functions $g$ from $\mathbb{F}_{q}^{n}$ to $\mathbb{F}_{q}$ . For every $\epsilon>0$ , we shall identify functions $f:\mathbb{F}_{q}^{n}\to\mathbb{F}_{q}$ , which satisfy $\mu(f)\leq 1+\epsilon$ when $n$ is sufficiently large. The construction is semiprobabilistic; it mimics the partial spread construction of so-called bent functions [4], but leaves some freedom, which will bring in probabilistic methods in the proof of our main results.

Henceforth we identify $\mathbb{F}_{q}^{n}$ with the field $\mathbb{F}_{q^{n}}$ . Let $H$ be a (multiplicative) subgroup of $\mathbb{F}_{q^{n}}^{*}$ of index $v$ . Let $T$ be a union of

[TABLE]

cosets of $H$ such that, if the coset $aH$ is contained in $T$ , then the coset $\lambda aH$ is contained in $T$ for each $\lambda\in\mathbb{F}_{q}^{*}$ . Put $S=\mathbb{F}_{q^{n}}\setminus T$ . Note that $v$ is not divisible by $q$ and so $S\setminus\{0\}$ is a union of at least $1$ and at most $q^{2}-q-1$ cosets of $H$ . We consider functions $f:\mathbb{F}_{q^{n}}\to\mathbb{F}_{q}$ of the form

[TABLE]

where $f_{T}$ is a function from $T$ to $\mathbb{F}_{q}$ and $f_{S}$ is a function from $S$ to $\mathbb{F}_{q}$ . The function $f_{T}$ is defined such that $f_{T}$ takes on every value of $\mathbb{F}_{q}$ equally often and such that

[TABLE]

That is, $f_{T}$ is constant on the cosets of $\mathbb{F}_{q}^{*}$ and also constant on the cosets of $H$ . The function $f_{S}$ will be determined later.

Recall that $\operatorname{ord}_{m}(a)$ for integers $m$ and $a$ with $m>0$ and $\gcd(a,m)=1$ is the smallest positive integer $t$ such that $m\mid a^{t}-1$ . Note that, if we fix $v$ , then for every multiple $n$ of $\operatorname{ord}_{v}(q)$ , there exists a subgroup of $\mathbb{F}_{q^{n}}^{*}$ of index $v$ . In particular, if $p$ is the characteristic of $\mathbb{F}_{q}$ , then $\operatorname{ord}_{v}(q)$ divides $\operatorname{ord}_{v}(p)$ , and so such a subgroup exists for every multiple $n$ of $\operatorname{ord}_{v}(p)$ .

Proposition 2.1.

Let $e$ be a positive integer, let $p$ be the characteristic of $\mathbb{F}_{q}$ , and suppose that $r>3$ is another prime such that $r\equiv 3\pmod{4}$ and $-p$ is a primitive root modulo $r^{e}$ . Put $v=r^{e}$ . Then there is an odd multiple $n$ of $\operatorname{ord}_{v}(p)$ and a function $f_{S}$ such that the function $f$ defined in (5) satisfies

[TABLE]

Remark.

With the notation as in Proposition 2.1, we have that $-1$ is a nonsquare modulo $v$ , which implies that

[TABLE]

Hence $\operatorname{ord}_{v}(p)$ is odd. Therefore $f$ is a function on an extension of $\mathbb{F}_{q}$ of odd degree.

Before we prove Proposition 2.1 we shall first deduce Theorems 1.1 and 1.3 from Proposition 2.1. Recall from elementary number theory (see [14, p. 102], for example) that the condition in Theorem 1.1 implies that $-p$ is a primitive root modulo $r^{e}$ for all positive integers $e$ . We can therefore take $e$ , and hence $v$ , in Proposition 2.1 arbitrarily large. Using (2) and $\mu_{q}(2)=1$ , we then obtain Theorem 1.1.

To deduce Theorem 1.3, we use the following special case of a result by Moree [12].

Proposition 2.2 ([12, Theorem 1.3]).

Assume GRH. Let $p$ be a prime. Then the density of primes $r\equiv 3\pmod{4}$ such that $-p$ is a primitive root modulo $r$ is

[TABLE]

for odd $p$ and $A/2$ for $p=2$ , where

[TABLE]

is Artin’s constant

Now for fixed $q$ , Proposition 2.2 implies, conditional on GRH, the existence of infinitely many primes $r$ for which we can apply Proposition 2.1 with $e=1$ . Using again (2) and $\mu_{q}(2)=1$ , we then obtain Theorem 1.3.

To prove Proposition 2.1, we shall turn the problem of estimating the nonlinearity of a function into a problem of estimating certain character sums. Recall that, for a finite field extension $K/F$ , the trace function $\operatorname{Tr}_{K/F}:K\to F$ is given by

[TABLE]

for each $y\in K$ . We define $\eta$ and $\psi$ to be the canonical additive characters of $\mathbb{F}_{q}$ and $\mathbb{F}_{q^{n}}$ , respectively. Denoting by $p$ the characteristic of $\mathbb{F}_{q}$ , we have

[TABLE]

for each $y\in\mathbb{F}_{q}$ and

[TABLE]

for each $y\in\mathbb{F}_{q^{n}}$ .

The Fourier transform of a function $g:\mathbb{F}_{q^{n}}\to\mathbb{F}_{q}$ is defined to be the function $\widehat{g}:\mathbb{F}_{q^{n}}\times\mathbb{F}_{q}\to\mathbb{C}$ given by

[TABLE]

for each $a\in\mathbb{F}_{q^{n}}$ and each $\lambda\in\mathbb{F}_{q}$ .

The following lemma gives the relationship between the nonlinearity of a function and its Fourier transform.

Lemma 2.3.

For every function $g:\mathbb{F}_{q^{n}}\to\mathbb{F}_{q}$ we have

[TABLE]

Proof.

For every $z\in\mathbb{F}_{q}$ , we have

[TABLE]

Therefore, for every function $h:\mathbb{F}_{q^{n}}\to\mathbb{F}_{q}$ , we have

[TABLE]

Now notice that the affine functions from $\mathbb{F}_{q^{n}}$ to $\mathbb{F}_{q}$ are precisely the $q^{n+1}$ functions $h_{a,b}$ for $a\in\mathbb{F}_{q^{n}}$ and $b\in\mathbb{F}_{q}$ , given by

[TABLE]

Therefore

[TABLE]

and the lemma follows from the definition (1) of the nonlinearity of $g$ and the normalisation (4). ∎

The strategy for our proof of Proposition 2.1 is to apply Lemma 2.3 to the function $f$ appearing in Proposition 2.1. We then bound the contributions to $\widehat{f}(a,\lambda)$ coming from $f_{T}$ and $f_{S}$ separately. Accordingly we define

[TABLE]

so that $\widehat{f}(a,\lambda)=\widehat{f}_{T}(a,\lambda)+\widehat{f}_{S}(a,\lambda)$ for all $a\in\mathbb{F}_{q^{n}}$ and all $\lambda\in\mathbb{F}_{q}$ . Proposition 2.1 will then follow in a straightforward way from Lemma 2.3 and the forthcoming Propositions 3.6 and 4.2.

3. The function $f_{T}$

Recall that $H$ is a subgroup of $\mathbb{F}_{q^{n}}^{*}$ of index $v$ and $T$ is a union of cosets of $\mathbb{F}_{q}^{*}$ and also a union of cosets of $H$ . By definition, the function $f_{T}:T\to\mathbb{F}_{q}$ takes on every value of $\mathbb{F}_{q}$ equally often and is constant on cosets of $\mathbb{F}_{q}^{*}$ and constant on cosets of $H$ , as given in (6).

For a multiplicative character $\chi$ of $\mathbb{F}_{q^{n}}$ , the Gauss sum $G(\chi)$ is defined to be

[TABLE]

where as before $\psi$ is the canonical additive character of $\mathbb{F}_{q^{n}}$ . It is well known that $\lvert G(\chi)\rvert=q^{n/2}$ if $\chi$ is nontrivial (which means that $\chi(y)\neq 1$ for some $y\in\mathbb{F}_{q^{n}}^{*}$ ) [11, Theorem 5.11].

Our starting point for the analysis of $\widehat{f}_{T}$ is the following lemma.

Lemma 3.1.

Let $\epsilon>0$ and suppose that, for all nontrivial multiplicative characters $\chi$ of $\mathbb{F}_{q^{n}}$ of order dividing $v$ , we have

[TABLE]

Then we have

[TABLE]

for all $a\in\mathbb{F}_{q^{n}}$ and all $b\in\mathbb{F}_{q}$ .

Proof.

Since $f_{T}$ takes on every value of $\mathbb{F}_{q}$ equally often, we have $\widehat{f}_{T}(0,\lambda)=0$ for each $\lambda\in\mathbb{F}_{q}^{*}$ . Hence we may assume that $a\in\mathbb{F}_{q^{n}}^{*}$ . Let $R$ be a set of representatives of the cosets of $H$ belonging to $T$ . For the moment fix $\lambda\in\mathbb{F}_{q}^{*}$ . Then we have

[TABLE]

where $\mathbbm{1}_{H}$ is the indicator of $H$ on $\mathbb{F}_{q^{n}}$ , so that

[TABLE]

Let $\chi$ be a multiplicative character of $\mathbb{F}_{q^{n}}$ of order $v$ . Then

[TABLE]

and for all $c\in\mathbb{F}_{q^{n}}^{*}$ we have

[TABLE]

Substitute into (9) to obtain

[TABLE]

Now write $G(\chi^{j})=q^{n/2}(-1+\gamma_{j})$ , so that $\lvert\gamma_{j}\rvert\leq\epsilon$ for all $j\in\{1,\dots,v-1\}$ by our assumption. Since $\lambda\in\mathbb{F}_{q}^{*}$ and so

[TABLE]

by the definition of $f_{T}$ , we obtain

[TABLE]

where

[TABLE]

From (10) we find that

[TABLE]

Since $f_{T}$ is constant on cosets of $H$ by definition (6), we find that

[TABLE]

Since $a^{-1}\in T$ if and only if $(\lambda a)^{-1}\in T$ and since $f_{T}$ is constant on cosets of $\mathbb{F}_{q}^{*}$ by definition (6), we obtain

[TABLE]

Hence, for all $b\in\mathbb{F}_{q}$ , we have

[TABLE]

On the other hand, by the triangle inequality we can bound $\lvert E(a,\lambda)\rvert$ by $\epsilon v$ for all $\lambda\in\mathbb{F}_{q}^{*}$ and therefore obtain by the triangle inequality

[TABLE]

as required. ∎

The following explicit evaluation of certain Gauss sums [9, Proposition 4.2] (see also [21, Theorem 4.1]) will help us to control the error term in Lemma 3.1.

Lemma 3.2 ([9, Proposition 4.2]).

Let $d$ be a positive integer, let $p$ be a prime, and suppose that $r>3$ is another prime such that $r\equiv 3\pmod{4}$ and $-p$ is a primitive root modulo $r^{d}$ . Write $k=\phi(r^{d})/2$ , let $\tau$ be a multiplicative character of $\mathbb{F}_{p^{k}}$ of order $r^{d}$ , and let $h$ be the class number of $\mathbb{Q}(\sqrt{-r})$ . Then

[TABLE]

*where $a$ and $b$ are integers satisfying $a,b\not\equiv 0\pmod{p}$ , $a^{2}+b^{2}r=4p^{h}$ , and $ap^{(k-h)/2}\equiv-2\pmod{r}$ . *

Recall that for a finite field extension $K/F$ , the norm function $\operatorname{N}_{K/F}:K\to F$ is defined by

[TABLE]

for each $y\in K$ . Every multiplicative character $\tau$ of $\mathbb{F}_{q}$ can be lifted to a multiplicative character $\chi$ of $\mathbb{F}_{q^{s}}$ by defining

[TABLE]

for each $y\in\mathbb{F}_{q^{s}}$ . Note that, if $d$ is a divisor of $q-1$ , then this lifting is an isomorphism between the character subgroups of order $d$ of $\mathbb{F}_{q}^{*}$ and $\mathbb{F}_{q^{s}}^{*}$ .

The well known Davenport-Hasse Theorem gives the relationship between the two Gauss sums $G(\tau)$ and $G(\chi)$ .

Lemma 3.3 ([11, Theorem 5.14]).

Let $\tau$ be a multiplicative character of $\mathbb{F}_{q}$ and suppose that $\tau$ is lifted to a multiplicative character $\chi$ of $\mathbb{F}_{q^{s}}$ . Then

[TABLE]

Now we obtain the following lemma as a corollary to Lemma 3.2.

Lemma 3.4.

Let $e$ and $d$ be integers satisfying $1\leq d\leq e$ and let $p$ be the characteristic of $\mathbb{F}_{q}$ . Suppose that $r>3$ is another prime such that $r\equiv 3\pmod{4}$ and $-p$ is a primitive root modulo $r^{e}$ . Write $m=\phi(r^{e})/2$ and $q=p^{t}$ and let $h$ be the class number of $\mathbb{Q}(\sqrt{-r})$ . Then there are nonzero integers $a$ and $b$ such that

[TABLE]

for all multiplicative characters $\chi$ of $\mathbb{F}_{q^{m}}$ of order $r^{d}$ , where the sign can depend on $\chi$ .

Proof.

Note that $-p$ is also a primitive root modulo $p^{d}$ . Write $k=\phi(q^{d})/2$ and let $\tau$ be the multiplicative character of $\mathbb{F}_{p^{k}}$ of order $r^{d}$ such that $\chi$ is the lifted character of $\tau$ . Lemma 3.2 implies that there are nonzero integers $a$ and $b$ such that

[TABLE]

where the sign can depend on $\chi$ . By Lemma 3.3 we have

[TABLE]

and the lemma follows since $m/k=\phi(r^{e})/\phi(r^{d})=r^{e-d}$ . ∎

The next lemma gives the desired control for the error term in Lemma 3.1.

Lemma 3.5.

Let $e$ be a positive integer and let $p$ be the characteristic of $\mathbb{F}_{q}$ . Suppose that $r>3$ is another prime such that $r\equiv 3\pmod{4}$ and $-p$ is a primitive root modulo $r^{e}$ . Write $m=\phi(r^{e})/2$ and let $\epsilon>0$ . Then there is an infinite set $I$ of odd positive integers such that, for all $s\in I$ and all nontrivial multiplicative characters $\chi$ of $\mathbb{F}_{q^{sm}}$ of order dividing $r^{e}$ , we have

[TABLE]

Here, $\arg(\xi)\in(-\pi,\pi]$ is the principal angle of a nonzero complex number $\xi$ .

Proof.

Let $\tau$ be a multiplicative character of $\mathbb{F}_{q^{m}}$ of order $r^{e}$ . Since $r>3$ , the units in the ring of algebraic integers of $\mathbb{Q}(\sqrt{-r})$ are $\pm 1$ , so that $\pm 1$ are the only roots of unity in $\mathbb{Q}(\sqrt{-r})$ . It then follows from Lemma 3.4 that $G(\tau)/q^{m/2}$ is not a root of unity. Therefore Weyl’s uniform distribution theorem [19, Satz 2] implies that $([G(\tau)/q^{m/2}]^{2i})_{i\in\mathbb{N}}$ , and therefore also $(G(\tau)/q^{m/2}]^{2i+1})_{i\in\mathbb{N}},$ is uniformly distributed on the complex unit circle. Hence there is an infinite set $I$ of odd positive integers such that

[TABLE]

for all $s\in I$ .

Let $s\in I$ and lift $\tau$ to a multiplicative character $\tau^{\prime}$ to $\mathbb{F}_{q^{sm}}$ . Then $\tau^{\prime}$ has order $r^{e}$ and Lemma 3.3 implies $G(\tau^{\prime})=G(\tau)^{s}$ , so that

[TABLE]

Now let $\chi$ be a multiplicative character of $\mathbb{F}_{q^{sm}}$ of order $r^{d}$ , where $1\leq d\leq e$ . Then by Lemma 3.4 we have

[TABLE]

which completes the proof. ∎

We are now in a position to deduce the following result, which controls $\widehat{f}_{T}$ and gives our first desired ingredient for the proof of Proposition 2.1.

Proposition 3.6.

Let $e$ be a positive integer and let $p$ be the characteristic of $\mathbb{F}_{q}$ . Suppose that $r>3$ is another prime such that $r\equiv 3\pmod{4}$ and $-p$ is a primitive root modulo $r^{e}$ . Put $v=r^{e}$ and let $\epsilon>0$ . Then there are infinitely many odd multiples $n$ of $\operatorname{ord}_{v}(p)$ such that the function $f_{T}$ satisfies

[TABLE]

for all $a\in\mathbb{F}_{q^{n}}$ and all $b\in\mathbb{F}_{q}$ .

Proof.

Write $m=\phi(v)/2$ and note that $m=\operatorname{ord}_{v}(p)$ . Letting $\epsilon>0$ , Lemma 3.5 implies that there is an infinite set $I$ of odd positive integers such that

[TABLE]

for all $s\in I$ and all nontrivial multiplicative characters $\chi$ of $\mathbb{F}_{q^{sm}}$ of order dividing $v$ . The desired result then follows from Lemma 3.1. ∎

We remark that in Proposition 3.6 the conclusion holds for infinitely many $n$ , which is stronger than what is needed to prove Proposition 2.1.

4. The function $f_{S}$

This section concerns the existence of an appropriate function $f_{S}:S\to\mathbb{F}_{q}$ . We shall use the following result that might be also of independent interest in discrepancy theory.

Theorem 4.1.

Let $K\geq 2$ be an integer and let $\mathfrak{F}$ be a family of $M$ subsets of a finite set $X$ with $\lvert X\rvert=N$ and $M\geq N$ . Then, for all sufficiently large $N$ , there exists a partition $\{Z_{1},Z_{2},\dots,Z_{K}\}$ of $X$ such that

[TABLE]

for each $i\in\{1,2,\dots,K\}$ .

The constant in Theorem 4.1 can certainly be improved by a more careful analysis. We note that Doerr and Srivastav [5, Theorem 3.15] proved a result similar to Theorem 4.1. However, compared to the proof of [5, Theorem 3.15], our proof of Theorem 4.1 is completely different and considerably simpler, although both proofs are based on Lemma 4.3 below.

Before we prove Theorem 4.1, we deduce the following result for the existence of an appropriate function $f_{S}$ , which gives our second desired ingredient for the proof of Proposition 2.1. Recall that $S$ is a subset of $\mathbb{F}_{q^{n}}$ such that $S\setminus\{0\}$ contains at least $1$ and at most $q^{2}-q-1$ cosets of a subgroup of $\mathbb{F}_{q^{n}}^{*}$ of index $v$ . Therefore

[TABLE]

Proposition 4.2.

For fixed $v$ and all sufficiently large $n$ , there is a function $f_{S}:S\to\mathbb{F}_{q}$ such that

[TABLE]

for all $a\in\mathbb{F}_{q^{n}}$ and all $\lambda\in\mathbb{F}_{q}^{*}$ .

Proof.

For each $a\in\mathbb{F}_{q^{n}}$ and each $z\in\mathbb{F}_{q}$ , define

[TABLE]

From Theorem 4.1 we find that, for all sufficiently large $\lvert S\rvert$ , there exists a partition $\{Z_{1},Z_{2},\dots,Z_{q}\}$ of $S$ such that

[TABLE]

for all $a,z,k$ . Henceforth suppose that $\lvert S\rvert$ is large enough so that this last estimate holds. For $\mathbb{F}_{q}=\{z_{1},z_{2},\dots,z_{q}\}$ , define $f_{S}:S\to\mathbb{F}_{q}$ by $f_{S}(y)=z_{k}$ for $y\in Z_{k}$ . Let $\eta$ be the canonical additive character of $\mathbb{F}_{q}$ and let $\lambda\in\mathbb{F}_{q}^{*}$ . From (14) we find that

[TABLE]

for all $a,z$ . Since $\sum_{c\in\mathbb{F}_{q}}\eta(\lambda c)=0$ , we obtain

[TABLE]

for all $a,z$ . We have

[TABLE]

using (7) and (13). Therefore by the triangle inequality and (15) we obtain

[TABLE]

and using (12), we can obtain the required estimate. ∎

In the remainder of this section we prove Theorem 4.1. We need a classical result from discrepancy theory due to Spencer [18], which we quote in the following specialised form.

Lemma 4.3 ([18, Theorem 7]).

Let $\mathfrak{F}$ be a family of $M$ subsets of a finite set $X$ with $\lvert X\rvert=N$ and $M\geq N$ and let $\delta$ be a real number. Then, for all sufficiently large $N$ , there exists $h:X\to\{-\delta,\delta\}$ such that

[TABLE]

We shall deduce the following result from Lemma 4.3 using an idea of Beck [1].

Lemma 4.4.

Let $\mathfrak{F}$ be a family of $M$ subsets of a finite set $X$ with $\lvert X\rvert=N$ and $M\geq N$ and let $\theta\in[0,1]$ . Then, for all sufficiently large $N$ , there exists a subset $Z$ of $X$ such that

[TABLE]

Proof.

We may assume that $\theta\in[0,\tfrac{1}{2}]$ ; otherwise we replace $Z$ by its complement in $X$ . The case $\theta=0$ is trivial since we can take $Z$ to be the empty set.

Now assume first that $\theta=\tfrac{1}{2}$ . Let $h:X\to\{-1,1\}$ be a function identified in Lemma 4.3 for $\delta=1$ . Put

[TABLE]

Then by Lemma 4.3 we have, for all sufficiently large $N$ ,

[TABLE]

and so

[TABLE]

as required.

Henceforth assume that $\theta\in(0,\tfrac{1}{2})$ . Let $\alpha$ be a real number such that

[TABLE]

and let $\Delta$ be the triangle with vertices

[TABLE]

The triangle $\Delta$ can be decomposed into four triangles that are congruent to $2^{-1}\Delta$ . By iterating this decomposition, we have the chain of partitions

[TABLE]

where, for each $i\in\{1,2,\dots,4^{k}\}$ , the triangle $\Delta(k,i)$ is congruent to $2^{-k}\Delta$ . Let $t$ be a natural number to be determined later. Then we have

[TABLE]

for some sequence $i_{1},i_{2},\dots,i_{t}$ . It will be convenient to write $\Delta=\Delta(0,1)$ and $i_{0}=1$ .

We now construct functions $h_{0},h_{1},\dots,h_{t}:X\to\mathbb{C}$ such that $h_{k}(y)$ is a vertex of $\Delta(k,i_{k})$ for each $y\in X$ . For each $y\in X$ , let $h_{t}(y)$ be a vertex of the small triangle $\Delta(t,i_{t})$ with minimum absolute value. Since the diameter of $\Delta$ ist at most $2$ , the diameter of $\Delta(t,i_{t})$ ist at most $2^{-t+1}$ , and so we have

[TABLE]

for each $y\in X$ . Therefore

[TABLE]

Now let $k\in\{1,2,\dots,t\}$ and suppose that $h_{k}(y)$ is a vertex of $\Delta(k,i_{k})$ for each $y\in X$ . Then, for each $y\in X$ , the point $h_{k}(y)$ is either a vertex of $\Delta(k-1,i_{k-1})$ or is a midpoint between two vertices of $\Delta(k-1,i_{k-1})$ . We set $h_{k-1}(y)=h_{k}(y)$ for all $y\in X$ , except for those $y\in X$ corresponding to the latter case. The remaining values of $h_{k-1}(y)$ are rounded to one of the neighbouring vertices of $\Delta(k-1,i_{k-1})$ using Lemma 4.3. Since the diameter of $\Delta(k-1,i_{k-1})$ is at most $2^{-k+2}$ , we have for all sufficiently large $N$ ,

[TABLE]

Hence by the triangle inequality we have, for all sufficiently large $N$ ,

[TABLE]

Applying the triangle inequality once more, we obtain from (16), for all sufficiently large $N$ ,

[TABLE]

by choosing $t$ large enough. Now $h_{0}(y)$ is a vertex of $\Delta$ for each $y\in X$ . Put

[TABLE]

Let $Y\in\mathfrak{F}$ be fixed and assume that $N$ is large enough, so that (18) holds. By considering the real part of the summation on the left hand side of (18), we obtain

[TABLE]

Equivalently we have

[TABLE]

Since $\cos\alpha<0$ and

[TABLE]

we conclude that $Z$ has the required property. ∎

It remains to prove Theorem 4.1.

Proof of Theorem 4.1.

It will be useful to work with the family of subsets $\mathfrak{F}^{\prime}=\mathfrak{F}\cup\{X\}$ of $X$ , so that $\lvert\mathfrak{F}^{\prime}\rvert\leq M+1$ .

First apply Lemma 4.4 with $\theta=(1/K)\lfloor K/2\rfloor$ to infer the existence of a subset $A$ of $X$ such that $A$ intersects each $Y\in\mathfrak{F}^{\prime}$ in roughly $\theta\lvert Y\rvert$ elements. Then the complement $B$ of $A$ intersects each $Y\in\mathfrak{F}^{\prime}$ in roughly $(1-\theta)\lvert Y\rvert$ elements. The problem is now reduced because it remains to partition $A$ into $\lfloor K/2\rfloor$ subsets and $B$ into $\lceil K/2\rceil$ subsets. If necessary, we apply Lemma 4.4 to the families of subsets $\mathfrak{F}^{\prime}$ restricted to $A$ and $B$ and then proceed iteratively, so that in each step Lemma 4.4 is applied with some $\theta\in[1/3,1/2]$ , until we obtain a partition $\{Z_{1},Z_{2},\dots,Z_{K}\}$ of $X$ such that each $Z_{i}$ intersects each $Y\in\mathfrak{F}^{\prime}$ in roughly $\lvert Y\rvert/K$ elements.

We now give a quantitative analysis. For every $Z\in\{Z_{1},Z_{2},\dots,Z_{K}\}$ , there are subsets $W_{0},W_{1},\dots,W_{s}$ (with $K\leq 2^{s}<2K$ ) of $X$ satisfying

[TABLE]

and numbers $\mu_{1},\dots,\mu_{s}\in[1/3,2/3]$ satisfying $\mu_{1}\cdots\mu_{s}=1/K$ such that

[TABLE]

for each $i\in\{1,2,\dots,s\}$ , each $Y\in\mathfrak{F}^{\prime}$ , and all sufficiently large $N$ . By the triangle inequality we have

[TABLE]

for each $j\in\{1,2,\dots,s\}$ and each $Y\in\mathfrak{F}^{\prime}$ . In particular, by taking $Y=X$ we obtain from (19) and (20) that

[TABLE]

for each $j\in\{1,2,\dots,s-1\}$ and all sufficiently large $N$ (with room to spare). Since $\lvert W_{0}\rvert=N$ , these estimates also hold for $j=0$ , and so substitution into (19) gives

[TABLE]

for each $i\in\{1,2,\dots,s\}$ , each $Y\in\mathfrak{F}^{\prime}$ , and all sufficiently large $N$ . From (20) with $j=s$ we then find that

[TABLE]

for all $Y\in\mathfrak{F}^{\prime}$ and all sufficiently large $N$ , where we have used that $1/3\leq\mu_{i}\leq 2/3$ for all $i$ . The series equals $\sqrt{3}/(\sqrt{3}-\sqrt{2})$ , from which the claimed bound can be obtained. ∎

Bibliography21

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] J. Beck. Flat polynomials on the unit circle—note on a problem of Littlewood. Bull. London Math. Soc. , 23(3):269–277, 1991.
2[2] G. Cohen, I. Honkala, S. Litsyn, and A. Lobstein. Covering codes , volume 54 of North-Holland Mathematical Library . North-Holland Publishing Co., Amsterdam, 1997.
3[3] M. D. Coleman. On the equation b 1 p − b 2 P 2 = b 3 subscript 𝑏 1 𝑝 subscript 𝑏 2 subscript 𝑃 2 subscript 𝑏 3 b_{1}p-b_{2}P_{2}=b_{3} . J. Reine Angew. Math. , 403:1–66, 1990.
4[4] J. F. Dillon. Elementary Hadamard difference sets . Pro Quest LLC, Ann Arbor, MI, 1974. Thesis (Ph.D.)–University of Maryland, College Park.
5[5] B. Doerr and A. Srivastav. Multicolour discrepancies. Combin. Probab. Comput. , 12(4):365–399, 2003.
6[6] T. Helleseth, T. Kløve, and J. Mykkeltveit. On the covering radius of binary codes. IEEE Trans. Inform. Theory , 24(5):627–628, 1978.
7[7] T. Kasami, S. Lin, and W. W. Peterson. New generalizations of the Reed-Muller codes. I. Primitive codes. IEEE Trans. Inform. Theory , IT-14:189–199, 1968.
8[8] S. Kavut and M. D. Yücel. 9-variable Boolean functions with nonlinearity 242 in the generalized rotation symmetric class. Inform. and Comput. , 208(4):341–350, 2010.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Highly nonlinear functions over finite fields

Abstract.

2010 Mathematics Subject Classification:

1. Introduction and results

Theorem 1.1**.**

Corollary 1.2**.**

Theorem 1.3**.**

2. Proof overview

Proposition 2.1**.**

Remark**.**

Proposition 2.2** ([12, Theorem 1.3]).**

Lemma 2.3**.**

Proof.

3. The function fTf_{T}fT​

Lemma 3.1**.**

Proof.

Lemma 3.2** ([9, Proposition 4.2]).**

Lemma 3.3** ([11, Theorem 5.14]).**

Lemma 3.4**.**

Proof.

Lemma 3.5**.**

Proof.

Proposition 3.6**.**

Proof.

4. The function fSf_{S}fS​

Theorem 4.1**.**

Proposition 4.2**.**

Proof.

Lemma 4.3** ([18, Theorem 7]).**

Lemma 4.4**.**

Proof.

Proof of Theorem 4.1.

Theorem 1.1.

Corollary 1.2.

Theorem 1.3.

Proposition 2.1.

Remark.

Proposition 2.2 ([12, Theorem 1.3]).

Lemma 2.3.

3. The function $f_{T}$

Lemma 3.1.

Lemma 3.2 ([9, Proposition 4.2]).

Lemma 3.3 ([11, Theorem 5.14]).

Lemma 3.4.

Lemma 3.5.

Proposition 3.6.

4. The function $f_{S}$

Theorem 4.1.

Proposition 4.2.

Lemma 4.3 ([18, Theorem 7]).

Lemma 4.4.