Highly nonlinear functions over finite fields
Kai-Uwe Schmidt

TL;DR
This paper proves a long-standing conjecture about the maximum Hamming distance of functions from finite fields to finite fields, extending previous results and determining the asymptotic behavior of Reed-Muller codes.
Contribution
It generalizes the Patterson-Wiedemann conjecture to all finite fields, using advanced number theory and probabilistic methods, and determines the asymptotic covering radius.
Findings
Proves the conjecture for most finite fields unconditionally.
Establishes the asymptotic maximum distance for functions as dimension grows.
Determines the asymptotic covering radius of Reed-Muller codes.
Abstract
We consider a generalisation of a conjecture by Patterson and Wiedemann from 1983 on the Hamming distance of a function from to to the set of affine functions from to . We prove the conjecture for each such that the characteristic of lies in a subset of the primes with density and we prove the conjecture for all by assuming the generalised Riemann hypothesis. Roughly speaking, we show the existence of functions for which the distance to the affine functions is maximised when tends to infinity. This also determines the asymptotic behaviour of the covering radius of the Reed-Muller code over and so answers a question raised by Leducq in 2013. Our results extend the case , which was recently proved by the author and which corresponds to the original conjecture by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Highly nonlinear functions over finite fields
Kai-Uwe Schmidt
Department of Mathematics, Paderborn University, Warburger Str. 100, 33098 Paderborn, Germany.
(Date: 16 September 2019)
Abstract.
We consider a generalisation of a conjecture by Patterson and Wiedemann from 1983 on the Hamming distance of a function from to to the set of affine functions from to . We prove the conjecture for each such that the characteristic of lies in a subset of the primes with density and we prove the conjecture for all by assuming the generalised Riemann hypothesis. Roughly speaking, we show the existence of functions for which the distance to the affine functions is maximised when tends to infinity. This also determines the asymptotic behaviour of the covering radius of the Reed-Muller code over and so answers a question raised by Leducq in 2013. Our results extend the case , which was recently proved by the author and which corresponds to the original conjecture by Patterson and Wiedemann. Our proof combines evaluations of Gauss sums in the semiprimitive case, probabilistic arguments, and methods from discrepancy theory.
2010 Mathematics Subject Classification:
Primary: 05D40; Secondary: 94B05
1. Introduction and results
The Hamming distance of two functions is
[TABLE]
We define the nonlinearity of to be
[TABLE]
where the minimum is over all affine functions from to . We are interested in functions with largest nonlinearity. Accordingly define to be the maximum of over all functions from to .
The number equals the covering radius of binary Reed-Muller code of order one [6] and in general is the covering radius of the appropriate generalisation over [10]. The determination of the covering radius of appears to be one of the most mysterious problems in coding theory [17], [10]. We refer to [7] for background on Reed-Muller codes over and to [2] for background on the covering radius of codes in general and its combinatorial and geometric significance.
It is convenient to use the normalisation
[TABLE]
It is known that
[TABLE]
for all prime powers and all positive integers and . This was proved in [6] for and in [10, Proposition 11 and Lemma 19] for all . It is not difficult to see that and so for all even , as shown in [6, Corollary 1] for and [10, Corollary 13] for all .
We are interested in the case that is odd. It is readily verified [10, p. 1594] that and therefore
[TABLE]
for all prime powers and all positive integers . It is known that for each [13]. Patterson and Wiedemann [15] improved the upper bound in (3) for to
[TABLE]
and, more recently, Kavut and Yücel [8] showed that
[TABLE]
A famous conjecture by Patterson and Wiedemann [15] asserts that
[TABLE]
and this conjecture was recently proved in [16].
This paper concerns the case that . Leducq [10] herself was able to improve the upper bound in (3) for , by showing that and so
[TABLE]
This suggests that for a similar phenomenon occurs as in the case and indeed we prove a corresponding result for many values of .
Theorem 1.1**.**
Let be a power of a prime and suppose that there is another prime such that and is a primitive root modulo . Then .
We list possible primes satisfying the conditions of Theorem 1.1 for the first 15 primes :
[TABLE]
For each prime , there are primitive roots modulo and by Dirichlet’s theorem on primes in arithmetic progressions, each of the corresponding congruence classes modulo contains a fraction of of all primes. Hence, by taking a prime with , the condition of Theorem 1.1 is satisfied for all in a subset of the primes with density
[TABLE]
where is Euler’s totient function. For example, for of all primes , we can take in Theorem 1.1.
It is known from [20] and [3] that there are infinitely many primes of the form , where is an odd number with at most three prime factors. Let be the -th prime of this form. By the Chinese Remainder Theorem, the density of primes such that the condition in Theorem 1.1 is satisfied for one of the primes is , where can be recursively defined by and
[TABLE]
for all . Since has a bounded number of prime factors, is bounded from below by some positive number and hence we have . We therefore obtain the following corollary of Theorem 1.1.
Corollary 1.2**.**
We have for all powers of a prime lying in a subset of the primes with density .
We shall see that the conclusion of Corollary 1.2 can be proved for all prime powers if one can show that, for each prime , there are infinitely many primes such that is a primitive root modulo . This is known to be true conditionally under the Generalised Riemann Hypothesis (GRH) and gives the following result.
Theorem 1.3**.**
Assume GRH. Then we have for all prime powers .
For the proof of our results we use a semiprobabilistic construction. We present this construction in the next section (Proposition 2.1) and then show how our main results follow from this result. The proof that this construction gives the desired properties uses methods from number theory and discrepancy theory and the details are contained in Sections 3 and 4. The overall structure of the proof is based on the idea of [16] to prove Theorem 1.1 for . However, in the general case, several additional ideas are crucially involved.
2. Proof overview
For a function , we define the normalisation
[TABLE]
where is the nonlinearity of , given in (1). Hence
[TABLE]
where the minimum is over all functions from to . For every , we shall identify functions , which satisfy when is sufficiently large. The construction is semiprobabilistic; it mimics the partial spread construction of so-called bent functions [4], but leaves some freedom, which will bring in probabilistic methods in the proof of our main results.
Henceforth we identify with the field . Let be a (multiplicative) subgroup of of index . Let be a union of
[TABLE]
cosets of such that, if the coset is contained in , then the coset is contained in for each . Put . Note that is not divisible by and so is a union of at least and at most cosets of . We consider functions of the form
[TABLE]
where is a function from to and is a function from to . The function is defined such that takes on every value of equally often and such that
[TABLE]
That is, is constant on the cosets of and also constant on the cosets of . The function will be determined later.
Recall that for integers and with and is the smallest positive integer such that . Note that, if we fix , then for every multiple of , there exists a subgroup of of index . In particular, if is the characteristic of , then divides , and so such a subgroup exists for every multiple of .
Proposition 2.1**.**
Let be a positive integer, let be the characteristic of , and suppose that is another prime such that and is a primitive root modulo . Put . Then there is an odd multiple of and a function such that the function defined in (5) satisfies
[TABLE]
Remark**.**
With the notation as in Proposition 2.1, we have that is a nonsquare modulo , which implies that
[TABLE]
Hence is odd. Therefore is a function on an extension of of odd degree.
Before we prove Proposition 2.1 we shall first deduce Theorems 1.1 and 1.3 from Proposition 2.1. Recall from elementary number theory (see [14, p. 102], for example) that the condition in Theorem 1.1 implies that is a primitive root modulo for all positive integers . We can therefore take , and hence , in Proposition 2.1 arbitrarily large. Using (2) and , we then obtain Theorem 1.1.
To deduce Theorem 1.3, we use the following special case of a result by Moree [12].
Proposition 2.2** ([12, Theorem 1.3]).**
Assume GRH. Let be a prime. Then the density of primes such that is a primitive root modulo is
[TABLE]
for odd and for , where
[TABLE]
is Artin’s constant
Now for fixed , Proposition 2.2 implies, conditional on GRH, the existence of infinitely many primes for which we can apply Proposition 2.1 with . Using again (2) and , we then obtain Theorem 1.3.
To prove Proposition 2.1, we shall turn the problem of estimating the nonlinearity of a function into a problem of estimating certain character sums. Recall that, for a finite field extension , the trace function is given by
[TABLE]
for each . We define and to be the canonical additive characters of and , respectively. Denoting by the characteristic of , we have
[TABLE]
for each and
[TABLE]
for each .
The Fourier transform of a function is defined to be the function given by
[TABLE]
for each and each .
The following lemma gives the relationship between the nonlinearity of a function and its Fourier transform.
Lemma 2.3**.**
For every function we have
[TABLE]
Proof.
For every , we have
[TABLE]
Therefore, for every function , we have
[TABLE]
Now notice that the affine functions from to are precisely the functions for and , given by
[TABLE]
Therefore
[TABLE]
and the lemma follows from the definition (1) of the nonlinearity of and the normalisation (4). ∎
The strategy for our proof of Proposition 2.1 is to apply Lemma 2.3 to the function appearing in Proposition 2.1. We then bound the contributions to coming from and separately. Accordingly we define
[TABLE]
so that for all and all . Proposition 2.1 will then follow in a straightforward way from Lemma 2.3 and the forthcoming Propositions 3.6 and 4.2.
3. The function
Recall that is a subgroup of of index and is a union of cosets of and also a union of cosets of . By definition, the function takes on every value of equally often and is constant on cosets of and constant on cosets of , as given in (6).
For a multiplicative character of , the Gauss sum is defined to be
[TABLE]
where as before is the canonical additive character of . It is well known that if is nontrivial (which means that for some ) [11, Theorem 5.11].
Our starting point for the analysis of is the following lemma.
Lemma 3.1**.**
Let and suppose that, for all nontrivial multiplicative characters of of order dividing , we have
[TABLE]
Then we have
[TABLE]
for all and all .
Proof.
Since takes on every value of equally often, we have for each . Hence we may assume that . Let be a set of representatives of the cosets of belonging to . For the moment fix . Then we have
[TABLE]
where is the indicator of on , so that
[TABLE]
Let be a multiplicative character of of order . Then
[TABLE]
and for all we have
[TABLE]
Substitute into (9) to obtain
[TABLE]
Now write , so that for all by our assumption. Since and so
[TABLE]
by the definition of , we obtain
[TABLE]
where
[TABLE]
From (10) we find that
[TABLE]
Since is constant on cosets of by definition (6), we find that
[TABLE]
Since if and only if and since is constant on cosets of by definition (6), we obtain
[TABLE]
Hence, for all , we have
[TABLE]
On the other hand, by the triangle inequality we can bound by for all and therefore obtain by the triangle inequality
[TABLE]
as required. ∎
The following explicit evaluation of certain Gauss sums [9, Proposition 4.2] (see also [21, Theorem 4.1]) will help us to control the error term in Lemma 3.1.
Lemma 3.2** ([9, Proposition 4.2]).**
Let be a positive integer, let be a prime, and suppose that is another prime such that and is a primitive root modulo . Write , let be a multiplicative character of of order , and let be the class number of . Then
[TABLE]
*where and are integers satisfying , , and . *
Recall that for a finite field extension , the norm function is defined by
[TABLE]
for each . Every multiplicative character of can be lifted to a multiplicative character of by defining
[TABLE]
for each . Note that, if is a divisor of , then this lifting is an isomorphism between the character subgroups of order of and .
The well known Davenport-Hasse Theorem gives the relationship between the two Gauss sums and .
Lemma 3.3** ([11, Theorem 5.14]).**
Let be a multiplicative character of and suppose that is lifted to a multiplicative character of . Then
[TABLE]
Now we obtain the following lemma as a corollary to Lemma 3.2.
Lemma 3.4**.**
Let and be integers satisfying and let be the characteristic of . Suppose that is another prime such that and is a primitive root modulo . Write and and let be the class number of . Then there are nonzero integers and such that
[TABLE]
for all multiplicative characters of of order , where the sign can depend on .
Proof.
Note that is also a primitive root modulo . Write and let be the multiplicative character of of order such that is the lifted character of . Lemma 3.2 implies that there are nonzero integers and such that
[TABLE]
where the sign can depend on . By Lemma 3.3 we have
[TABLE]
and the lemma follows since . ∎
The next lemma gives the desired control for the error term in Lemma 3.1.
Lemma 3.5**.**
Let be a positive integer and let be the characteristic of . Suppose that is another prime such that and is a primitive root modulo . Write and let . Then there is an infinite set of odd positive integers such that, for all and all nontrivial multiplicative characters of of order dividing , we have
[TABLE]
Here, is the principal angle of a nonzero complex number .
Proof.
Let be a multiplicative character of of order . Since , the units in the ring of algebraic integers of are , so that are the only roots of unity in . It then follows from Lemma 3.4 that is not a root of unity. Therefore Weyl’s uniform distribution theorem [19, Satz 2] implies that , and therefore also is uniformly distributed on the complex unit circle. Hence there is an infinite set of odd positive integers such that
[TABLE]
for all .
Let and lift to a multiplicative character to . Then has order and Lemma 3.3 implies , so that
[TABLE]
Now let be a multiplicative character of of order , where . Then by Lemma 3.4 we have
[TABLE]
which completes the proof. ∎
We are now in a position to deduce the following result, which controls and gives our first desired ingredient for the proof of Proposition 2.1.
Proposition 3.6**.**
Let be a positive integer and let be the characteristic of . Suppose that is another prime such that and is a primitive root modulo . Put and let . Then there are infinitely many odd multiples of such that the function satisfies
[TABLE]
for all and all .
Proof.
Write and note that . Letting , Lemma 3.5 implies that there is an infinite set of odd positive integers such that
[TABLE]
for all and all nontrivial multiplicative characters of of order dividing . The desired result then follows from Lemma 3.1. ∎
We remark that in Proposition 3.6 the conclusion holds for infinitely many , which is stronger than what is needed to prove Proposition 2.1.
4. The function
This section concerns the existence of an appropriate function . We shall use the following result that might be also of independent interest in discrepancy theory.
Theorem 4.1**.**
Let be an integer and let be a family of subsets of a finite set with and . Then, for all sufficiently large , there exists a partition of such that
[TABLE]
for each .
The constant in Theorem 4.1 can certainly be improved by a more careful analysis. We note that Doerr and Srivastav [5, Theorem 3.15] proved a result similar to Theorem 4.1. However, compared to the proof of [5, Theorem 3.15], our proof of Theorem 4.1 is completely different and considerably simpler, although both proofs are based on Lemma 4.3 below.
Before we prove Theorem 4.1, we deduce the following result for the existence of an appropriate function , which gives our second desired ingredient for the proof of Proposition 2.1. Recall that is a subset of such that contains at least and at most cosets of a subgroup of of index . Therefore
[TABLE]
Proposition 4.2**.**
For fixed and all sufficiently large , there is a function such that
[TABLE]
for all and all .
Proof.
For each and each , define
[TABLE]
From Theorem 4.1 we find that, for all sufficiently large , there exists a partition of such that
[TABLE]
for all . Henceforth suppose that is large enough so that this last estimate holds. For , define by for . Let be the canonical additive character of and let . From (14) we find that
[TABLE]
for all . Since , we obtain
[TABLE]
for all . We have
[TABLE]
using (7) and (13). Therefore by the triangle inequality and (15) we obtain
[TABLE]
and using (12), we can obtain the required estimate. ∎
In the remainder of this section we prove Theorem 4.1. We need a classical result from discrepancy theory due to Spencer [18], which we quote in the following specialised form.
Lemma 4.3** ([18, Theorem 7]).**
Let be a family of subsets of a finite set with and and let be a real number. Then, for all sufficiently large , there exists such that
[TABLE]
We shall deduce the following result from Lemma 4.3 using an idea of Beck [1].
Lemma 4.4**.**
Let be a family of subsets of a finite set with and and let . Then, for all sufficiently large , there exists a subset of such that
[TABLE]
Proof.
We may assume that ; otherwise we replace by its complement in . The case is trivial since we can take to be the empty set.
Now assume first that . Let be a function identified in Lemma 4.3 for . Put
[TABLE]
Then by Lemma 4.3 we have, for all sufficiently large ,
[TABLE]
and so
[TABLE]
as required.
Henceforth assume that . Let be a real number such that
[TABLE]
and let be the triangle with vertices
[TABLE]
The triangle can be decomposed into four triangles that are congruent to . By iterating this decomposition, we have the chain of partitions
[TABLE]
where, for each , the triangle is congruent to . Let be a natural number to be determined later. Then we have
[TABLE]
for some sequence . It will be convenient to write and .
We now construct functions such that is a vertex of for each . For each , let be a vertex of the small triangle with minimum absolute value. Since the diameter of ist at most , the diameter of ist at most , and so we have
[TABLE]
for each . Therefore
[TABLE]
Now let and suppose that is a vertex of for each . Then, for each , the point is either a vertex of or is a midpoint between two vertices of . We set for all , except for those corresponding to the latter case. The remaining values of are rounded to one of the neighbouring vertices of using Lemma 4.3. Since the diameter of is at most , we have for all sufficiently large ,
[TABLE]
Hence by the triangle inequality we have, for all sufficiently large ,
[TABLE]
Applying the triangle inequality once more, we obtain from (16), for all sufficiently large ,
[TABLE]
by choosing large enough. Now is a vertex of for each . Put
[TABLE]
Let be fixed and assume that is large enough, so that (18) holds. By considering the real part of the summation on the left hand side of (18), we obtain
[TABLE]
Equivalently we have
[TABLE]
Since and
[TABLE]
we conclude that has the required property. ∎
It remains to prove Theorem 4.1.
Proof of Theorem 4.1.
It will be useful to work with the family of subsets of , so that .
First apply Lemma 4.4 with to infer the existence of a subset of such that intersects each in roughly elements. Then the complement of intersects each in roughly elements. The problem is now reduced because it remains to partition into subsets and into subsets. If necessary, we apply Lemma 4.4 to the families of subsets restricted to and and then proceed iteratively, so that in each step Lemma 4.4 is applied with some , until we obtain a partition of such that each intersects each in roughly elements.
We now give a quantitative analysis. For every , there are subsets (with ) of satisfying
[TABLE]
and numbers satisfying such that
[TABLE]
for each , each , and all sufficiently large . By the triangle inequality we have
[TABLE]
for each and each . In particular, by taking we obtain from (19) and (20) that
[TABLE]
for each and all sufficiently large (with room to spare). Since , these estimates also hold for , and so substitution into (19) gives
[TABLE]
for each , each , and all sufficiently large . From (20) with we then find that
[TABLE]
for all and all sufficiently large , where we have used that for all . The series equals , from which the claimed bound can be obtained. ∎
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] J. Beck. Flat polynomials on the unit circle—note on a problem of Littlewood. Bull. London Math. Soc. , 23(3):269–277, 1991.
- 2[2] G. Cohen, I. Honkala, S. Litsyn, and A. Lobstein. Covering codes , volume 54 of North-Holland Mathematical Library . North-Holland Publishing Co., Amsterdam, 1997.
- 3[3] M. D. Coleman. On the equation b 1 p − b 2 P 2 = b 3 subscript 𝑏 1 𝑝 subscript 𝑏 2 subscript 𝑃 2 subscript 𝑏 3 b_{1}p-b_{2}P_{2}=b_{3} . J. Reine Angew. Math. , 403:1–66, 1990.
- 4[4] J. F. Dillon. Elementary Hadamard difference sets . Pro Quest LLC, Ann Arbor, MI, 1974. Thesis (Ph.D.)–University of Maryland, College Park.
- 5[5] B. Doerr and A. Srivastav. Multicolour discrepancies. Combin. Probab. Comput. , 12(4):365–399, 2003.
- 6[6] T. Helleseth, T. Kløve, and J. Mykkeltveit. On the covering radius of binary codes. IEEE Trans. Inform. Theory , 24(5):627–628, 1978.
- 7[7] T. Kasami, S. Lin, and W. W. Peterson. New generalizations of the Reed-Muller codes. I. Primitive codes. IEEE Trans. Inform. Theory , IT-14:189–199, 1968.
- 8[8] S. Kavut and M. D. Yücel. 9-variable Boolean functions with nonlinearity 242 in the generalized rotation symmetric class. Inform. and Comput. , 208(4):341–350, 2010.
