Exceptional biases in counting primes over functions fields
Alexandre Bailleul, Lucile Devin, Daniel Keliher, Wanlin Li

TL;DR
This paper investigates the rarity of certain biases in prime distributions over function fields, showing they become negligible as the size of the finite field grows, using advanced sieve and geometric methods.
Contribution
It introduces new bounds demonstrating the vanishing probability of three types of prime biases in large finite fields, improving previous results by Kowalski.
Findings
Biases occur with probability tending to zero as q increases
New bounds improve upon Kowalski's earlier results
Uses advanced sieve methods and arithmetic geometry techniques
Abstract
We study how often exceptional configurations of irreducible polynomials over finite fields occur in the context of prime number races and Chebyshev's bias. In particular, we show that three types of biases, which we call "complete bias", "lower order bias" and "reversed bias", occur with probability going to zero among the family of all squarefree monic polynomials of a given degree in as , a power of a fixed prime, goes to infinity. The bounds given improve on a previous result of Kowalski, who studied a similar question along particular -parameter families of reducible polynomials. The tools used are the large sieve for Frobenius developed by Kowalski, an improvement of it due to Perret-Gentil and considerations from the theory of linear recurrence sequences and arithmetic geometry.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCoding theory and cryptography · Analytic Number Theory Research · Algebraic Geometry and Number Theory
Exceptional biases in counting primes over function fields
Alexandre Bailleul
ENS Paris-Saclay, Centre Borelli, UMR 9010, 91190 Gif-sur-Yvette, France
,
Lucile Devin
Univ. Littoral Côte d’Opale, UR 2597 LMPA, Laboratoire de Mathématiques Pures et Appliquées Joseph Liouville, F-62100 Calais, France
,
Daniel Keliher
University of Georgia, Department of Mathematics, 200 D. W. Brooks Drive, Athens, GA 30602, USA
and
Wanlin Li
Washington University in St. Louis, Department of Mathematics and Statistics, One Brookings Drive, St. Louis, MO 63130, USA
Abstract.
We study how often exceptional configurations of irreducible polynomials over finite fields occur in the context of prime number races and Chebyshev’s bias. In particular, we show that three types of biases, which we call “complete bias”, “lower order bias” and “reversed bias”, occur with probability going to zero among the family of all squarefree monic polynomials of a given degree in as , a power of a fixed prime, goes to infinity. The bounds given improve on a previous result of Kowalski, who studied a similar question along particular 1-parameter families of reducible polynomials. The tools used are the large sieve for Frobenius developed by Kowalski, an improvement of it due to Perret-Gentil and considerations from the theory of linear recurrence sequences and arithmetic geometry.
1. Introduction
Chebyshev’s bias is the phenomenon that there are more prime numbers of the form than of the form in initial intervals of for most values of (more precisely, the set of such admits a logarithmic density of around ). More generally, primes which are congruent to a fixed non-square residue class modulo an integer are more numerous than those which are congruent to a given square residue class modulo in initial intervals of . The origin of this phenomenon was explained by Rubinstein and Sarnak in [RS].
The analogue of Chebyshev’s bias over function fields was first considered by Cha in [Cha2008] to study inequities in the distribution of irreducible polynomials in residue classes of , and later by Cha and Im in [ChaIm2011] in function field extensions. As in the classical archimedean case of [RS], a central hypothesis is a linear independence hypothesis which will be called throughout. If the arguments of the non-trivial inverse zeros (of non-negative imaginary parts) of the underlying -functions are of the form , then claims that the ’s, together with , are linearly independent over . A consequence of is that Chebyshev’s bias favours non-square residue classes rather than square residue classes in the distribution of primes. See [RS]*page 185 (where it is called ) for the archimedean case, and [Cha2008]*page 1366 for the function field case. For a survey on prime number races over , see [GranvilleMartin].
Over and number fields, exceptional biases have been studied in the literature, notably in a series of papers by Ford and Konyagin [Ford_Konyagin_2002, Ford_Konyagin_2003] and [FordKonyaginLamzouri]. Fiorilli and Martin [FiorilliMartin], under both the Generalized Riemann Hypothesis and , list the largest possible biases in the prime number race between quadratic residues and non-quadratic residues. In number field extensions, Bailleul produced infinite families of examples exhibiting a reversed bias in [Bailleul1], conditionally on a suitable linear independence hypothesis. As for unconditional results, Fiorilli and Jouve constructed infinite families exhibiting a complete bias in [FiorilliJouve].
The state of affairs in the function field setting is rather different. For instance, over , there are a few known counterexamples to (see [Cha2008]*Section 5, [DevinMeng]*Section 3, [Dupuyetal]*Section 7, [Sedrati]*Section 10), which can lead to what we call “exceptional biases”, for example favouring square residue classes rather than non-square residue classes (“reversed bias”), or having more non-square residue classes than square residue classes 100% of the time (“complete bias”). In [CFJ2016], Cha, Fiorilli and Jouve give examples of exceptional biases in Mazur’s race related to counting points on elliptic curves. They prove also the genericity of for certain families in this context in [CFJ2017].
In this paper, we investigate three types of exceptional biases. For those types of biases, we establish more precise necessary conditions than negation of for them to hold, and we show that they happen very rarely.
In order to state our results more precisely, we need to introduce some notation. When is a power of a prime and , we let
[TABLE]
For , let denote the unique primitive quadratic character modulo , and
[TABLE]
Note that when is irreducible then this is, up to a positive factor, the difference between the number of irreducible square residues modulo of degree and those which are non-square residues. We also denote by the hyperelliptic curve defined over as the smooth projective model of the curve with affine equation .
In [Kowalski2010], Kowalski showed that, in a precise quantitative sense (see formula (1.1) below), the hypothesis is generically true for the zeta functions of hyperelliptic curves of the form , where of even degree is fixed and is a parameter such that , as . This implies that for most of the parameters , the counting function is biased towards negative values and changes sign infinitely many times. This behavior is expected to hold for generically among because of .
Our main results are the following four bounds, which improve Kowalski’s result. The terms “complete bias”, “lower order bias”, and “reversed bias” are defined, respectively, in Definitions 2.2, 2.4, and 2.6.
Theorem 1.1**.**
Let be an odd prime number, a power of and . We write and .
- (1)
We have
[TABLE]
where . 2. (2)
If is a square then, we have
[TABLE]
and otherwise. 3. (3)
We have
[TABLE] 4. (4)
We have
[TABLE]
where .
To prove this theorem, we follow closely Kowalski’s method based on the large sieve for Frobenius developed in [Kowalskibook] (and improved by Perret-Gentil [Perret-GentilANT]). The theorem above should be compared to Kowalski’s bound (1.1), which we now state.
Theorem 1.2** ([Kowalski2010]*Proposition 1.1).**
Let be an integer, and let be a squarefree monic polynomial of degree . Let be an odd prime such that does not divide the discriminant of , and let be the open subset of the affine -line where . Consider the algebraic family of smooth projective hyperelliptic curves of genus given as the smooth projective models of the curves with affine equations
[TABLE]
Then for any extension we have
[TABLE]
where and .
Remark 1.3**.**
The bound stated in [Kowalski2010] is a bit larger, the exponent of is simply , but Kowalski gave this better exponent in [Kowalskibook]*Theorem 8.15, for the more general condition that the Galois group of the zeta function of is not maximal. It is indeed more general since if there exists a non-trivial linear relation between and the arguments of the roots of the zeta function, hence a multiplicative relation between those roots, then its Galois group is not maximal since this relation cannot be preserved by every allowed permutations of the roots. However, note there is a typo in the bound stated in [Kowalskibook]*Theorem 8.15: the exponent there reads with , coming from the larger contribution of p.181, but we can actually only get . The count is detailed in [Kowalski2006]*Lemma 7.3 iii) but the author is counting each symplectic polynomial with a given factorization twice, hence a missing factor. The proof of Lemma 7.7 fixes this.
The bounds in Theorem 1.1 improve Kowalski’s bound (1.1) in two aspects. First, the space of parameters is larger than Kowalski’s. While he obtains his bound for families of polynomials of a very specific shape, our bound applies to all monic squarefree polynomials of a given degree. It should be noted that our method would allow us to prove the same bounds as in Theorem 1.1 but along Kowalski’s family of curves in Theorem 1.2, independently of , by using the large sieve estimate [Kowalskibook]*Corollary 8.10 instead of Proposition 2.22 of this paper. Moreover, the exponents for in the bounds 1.1.2 and 1.1.3 are twice as small, while the exponent for in the last bound 1.1.4 is slightly better. Observe however that by passing to a multidimensional space of parameters, we lose the uniformity in in the bounds. Such a phenomenon was already present in [Kowalskibook]*Corollary 8.10 which results in a larger exponent of in the multidimensional case. In our case, the uniformity in is lost when applying the improved bound [Perret-GentilANT]*Theorem 5.14.(ii).(c).
For the first two properties considered in Theorem 1.1, inputs from arithmetic geometry give us better bounds for some restricted genera. Our first improvement is for genus or concerning the failure of .
Theorem 1.4**.**
Let be a prime number, a power of and . We write , so that . When , we have
[TABLE]
When , then we have
[TABLE]
In particular, in this more restricted setting, these bounds improve on 1.1 1 and a fortiori on 1.1 4. Note that the result for genus at most two comes from the fact that we completely understand the Frobenius eigenvalues for genus and hyperelliptic curves over . The reason is that all smooth projective curves of genus at most two are hyperelliptic, and the Torelli image of is dense in . Neither of the facts holds for higher genus.
Our last result is a bound for the bias dealt with in Theorem 1.1 2 which is uniform in the degree, at the expense of being worse in terms of for small .
Theorem 1.5**.**
If is a fixed prime power with . Then,
[TABLE]
In particular, this bound is better than the second bound of Theorem 1.1 in terms of as soon as . The underlying method coming from arithmetic geometry cannot deal with the conditions in 1.1 3 and 4 because they are concerned with multiple zeros of the zeta function of at once.
Outline of the paper
In Section 2 we set the notation and give preliminary results used in the rest of the paper. In particular, section 2.3 states some results about linear recurrent sequences, and section 2.4 is devoted to the proof of a large sieve statement, which is one important step in the proof of Theorem 1.1. In Section 3 we give a proof of the first item of Theorem 1.1 following Kowalski’s method and Theorem 1.4 by elementary methods. In Section 4 we derive conditions for a complete bias and prove the second item of Theorem 1.1 with the large sieve for Frobenius and Theorem 1.5 with arithmetic geometry. In Section 5 and 6 we derive conditions for a lower order bias and a reversed bias respectively and we prove the last two items of Theorem 1.1. Finally, in Section 7 we gather counting lemmas obtained using our large sieve result Proposition 2.22 that are used in the proofs of the different parts of Theorem 1.1.
Acknowledgements
This work was partially supported by the grant KAW 2019.0517 from the Knut and Alice Wallenberg Foundation (for LD). Part of this work was conducted while WL was in residence at the Mathematical Sciences Research Institute in Berkeley, California, during the Spring 2023 semester. The authors thank Florent Jouve, Jordan Ellenberg and Emmanuel Kowalski for helpful discussions. They also thank Régis de la Bretèche for organizing the elementary and analytic number theory seminar in IHP, Paris, where ideas used in this paper were born.
2. Preliminary results and notations
2.1. Notations and Definitions
We first provide notations for the rest of the paper. When , the projective curve with affine model is denoted by . Recall that has genus .
For , let be the primitive quadratic character modulo . We want to compare the number of degree irreducible polynomials over such that and those such that for varying . Define the Dirichlet -function associated to a Dirichlet character modulo as
[TABLE]
where and the sum and product above range over monic (resp. irreducible) polynomials of .
We now recall some properties of the -functions under consideration; see e.g. [Rosen2002]*Proposition 4.3, Theorem 5.9 for details. For a non-principal Dirichlet character , the Dirichlet -function is a polynomial in with integer coefficients and the zeta function of is a rational function in , which we denote by . Thanks to the deep work of Weil [Weil_RH], we know the analogue of the Riemann Hypothesis is satisfied for these zeta functions, that is their inverse zeros have absolute value . When is odd, then , and when is even, we have . In the following, we will mostly use the reciprocal polynomial
[TABLE]
which is monic, and its roots are the inverse zeros of .
In the following, we denote by the distinct inverse zeros of of norm , with multiplicity . We might forget the dependency in the character when only one character is considered and the notation stays clear from the context. We let be the number of distinct pairs of conjugate non-real zeros of . Since has real coefficients, after reordering, we can assume and we have for . Since is primitive, we have
[TABLE]
Using the explicit formula in [Cha2008]*Proposition 4.2, our object of study is the function
[TABLE]
Let be the opposite of the main sum of in (2.1); that is
[TABLE]
In the case the set is linearly independent over , which is expected to be the generic case, then \Delta_{f}(n)-\big{(}m_{0}(\chi_{f})+\tfrac{1}{2}\big{)} oscillates around zero and takes positive (resp. negative) values half of the time (i.e., for of positive integers ). Thus, is larger (resp. smaller) than its mean value for half of the positive integers . One deduces (see [Cha2008]*page 1366) that there is a bias in the distribution of the values of in the direction of positive values, i.e. coming back to we expect a bias towards negative values. Or in other terms, there are in general more irreducible polynomials of degree with than with .
Now, it can happen that the oscillating part does not distribute so well between positive and negative values. This is the case in the examples given in [Cha2008]*Section 5 and also for the different kinds of behaviors we consider in this paper.
Remark 2.1**.**
In this paper, we are studying the summatory function of a quadratic character over irreducible polynomials. Another “prime number race” of interest is the one between quadratic residues and non-quadratic residues. Observe that these are the same in the case is irreducible. In the case is not irreducible, one has to take into account the contribution of all quadratic (non-necessarily primitive) characters modulo , which makes the study more difficult. The general formula proved in [DevinMeng]*Proposition 5.2 is
[TABLE]
where denotes the set of quadratic residues modulo , denotes the set of non-quadratic residues modulo and is the set of quadratic characters modulo .
For a given , we define three kinds of “exceptional biases” as follows.
Definition 2.2**.**
[Complete bias] We say that exhibits a complete bias if for almost all . That is,
[TABLE]
Remark 2.3**.**
[ vs. ] In particular, if exhibits a complete bias, then exists and is equal to , but the converse need not hold. Note that the above definition may not cover all the cases for which : it may happen that for a positive proportion of and then for those , the sign of is determined by the sign of the error term and necessitate further study. In the next definition, we define the case of “lower order bias” below to characterize this possibility.
Definition 2.4**.**
[Lower order bias] We say that exhibits a lower order bias if for a positive proportion of . That is,
[TABLE]
Remark 2.5**.**
The condition of having a lower order bias is close to the condition “ties have positive density”, as introduced by Martin and Ng in [MartinNg2020] in the context of prime number races.
Finally, the last type of exceptional bias we are going to study is a direct incompatibility with the expectation that is negative for more than of integers .
Definition 2.6**.**
[Reversed bias] We say that exhibits a reversed bias if for more than half of the . That is,
[TABLE]
Remark 2.7**.**
- (1)
In Section 2.3, we will show the three densities in Definitions 2.2,2.4,2.6 exist, see Corollaries 2.15 and 2.17. 3. (2)
Note that both a lower order bias and a reversed bias may occur simultaneously, but that is the only possible combination of two exceptional biases.
Remark 2.8**.**
Observe that we could also (as in [Cha2008, DevinMeng]) count irreducible polynomials of degree instead of degree . In this case, the functions replacing and take the following more complicated forms:
[TABLE]
where the sums are over .
We cannot adapt most of our proofs for those quantities. For instance, the maximal value of such a sum is not easy to determine, and we’ll make frequent use of the maximum values in Section 4.1 (e.g. the proof of Proposition 4.2 to see why maximal values are relevant to us). However, we have for example , where represents the main sum in above, and so if is large enough compared to , the sign of is the sign of . In particular, under the right conditions, a complete bias and a reversed bias in the “degree ” setting one gets from studying ), implies a similar bias in the “degree ” setting one gets from studying ). Note also that the difference between counting irreducible polynomials of degree equal to and counting those of degree at most is analogous to the difference between counting prime number in intervals of the form and those in intervals of the form .
2.2. Properties of limiting distributions
To study the densities involved in the definitions 2.2, 2.4, and 2.6, we will use the notion of limiting distribution, which we define as follows.
Definition 2.9**.**
Let be a real function, we say that admits a limiting distribution if there exists a probability measure on Borel sets in such that for any bounded continuous function on , we have
[TABLE]
We call the limiting distribution of the function .
The function defined as Equation 2.3 is quasi-periodic, and we can apply the Kronecker-Weyl equidistribution theorem (see e.g. [Hum]*Lemma 2.7 and [Bailleul2]*Theorem 2.2) to prove the following proposition ([DevinMeng]*Proposition 2.1).
Proposition 2.10**.**
The function admits a limiting distribution with mean value and variance
[TABLE]
Moreover, the measure has support in
[TABLE]
The next lemma will be used to study reversed bias.
Lemma 2.11**.**
The distribution in Proposition 2.10 is symmetric with respect to if and only if there is no relation
[TABLE]
with and .
Proof.
Denote by the closure of the -parameter group in the -dimensional torus . We first remark that by Pontryagin duality, for any , if and only if for every character , one has . Therefore, we just need to show that is symmetric with respect to if and only if , since if and only if is even.
By the Kronecker–Weyl Equidistribution Theorem (see for example [DevinMeng]*Lemma 2.2), is a subtorus of and we have, for any continuous function ,
[TABLE]
where is the normalized Haar measure on . Then is the push-forward measure of through
[TABLE]
for any bounded continuous functions .
Now, is symmetric with respect to if and only if, for every continuous function , one has
[TABLE]
Observe that
[TABLE]
So, if , using the fact that the Haar measure is translation-invariant, we deduce that is symmetric with respect to .
On the other hand, assume . Then as is closed, and there exists such that for each one has111See Lemma 4.8 for an explicit bound.
[TABLE]
Let be a non-zero, non-negative function, supported in an interval of length around . Then
[TABLE]
while
[TABLE]
In particular, we deduce that is not symmetric with respect to . ∎
2.3. Results about linear recurrence sequences
We are interested in the positivity and zero-sets of the quantities defined in 2.3. One of the key insight is that those quantities are linear recurrence sequences which will imply the limits in Definitions 2.2, 2.4, and 2.6 exist as shown in Corollaries 2.15, 2.17.
Definition 2.12**.**
A linear recurrence sequence of order is a sequence such that there exist satisfying
[TABLE]
for all . We define its zero-set as .
It is classical that any linear recurrence sequence can be expressed in a generalized power sum form and that, conversely, any generalized power sum satisfies a linear recurrence relation.
Lemma 2.13**.**
Let , then the sequence is a linear recurrence sequence.
Proof.
Let be the reversed zeta function of the curve with , and let be the primitive quadratic character modulo and be the genus of . The roots of are which are of the form with some of them possibly . Then, the conclusion follows from [RecSeq]*page 3. ∎
Note that Lemma 2.13 is a well-known fact that follows directly from the rationality of the -function. It is not a particularity of hyperelliptic curves. We stated and proved the result here, as this is the first time it is used in the context of studying Chebyshev’s bias.
It turns out one can characterize the zero-set of such a linear recurrence sequence following the Skolem-Mahler-Lech theorem, which is stated below. A very short proof over (the Skolem case), which is the case of interest for us, using -adic analysis, is given in [RecSeq]*Theorem 2.1.
Theorem 2.14** (Skolem-Mahler-Lech, [RecSeq]*Theorem 2.1).**
Assume is a linear recurrence sequence over a field of characteristic zero. Then its zero-set is the union of a finite set and a finite number of arithmetic progressions.
This allows us to show that the density in the Definition 2.4 of a lower order bias always exists.
Corollary 2.15**.**
The density in Definition 2.4 for lower order bias exists.
Proof.
By Lemma 2.13, is a linear recurrence sequence. Its zero-set is a finite union of arithmetic progressions and a finite set following Theorem 2.14, therefore it admits a natural density. ∎
Another useful fact is the following result, showing that the densities considered for complete biases and reversed biases exist.
Theorem 2.16** ([BeGer]*Theorem 1).**
Let be a linear recurrence sequence of real numbers. Then its positivity set admits a natural density.
Corollary 2.17**.**
The densities and in Definitions 2.2 and 2.6 exist.
For certain kinds of linear recurrence sequences, called non-degenerate linear recurrence sequences, we know their zero-sets are finite. We introduce the following more general terminology for the character inspired by [RecSeq]*Section 1.1.9 because it will be an important condition to study in the proofs of (3) and (4) in Theorem 1.1.
Definition 2.18**.**
We say that is non-degenerate when none of , for , and none of , for , is a root of unity.
Using Definition 2.18, we prove the following Lemma which will be of important use in the study of lower order bias in Section 5.
Lemma 2.19**.**
Assume is non-degenerate as in Definition 2.18. Then the zero-set of is finite.
Proof.
By [RecSeq]*page 25, a non-degenerate linear recurrence sequence, that is, a sequence whose characteristic roots satisfy that no is a root of unity for , takes a given value only finitely many times. In our case however, the characteristic roots are , but also and because of the terms and in , and obviously is a root of unity. But it is easily seen that \Big{(}\Delta_{f}(2n)-\left(m_{0}(\chi_{f})+\frac{1}{2}\right)-\left(m_{\pi}(\chi_{f})+\frac{1}{2}\right)\Big{)}_{n\geq 0} and \Big{(}\Delta_{f}(2n+1)-\left(m_{0}(\chi_{f})+\frac{1}{2}\right)+\left(m_{\pi}(\chi_{f})+\frac{1}{2}\right)\Big{)}_{n\geq 0} are linear recurrence sequences ([RecSeq]*Theorem 1.1 and [RecSeq]*Theorem 1.3), and when is non-degenerate according to Definition 2.18, then those are non-degenerate as linear recurrence sequences. In particular, they respectively take the values and a finite number of times, which proves that vanishes a finite number of times. ∎
Remark 2.20**.**
In the non-degenerate case, we could replace the densities in Definitions 2.2 and 2.6 by the corresponding densities for since they exist and coincide with the ones about in that case following the fact that the density in Definition 2.4 is zero.
2.4. A large sieve statement
Let be the group of symplectic similitudes222This is sometimes called the general symplectic group and denoted as GSp in . It contains matrices such that there exists a scalar , called the multiplicator of , satisfying with When is a symplectic similitude with multiplicator , we say that is -symplectic. In this paper, following [Kowalskibook]*page 158 but with a reversed convention, we call -symplectic, any monic polynomial of even degree satisfying
[TABLE]
In particular, for the polynomial as defined in (2.1) is -symplectic.
Let us first state the result Theorem 2.21 for a general setting, using Perret-Gentil’s improvement of Kowalski’s large sieve for Frobenius [Perret-GentilANT]*Theorem 5.14.(ii).(c) and later apply it to our setting in Proposition 2.22.
The theorem is given for a general smooth affine geometrically connected algebraic variety of dimension over . We assume that has a compactification where it is the complement of a divisor with normal crossing. We denote by a geometric generic point of .
Let us fix a set of primes different from and of density . We study a family of lisse sheaves of -vector spaces on , corresponding to continuous homomorphisms , for that arise from a compatible system (as in [Kowalskibook]*Definition 8.7). Then for , we denote and .
Theorem 2.21**.**
Let be a prime number and be a power of . For each fix a conjugacy invariant subset, in the coset .
Then, for any and for any which is a power of , one has
[TABLE]
with a constant that depends only on and on the family (in particular not on , but certainly on ),
[TABLE]
where is the set of squarefree integers whose prime factors are all in , , and when one can take .
Proof.
We are in the setting of [Kowalskibook]*Chapter 8, following the ideas and notations of loc. cit. It follows from [Kowalskibook]*Proposition 2.9 as in [Kowalskibook]*Corollary 8.10 that
[TABLE]
where is as defined in (2.4) and is the large sieve constant. As in the proof of [Kowalskibook]*Proposition 8.8 we obtain that
[TABLE]
with
[TABLE]
where is the lisse sheaf corresponding to the representation as defined in [Kowalskibook]*(3.8), and is the sum of all except the largest Betti numbers as defined in [Kowalskibook]*page 166. In [Perret-GentilANT]*Section 5D2, Perret-Gentil improves the bound on compared to the bound of [Kowalskibook]*Proposition 8.8 in the case of the complement of a divisor with normal crossing. He obtains
[TABLE]
where the implicit constant depends on and on the family (in particular not on , but certainly on and on ). Thus, we have
[TABLE]
To conclude, we use [Kowalskibook]*(8.13), and multiplicativity. In particular, representations of satisfy and . ∎
To improve on Kowalski’s bound (1.1) in Theorem 1.2, we are going to use the following large sieve result which follows from Theorem 2.21 applied to the variety of configurations, with the compatible system given by the action of the Frobenius.
Proposition 2.22**.**
Let be a prime number and be a power of . Let , be the configuration space of monic squarefree polynomials of degree and be the set of primes different from and .
*For each , the action of the Frobenius endomorphism on gives a representation for a geometric generic point and for all they form a compatible system (as in [Kowalskibook]Definition 8.7), with image equal to the set of -symplectic similitudes following the work of Hall [Hall].
For every , let be a conjugacy invariant subset such that the multiplicator of every element of is .
Then, one has
[TABLE]
where the implicit constant depends only on and , we can take , is the set of squarefree integers whose prime factors are all in , and .
Proof.
We are in the setting of Theorem 2.21 with of dimension . The variety is defined by the non-vanishing of the discriminant, it is thus a smooth affine geometrically connected algebraic variety which is the complement of a divisor with normal crossing ([EVW]*Lemma 7.6).
As in [Kowalskibook]*Section 8.6 for each , the sheaf corresponding to is a rank lisse sheaf of -modules on . Since the action of the Frobenius on is independent of , the representations arise from a compatible system. By [Hall]*Theorem 1.2 (attributed to Yu), the images of and of (arithmetic and geometric monodromy groups) are conjugate to for all .
Hence, the bound follows from Theorem 2.21, where we chose . ∎
Remark 2.23**.**
Note that for any finite set of primes , the result of Proposition 2.22 holds with the set replaced by , and the set replaced by the set of squarefree integers with prime factors in . This is used in the proof of Lemma 7.5.
3. Linear dependence
Kowalski’s Theorem 1.2 is concerned with one-parameter families of reducible squarefree polynomials. The large sieve result Proposition 2.22 above allows us, following Kowalski’s proof in [Kowalskibook], to get the exact same bound, but for the larger space of parameters .
Proof of Theorem 1.1.1..
We follow exactly the proof of [Kowalskibook]*Theorem 8.15 but instead of using [Kowalskibook]*Corollary 8.10, we use Proposition 2.22. Thus, we obtain
[TABLE]
where for ,
[TABLE]
and the sets are defined as in [Kowalskibook]*pages 179–180. In particular,
- (1)
is the set of matrices with multiplicator such that is irreducible, and [Kowalskibook]*page 181 gives . 2. (2)
is the set of matrices with multiplicator such that factors as a product of an irreducible quadratic polynomial and a product of irreducible polynomials of odd degree, which satisfy333a factor was forgotten in [Kowalskibook]*page 181. by Lemma 7.7 (with , , in the case is odd) and [Kowalskibook]*Lemma B.5. 3. (3)
is the set of matrices with multiplicator such that the polynomial defined by factors as a product of an irreducible quadratic polynomial and a product of irreducible polynomials of odd degree, and [Kowalskibook]*page 181 gives . 4. (4)
is the set of matrices with multiplicator such that the polynomial defined by has an irreducible factor of prime degree , and [Kowalskibook]*page 181 gives .
The final bound is the same (correcting into ), but the space of parameters is larger. The dependency on is lost in the proof of Theorem 2.21.∎
To prove Theorem 1.4 for the genus case, we will use the following result of Ahmadi and Shparlinski.
Theorem 3.1** ([AhmadiShpar]*Theorem 2).**
Let be a smooth projective curve of genus . If the Jacobian of is absolutely simple, then the zeta function of satisfies .
Proof of Theorem 1.4..
Let us first prove the bound when and assume for now that . Then is an elliptic curve, with two conjugate (possibly equal) Frobenius eigenvalues. The only way for to fail is that those eigenvalues are of the form with a root of unity, that is, has to be a supersingular elliptic curve. By [Silverman]*V Theorem 4.1.(c), there are such curves over , up to -isomorphism (recall that is a power of the prime number ). But two elliptic curves are isomorphic over if and only if they have the same -invariant ([Silverman]*III Proposition 1.4.(b) which holds in every characteristic). Let be a fixed supersingular elliptic curve defined over with -invariant , and let us write the -invariant of the elliptic curve . Then clearly is a non-zero polynomial equation in the coefficients of by the definition of the -invariant [Silverman]*page 42. It is indeed non-zero since there always exist a non-supersingular elliptic curve over ([Waterhouse]*Theorem 4.1). In particular, one has
[TABLE]
This yields
[TABLE]
and the result follows since in general . In the case where we assume that . Then is isomorphic to its Jacobian , and by [Cremona]*page 82, is given as the smooth projective model of the curve defined by the equation , and and are the quartic invariants defined in [Cremona]*pages 72–73. The -invariant of is then clearly a non-constant rational function in the coefficients of , and we conclude as in the case .
Assume now that . By Theorem 3.1, if fails for the zeta function of , then its Jacobian is not absolutely simple, i.e. it splits over a finite extension of . In particular, the Weil polynomial of is reducible. Calling the degree , one has , where is the Weil polynomial of , which is equal to ([CorSil]*VII. Corollary 11.4), and is a primitive -th root of unity. It easily implies that has roots , . Now, there are two possible cases. Either one of is a rational number (necessarily ), or there are two indices such that is a rational number (necessarily ). In particular, is degenerate according to Definition 2.18. We conclude by Lemma 7.3. ∎
4. Complete biases
4.1. Upper bounds for complete biases
To derive a necessary condition for exhibiting a complete bias, we will use the following simple inequality of Bhatia and Davis [BhatiaDavis]*Theorem 1 (the proof in [BhatiaDavis] is done for discrete random variables, but the general case works exactly the same).
Theorem 4.1** (Bhatia-Davis Inequality).**
Let be a bounded random variable such that almost-surely with mean and variance , then
[TABLE]
Proposition 4.2** (Necessary condition for complete bias).**
Let and assume that admits a complete bias. Then one of the following assertions is true.
- (1)
The distribution is symmetric with respect to its mean value and and in the case , the inequality is strict with more than half of the zeros equal to . 2. (2)
The distribution is not symmetric with respect to its mean value and .
In particular, this implies the following condition.
Corollary 4.3**.**
If admits a complete bias for , then is a square and .
Remark 4.4**.**
In the case of Dirichlet -functions over , it is a famous conjecture of Chowla[Chowla] that no such -function can vanish at . It is known that Artin -functions corresponding to number fields extensions can vanish at . Incidentally, this was used in [Bailleul1] to provide examples of reversed bias in this context. In the function field case, it was shown in [Li18]*Theorem 1.3 that for any there are infinitely many Dirichlet -functions over vanishing at , that is such that the corresponding Weil polynomial vanishes at . However it is expected that of those -functions do not vanish at for a fixed ([Li18]*Remark 1.4). If this were true, we would obtain the following result instead of Theorem 1.5: for every a power of an odd prime,
[TABLE]
Note also that by [ELS]*Corollary 1.6 there is no complete bias when is irreducible and does not divide the degree of . Indeed, in this case .
We can now prove our main results concerning upper bounds for complete bias using the necessary condition in Corollary 4.3.
Proof of Theorem 1.1.2..
The proof follows from applying Corollary 4.3 and Lemma 7.1. ∎
Proof of Theorem 1.5.
By [ELS]*Theorem 3.2, one has
[TABLE]
and so the bound follows from Corollary 4.3. ∎
We finally give the proof of our necessary condition for complete bias.
Proof of Proposition 4.2.
Suppose that the distribution is symmetric with respect to its mean value . We have , so this value is in . Indeed, let and be non-negative continuous and supported on , with . Then
[TABLE]
where \tilde{h}(a_{0},\dots,a_{r})=h\Big{(}m_{0}(\chi_{f})+\tfrac{1}{2}+(m_{\pi}(\chi_{f})+\tfrac{1}{2})e^{ia_{0}}+2\sum_{j=1}^{r}m_{\theta_{j}}(\chi_{f})\cos(a_{j})\Big{)} and is the Haar measure on the subtorus of generated by . Since , we get , which implies .
By symmetry, is also in , so it is non-negative.
In the case is not symmetric with respect to its mean value, we are interested in the behavior of
[TABLE]
By [Bailleul2]*Theorem 3.1, we have where are random variables whose distributions are the limiting distributions of and respectively. Since we are assuming complete bias, then yields .
We apply the Bhatia-Davis Inequality, Theorem 4.1, to the random variable . To do so, we need the maximum, minimum, mean, and variance of . To understand these, we group the by pairs such that when necessary. We have
[TABLE]
where we sum on (in particular ), and we define where (and if such a does not exist). This grouping of terms was made to simplify the computation of the variance below. From this expression we deduce
[TABLE]
By the assumption of complete bias, we have almost-surely. By the definition of , we have
[TABLE]
and
[TABLE]
By the Bhatia-Davis inequality (Theorem 4.1), we obtain
[TABLE]
This yields
[TABLE]
If every is zero, this means that for every integer , one has . Since exhibits a complete bias, this has to be positive, i.e. . If there is at least one non-zero , the inequality (4.2) also implies .
Finally, since and have distinct multiplicities as roots of , those must be rational, hence integers, and so must be a square. ∎
4.2. Examples of complete biases
In this section, we first give a sufficient condition for a complete bias, in the hope to use it to find examples of instances of such an exceptional behavior.
Lemma 4.5** (Sufficient condition for complete bias).**
Let . Write
[TABLE]
with of maximal degree, . Assume that one of the following assertions holds,
- (1)
we have and , or 2. (2)
we have and , and
- (a)
* admits a root whose angle is not in , or* 2. (b)
there exists satisfying and is odd, where are the angles of the roots of .
Then there is a complete bias with modulus .
One such example is where is a generator of over , in [DevinMeng]*Example 3 the authors show that .
Remark 4.6**.**
More generally, in [DevinMeng]*Proposition 3.1, based on Honda–Tate ideas (citing [Waterhouse]*Theorem 4.1), one can see that for each square, there exist of degree such that the -function of is . This gives one example satisfying Lemma 4.5 for each square.
Remark 4.7**.**
Note however that our sufficient condition for a complete bias Lemma 4.5 is more restrictive than simply vanishing at so we cannot use the lower bound from [Li18]*Theorem 1.3 to give infinitely many examples of complete bias for a fixed .
Proof of Lemma 4.5.
It suffices to prove that under these conditions, we have for almost all , where is defined in (2.3). We order the zeros of so that the first ones correspond to the zeros of , with multiplicities. Then, for all we have
[TABLE]
and
[TABLE]
Since for all and , the conditions in case 1 imply that for all . In the case the conditions of 2a are satisfied, we have for almost all , since, up to reordering, we can assume that which yields for almost all . This concludes the proof in the case 2a. In the case 2b, it follows from Lemma 4.8 that for all , where and this concludes the proof. ∎
We conclude this section by proving a technical lemma that was used in the proof of the sufficient condition (Lemma 4.5).
Lemma 4.8**.**
Let and assume that there exists satisfying and is odd. Then, for all , we have In particular,
[TABLE]
Proof.
Recall that . For each , let be an integer that satisfies this minimum. We have
[TABLE]
Now, suppose that , then we have
[TABLE]
This concludes the proof. ∎
5. Lower order biases
5.1. Upper bound
Our reflections on linear recurrence sequences from Section 2.3 give a good understanding on lower order bias. In particular, the contraposition of Lemma 2.19 yields the following necessary condition for a lower order bias.
Proposition 5.1** (Necessary condition for lower order bias).**
If admits a lower order bias, then is degenerate (see Definition 2.18).
This lemma implies that for to admit a lower order bias, the Jacobian of the curve is either non-ordinary or geometrically admitting an isogenous factor of order at least .
Using this lemma and an application of the large sieve from Proposition 2.22, we obtain the proof of Theorem 1.1.3.
Proof of Theorem 1.1.3.
The proof follows from applying Proposition 5.1 and Lemma 7.3. ∎
5.2. A sufficient condition for lower order bias and examples
Lemma 5.2** (Sufficient condition for lower order bias).**
Let . Suppose that , then for all , in particular, there is a lower order bias with modulus .
Proof.
Assume that the roots of with positive imaginary parts are labelled so that their arguments are . Since , the multiplicity of equals to that of . For and , one has , whence
[TABLE]
Further,
[TABLE]
The above together give for all . This is sufficient to deduce that there is a lower order bias with modulus . ∎
One such example is which is irreducible and the -function of is which is even with inverse roots , , where
[TABLE]
Moreover, using [Calcut]*page 17, we see that has argument unrelated to .
Remark 5.3**.**
Using [HNR]*Table 1.2 and the sufficient condition, we can give several examples for each that have a lower order bias. Namely the authors show that the polynomial with is the Weil polynomial of the Jacobian of a hyperelliptic curve of genus 2 if , and is not a square. Since the Weil polynomial of the Jacobian of such a curve is equal to the corresponding ([CorSil]*VII. Corollary 11.4), such exhibit lower order biases.
Remark 5.4**.**
The condition of Lemma 5.2 gives rise to the following question: Fix a finite field , how many hyperelliptic curves admit even Frobenius characteristic polynomials? If is such a curve, then has its Frobenius characteristic polynomial being a perfect square. This question is closely related to counting curves/characters whose -functions are perfect squares.
6. Reversed biases
6.1. Upper bound
Let us first give a necessary condition for a reversed bias.
Proposition 6.1** (Necessary condition for a reversed bias).**
If there is a reversed bias with modulus then
- •
either there exist satisfying and is odd, where the are angles of zeros of .
- •
or (in particular, is a square).
Proof.
Suppose admits a reversed bias. Then the distribution is not symmetric with respect to its mean value . So, from Lemma 2.11, there exists such that and .
If is even, we get the first condition. Otherwise, assume that all relation between the ’s, satisfy is even. Then we deduce from Lemma 2.11, that the limiting distribution of the functions and are symmetric with respect to their mean values, which are and . If the probability that one of the two functions is negative is larger than , then at least one of the mean values has to be negative. ∎
Here is a translation of our necessary condition in terms of the Galois group of over , which is more convenient to use in the large sieve. Recall that for , is a subgroup of , itself a subgroup of (see [Kowalskibook]*page 249). In the following, we will consider that acts on , the set of indices of the roots . The fact that means that if then for all .
Lemma 6.2**.**
Let . Assume that there exist such that and . Then at least one of the following conditions hold:
- (1)
* is not separable.* 2. (2)
* is degenerate (in the sense of Definition 2.18).* 3. (3)
* does not act transitively on the set of pairs ;* 4. (4)
For every , does not contain the transposition , and for every pair , with , does not contain the -cycle .
Proof.
Assume that none of the first three items are satisfied. Let us fix , then for every , there exist such that . From the multiplicative relation
[TABLE]
with , we apply and taking the product over all ’s we obtain another multiplicative relation of the form
[TABLE]
where . In particular . This being true for each , by taking a suitable product of large powers of expressions of the form 6.1, we deduce that there exists such that
[TABLE]
Let . If , then we apply it to the relation 6.2 and taking a quotient we get . This is a contradiction because and is not a root of unity since is non-degenerate.
Now, let . If we get , and similarly by applying its inverse , we get . Combining the two relations, we obtain . But at least one among and is non-zero, since the ’s are non-zero, and as before, this shows that we cannot have . ∎
Lemma 6.3**.**
Let be a -symplectic polynomial of degree with roots . If does not act transitively on the pairs then defined by is reducible.
Proof.
Notice the roots of are the . Every element of are restrictions of elements of to the splitting field of . Now if is irreducible over , then acts transitively on the set . Thus, if , there exists such that . But for some so we have which implies , which means or , and acts transitively on the set of pairs . ∎
We can finally prove the last part of our main theorem.
Proof of Theorem 1.1.4.
The proof follows by using the necessary conditions of Proposition 6.1. We obtain a bound for the second condition by the same argument as in Lemma 7.1. For the first condition of Proposition 6.1, we use Lemma 6.2 and bound the conditions and by Lemma 7.3. The third item is bounded by using Lemma 6.3 and Lemma 7.4. Finally, the bound obtained with the condition on the Galois group, which is the largest contribution, is dealt with by Lemma 7.5. ∎
6.2. Examples
In the hope of finding examples of reversed bias in the sense of Definition 2.6 we estimated
[TABLE]
where is the primitive quadratic character modulo for small genera and small finite fields . In particular, for fixed we computed for many values of , e.g. all . We found no clear candidate curves which exhibited a ”strong” reversed bias amongst with a prime less than and as well as among those curves with and .
Remark 6.4**.**
We can still provide an infinite family of examples exhibiting a reversed bias. Indeed, when is a square the polynomial is the -function of a hyperelliptic curve of genus according to [HNR]. For such a curve , we have which is -periodic and takes positive values and negative values; explicitly, it takes the values .
Cha’s example ([Cha2008]*Example 5.3) corresponds to a reversed bias, however Cha is counting polynomials with degree less than instead of polynomials of degree equal to (see Remark 2.8). We verified that this example does not meet our criterion of being a reversed bias with our way of counting polynomials, but it exhibits a lower order bias because is -periodic, takes positive values (at ), negative values (at ) and is zero otherwise.
7. A few counting lemmas
Using the large sieve statement Proposition 2.22, we will now prove important intermediate counting lemmas that are used to establish our upper bounds for exceptional biases. Recall that is a power of the prime , and for any , is the genus of the curve for any the set of monic squarefree polynomials in of degree .
Lemma 7.1**.**
We have
[TABLE]
where .
Proof.
We first remark that the set is empty when is not a square, because in that case, and are conjugate algebraic numbers, so they must have the same multiplicity as roots of a polynomial with integer coefficients such as . We will prove our bound by showing that when is a square, we have
[TABLE]
For every (recall that is simply the set of primes different from and ), we introduce the set of -symplectic matrices for which is not an eigenvalue. From [Kowalskibook]*Lemma B.5 (due to Chavdarov) we have
[TABLE]
Since the set of symplectic -polynomials of degree in has dimension , and that the condition of vanishing at one point is a linear equation of the coefficients, we have
[TABLE]
We deduce that there exist a constant depending on such that
[TABLE]
Therefore, for , we have
[TABLE]
The desired bound then follows from Proposition 2.22 by summing only over primes in . ∎
Remark 7.2**.**
We could improve the bound above by not restricting to the sum over primes, but we decided not to pursue this here, as we expect the improvement will only be on the power of .
The following lemma will allow us to reduce our counting to the case of non-degenerate characters (as in Definition 2.18) and simple roots of .
Lemma 7.3**.**
We have
[TABLE]
where .
Proof.
Let satisfy the above condition, that is is degenerate or has a multiple root in . Then there exist such that is a root of unity, we denote its order (one can take in the case of a multiple root ). We first remark that and are algebraic integers of degree at most , so clearly is an algebraic number of degree at most , and so .
Since , it means that the polynomial has a multiple root. This implies that its discriminant is [math]. Now, is a polynomial with integer coefficients in the coefficients of since it is the resultant of and its derivative. Moreover, those coefficients are symmetric polynomials in the ’s, and in particular in the ’s. By the fundamental theorem of symmetric polynomials, this is a polynomial expression in the elementary symmetric polynomials in the ’s, which are precisely the coefficients of .
We have shown that satisfies a certain integral polynomial equation, i.e. there exists a polynomial such that, if are the coefficients of , then one has . Since there are at most finitely many such that , we get a universal relation
[TABLE]
such that if is degenerate or has a multiple root, then .
Moreover, when is large enough, we know that is non-zero since by Kowalski’s result (Theorem 1.2), there exists a polynomial monic of degree such that satisfies LI, and in particular, none of its quotients of roots is a root of unity, and for that polynomial, one has .
So the equation defines a hypersurface in the set of -symplectic polynomials of fixed degree, and we have
[TABLE]
See also [Kowalskibook]*Theorem B.6. The end of the proof is completely similar to the end of the proof of Lemma 7.1. ∎
In the next lemma, denotes the “real Weil polynomial” attached to , defined by the relation
[TABLE]
Lemma 7.4**.**
We have
[TABLE]
where .
Proof.
We use Proposition 2.22 with the set
[TABLE]
Since if a monic polynomial is reducible, none of its reduction modulo a prime can be irreducible, we have
[TABLE]
where . There are monic irreducible polynomials of degree with coefficients in . As is a bijection from the set of -symplectic polynomials in of degree to the set of monic polynomials of degree in , we deduce from [Kowalskibook]*Lemma B.5 (similarly to Lemma 7.1) that
[TABLE]
We conclude using the estimation of the sum from a theorem of Lau and Wu [Kowalskibook]Theorem G.2 applied the same way as Kowalski in [Kowalskibook](8.24):
[TABLE]
from which we deduce the stated bound. ∎
The last counting lemma is about polynomials such that does not contain certain permutations. Recall from the discussion above Lemma 6.2 that acts on .
Lemma 7.5**.**
We have
[TABLE]
where .
Proof.
First, we may assume that is separable, since the announced bound is worse than that of 7.3.
We are once again going to use the large sieve bound coming from Proposition 2.22 but the set of prime numbers used in the large sieve has to be modified a bit here because of Lemma 7.7: we take to be the set of prime numbers different from and and larger than (see Remark 2.23). This only induces a further dependency on in the implied constants, but doesn’t modify the final bound.
For every , we consider be the set of -symplectic matrices such that the characteristic polynomial admits a factorization either as a quadratic irreducible polynomial multiplied by distinct irreducible polynomials of odd degree, or as a quartic irreducible polynomial multiplied by distinct irreducible polynomials of odd degree. Indeed, if is separable but the Galois group does not contain a transposition nor a -cycle (when seen as a subgroup of ), then for any (see [Jacobson]*Theorem 4.37).
Therefore, we need to count the symplectic polynomials with such factorizations to be able to conclude as above. For , we let .
In the case is even, we use Lemma 7.7 with (, ) and with (, , ) to get
[TABLE]
In the case is odd we use Lemma 7.7 with (, , ) with (, , ), and with (, ) to get
[TABLE]
In both cases we have , so we obtain the announced bound in the same way as in the proof of Proposition 7.4. ∎
Remark 7.6**.**
In the proof above of Lemma 7.5 one could expand the application of Lemma 7.7 to add more terms to the lower bound of to gain marginal improvements. The additional condition in the lemma and its application above with delivers our improvement over Kowalski’s bound (1.1).
Lemma 7.7**.**
Let be two integers, and let be a prime number. Write and let , , be integers such that . Let be the set of -symplectic squarefree polynomials which factor as a product , where is an irreducible -symplectic polynomial of degree , each is a product of distinct irreducible monic polynomials of degree , and is the -reciprocal of . Then, we have
[TABLE]
Proof.
First observe that for any -symplectic polynomial , one has , in particular, for all , one has . We appeal to [Kowalski2006]*Lemma 7.3 (ii), which gives that the count of irreducible symplectic polynomials of degree is larger than (see also [DDS]*Lemma 3 which can be adapted to the case of -symplectic polynomials). The irreducible factors of odd degree of a symplectic polynomial come in pairs , uniquely determined by either of its elements. So it suffices to count polynomials of degree that are products of distinct odd degree irreducible polynomials. By [Kowalskibook]*Lemma B.1 there are polynomials with given factorization (as in the statement of the lemma).
For each polynomial with factorization type , for each factor , , we have made a choice of which element of the pair to include. There are such choices for each .
We just need to remove from the final count the monic -symplectic polynomials that have multiple roots, as counted in the proof of Lemma 7.3, there are at most such polynomials, so this does not change the main term. ∎
References
