Exceptional biases in counting primes over functions fields

Alexandre Bailleul; Lucile Devin; Daniel Keliher; Wanlin Li

arXiv:2302.13665·math.NT·March 5, 2024

Exceptional biases in counting primes over functions fields

Alexandre Bailleul, Lucile Devin, Daniel Keliher, Wanlin Li

PDF

Open Access

TL;DR

This paper investigates the rarity of certain biases in prime distributions over function fields, showing they become negligible as the size of the finite field grows, using advanced sieve and geometric methods.

Contribution

It introduces new bounds demonstrating the vanishing probability of three types of prime biases in large finite fields, improving previous results by Kowalski.

Findings

01

Biases occur with probability tending to zero as q increases

02

New bounds improve upon Kowalski's earlier results

03

Uses advanced sieve methods and arithmetic geometry techniques

Abstract

We study how often exceptional configurations of irreducible polynomials over finite fields occur in the context of prime number races and Chebyshev's bias. In particular, we show that three types of biases, which we call "complete bias", "lower order bias" and "reversed bias", occur with probability going to zero among the family of all squarefree monic polynomials of a given degree in $F_{q} [x]$ as $q$ , a power of a fixed prime, goes to infinity. The bounds given improve on a previous result of Kowalski, who studied a similar question along particular $1$ -parameter families of reducible polynomials. The tools used are the large sieve for Frobenius developed by Kowalski, an improvement of it due to Perret-Gentil and considerations from the theory of linear recurrence sequences and arithmetic geometry.

Equations199

H_{n} (F_{q}) = {f \in F_{q} [x] ∣ f is monic, squarefree, de g f = n} .

H_{n} (F_{q}) = {f \in F_{q} [x] ∣ f is monic, squarefree, de g f = n} .

Π (n; χ_{f})

Π (n; χ_{f})

\displaystyle-\#\{h\in{\mathbb{F}}_{q}[x]\mid h\text{ is irreducible, }\deg h=n\text{ and }\chi_{f}(h)=-1\}\Big{)}.

\frac{1}{∣ H _{n} ( F _{q} ) ∣} # {f \in H_{n} (F_{q}) ∣ The zeta function of C_{f} does not satisfy LI} ≪_{p, g} q^{- \frac{1}{2 A}} (lo g q)^{1 - δ}

\frac{1}{∣ H _{n} ( F _{q} ) ∣} # {f \in H_{n} (F_{q}) ∣ The zeta function of C_{f} does not satisfy LI} ≪_{p, g} q^{- \frac{1}{2 A}} (lo g q)^{1 - δ}

\frac{1}{∣ H _{n} ( F _{q} ) ∣} # {f \in H_{n} (F_{q}) ∣ Π (n; χ_{f}) exhibits a complete bias} ≪_{p, g} q^{- \frac{1}{A}} lo g q,

\frac{1}{∣ H _{n} ( F _{q} ) ∣} # {f \in H_{n} (F_{q}) ∣ Π (n; χ_{f}) exhibits a complete bias} ≪_{p, g} q^{- \frac{1}{A}} lo g q,

\frac{1}{∣ H _{n} ( F _{q} ) ∣} # {f \in H_{n} (F_{q}) ∣ Π (n; χ_{f}) exhibits a lower order bias} ≪_{p, g} q^{- \frac{1}{A}} lo g q .

\frac{1}{∣ H _{n} ( F _{q} ) ∣} # {f \in H_{n} (F_{q}) ∣ Π (n; χ_{f}) exhibits a lower order bias} ≪_{p, g} q^{- \frac{1}{A}} lo g q .

\frac{1}{∣ H _{n} ( F _{q} ) ∣} # {f \in H_{n} (F_{q}) ∣ Π (n; χ_{f}) exhibits a reversed bias} ≪_{p, g} q^{- \frac{1}{2 A}} (lo g q)^{1 - δ^{'}},

\frac{1}{∣ H _{n} ( F _{q} ) ∣} # {f \in H_{n} (F_{q}) ∣ Π (n; χ_{f}) exhibits a reversed bias} ≪_{p, g} q^{- \frac{1}{2 A}} (lo g q)^{1 - δ^{'}},

C_{t} : y^{2} = f (x) (x - t), for t \in U .

C_{t} : y^{2} = f (x) (x - t), for t \in U .

\frac{1}{∣ U ( F _{q} ) ∣} # {t \in U (F_{q}) ∣ The zeta function of C_{t} does not satisfy LI} ≪_{g} q^{- \frac{1}{2 A}} (lo g q)^{1 - δ},

\frac{1}{∣ U ( F _{q} ) ∣} # {t \in U (F_{q}) ∣ The zeta function of C_{t} does not satisfy LI} ≪_{g} q^{- \frac{1}{2 A}} (lo g q)^{1 - δ},

\frac{1}{∣ H _{n} ( F _{q} ) ∣} # {f \in H_{n} (F_{q}) ∣ The zeta function of C_{f} does not satisfy LI} ≪ \frac{p}{q} .

\frac{1}{∣ H _{n} ( F _{q} ) ∣} # {f \in H_{n} (F_{q}) ∣ The zeta function of C_{f} does not satisfy LI} ≪ \frac{p}{q} .

\frac{1}{∣ H _{n} ( F _{q} ) ∣} # {f \in H_{n} (F_{q}) ∣ The zeta function of C_{f} does not satisfy LI} ≪_{p} q^{- \frac{1}{12}} lo g q .

\frac{1}{∣ H _{n} ( F _{q} ) ∣} # {f \in H_{n} (F_{q}) ∣ The zeta function of C_{f} does not satisfy LI} ≪_{p} q^{- \frac{1}{12}} lo g q .

n \geq 3 sup \frac{1}{∣ H _{n} ( F _{q} ) ∣} # {f \in H_{n} (F_{q}) ∣ Π (n; χ_{f}) exhibits a complete bias} ≪ \frac{1}{q ^{1/276}} .

n \geq 3 sup \frac{1}{∣ H _{n} ( F _{q} ) ∣} # {f \in H_{n} (F_{q}) ∣ Π (n; χ_{f}) exhibits a complete bias} ≪ \frac{1}{q ^{1/276}} .

L (s, χ) = a monic \sum \frac{χ ( a )}{∣ a ∣ ^{s}} = P irreducible \prod (1 - \frac{χ ( P )}{∣ P ∣ ^{s}})^{- 1},

L (s, χ) = a monic \sum \frac{χ ( a )}{∣ a ∣ ^{s}} = P irreducible \prod (1 - \frac{χ ( P )}{∣ P ∣ ^{s}})^{- 1},

P_{f} (T) = T^{2 g} Z_{f} (T^{- 1}),

P_{f} (T) = T^{2 g} Z_{f} (T^{- 1}),

m_{0} (χ_{f}) + m_{π} (χ_{f}) + 2 j = 1 \sum r m_{θ_{j}} (χ_{f}) = 2 g .

m_{0} (χ_{f}) + m_{π} (χ_{f}) + 2 j = 1 \sum r m_{θ_{j}} (χ_{f}) = 2 g .

Π (n; χ_{f})

Π (n; χ_{f})

\displaystyle\quad-\#\{h\in{\mathbb{F}}_{q}[x]\mid h\text{ is irreducible , }\deg h=n\text{ and }\chi_{f}(h)=-1\}\Big{)}

= \frac{n}{q ^{n /2}} d e g h = n h irreducible \sum χ_{f} (h)

= - (m_{0} (χ_{f}) + \frac{1}{2}) - (m_{π} (χ_{f}) + \frac{1}{2}) (- 1)^{n} - θ_{j} \neq = 0, π \sum m_{θ_{j}} (χ_{f}) e^{in θ_{j} (χ_{f})} + O_{f} (q^{- \frac{n}{6}}) .

Δ_{f} (n) = (m_{0} (χ_{f}) + \frac{1}{2}) + (m_{π} (χ_{f}) + \frac{1}{2}) (- 1)^{n} + θ_{j} \neq = 0, π \sum m_{θ_{j}} (χ_{f}) e^{in θ_{j} (χ_{f})} .

Δ_{f} (n) = (m_{0} (χ_{f}) + \frac{1}{2}) + (m_{π} (χ_{f}) + \frac{1}{2}) (- 1)^{n} + θ_{j} \neq = 0, π \sum m_{θ_{j}} (χ_{f}) e^{in θ_{j} (χ_{f})} .

Π (n; f, □, ⊠) :=

Π (n; f, □, ⊠) :=

\displaystyle-\frac{1}{\lvert\boxtimes\rvert}\lvert\{h\in{\mathbb{F}}_{q}[x]\mid h\text{ monic irreducible, }\deg{h}=n,h\bmod f\in\boxtimes\}\rvert\Big{)}

=

\displaystyle\hskip 56.9055pt+O_{f}\left(q^{-\frac{n}{6}}\right)\Bigg{\}},

dens (Δ_{f} > 0) := X \to + \infty lim \frac{1}{X} n \leq X \sum 1_{Δ_{f} (n) > 0} = 1.

dens (Δ_{f} > 0) := X \to + \infty lim \frac{1}{X} n \leq X \sum 1_{Δ_{f} (n) > 0} = 1.

dens (Δ_{f} (n) = 0) := X \to + \infty lim \frac{1}{X} n \leq X \sum 1_{Δ_{f} (n) = 0} > 0.

dens (Δ_{f} (n) = 0) := X \to + \infty lim \frac{1}{X} n \leq X \sum 1_{Δ_{f} (n) = 0} > 0.

dens (Δ_{f} (n) < 0) := X \to + \infty lim \frac{1}{X} n \leq X \sum 1_{Δ_{f} (n) < 0} > \frac{1}{2} .

dens (Δ_{f} (n) < 0) := X \to + \infty lim \frac{1}{X} n \leq X \sum 1_{Δ_{f} (n) < 0} > \frac{1}{2} .

Π (\leq n; f, □, ⊠)

Π (\leq n; f, □, ⊠)

\displaystyle-\frac{1}{\lvert\boxtimes\rvert}\lvert\{h\in{\mathbb{F}}_{q}[x]\mid h\text{ monic irreducible, }\deg{h}\leq n,h\bmod f\in\boxtimes\}\rvert\Big{)}

\displaystyle=\ \frac{-1}{\lvert\boxtimes\rvert}\Bigg{\{}\sum_{\chi\in X_{f}^{\text{quad}}}\Bigg{(}\ \left(m_{0}(\chi)+\tfrac{1}{2}\right)\frac{\sqrt{q}}{\sqrt{q}-1}+\left(m_{\pi}(\chi)+\tfrac{1}{2}\right)\frac{\sqrt{q}}{\sqrt{q}+1}(-1)^{n}

\displaystyle+\sum_{\theta_{j}\neq 0,\pi}m_{\theta_{j}}(\chi)\frac{\sqrt{q}e^{i\theta_{j}(\chi)}}{\sqrt{q}e^{i\theta_{j}(\chi)}-1}e^{in\theta_{j}(\chi)}\Bigg{)}+O_{f}\left(q^{-\frac{n}{6}}\right)\Bigg{\}};

Π (\leq n; χ_{f}) :=

=

- θ_{j} \neq = 0, π \sum m_{θ_{j}} (χ_{f}) \frac{q e ^{i θ_{j} (χ_{f})}}{q e ^{i θ_{j} (χ_{f})} - 1} e^{in θ_{j} (χ_{f})} + O_{f} (q^{- \frac{n}{6}}),

Y \to \infty lim \frac{1}{Y} n \leq Y \sum h (D (n)) = \int_{R} h (t) d μ (t) .

Y \to \infty lim \frac{1}{Y} n \leq Y \sum h (D (n)) = \int_{R} h (t) d μ (t) .

\big{(}m_{\pi}(\chi_{f}\big{)}+\tfrac{1}{2})^{2}+\frac{1}{2}\sum\limits_{j=1}^{r}m_{\theta_{j}}(\chi_{f})^{2}.

\big{(}m_{\pi}(\chi_{f}\big{)}+\tfrac{1}{2})^{2}+\frac{1}{2}\sum\limits_{j=1}^{r}m_{\theta_{j}}(\chi_{f})^{2}.

[m_{0} (χ_{f}) - m_{π} (χ_{f}) - 2 j = 1 \sum r m_{θ_{j}} (χ_{f}), m_{0} (χ_{f}) + m_{π} (f) + 1 + 2 j = 1 \sum r m_{θ_{j}} (χ_{f})] .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsCoding theory and cryptography · Analytic Number Theory Research · Algebraic Geometry and Number Theory

Full text

Exceptional biases in counting primes over function fields

Alexandre Bailleul

ENS Paris-Saclay, Centre Borelli, UMR 9010, 91190 Gif-sur-Yvette, France

[email protected]

,

Lucile Devin

Univ. Littoral Côte d’Opale, UR 2597 LMPA, Laboratoire de Mathématiques Pures et Appliquées Joseph Liouville, F-62100 Calais, France

[email protected]

,

Daniel Keliher

University of Georgia, Department of Mathematics, 200 D. W. Brooks Drive, Athens, GA 30602, USA

[email protected]

and

Wanlin Li

Washington University in St. Louis, Department of Mathematics and Statistics, One Brookings Drive, St. Louis, MO 63130, USA

[email protected]

Abstract.

We study how often exceptional configurations of irreducible polynomials over finite fields occur in the context of prime number races and Chebyshev’s bias. In particular, we show that three types of biases, which we call “complete bias”, “lower order bias” and “reversed bias”, occur with probability going to zero among the family of all squarefree monic polynomials of a given degree in ${\mathbb{F}}_{q}[x]$ as $q$ , a power of a fixed prime, goes to infinity. The bounds given improve on a previous result of Kowalski, who studied a similar question along particular 1-parameter families of reducible polynomials. The tools used are the large sieve for Frobenius developed by Kowalski, an improvement of it due to Perret-Gentil and considerations from the theory of linear recurrence sequences and arithmetic geometry.

1. Introduction

Chebyshev’s bias is the phenomenon that there are more prime numbers of the form $4n+3$ than of the form $4n+1$ in initial intervals $\llbracket 2,x\rrbracket$ of $\mathbb{N}$ for most values of $x$ (more precisely, the set of such $x$ admits a logarithmic density of around $99.59\%$ ). More generally, primes which are congruent to a fixed non-square residue class modulo an integer $q$ are more numerous than those which are congruent to a given square residue class modulo $q$ in initial intervals of $\mathbb{N}$ . The origin of this phenomenon was explained by Rubinstein and Sarnak in [RS].

The analogue of Chebyshev’s bias over function fields was first considered by Cha in [Cha2008] to study inequities in the distribution of irreducible polynomials in residue classes of ${\mathbb{F}}_{q}[x]$ , and later by Cha and Im in [ChaIm2011] in function field extensions. As in the classical archimedean case of [RS], a central hypothesis is a linear independence hypothesis which will be called $\mathrm{LI}$ throughout. If the arguments of the non-trivial inverse zeros (of non-negative imaginary parts) of the underlying $L$ -functions are of the form $\sqrt{q}e^{i\theta}$ , then $\mathrm{LI}$ claims that the $\theta$ ’s, together with $\pi$ , are linearly independent over ${\mathbb{Q}}$ . A consequence of $\mathrm{LI}$ is that Chebyshev’s bias favours non-square residue classes rather than square residue classes in the distribution of primes. See [RS]*page 185 (where it is called $\mathrm{GSH}$ ) for the archimedean case, and [Cha2008]*page 1366 for the function field case. For a survey on prime number races over ${\mathbb{Q}}$ , see [GranvilleMartin].

Over ${\mathbb{Q}}$ and number fields, exceptional biases have been studied in the literature, notably in a series of papers by Ford and Konyagin [Ford_Konyagin_2002, Ford_Konyagin_2003] and [FordKonyaginLamzouri]. Fiorilli and Martin [FiorilliMartin], under both the Generalized Riemann Hypothesis and $\mathrm{LI}$ , list the largest possible biases in the prime number race between quadratic residues and non-quadratic residues. In number field extensions, Bailleul produced infinite families of examples exhibiting a reversed bias in [Bailleul1], conditionally on a suitable linear independence hypothesis. As for unconditional results, Fiorilli and Jouve constructed infinite families exhibiting a complete bias in [FiorilliJouve].

The state of affairs in the function field setting is rather different. For instance, over $\mathbb{F}_{q}[x]$ , there are a few known counterexamples to $\mathrm{LI}$ (see [Cha2008]*Section 5, [DevinMeng]*Section 3, [Dupuyetal]*Section 7, [Sedrati]*Section 10), which can lead to what we call “exceptional biases”, for example favouring square residue classes rather than non-square residue classes (“reversed bias”), or having more non-square residue classes than square residue classes 100% of the time (“complete bias”). In [CFJ2016], Cha, Fiorilli and Jouve give examples of exceptional biases in Mazur’s race related to counting points on elliptic curves. They prove also the genericity of $\mathrm{LI}$ for certain families in this context in [CFJ2017].

In this paper, we investigate three types of exceptional biases. For those types of biases, we establish more precise necessary conditions than negation of $\mathrm{LI}$ for them to hold, and we show that they happen very rarely.

In order to state our results more precisely, we need to introduce some notation. When $q$ is a power of a prime $p$ and $n\geq 1$ , we let

[TABLE]

For $f\in\mathcal{H}_{n}({\mathbb{F}}_{q})$ , let $\chi_{f}$ denote the unique primitive quadratic character modulo $f$ , and

[TABLE]

Note that when $f$ is irreducible then this is, up to a positive factor, the difference between the number of irreducible square residues modulo $f$ of degree $n$ and those which are non-square residues. We also denote by $\mathcal{C}_{f}$ the hyperelliptic curve defined over ${\mathbb{F}}_{q}$ as the smooth projective model of the curve with affine equation $y^{2}=f(x)$ .

In [Kowalski2010], Kowalski showed that, in a precise quantitative sense (see formula (1.1) below), the $\mathrm{LI}$ hypothesis is generically true for the zeta functions of hyperelliptic curves of the form $\mathcal{C}_{g(x)(x-t)}$ , where $g\in\mathcal{H}_{n}({\mathbb{F}}_{q})$ of even degree is fixed and $t\in{\mathbb{F}}_{q}$ is a parameter such that $g(t)\neq 0$ , as $q\to\infty$ . This implies that for most of the parameters $t$ , the counting function $\Pi(n;\chi_{g(x)(x-t)})$ is biased towards negative values and changes sign infinitely many times. This behavior is expected to hold for $\Pi(n;\chi_{f})$ generically among $f\in\mathcal{H}_{n}({\mathbb{F}}_{q})$ because of $\mathrm{LI}$ .

Our main results are the following four bounds, which improve Kowalski’s result. The terms “complete bias”, “lower order bias”, and “reversed bias” are defined, respectively, in Definitions 2.2, 2.4, and 2.6.

Theorem 1.1.

Let $p$ be an odd prime number, $q$ a power of $p$ and $n\geq 1$ . We write $g=\left\lfloor\frac{n-1}{2}\right\rfloor$ and $A=2g^{2}+g+2$ .

(1)

We have

[TABLE]

where $1\geq\delta\underset{g\to+\infty}{\sim}\frac{1}{8g}$ . 2. (2)

If $q$ is a square then, we have

[TABLE]

and $\#\{f\in\mathcal{H}_{n}({\mathbb{F}}_{q})\mid\Pi(n;\chi_{f})\text{ exhibits a complete bias}\}=0$ otherwise. 3. (3)

We have

[TABLE] 4. (4)

We have

[TABLE]

where $1\geq\delta^{\prime}\underset{g\to+\infty}{\sim}\frac{7}{24g}$ .

To prove this theorem, we follow closely Kowalski’s method based on the large sieve for Frobenius developed in [Kowalskibook] (and improved by Perret-Gentil [Perret-GentilANT]). The theorem above should be compared to Kowalski’s bound (1.1), which we now state.

Theorem 1.2 ([Kowalski2010]*Proposition 1.1).

Let $g\geq 1$ be an integer, and let $f\in\mathbb{Z}[x]$ be a squarefree monic polynomial of degree $2g$ . Let $p$ be an odd prime such that $p$ does not divide the discriminant of $f$ , and let $U/{\mathbb{F}}_{p}$ be the open subset of the affine $t$ -line where $f(t)\neq 0$ . Consider the algebraic family $\mathcal{C}_{f}\to U$ of smooth projective hyperelliptic curves of genus $g$ given as the smooth projective models of the curves with affine equations

[TABLE]

Then for any extension ${\mathbb{F}}_{q}/{\mathbb{F}}_{p}$ we have

[TABLE]

where $A=2g^{2}+g+2$ and $1\geq\delta\underset{g\to+\infty}{\sim}\frac{1}{8g}$ .

Remark 1.3.

The bound stated in [Kowalski2010] is a bit larger, the exponent of $\log q$ is simply $1$ , but Kowalski gave this better exponent in [Kowalskibook]*Theorem 8.15, for the more general condition that the Galois group of the zeta function of $\mathcal{C}_{t}$ is not maximal. It is indeed more general since if there exists a non-trivial linear relation between $\pi$ and the arguments of the roots of the zeta function, hence a multiplicative relation between those roots, then its Galois group is not maximal since this relation cannot be preserved by every allowed permutations of the roots. However, note there is a typo in the bound stated in [Kowalskibook]*Theorem 8.15: the exponent there reads $1-\delta$ with $\delta\underset{g\to+\infty}{\sim}\frac{1}{4g}$ , coming from the larger contribution of $\delta_{2}\geq\frac{1}{4g}$ p.181, but we can actually only get $\delta_{2}\geq\frac{1}{8g}$ . The count is detailed in [Kowalski2006]*Lemma 7.3 iii) but the author is counting each symplectic polynomial with a given factorization twice, hence a missing $\tfrac{1}{2}$ factor. The proof of Lemma 7.7 fixes this.

The bounds in Theorem 1.1 improve Kowalski’s bound (1.1) in two aspects. First, the space of parameters is larger than Kowalski’s. While he obtains his bound for families of polynomials of a very specific shape, our bound applies to all monic squarefree polynomials of a given degree. It should be noted that our method would allow us to prove the same bounds as in Theorem 1.1 but along Kowalski’s family of curves in Theorem 1.2, independently of $p$ , by using the large sieve estimate [Kowalskibook]*Corollary 8.10 instead of Proposition 2.22 of this paper. Moreover, the exponents for $q$ in the bounds 1.1.2 and 1.1.3 are twice as small, while the exponent for $\log q$ in the last bound 1.1.4 is slightly better. Observe however that by passing to a multidimensional space of parameters, we lose the uniformity in $p$ in the bounds. Such a phenomenon was already present in [Kowalskibook]*Corollary 8.10 which results in a larger exponent of $q$ in the multidimensional case. In our case, the uniformity in $p$ is lost when applying the improved bound [Perret-GentilANT]*Theorem 5.14.(ii).(c).

For the first two properties considered in Theorem 1.1, inputs from arithmetic geometry give us better bounds for some restricted genera. Our first improvement is for genus $1$ or $2$ concerning the failure of $\mathrm{LI}$ .

Theorem 1.4.

Let $p\neq 2,3$ be a prime number, $q$ a power of $p$ and $3\leq n\leq 6$ . We write $g=\left\lfloor\frac{n-1}{2}\right\rfloor$ , so that $1\leq g\leq 2$ . When $g=1$ , we have

[TABLE]

When $g=2$ , then we have

[TABLE]

In particular, in this more restricted setting, these bounds improve on 1.1 1 and a fortiori on 1.1 4. Note that the result for genus at most two comes from the fact that we completely understand the Frobenius eigenvalues for genus $1$ and $2$ hyperelliptic curves over $\overline{\mathbb{F}}_{p}$ . The reason is that all smooth projective curves of genus at most two are hyperelliptic, and the Torelli image of $\mathcal{M}_{2}$ is dense in $\mathcal{A}_{2}$ . Neither of the facts holds for higher genus.

Our last result is a bound for the bias dealt with in Theorem 1.1 2 which is uniform in the degree, at the expense of being worse in terms of $q$ for small $g$ .

Theorem 1.5.

If $q=p^{e}$ is a fixed prime power with $2\mid e$ . Then,

[TABLE]

In particular, this bound is better than the second bound of Theorem 1.1 in terms of $q$ as soon as $g\geq 12$ . The underlying method coming from arithmetic geometry cannot deal with the conditions in 1.1 3 and 4 because they are concerned with multiple zeros of the zeta function of $\mathcal{C}_{f}$ at once.

Outline of the paper

In Section 2 we set the notation and give preliminary results used in the rest of the paper. In particular, section 2.3 states some results about linear recurrent sequences, and section 2.4 is devoted to the proof of a large sieve statement, which is one important step in the proof of Theorem 1.1. In Section 3 we give a proof of the first item of Theorem 1.1 following Kowalski’s method and Theorem 1.4 by elementary methods. In Section 4 we derive conditions for a complete bias and prove the second item of Theorem 1.1 with the large sieve for Frobenius and Theorem 1.5 with arithmetic geometry. In Section 5 and 6 we derive conditions for a lower order bias and a reversed bias respectively and we prove the last two items of Theorem 1.1. Finally, in Section 7 we gather counting lemmas obtained using our large sieve result Proposition 2.22 that are used in the proofs of the different parts of Theorem 1.1.

Acknowledgements

This work was partially supported by the grant KAW 2019.0517 from the Knut and Alice Wallenberg Foundation (for LD). Part of this work was conducted while WL was in residence at the Mathematical Sciences Research Institute in Berkeley, California, during the Spring 2023 semester. The authors thank Florent Jouve, Jordan Ellenberg and Emmanuel Kowalski for helpful discussions. They also thank Régis de la Bretèche for organizing the elementary and analytic number theory seminar in IHP, Paris, where ideas used in this paper were born.

2. Preliminary results and notations

2.1. Notations and Definitions

We first provide notations for the rest of the paper. When $f\in\mathcal{H}_{n}({\mathbb{F}}_{q})$ , the projective curve with affine model $y^{2}=f(x)$ is denoted by $\mathcal{C}_{f}$ . Recall that $\mathcal{C}_{f}$ has genus $g=\left\lfloor\frac{n-1}{2}\right\rfloor$ .

For $f\in\mathcal{H}_{n}({\mathbb{F}}_{q})$ , let $\chi_{f}$ be the primitive quadratic character modulo $f$ . We want to compare the number of degree $n$ irreducible polynomials $P$ over ${\mathbb{F}}_{q}$ such that $\chi_{f}(P)=1$ and those such that $\chi_{f}(P)=-1$ for varying $n$ . Define the Dirichlet $L$ -function associated to a Dirichlet character $\chi$ modulo $f$ as

[TABLE]

where $\lvert a\rvert=q^{\deg a}$ and the sum and product above range over monic (resp. irreducible) polynomials of ${\mathbb{F}}_{q}[x]$ .

We now recall some properties of the $L$ -functions under consideration; see e.g. [Rosen2002]*Proposition 4.3, Theorem 5.9 for details. For a non-principal Dirichlet character $\chi$ , the Dirichlet $L$ -function $L(s,\chi)$ is a polynomial $\mathcal{L}(u,\chi)$ in $u:=q^{-s}$ with integer coefficients and the zeta function of $\mathcal{C}_{f}$ is a rational function in $u$ , which we denote by $\zeta(\mathcal{C}_{f},u)=\frac{Z_{f}(u)}{(1-u)(1-qu)}$ . Thanks to the deep work of Weil [Weil_RH], we know the analogue of the Riemann Hypothesis is satisfied for these zeta functions, that is their inverse zeros have absolute value $\sqrt{q}$ . When $n$ is odd, then $\mathcal{L}(u,\chi_{f})=Z_{f}(u)$ , and when $n$ is even, we have $\mathcal{L}(u,\chi_{f})=Z_{f}(u)(1-u)$ . In the following, we will mostly use the reciprocal polynomial

[TABLE]

which is monic, and its roots are the inverse zeros of $Z_{f}$ .

In the following, we denote by $\alpha_{j}(\chi)=\sqrt{q}e^{i\theta_{j}(\chi)}$ the distinct inverse zeros of $\mathcal{L}(u,\chi)$ of norm $\sqrt{q}$ , with multiplicity $m_{\theta_{j}}(\chi)$ . We might forget the dependency in the character $\chi$ when only one character is considered and the notation stays clear from the context. We let $r$ be the number of distinct pairs of conjugate non-real zeros of $\mathcal{L}(u,\chi_{f})$ . Since $\mathcal{L}(u,\chi_{f})$ has real coefficients, after reordering, we can assume $\theta_{j+r}(\chi_{f})=-\theta_{j}(\chi_{f})$ and we have $m_{\theta_{j}}(\chi_{f})=m_{-\theta_{j}}(\chi_{f})$ for $1\leq j\leq r$ . Since $\chi_{f}$ is primitive, we have

[TABLE]

Using the explicit formula in [Cha2008]*Proposition 4.2, our object of study is the function

[TABLE]

Let $\Delta_{f}(n)$ be the opposite of the main sum of $\Pi(n;\chi_{f})$ in (2.1); that is

[TABLE]

In the case the set $\{\theta_{1}(\chi_{f}),\dots,\theta_{r}(\chi_{f})\}\cup\{\pi\}$ is linearly independent over $\mathbb{Q}$ , which is expected to be the generic case, then $\Delta_{f}(n)-\big{(}m_{0}(\chi_{f})+\tfrac{1}{2}\big{)}$ oscillates around zero and takes positive (resp. negative) values half of the time (i.e., for $50\%$ of positive integers $n$ ). Thus, $\Delta_{f}$ is larger (resp. smaller) than its mean value $m_{0}(\chi_{f})+\tfrac{1}{2}$ for half of the positive integers $n$ . One deduces (see [Cha2008]*page 1366) that there is a bias in the distribution of the values of $\Delta_{f}$ in the direction of positive values, i.e. coming back to $\Pi(n;\chi_{f})$ we expect a bias towards negative values. Or in other terms, there are in general more irreducible polynomials $P$ of degree $n$ with $\chi_{f}(P)=-1$ than with $\chi_{f}(P)=1$ .

Now, it can happen that the oscillating part does not distribute so well between positive and negative values. This is the case in the examples given in [Cha2008]*Section 5 and also for the different kinds of behaviors we consider in this paper.

Remark 2.1.

In this paper, we are studying the summatory function of a quadratic character over irreducible polynomials. Another “prime number race” of interest is the one between quadratic residues and non-quadratic residues. Observe that these are the same in the case $f$ is irreducible. In the case $f$ is not irreducible, one has to take into account the contribution of all quadratic (non-necessarily primitive) characters modulo $f$ , which makes the study more difficult. The general formula proved in [DevinMeng]*Proposition 5.2 is

[TABLE]

where $\square$ denotes the set of quadratic residues modulo $f$ , $\boxtimes$ denotes the set of non-quadratic residues modulo $f$ and $X_{f}^{\text{quad}}$ is the set of quadratic characters modulo $f$ .

For a given $f\in\mathcal{H}_{n}({\mathbb{F}}_{q})$ , we define three kinds of “exceptional biases” as follows.

Definition 2.2.

[Complete bias] We say that $\Pi(n,\chi_{f})$ exhibits a complete bias if $\Delta_{f}(n)>0$ for almost all $n$ . That is,

[TABLE]

Remark 2.3.

[ $\Pi(n,\chi_{f})$ vs. $\Delta(n,\chi_{f})$ ] In particular, if $\Pi(n,\chi_{f})$ exhibits a complete bias, then $\mathrm{dens}(\Pi(n,\chi_{f})<0)$ exists and is equal to $1$ , but the converse need not hold. Note that the above definition may not cover all the cases for which $\overline{\mathrm{dens}}(\Pi(n,\chi_{f})<0)=1$ : it may happen that $\Delta_{f}(n)=0$ for a positive proportion of $n$ and then for those $n$ , the sign of $\Pi(n,\chi_{f})$ is determined by the sign of the error term $O_{f}\left(q^{-\frac{n}{6}}\right)$ and necessitate further study. In the next definition, we define the case of “lower order bias” below to characterize this possibility.

Definition 2.4.

[Lower order bias] We say that $\Pi(n,\chi_{f})$ exhibits a lower order bias if $\Delta_{f}(n)=0$ for a positive proportion of $n$ . That is,

[TABLE]

Remark 2.5.

The condition of having a lower order bias is close to the condition “ties have positive density”, as introduced by Martin and Ng in [MartinNg2020] in the context of prime number races.

Finally, the last type of exceptional bias we are going to study is a direct incompatibility with the expectation that $\Pi(n,\chi_{f})$ is negative for more than $50\%$ of integers $n$ .

Definition 2.6.

[Reversed bias] We say that $\Pi(n,\chi_{f})$ exhibits a reversed bias if $\Delta_{f}(n)<0$ for more than half of the $n$ . That is,

[TABLE]

Remark 2.7.

(1)

In Section 2.3, we will show the three densities in Definitions 2.2,2.4,2.6 exist, see Corollaries 2.15 and 2.17. 3. (2)

Note that both a lower order bias and a reversed bias may occur simultaneously, but that is the only possible combination of two exceptional biases.

Remark 2.8.

Observe that we could also (as in [Cha2008, DevinMeng]) count irreducible polynomials of degree $\leq n$ instead of degree $=n$ . In this case, the functions replacing $\Pi(n;f,\square,\boxtimes)$ and $\Pi(n,\chi_{f})$ take the following more complicated forms:

[TABLE]

where the sums are over $\{h\in{\mathbb{F}}_{q}[x]\mid h\text{ monic irreducible, }\deg{h}\leq n\}$ .

We cannot adapt most of our proofs for those quantities. For instance, the maximal value of such a sum is not easy to determine, and we’ll make frequent use of the maximum values in Section 4.1 (e.g. the proof of Proposition 4.2 to see why maximal values are relevant to us). However, we have for example $\Delta(\leq n;f,\square,\boxtimes)=\Delta(n;f,\square,\boxtimes)+O\left(\frac{\sum_{\theta}m_{\theta}(\chi_{f})}{\sqrt{q}}\right)$ , where $\Delta(\leq n;f,\square,\boxtimes)$ represents the main sum in $\Pi(\leq n;f,\square,\boxtimes)$ above, and so if $q$ is large enough compared to $\sum_{\theta}m_{\theta}(\chi_{f})$ , the sign of $\Delta(\leq n;f,\square,\boxtimes)$ is the sign of $\Delta(n;f,\square,\boxtimes)$ . In particular, under the right conditions, a complete bias and a reversed bias in the “degree $=n$ ” setting one gets from studying $\Pi(n;\chi_{f}$ ), implies a similar bias in the “degree $\leq n$ ” setting one gets from studying $\Pi(\leq n;\chi_{f}$ ). Note also that the difference between counting irreducible polynomials of degree equal to $n$ and counting those of degree at most $n$ is analogous to the difference between counting prime number in intervals of the form $[X,2X]$ and those in intervals of the form $[2,X]$ .

2.2. Properties of limiting distributions

To study the densities involved in the definitions 2.2, 2.4, and 2.6, we will use the notion of limiting distribution, which we define as follows.

Definition 2.9.

Let $D:\mathbb{N}\rightarrow\mathbb{R}$ be a real function, we say that $D$ admits a limiting distribution if there exists a probability measure $\mu$ on Borel sets in $\mathbb{R}$ such that for any bounded continuous function $h$ on $\mathbb{R}$ , we have

[TABLE]

We call $\mu$ the limiting distribution of the function $D$ .

The function $\Delta_{f}$ defined as Equation 2.3 is quasi-periodic, and we can apply the Kronecker-Weyl equidistribution theorem (see e.g. [Hum]*Lemma 2.7 and [Bailleul2]*Theorem 2.2) to prove the following proposition ([DevinMeng]*Proposition 2.1).

Proposition 2.10.

The function $\Delta_{f}$ admits a limiting distribution $\mu_{\Delta_{f}}$ with mean value $m_{0}(\chi_{f})+\frac{1}{2}$ and variance

[TABLE]

Moreover, the measure $\mu_{\Delta_{f}}$ has support in

[TABLE]

The next lemma will be used to study reversed bias.

Lemma 2.11.

The distribution $\mu_{\Delta_{f}}$ in Proposition 2.10 is symmetric with respect to $m_{0}(\chi_{f})+\frac{1}{2}$ if and only if there is no relation

[TABLE]

with $k_{0},\dots,k_{r}\in\mathbb{Z}$ and $k_{0}+\sum_{j=1}^{r}k_{j}\equiv 1\mod 2$ .

Proof.

Denote by $A(\Delta_{f})$ the closure of the $1$ -parameter group $H:=\{n(\pi,\theta_{1},\ldots,\theta_{r}):n\in\mathbb{Z}\}/(2\pi\mathbb{Z})^{r+1}$ in the $(r+1)$ -dimensional torus $\mathbb{T}^{r+1}:=(\mathbb{R}/2\pi\mathbb{Z})^{r+1}$ . We first remark that by Pontryagin duality, for any $\underline{z}\in\mathbb{T}^{r+1}$ , $\underline{z}\in A(\Delta_{f})$ if and only if for every character $\underline{k}=(k_{0},\dots,k_{r})\in H^{\bot}\subset\mathbb{Z}^{r+1}$ , one has $\underline{k}(\underline{z})=k_{0}z_{0}+\dots+k_{r}z_{r}=0$ . Therefore, we just need to show that $\mu_{\Delta_{f}}$ is symmetric with respect to $m_{0}(\chi_{f})+\tfrac{1}{2}$ if and only if $(\pi,\dots,\pi)\in A(\Delta_{f})$ , since $\underline{k}(\pi,\dots,\pi)=0$ if and only if $\sum_{i=0}^{r}k_{i}$ is even.

By the Kronecker–Weyl Equidistribution Theorem (see for example [DevinMeng]*Lemma 2.2), $A(\Delta_{f})$ is a subtorus of $\mathbb{T}^{r}$ and we have, for any continuous function $h:\mathbb{T}^{r}\rightarrow\mathbb{C}$ ,

[TABLE]

where $\omega_{A(\Delta_{f})}$ is the normalized Haar measure on $A(\Delta_{f})$ . Then $\mu_{\Delta_{f}}$ is the push-forward measure of $\omega_{A(\Delta_{f})}$ through

[TABLE]

for any bounded continuous functions $h:{\mathbb{R}}\to{\mathbb{R}}$ .

Now, $\mu_{\Delta_{f}}$ is symmetric with respect to $m_{0}(\chi_{f})+\frac{1}{2}$ if and only if, for every continuous function $h$ , one has

[TABLE]

Observe that

[TABLE]

So, if $(\pi,\dots,\pi)\in{A(\Delta_{f})}$ , using the fact that the Haar measure is translation-invariant, we deduce that $\mu_{\Delta_{f}}$ is symmetric with respect to $m_{0}(\chi_{f})+\frac{1}{2}$ .

On the other hand, assume $(\pi,\dots,\pi)\notin A(\Delta_{f})$ . Then as $A(\Delta_{f})$ is closed, and $m_{\pi}(\chi_{f}),m_{\theta_{j}}(\chi_{f})\geq 0$ there exists $\epsilon>0$ such that for each $a\in A(\Delta_{f})$ one has111See Lemma 4.8 for an explicit bound.

[TABLE]

Let $h_{\epsilon}$ be a non-zero, non-negative function, supported in an interval of length $\epsilon$ around $m_{0}(\chi_{f})-m_{\pi}(\chi_{f})-2\sum_{j=1}^{r}m_{\theta_{j}}(\chi_{f})$ . Then

[TABLE]

while

[TABLE]

In particular, we deduce that $\mu_{\Delta_{f}}$ is not symmetric with respect to $m_{0}(\chi_{f})$ . ∎

2.3. Results about linear recurrence sequences

We are interested in the positivity and zero-sets of the quantities $\Delta_{f}(n)$ defined in 2.3. One of the key insight is that those quantities are linear recurrence sequences which will imply the limits in Definitions 2.2, 2.4, and 2.6 exist as shown in Corollaries 2.15, 2.17.

Definition 2.12.

A linear recurrence sequence of order $k\in\mathbb{Z}_{>0}$ is a sequence $(a_{n})_{n\in\mathbb{N}}$ such that there exist $u_{0},\dots,u_{k-1}\in{\mathbb{C}}$ satisfying

[TABLE]

for all $n\in\mathbb{N}$ . We define its zero-set as $\{n\in\mathbb{Z}_{>0}\mid a_{n}=0\}$ .

It is classical that any linear recurrence sequence can be expressed in a generalized power sum form and that, conversely, any generalized power sum satisfies a linear recurrence relation.

Lemma 2.13.

Let $f\in\mathcal{H}_{n}({\mathbb{F}}_{q})$ , then the sequence $\Delta_{f}$ is a linear recurrence sequence.

Proof.

Let $P_{f}$ be the reversed zeta function of the curve $\mathcal{C}_{f}:y^{2}=f(x)$ with $f\in\mathcal{H}_{n}({\mathbb{F}}_{q})$ , and let $\chi_{f}$ be the primitive quadratic character modulo $f$ and $g$ be the genus of $\mathcal{C}_{f}$ . The roots of $P_{f}$ are $\alpha_{1},\dots,\alpha_{2g}$ which are of the form $\sqrt{q}e^{i\theta_{i}(\chi_{f})}$ with some of them possibly $\pm\sqrt{q}$ . Then, the conclusion follows from [RecSeq]*page 3. ∎

Note that Lemma 2.13 is a well-known fact that follows directly from the rationality of the $L$ -function. It is not a particularity of hyperelliptic curves. We stated and proved the result here, as this is the first time it is used in the context of studying Chebyshev’s bias.

It turns out one can characterize the zero-set of such a linear recurrence sequence following the Skolem-Mahler-Lech theorem, which is stated below. A very short proof over ${\mathbb{Q}}$ (the Skolem case), which is the case of interest for us, using $p$ -adic analysis, is given in [RecSeq]*Theorem 2.1.

Theorem 2.14 (Skolem-Mahler-Lech, [RecSeq]*Theorem 2.1).

Assume $(a_{n})_{n\in\mathbb{N}}$ is a linear recurrence sequence over a field of characteristic zero. Then its zero-set is the union of a finite set and a finite number of arithmetic progressions.

This allows us to show that the density in the Definition 2.4 of a lower order bias always exists.

Corollary 2.15.

The density $\mathrm{dens}(\Delta_{f}(n)=0)$ in Definition 2.4 for lower order bias exists.

Proof.

By Lemma 2.13, $\Delta_{f}$ is a linear recurrence sequence. Its zero-set is a finite union of arithmetic progressions and a finite set following Theorem 2.14, therefore it admits a natural density. ∎

Another useful fact is the following result, showing that the densities considered for complete biases and reversed biases exist.

Theorem 2.16 ([BeGer]*Theorem 1).

Let $(a_{n})_{n\in\mathbb{N}}$ be a linear recurrence sequence of real numbers. Then its positivity set $\{n\in\mathbb{N}\mid a_{n}>0\}$ admits a natural density.

Corollary 2.17.

The densities $\mathrm{dens}(\Delta_{f}>0)$ and $\mathrm{dens}(\Delta_{f}<0)$ in Definitions 2.2 and 2.6 exist.

For certain kinds of linear recurrence sequences, called non-degenerate linear recurrence sequences, we know their zero-sets are finite. We introduce the following more general terminology for the character $\chi_{f}$ inspired by [RecSeq]*Section 1.1.9 because it will be an important condition to study in the proofs of (3) and (4) in Theorem 1.1.

Definition 2.18.

We say that $\chi_{f}$ is non-degenerate when none of $\frac{\alpha_{i}}{\alpha_{j}}$ , for $1\leq i\neq j\leq r$ , and none of $\frac{\overline{\alpha_{i}}}{\alpha_{j}}$ , for $1\leq i,j\leq r$ , is a root of unity.

Using Definition 2.18, we prove the following Lemma which will be of important use in the study of lower order bias in Section 5.

Lemma 2.19.

Assume $\chi_{f}$ is non-degenerate as in Definition 2.18. Then the zero-set of $\Delta_{f}(n)$ is finite.

Proof.

By [RecSeq]*page 25, a non-degenerate linear recurrence sequence, that is, a sequence whose characteristic roots $\beta_{1},\dots,\beta_{d}$ satisfy that no $\frac{\beta_{i}}{\beta_{j}}$ is a root of unity for $i\neq j$ , takes a given value only finitely many times. In our case however, the characteristic roots are $\frac{\alpha_{1}}{\sqrt{q}},\dots,\frac{\alpha_{r}}{\sqrt{q}},\frac{\overline{\alpha_{1}}}{\sqrt{q}},\dots,\frac{\overline{\alpha_{r}}}{\sqrt{q}}$ , but also $1$ and $-1$ because of the terms $m_{0}(\chi_{f})+\frac{1}{2}$ and $\left(m_{\pi}(\chi_{f})+\frac{1}{2}\right)(-1)^{n}$ in $\Delta_{f}(n)$ , and obviously $\frac{1}{-1}$ is a root of unity. But it is easily seen that $\Big{(}\Delta_{f}(2n)-\left(m_{0}(\chi_{f})+\frac{1}{2}\right)-\left(m_{\pi}(\chi_{f})+\frac{1}{2}\right)\Big{)}_{n\geq 0}$ and $\Big{(}\Delta_{f}(2n+1)-\left(m_{0}(\chi_{f})+\frac{1}{2}\right)+\left(m_{\pi}(\chi_{f})+\frac{1}{2}\right)\Big{)}_{n\geq 0}$ are linear recurrence sequences ([RecSeq]*Theorem 1.1 and [RecSeq]*Theorem 1.3), and when $\chi_{f}$ is non-degenerate according to Definition 2.18, then those are non-degenerate as linear recurrence sequences. In particular, they respectively take the values $-\left(m_{0}(\chi_{f})+\frac{1}{2}\right)-\left(m_{\pi}(\chi_{f})+\frac{1}{2}\right)$ and $-\left(m_{0}(\chi_{f})+\frac{1}{2}\right)+\left(m_{\pi}(\chi_{f})+\frac{1}{2}\right)$ a finite number of times, which proves that $\Delta_{f}(n)$ vanishes a finite number of times. ∎

Remark 2.20.

In the non-degenerate case, we could replace the densities in Definitions 2.2 and 2.6 by the corresponding densities for $\Pi(n;\chi_{f})$ since they exist and coincide with the ones about $\Delta_{f}$ in that case following the fact that the density $\mathrm{dens}(\Delta_{f}(n)=0)$ in Definition 2.4 is zero.

2.4. A large sieve statement

Let $\operatorname{CSp}_{2g}({\mathbb{F}}_{\ell})$ be the group of symplectic similitudes222This is sometimes called the general symplectic group and denoted as GSp in $\operatorname{GL}_{2g}({\mathbb{F}}_{\ell})$ . It contains matrices $M\in\operatorname{GL}_{2g}({\mathbb{F}}_{\ell})$ such that there exists a scalar $m\in{\mathbb{F}}_{\ell}^{*}$ , called the multiplicator of $M$ , satisfying $M^{\top}JM=mJ$ with $J=\begin{pmatrix}0&I_{g}\\ -I_{g}&0\end{pmatrix}.$ When $M$ is a symplectic similitude with multiplicator $m$ , we say that $M$ is $m$ -symplectic. In this paper, following [Kowalskibook]*page 158 but with a reversed convention, we call $m$ -symplectic, any monic polynomial $P$ of even degree $2g$ satisfying

[TABLE]

In particular, for $f\in\mathcal{H}_{n}(\mathbb{F}_{q})$ the polynomial $P_{f}$ as defined in (2.1) is $q$ -symplectic.

Let us first state the result Theorem 2.21 for a general setting, using Perret-Gentil’s improvement of Kowalski’s large sieve for Frobenius [Perret-GentilANT]*Theorem 5.14.(ii).(c) and later apply it to our setting in Proposition 2.22.

The theorem is given for a general $U/\mathbb{F}_{p}$ smooth affine geometrically connected algebraic variety of dimension $d\geq 1$ over $\mathbb{F}_{p}$ . We assume that $U$ has a compactification where it is the complement of a divisor with normal crossing. We denote by $\bar{\eta}$ a geometric generic point of $U$ .

Let us fix $\Lambda$ a set of primes different from $2$ and $p$ of density $1$ . We study a family $\mathcal{F}_{\ell}$ of lisse sheaves of $\mathbb{F}_{\ell}$ -vector spaces on $U$ , corresponding to continuous homomorphisms $\rho_{\ell}:\pi_{1}(U,\bar{\eta})\rightarrow\operatorname{GL}_{r}(\mathbb{F}_{\ell})$ , for $\ell\in\Lambda$ that arise from a compatible system (as in [Kowalskibook]*Definition 8.7). Then for $\ell\in\Lambda$ , we denote $G_{\ell}=\rho_{\ell}(\pi_{1}(U,\bar{\eta}))$ and $G_{\ell}^{\mathrm{geo}}=\rho_{\ell}(\pi_{1}(U_{\overline{\mathbb{F}}_{q}},\bar{\eta}))$ .

Theorem 2.21.

Let $p$ be a prime number and $q>1$ be a power of $p$ . For each $\ell\in\Lambda$ fix $\Omega_{\ell}\subset G_{\ell}$ a conjugacy invariant subset, in the coset $\rho_{\ell}(\operatorname{Frob}_{f,q})G_{\ell}^{\mathrm{geo}}$ .

Then, for any $L\geq 1$ and for any $q$ which is a power of $p$ , one has

[TABLE]

with $C=C(U_{\overline{\mathbb{F}}_{q}},\{\rho_{\ell}\}_{\ell\in\Lambda})$ a constant that depends only on $U_{\overline{\mathbb{F}}_{q}}$ and on the family $\{\rho_{\ell}\}_{\ell\in\Lambda}$ (in particular not on $q$ , but certainly on $d$ ),

[TABLE]

where $\mathcal{L}$ is the set of squarefree integers whose prime factors are all in $\Lambda$ , $\psi(m):=\prod_{\ell\mid m}(\ell+1)$ , and when $G_{\ell}^{\mathrm{geo}}=\operatorname{Sp}(2g,\mathbb{F}_{\ell})$ one can take $A=2g^{2}+g+2$ .

Proof.

We are in the setting of [Kowalskibook]*Chapter 8, following the ideas and notations of loc. cit. It follows from [Kowalskibook]*Proposition 2.9 as in [Kowalskibook]*Corollary 8.10 that

[TABLE]

where $H$ is as defined in (2.4) and $\Delta$ is the large sieve constant. As in the proof of [Kowalskibook]*Proposition 8.8 we obtain that

[TABLE]

with

[TABLE]

where $\mathcal{W}(\pi,\tau)$ is the lisse sheaf corresponding to the representation $[\pi,\bar{\tau}]$ as defined in [Kowalskibook]*(3.8), and $\sigma^{\prime}_{c}$ is the sum of all except the largest Betti numbers as defined in [Kowalskibook]*page 166. In [Perret-GentilANT]*Section 5D2, Perret-Gentil improves the bound on $\sigma^{\prime}_{c}(U_{\overline{\mathbb{F}}_{q}},\mathcal{W}(\pi,\tau))$ compared to the bound of [Kowalskibook]*Proposition 8.8 in the case of the complement of a divisor with normal crossing. He obtains

[TABLE]

where the implicit constant depends on $U_{\overline{\mathbb{F}}_{q}}$ and on the family $\{\rho_{\ell}\}_{\ell\in\Lambda}$ (in particular not on $q$ , but certainly on $d$ and on $p$ ). Thus, we have

[TABLE]

To conclude, we use [Kowalskibook]*(8.13), and multiplicativity. In particular, representations of $\operatorname{Sp}(2g,\mathbb{F}_{\ell})$ satisfy $\dim\pi\leq(\ell+1)^{g^{2}}$ and $\sum_{\pi\in\Pi_{\ell}^{*}}\dim\pi\leq(\ell+1)^{g^{2}+g+1}$ . ∎

To improve on Kowalski’s bound (1.1) in Theorem 1.2, we are going to use the following large sieve result which follows from Theorem 2.21 applied to the variety of configurations, with the compatible system given by the action of the Frobenius.

Proposition 2.22.

Let $p$ be a prime number and $q>1$ be a power of $p$ . Let $n\geq 2$ , $\mathscr{H}_{n}$ be the configuration space of monic squarefree polynomials of degree $n$ and $\Lambda$ be the set of primes different from $2$ and $p$ .

*For each $\ell\in\Lambda$ , the action of the Frobenius endomorphism $\operatorname{Frob}_{f,q}$ on $\textup{H}^{1}_{\text{\'{e}t}}(\mathcal{C}_{f},\mathbb{Z}_{\ell})$ gives a representation $\rho_{\ell}:\pi_{1}(\mathscr{H}_{n},\bar{\eta})\rightarrow\operatorname{GL}_{2g}({\mathbb{F}}_{\ell})$ for $\bar{\eta}$ a geometric generic point and for all $\ell\in\Lambda$ they form a compatible system (as in [Kowalskibook]Definition 8.7), with image equal to the set of $q$ -symplectic similitudes following the work of Hall [Hall].

For every $\ell\in\Lambda$ , let $\Omega_{\ell}\subset\operatorname{CSp}_{2g}({\mathbb{F}}_{\ell})$ be a conjugacy invariant subset such that the multiplicator of every element of $\Omega_{\ell}$ is $q$ .

Then, one has

[TABLE]

where the implicit constant depends only on $n$ and $p$ , we can take $A=2g^{2}+g+2$ , $\mathcal{L}$ is the set of squarefree integers whose prime factors are all in $\Lambda$ , and $\psi(m)=\prod_{\ell\mid m}(\ell+1)$ .

Proof.

We are in the setting of Theorem 2.21 with $U=\mathscr{H}_{n}$ of dimension $n\geq 2$ . The variety $\mathscr{H}_{n}\subset\mathbb{A}^{n}$ is defined by the non-vanishing of the discriminant, it is thus a smooth affine geometrically connected algebraic variety which is the complement of a divisor with normal crossing ([EVW]*Lemma 7.6).

As in [Kowalskibook]*Section 8.6 for each $\ell\neq 2,p$ , the sheaf $\mathcal{F}_{\ell}$ corresponding to $\rho_{\ell}$ is a rank $2g$ lisse sheaf of ${\mathbb{F}}_{\ell}$ -modules on $\mathscr{H}_{n}$ . Since the action of the Frobenius on $H^{1}(C,\mathbb{Z}_{\ell})$ is independent of $\ell$ , the representations $\rho_{\ell}$ arise from a compatible system. By [Hall]*Theorem 1.2 (attributed to Yu), the images of $\pi_{1}(\mathscr{H}_{n},\bar{\eta})$ and of $\pi_{1}(\overline{\mathscr{H}}_{n},\bar{\eta})$ (arithmetic and geometric monodromy groups) are conjugate to $\operatorname{Sp}_{2g}({\mathbb{F}}_{\ell})$ for all $\ell\neq 2,p$ .

Hence, the bound follows from Theorem 2.21, where we chose $L+1=q^{\frac{1}{2A}}$ . ∎

Remark 2.23.

Note that for any finite set of primes $S$ , the result of Proposition 2.22 holds with the set $\Lambda$ replaced by $\Lambda^{\prime}=\Lambda\setminus S$ , and the set $\mathcal{L}$ replaced by the set $\mathcal{L}^{\prime}$ of squarefree integers with prime factors in $\Lambda^{\prime}$ . This is used in the proof of Lemma 7.5.

3. Linear dependence

Kowalski’s Theorem 1.2 is concerned with one-parameter families of reducible squarefree polynomials. The large sieve result Proposition 2.22 above allows us, following Kowalski’s proof in [Kowalskibook], to get the exact same bound, but for the larger space of parameters $\mathcal{H}_{n}({\mathbb{F}}_{q})$ .

Proof of Theorem 1.1.1..

We follow exactly the proof of [Kowalskibook]*Theorem 8.15 but instead of using [Kowalskibook]*Corollary 8.10, we use Proposition 2.22. Thus, we obtain

[TABLE]

where for $i=1,\dots 4$ ,

[TABLE]

and the sets $\Omega_{i,\ell}$ are defined as in [Kowalskibook]*pages 179–180. In particular,

(1)

$\Omega_{1,\ell}$ is the set of matrices $M\in\operatorname{CSp}_{2g}(\mathbb{F}_{\ell})$ with multiplicator $q$ such that $\chi_{M}(X)$ is irreducible, and [Kowalskibook]*page 181 gives $\frac{\lvert\Omega_{1,\ell}\rvert}{\lvert\operatorname{Sp}({\mathbb{F}}_{\ell})\rvert}\geq\frac{1}{2g}$ . 2. (2)

$\Omega_{2,\ell}$ is the set of matrices $M\in\operatorname{CSp}_{2g}(\mathbb{F}_{\ell})$ with multiplicator $q$ such that $\chi_{M}(X)$ factors as a product of an irreducible quadratic polynomial and a product of irreducible polynomials of odd degree, which satisfy333a factor $\frac{1}{2}$ was forgotten in [Kowalskibook]*page 181. $\frac{\lvert\Omega_{2,\ell}\rvert}{\lvert\operatorname{Sp}({\mathbb{F}}_{\ell})\rvert}\geq\frac{1}{8g}$ by Lemma 7.7 (with $k=1$ , $n_{0}=1$ , $n_{\frac{g-3}{2}}=1$ in the case $g$ is odd) and [Kowalskibook]*Lemma B.5. 3. (3)

$\Omega_{3,\ell}$ is the set of matrices $M\in\operatorname{CSp}_{2g}(\mathbb{F}_{\ell})$ with multiplicator $q$ such that the polynomial $h$ defined by $\chi_{M}(X)=X^{g}h(X+qX^{-1})$ factors as a product of an irreducible quadratic polynomial and a product of irreducible polynomials of odd degree, and [Kowalskibook]*page 181 gives $\frac{\lvert\Omega_{3,\ell}\rvert}{\lvert\operatorname{Sp}({\mathbb{F}}_{\ell})\rvert}\underset{g\to+\infty}{\sim}\frac{\log 2}{\log g}$ . 4. (4)

$\Omega_{4,\ell}$ is the set of matrices $M\in\operatorname{CSp}_{2g}(\mathbb{F}_{\ell})$ with multiplicator $q$ such that the polynomial $h$ defined by $\chi_{M}(X)=X^{g}h(X+qX^{-1})$ has an irreducible factor of prime degree $>\tfrac{g}{2}$ , and [Kowalskibook]*page 181 gives $\frac{\lvert\Omega_{4,\ell}\rvert}{\lvert\operatorname{Sp}({\mathbb{F}}_{\ell})\rvert}\underset{g\to+\infty}{\sim}\frac{1}{\sqrt{2\pi g}}$ .

The final bound is the same (correcting $\delta_{2}\geq(4g)^{-1}$ into $\delta_{2}\geq(8g)^{-1}$ ), but the space of parameters $\mathcal{H}_{n}({\mathbb{F}}_{q})$ is larger. The dependency on $p$ is lost in the proof of Theorem 2.21.∎

To prove Theorem 1.4 for the genus $2$ case, we will use the following result of Ahmadi and Shparlinski.

Theorem 3.1 ([AhmadiShpar]*Theorem 2).

Let $\mathcal{C}$ be a smooth projective curve of genus $2$ . If the Jacobian of $\mathcal{C}$ is absolutely simple, then the zeta function of $\mathcal{C}$ satisfies $\mathrm{LI}$ .

Proof of Theorem 1.4..

Let us first prove the bound when $g=1$ and assume for now that $\deg f=3$ . Then $\mathcal{C}_{f}$ is an elliptic curve, with two conjugate (possibly equal) Frobenius eigenvalues. The only way for $\mathrm{LI}$ to fail is that those eigenvalues are of the form $\sqrt{q}\zeta$ with $\zeta$ a root of unity, that is, $\mathcal{C}_{f}$ has to be a supersingular elliptic curve. By [Silverman]*V Theorem 4.1.(c), there are $\ll p$ such curves over ${\mathbb{F}}_{q}$ , up to $\overline{{\mathbb{F}}}_{q}$ -isomorphism (recall that $q$ is a power of the prime number $p$ ). But two elliptic curves are isomorphic over $\overline{{\mathbb{F}}}_{q}$ if and only if they have the same $j$ -invariant ([Silverman]*III Proposition 1.4.(b) which holds in every characteristic). Let $E$ be a fixed supersingular elliptic curve defined over ${\mathbb{F}}_{q}$ with $j$ -invariant $j$ , and let us write $j(f)$ the $j$ -invariant of the elliptic curve $\mathcal{C}_{f}$ . Then clearly $j(f)=j$ is a non-zero polynomial equation in the $\deg f$ coefficients of $f$ by the definition of the $j$ -invariant [Silverman]*page 42. It is indeed non-zero since there always exist a non-supersingular elliptic curve over ${\mathbb{F}}_{q}$ ([Waterhouse]*Theorem 4.1). In particular, one has

[TABLE]

This yields

[TABLE]

and the result follows since in general $|\mathcal{H}_{n}({\mathbb{F}}_{q})|=q^{n}-q^{n-1}$ . In the case where $\deg f=4$ we assume that $p\neq 2,3$ . Then $\mathcal{C}_{f}$ is isomorphic to its Jacobian $J_{f}$ , and by [Cremona]*page 82, $J_{f}$ is given as the smooth projective model of the curve defined by the equation $y^{2}=x^{3}-27Ix-27J$ , and $I$ and $J$ are the quartic invariants defined in [Cremona]*pages 72–73. The $j$ -invariant of $J_{f}$ is then clearly a non-constant rational function in the coefficients of $f$ , and we conclude as in the case $\deg f=3$ .

Assume now that $g=2$ . By Theorem 3.1, if $\mathrm{LI}$ fails for the zeta function of $\mathcal{C}_{f}$ , then its Jacobian $J_{f}$ is not absolutely simple, i.e. it splits over a finite extension $\mathbb{K}$ of ${\mathbb{F}}_{q}$ . In particular, the Weil polynomial $W_{f,\mathbb{K}}$ of $J_{f}/\mathbb{K}$ is reducible. Calling $d$ the degree $[\mathbb{K}:{\mathbb{F}}_{q}]$ , one has $W_{f,\mathbb{K}}(X^{d})=\prod_{k=0}^{d-1}W_{f}(\zeta_{d}^{k}X)=\prod_{k=0}^{d-1}P_{f}(\zeta_{d}^{k}X)$ , where $W_{f}$ is the Weil polynomial of $J_{f}/{\mathbb{F}}_{q}$ , which is equal to $P_{f}$ ([CorSil]*VII. Corollary 11.4), and $\zeta_{d}$ is a primitive $d$ -th root of unity. It easily implies that $W_{f,\mathbb{K}}$ has roots $\alpha_{j}(\chi_{f})^{d}$ , $j\in\{1,\dots,4\}$ . Now, there are two possible cases. Either one of $\alpha_{i}(\chi_{f})^{d}$ is a rational number (necessarily $\pm q^{d/2}$ ), or there are two indices $i\neq j\in\{1,\dots,4\}$ such that $\alpha_{i}(\chi_{f})^{d}\alpha_{j}(\chi_{f})^{d}$ is a rational number (necessarily $\pm q^{d}$ ). In particular, $\chi_{f}$ is degenerate according to Definition 2.18. We conclude by Lemma 7.3. ∎

4. Complete biases

4.1. Upper bounds for complete biases

To derive a necessary condition for exhibiting a complete bias, we will use the following simple inequality of Bhatia and Davis [BhatiaDavis]*Theorem 1 (the proof in [BhatiaDavis] is done for discrete random variables, but the general case works exactly the same).

Theorem 4.1 (Bhatia-Davis Inequality).

Let $X$ be a bounded random variable such that $a\leq X\leq b$ almost-surely with mean $\mu$ and variance $\sigma^{2}$ , then

[TABLE]

Proposition 4.2 (Necessary condition for complete bias).

Let $f\in{\mathbb{F}}_{q}[x]$ and assume that $\Pi(n;\chi_{f})$ admits a complete bias. Then one of the following assertions is true.

(1)

The distribution $\mu_{\Delta_{f}}$ is symmetric with respect to its mean value and $m_{0}(\chi_{f})\geq m_{\pi}(\chi_{f})+2\sum_{j=1}^{r}m_{j}(\chi_{f})$ and in the case $r=0$ , the inequality is strict with more than half of the zeros equal to $\sqrt{q}$ . 2. (2)

The distribution $\mu_{\Delta_{f}}$ is not symmetric with respect to its mean value and $m_{0}(\chi_{f})>m_{\pi}(\chi_{f})$ .

In particular, this implies the following condition.

Corollary 4.3.

If $\Pi(n;\chi_{f})$ admits a complete bias for $f\in\mathbb{F}_{q}[x]$ , then $q$ is a square and $L(\frac{1}{2},\chi_{f})=0$ .

Remark 4.4.

In the case of Dirichlet $L$ -functions over ${\mathbb{Q}}$ , it is a famous conjecture of Chowla[Chowla] that no such $L$ -function can vanish at $\tfrac{1}{2}$ . It is known that Artin $L$ -functions corresponding to number fields extensions can vanish at $\tfrac{1}{2}$ . Incidentally, this was used in [Bailleul1] to provide examples of reversed bias in this context. In the function field case, it was shown in [Li18]*Theorem 1.3 that for any $q$ there are infinitely many Dirichlet $L$ -functions over ${\mathbb{F}}_{q}(x)$ vanishing at $\tfrac{1}{2}$ , that is such that the corresponding Weil polynomial vanishes at $\sqrt{q}$ . However it is expected that $100\%$ of those $L$ -functions do not vanish at $\tfrac{1}{2}$ for a fixed $q$ ([Li18]*Remark 1.4). If this were true, we would obtain the following result instead of Theorem 1.5: for every $q$ a power of an odd prime,

[TABLE]

Note also that by [ELS]*Corollary 1.6 there is no complete bias when $f$ is irreducible and $4$ does not divide the degree of $f$ . Indeed, in this case $L(\frac{1}{2},\chi_{f})\neq 0$ .

We can now prove our main results concerning upper bounds for complete bias using the necessary condition in Corollary 4.3.

Proof of Theorem 1.1.2..

The proof follows from applying Corollary 4.3 and Lemma 7.1. ∎

Proof of Theorem 1.5.

By [ELS]*Theorem 3.2, one has

[TABLE]

and so the bound follows from Corollary 4.3. ∎

We finally give the proof of our necessary condition for complete bias.

Proof of Proposition 4.2.

Suppose that the distribution $\mu_{\Delta_{f}}$ is symmetric with respect to its mean value $m_{0}(\chi_{f})+\frac{1}{2}$ . We have $\Delta_{f}(0)=m_{0}(\chi_{f})+m_{\pi}(\chi_{f})+1+2\sum_{j=1}^{r}m_{j}(\chi_{f})$ , so this value is in $\mathrm{supp}\mu_{\Delta_{f}}$ . Indeed, let $\varepsilon>0$ and $h:{\mathbb{R}}\to{\mathbb{R}}$ be non-negative continuous and supported on $[\Delta_{f}(0)-\varepsilon,\Delta_{f}(0)+\varepsilon]$ , with $h(\Delta_{f}(0))>0$ . Then

[TABLE]

where $\tilde{h}(a_{0},\dots,a_{r})=h\Big{(}m_{0}(\chi_{f})+\tfrac{1}{2}+(m_{\pi}(\chi_{f})+\tfrac{1}{2})e^{ia_{0}}+2\sum_{j=1}^{r}m_{\theta_{j}}(\chi_{f})\cos(a_{j})\Big{)}$ and $\mathop{}\!\mathrm{d}\omega_{A(\Delta_{f})}$ is the Haar measure on the subtorus $A(\Delta_{f})$ of $\mathbb{T}^{r+1}$ generated by $(\pi,\theta_{1},\dots,\theta_{r})$ . Since $h(\Delta_{f}(0))=\tilde{h}(0,\dots,0)>0$ , we get $\int_{{\mathbb{R}}}h(t)\mathop{}\!\mathrm{d}\mu_{\Delta_{f}}(t)>0$ , which implies $\Delta_{f}(0)\in\mathrm{supp}\mu_{\Delta_{f}}$ .

By symmetry, $2(m_{0}(\chi_{f})+\frac{1}{2})-(m_{0}(\chi_{f})+m_{\pi}(\chi_{f})+1+2\sum_{j=1}^{r}m_{j}(\chi_{f}))$ is also in $\mathrm{supp}\mu_{\Delta_{f}}$ , so it is non-negative.

In the case $\mu_{\Delta_{f}}$ is not symmetric with respect to its mean value, we are interested in the behavior of

[TABLE]

By [Bailleul2]*Theorem 3.1, we have $\text{dens}(\Delta_{f}>0)\leq\frac{1}{2}\mathbb{P}(Y_{0}\geq 0)+\frac{1}{2}\mathbb{P}(Y_{1}\geq 0)$ where $Y_{0},Y_{1}$ are random variables whose distributions are the limiting distributions of $\Delta_{f}(2\cdot)$ and $\Delta_{f}(2\cdot+1)$ respectively. Since we are assuming complete bias, then $\text{dens}(\Delta_{f}>0)=1$ yields $\mathbb{P}(Y_{0}\geq 0)=\mathbb{P}(Y_{1}\geq 0)=1$ .

We apply the Bhatia-Davis Inequality, Theorem 4.1, to the random variable $Y_{1}$ . To do so, we need the maximum, minimum, mean, and variance of $Y_{1}$ . To understand these, we group the $\theta_{j}$ by pairs such that $\theta_{j^{\prime}}=\pi-\theta_{j}$ when necessary. We have

[TABLE]

where we sum on $\{\theta_{1},\dots,\theta_{r^{\prime}}\}=\{\theta_{1},\dots,\theta_{r}\}\setminus\{\theta_{j}\mid\exists k\leq j,\theta_{j}=\pi-\theta_{k}\}$ (in particular $\frac{\pi}{2}\notin\{\theta_{1},\dots,\theta_{r^{\prime}}\}$ ), and we define $m_{j}^{\prime}(\chi_{f})=m_{j}(\chi_{f})-m_{k(j)}(\chi_{f})$ where $\theta_{k(j)}=\pi-\theta_{j}$ (and $m_{k(j)}(\chi_{f})=0$ if such a $\theta_{k(j)}$ does not exist). This grouping of terms was made to simplify the computation of the variance below. From this expression we deduce

[TABLE]

By the assumption of complete bias, we have $Y_{1}\geq 0$ almost-surely. By the definition of $Y_{1}$ , we have

[TABLE]

and

[TABLE]

By the Bhatia-Davis inequality (Theorem 4.1), we obtain

[TABLE]

This yields

[TABLE]

If every $m_{j}^{\prime}(\chi_{f})$ is zero, this means that for every integer $n$ , one has $\Delta_{f}(2n+1)=m_{0}(\chi_{f})-m_{\pi}(\chi_{f})$ . Since $\Pi(n;\chi_{f})$ exhibits a complete bias, this has to be positive, i.e. $m_{0}(\chi_{f})>m_{\pi}(\chi_{f})$ . If there is at least one non-zero $m_{j}^{\prime}(\chi_{f})$ , the inequality (4.2) also implies $m_{0}(\chi_{f})>m_{\pi}(\chi_{f})$ .

Finally, since $\sqrt{q}$ and $-\sqrt{q}$ have distinct multiplicities as roots of $P_{f}\in\mathbb{Z}[T]$ , those must be rational, hence integers, and so $q$ must be a square. ∎

4.2. Examples of complete biases

In this section, we first give a sufficient condition for a complete bias, in the hope to use it to find examples of instances of such an exceptional behavior.

Lemma 4.5 (Sufficient condition for complete bias).

Let $f\in\mathbb{F}_{q}[x]$ . Write

[TABLE]

with $L_{2}(-u)=L_{2}(u)$ of maximal degree, $\deg L_{i}=d_{i}$ . Assume that one of the following assertions holds,

(1)

we have $m_{0}>m_{\pi}+d_{1}$ and $m_{0}+m_{\pi}+1>d_{1}+d_{2}$ , or 2. (2)

we have $m_{0}\geq m_{\pi}+d_{1}$ and $m_{0}+m_{\pi}+1\geq d_{1}+d_{2}$ , and

(a)

$L_{1}$ * admits a root whose angle is not in $\mathbb{Q}\pi$ , or* 2. (b)

there exists $k_{1},\dots,k_{d_{1}}\in\mathbb{Z}$ satisfying $\sum_{i=1}^{d_{1}}k_{i}\theta_{i}\equiv 0\pmod{2\pi}$ and $\sum_{i=1}^{d_{1}}k_{i}$ is odd, where $\theta_{1},\dots,\theta_{d_{1}}$ are the angles of the roots of $L_{1}$ .

Then there is a complete bias with modulus $f$ .

One such example is $f=t^{4}+2t^{3}+2t+a^{7}\in{\mathbb{F}}_{9}[t]$ where $a$ is a generator of ${\mathbb{F}}_{9}$ over ${\mathbb{F}}_{3}$ , in [DevinMeng]*Example 3 the authors show that $P_{f}(u)=(u-3)^{2}$ .

Remark 4.6.

More generally, in [DevinMeng]*Proposition 3.1, based on Honda–Tate ideas (citing [Waterhouse]*Theorem 4.1), one can see that for each $q$ square, there exist $f\in{\mathbb{F}}_{q}[x]$ of degree $3$ such that the $L$ -function of $\chi_{f}$ is $(1-\sqrt{q}u)^{2}$ . This gives one example satisfying Lemma 4.5 for each $q$ square.

Remark 4.7.

Note however that our sufficient condition for a complete bias Lemma 4.5 is more restrictive than simply vanishing at $\tfrac{1}{2}$ so we cannot use the lower bound from [Li18]*Theorem 1.3 to give infinitely many examples of complete bias for a fixed $q$ .

Proof of Lemma 4.5.

It suffices to prove that under these conditions, we have $\Delta_{f}(n)>0$ for almost all $n$ , where $\Delta_{f}$ is defined in (2.3). We order the zeros of $P_{f}$ so that the first ones correspond to the zeros of $L_{1}$ , with multiplicities. Then, for all $n$ we have

[TABLE]

and

[TABLE]

Since $\cos(\theta_{j}n)\geq-1$ for all $j$ and $n$ , the conditions in case 1 imply that $\Delta_{f}(n)>0$ for all $n$ . In the case the conditions of 2a are satisfied, we have $\sum_{j=1}^{d_{1}}\cos(\theta_{j}n)>-d_{1}$ for almost all $n$ , since, up to reordering, we can assume that $\theta_{1}\notin\mathbb{Q}\pi$ which yields $\cos(\theta_{1}n)>-1$ for almost all $n$ . This concludes the proof in the case 2a. In the case 2b, it follows from Lemma 4.8 that $\sum_{j=1}^{d_{1}}\cos(\theta_{j}n)\geq-d_{1}+1+\cos(\pi(1-\tfrac{1}{\kappa}))>-d_{1}$ for all $n$ , where $\kappa=\sum_{i=1}^{d_{1}}\lvert k_{i}\rvert$ and this concludes the proof. ∎

We conclude this section by proving a technical lemma that was used in the proof of the sufficient condition (Lemma 4.5).

Lemma 4.8.

Let $\gamma_{1},\dots,\gamma_{N}\in(0,\pi)$ and assume that there exists $k_{1},\dots,k_{N}\in\mathbb{Z}$ satisfying $\sum_{i=1}^{N}k_{i}\gamma_{i}\equiv 0\pmod{2\pi}$ and $\sum_{i=1}^{N}k_{i}$ is odd. Then, for all $\ell\in\mathbb{Z}$ , we have $\max_{1\leq i\leq N}\lVert\ell\gamma_{i}-\pi\rVert_{2\pi}\geq\frac{\pi}{\sum_{i=1}^{N}\lvert k_{i}\rvert}.$ In particular,

[TABLE]

Proof.

Recall that $\lVert\ell\gamma_{i}-\pi\rVert_{2\pi}=\min_{n\in\mathbb{Z}}\lvert\ell\gamma_{i}-(2n+1)\pi\rvert$ . For each $i$ , let $n_{i}\in\mathbb{Z}$ be an integer that satisfies this minimum. We have

[TABLE]

Now, suppose that $\lVert\gamma-\pi\rVert_{2\pi}\geq\frac{\pi}{\kappa}$ , then we have

[TABLE]

This concludes the proof. ∎

5. Lower order biases

5.1. Upper bound

Our reflections on linear recurrence sequences from Section 2.3 give a good understanding on lower order bias. In particular, the contraposition of Lemma 2.19 yields the following necessary condition for a lower order bias.

Proposition 5.1 (Necessary condition for lower order bias).

If $\Pi(n;\chi_{f})$ admits a lower order bias, then $\chi_{f}$ is degenerate (see Definition 2.18).

This lemma implies that for $\Pi(n;\chi_{f})$ to admit a lower order bias, the Jacobian of the curve $C_{f}:y^{2}=f(x)$ is either non-ordinary or geometrically admitting an isogenous factor of order at least $2$ .

Using this lemma and an application of the large sieve from Proposition 2.22, we obtain the proof of Theorem 1.1.3.

Proof of Theorem 1.1.3.

The proof follows from applying Proposition 5.1 and Lemma 7.3. ∎

5.2. A sufficient condition for lower order bias and examples

Lemma 5.2 (Sufficient condition for lower order bias).

Let $f\in\mathbb{F}_{q}[x]$ . Suppose that $P_{f}(u)=P_{f}(-u)$ , then $\Delta_{f}(2n+1)=0$ for all $n$ , in particular, there is a lower order bias with modulus $f$ .

Proof.

Assume that the roots of $P_{f}$ with positive imaginary parts are labelled so that their arguments are $\theta_{1},...,\theta_{t},\pi-\theta_{1},...,\pi-\theta_{t}$ . Since $P_{f}(u)=P_{f}(-u)$ , the multiplicity of $\theta_{i}$ equals to that of $\pi-\theta_{i}$ . For $n\in\mathbb{N}$ and $1\leq j\leq t$ , one has $\cos((\pi-\theta_{j})(2n+1))=-\cos(\theta_{j}(2n+1))$ , whence

[TABLE]

Further,

[TABLE]

The above together give $\Delta_{f}(2n+1)=0$ for all $n\in\mathbb{N}$ . This is sufficient to deduce that there is a lower order bias with modulus $f$ . ∎

One such example is $f=t^{6}+2t^{3}+5\in{\mathbb{F}}_{23}[t]$ which is irreducible and the $L$ -function of $\chi_{f}$ is $1-29u^{2}+23^{2}u^{4}$ which is even with $4$ inverse roots $\pm\alpha$ , $\pm\overline{\alpha}$ , where

[TABLE]

Moreover, using [Calcut]*page 17, we see that $\alpha$ has argument unrelated to $\pi$ .

Remark 5.3.

Using [HNR]*Table 1.2 and the sufficient condition, we can give several examples for each $q$ that have a lower order bias. Namely the authors show that the polynomial $X^{4}-bX^{2}+q^{2}$ with $b=2q\cos(2\theta)$ is the Weil polynomial of the Jacobian of a hyperelliptic curve of genus 2 if $b\in\mathbb{Z}$ , $b\neq q,2q,2q-1,2q-2$ and $b+2q$ is not a square. Since the Weil polynomial of the Jacobian of such a curve is equal to the corresponding $P_{f}$ ([CorSil]*VII. Corollary 11.4), such $f$ exhibit lower order biases.

Remark 5.4.

The condition of Lemma 5.2 gives rise to the following question: Fix a finite field $\mathbb{F}_{q}$ , how many hyperelliptic curves admit even Frobenius characteristic polynomials? If $C$ is such a curve, then $C\otimes\mathbb{F}_{q^{2}}$ has its Frobenius characteristic polynomial being a perfect square. This question is closely related to counting curves/characters whose $L$ -functions are perfect squares.

6. Reversed biases

6.1. Upper bound

Let us first give a necessary condition for a reversed bias.

Proposition 6.1 (Necessary condition for a reversed bias).

If there is a reversed bias with modulus $f$ then

•

either there exist $k_{1},\dots,k_{g}\in\mathbf{Z}$ satisfying $\sum_{i=1}^{g}k_{i}\theta_{i}\equiv 0\pmod{2\pi}$ and $\sum_{i=1}^{g}k_{i}$ is odd, where the $\theta_{i}$ are angles of zeros of $P_{f}$ .

•

or $m_{0}<m_{\pi}$ (in particular, $q$ is a square).

Proof.

Suppose $f$ admits a reversed bias. Then the distribution $\mu_{f}$ is not symmetric with respect to its mean value $m_{0}+\frac{1}{2}\geq 0$ . So, from Lemma 2.11, there exists $k_{0},k_{1},\dots,k_{r}$ such that $k_{0}+\sum_{j=1}^{r}k_{j}\equiv 1\pmod{2}$ and $k_{0}\pi+\sum_{j=1}^{r}k_{j}\theta_{j}\equiv 0\pmod{2\pi}$ .

If $k_{0}$ is even, we get the first condition. Otherwise, assume that all relation between the $\theta_{j}$ ’s, $\sum_{i=1}^{g}k_{i}\theta_{i}\equiv 0\pmod{2\pi}$ satisfy $\sum_{i=1}^{g}k_{i}$ is even. Then we deduce from Lemma 2.11, that the limiting distribution of the functions $\Delta(2\cdot)$ and $\Delta(2\cdot+1)$ are symmetric with respect to their mean values, which are $m_{0}+m_{\pi}+1$ and $m_{0}-m_{\pi}$ . If the probability that one of the two functions is negative is larger than $\frac{1}{2}$ , then at least one of the mean values has to be negative. ∎

Here is a translation of our necessary condition in terms of the Galois group of $P_{f}$ over $\mathbb{Q}$ , which is more convenient to use in the large sieve. Recall that for $f\in\mathcal{H}_{n}({\mathbb{F}}_{q})$ , $\mathrm{Gal}_{\mathbb{Q}}(P_{f})$ is a subgroup of $W_{2g}=\mathfrak{S}_{g}\ltimes\left(\mathbb{Z}/2\mathbb{Z}\right)^{g}$ , itself a subgroup of $\mathfrak{S}_{2g}$ (see [Kowalskibook]*page 249). In the following, we will consider that $\mathrm{Gal}(P_{f})\subset\mathfrak{S}_{2g}$ acts on $\{-g,\dots,-1,1,\dots,g\}$ , the set of indices of the roots $\alpha_{1},\dots,\alpha_{g},\alpha_{-1}=\overline{\alpha_{1}},\dots,\alpha_{-g}=\overline{\alpha_{g}}$ . The fact that $\mathrm{Gal}(P_{f})\subset W_{2g}$ means that if $\sigma\in\mathrm{Gal}(P_{f})$ then $\sigma(-i)=-\sigma(i)$ for all $i\in\{-g,\dots,-1,1,\dots,g\}$ .

Lemma 6.2.

Let $f\in\mathcal{H}_{n}(\mathbb{F}_{q})$ . Assume that there exist $k_{1},\dots,k_{g}\in\mathbb{Z}$ such that $k_{1}\theta_{1}+\dots+k_{g}\theta_{g}\equiv 0\text{ mod }2\pi$ and $k_{1}+\dots+k_{g}\equiv 1\pmod{2}$ . Then at least one of the following conditions hold:

(1)

$P_{f}$ * is not separable.* 2. (2)

$\chi_{f}$ * is degenerate (in the sense of Definition 2.18).* 3. (3)

$\mathrm{Gal}_{{\mathbb{Q}}}(P_{f})$ * does not act transitively on the set of pairs $\{\{1,-1\},\dots,\{g,-g\}\}$ ;* 4. (4)

For every $i\in\{1,\dots,g\}$ , $\mathrm{Gal}(P_{f})$ does not contain the transposition $(i\,-i)$ , and for every pair $\{i,j\}$ , with $i\neq j\in\{1,\dots,g\}$ , $\mathrm{Gal}_{{\mathbb{Q}}}(P_{f})$ does not contain the $4$ -cycle $(i\,j\,-i\,-j)$ .

Proof.

Assume that none of the first three items are satisfied. Let us fix $i\in\{1,\dots,g\}$ , then for every $j\in\{1,\dots,g\}\setminus\{i\}$ , there exist $\sigma_{j}\in\mathrm{Gal}(P_{f})$ such that $\sigma_{j}(j)\in\pm i$ . From the multiplicative relation

[TABLE]

with $\sum_{j=1}^{g}k_{j}\equiv 1\pmod{2}$ , we apply $\sigma_{j}$ and taking the product over all $j$ ’s we obtain another multiplicative relation of the form

[TABLE]

where $S_{i,i}=\sum\pm k_{j}\equiv 1\pmod{2}$ . In particular $S_{i,i}\neq 0$ . This being true for each $i\in\{1,\dots,g\}$ , by taking a suitable product of large powers of expressions of the form 6.1, we deduce that there exists $S_{1},\dots,S_{g}\in\mathbb{Z}\setminus\{0\}$ such that

[TABLE]

Let $i\in\{1,\dots,g\}$ . If $(i\,-i)\in\mathrm{Gal}_{{\mathbb{Q}}}(P_{f})$ , then we apply it to the relation 6.2 and taking a quotient we get $\left(\frac{\alpha_{i}}{\sqrt{q}}\right)^{2S_{i}}=1$ . This is a contradiction because $S_{i}\neq 0$ and $\frac{\alpha_{i}}{\sqrt{q}}$ is not a root of unity since $\chi_{f}$ is non-degenerate.

Now, let $i\neq j\in\{1,\dots,g\}$ . If $(i\,j\,-i\,-j)\in\mathrm{Gal}_{{\mathbb{Q}}}(P_{f})$ we get $\left(\frac{\alpha_{i}}{\sqrt{q}}\right)^{S_{i}+S_{j}}\left(\frac{\alpha_{j}}{\sqrt{q}}\right)^{S_{j}-S_{i}}=1$ , and similarly by applying its inverse $(i\,-j\,-i\,j)=(i\,j\,-i\,-j)^{3}$ , we get $\left(\frac{\alpha_{i}}{\sqrt{q}}\right)^{S_{i}-S_{j}}\left(\frac{\alpha_{j}}{\sqrt{q}}\right)^{S_{j}+S_{i}}=1$ . Combining the two relations, we obtain $\left(\frac{\alpha_{j}}{\sqrt{q}}\right)^{(S_{j}+S_{i})^{2}+(S_{j}-S_{i})^{2}}=1$ . But at least one among $S_{i}+S_{j}$ and $S_{i}-S_{j}$ is non-zero, since the $S_{i}$ ’s are non-zero, and as before, this shows that we cannot have $(i\,j\,-i\,-j)\in\mathrm{Gal}_{{\mathbb{Q}}}(P_{f})$ . ∎

Lemma 6.3.

Let $P\in\mathbb{Q}[T]$ be a $q$ -symplectic polynomial of degree $2g$ with roots $\alpha_{1},\overline{\alpha_{1}},\dots,\alpha_{g},\overline{\alpha_{g}}$ . If $\operatorname{Gal}_{{\mathbb{Q}}}(P)$ does not act transitively on the pairs $\{\alpha_{1},\overline{\alpha_{1}}\},\dots,\{\alpha_{g},\overline{\alpha_{g}}\}$ then $h_{P}$ defined by $P(T)=T^{g}h_{P}(T+qT^{-1})$ is reducible.

Proof.

Notice the roots of $h_{P}$ are the $\alpha_{i}+\overline{\alpha_{i}}$ . Every element of $\operatorname{Gal}_{{\mathbb{Q}}}(h_{P})$ are restrictions of elements of $\operatorname{Gal}_{{\mathbb{Q}}}(P)$ to the splitting field of $h_{P}$ . Now if $h_{P}$ is irreducible over $\mathbb{Q}$ , then $\operatorname{Gal}(h_{P})$ acts transitively on the set $\{\alpha_{i}+\overline{\alpha_{i}}\mid i=1,\dots,g\}$ . Thus, if $i\neq j\in\{1,\dots,g\}$ , there exists $\sigma\in\operatorname{Gal}(P)$ such that $\sigma(\alpha_{i}+\overline{\alpha_{i}})=\alpha_{j}+\overline{\alpha_{j}}$ . But $\sigma(\alpha_{i})=\alpha_{k}$ for some $k\in\{1,\dots,2g\}$ so we have $\sqrt{q}\cos(\theta_{k})=\sqrt{q}\cos(\theta_{j})$ which implies $\theta_{k}=\pm\theta_{j}$ , which means $\sigma(\alpha_{i})=\alpha_{j}$ or $\sigma(\alpha_{i})=\overline{\alpha_{j}}$ , and $\operatorname{Gal}_{{\mathbb{Q}}}(P)$ acts transitively on the set of pairs $\{\alpha_{i},\overline{\alpha_{i}}\}$ . ∎

We can finally prove the last part of our main theorem.

Proof of Theorem 1.1.4.

The proof follows by using the necessary conditions of Proposition 6.1. We obtain a bound for the second condition by the same argument as in Lemma 7.1. For the first condition of Proposition 6.1, we use Lemma 6.2 and bound the conditions $1.$ and $2.$ by Lemma 7.3. The third item is bounded by using Lemma 6.3 and Lemma 7.4. Finally, the bound obtained with the condition on the Galois group, which is the largest contribution, is dealt with by Lemma 7.5. ∎

6.2. Examples

In the hope of finding examples of reversed bias in the sense of Definition 2.6 we estimated

[TABLE]

where $\chi_{f}$ is the primitive quadratic character modulo $f\in\mathcal{H}_{n}({\mathbb{F}}_{q})$ for small genera $g=\left\lfloor\frac{n-1}{2}\right\rfloor$ and small finite fields $\mathbb{F}_{q}$ . In particular, for fixed $f(x)\in\mathbb{F}_{q}[x]$ we computed $\Delta_{f}(n)$ for many values of $n$ , e.g. all $0\leq n\leq 1000$ . We found no clear candidate curves which exhibited a ”strong” reversed bias amongst $\mathcal{C}_{f}/\mathbb{F}_{q}$ with $q$ a prime less than $11$ and $\deg f(x)\leq 6$ as well as among those curves with $\deg f(x)\leq 8$ and $q=3$ .

Remark 6.4.

We can still provide an infinite family of examples exhibiting a reversed bias. Indeed, when $q$ is a square the polynomial $(1-u\sqrt{q}+u^{2}q)^{2}$ is the $L$ -function of a hyperelliptic curve of genus $2$ according to [HNR]. For such a curve $\mathcal{C}_{f}$ , we have $\Delta_{f}(n)=\frac{1}{2}+\frac{(-1)^{n}}{2}+4\cos(\tfrac{2\pi}{3}n)$ which is $6$ -periodic and takes $2$ positive values and $4$ negative values; explicitly, it takes the values $5,-2,-1,4,-1,-2$ .

Cha’s example ([Cha2008]*Example 5.3) corresponds to a reversed bias, however Cha is counting polynomials with degree less than $n$ instead of polynomials of degree equal to $n$ (see Remark 2.8). We verified that this example does not meet our criterion of being a reversed bias with our way of counting polynomials, but it exhibits a lower order bias because $\Delta_{f}(n)$ is $10$ -periodic, takes $3$ positive values (at $n\in\{0,1,9\}$ ), $2$ negative values (at $n\in\{3,7$ ) and is zero otherwise.

7. A few counting lemmas

Using the large sieve statement Proposition 2.22, we will now prove important intermediate counting lemmas that are used to establish our upper bounds for exceptional biases. Recall that $q$ is a power of the prime $p$ , and for any $n\geq 2$ , $g=\left\lfloor\frac{n-1}{2}\right\rfloor$ is the genus of the curve $\mathcal{C}_{f}$ for any $f\in\mathcal{H}_{n}({\mathbb{F}}_{q})$ the set of monic squarefree polynomials in ${\mathbb{F}}_{q}[x]$ of degree $n$ .

Lemma 7.1.

We have

[TABLE]

where $A=2g^{2}+g+2$ .

Proof.

We first remark that the set $\left\{f\in\mathcal{H}_{n}(\mathbb{F}_{q})\mid m_{0}(\chi_{f})>m_{\pi}(\chi_{f})\right\}$ is empty when $q$ is not a square, because in that case, $\sqrt{q}$ and $-\sqrt{q}$ are conjugate algebraic numbers, so they must have the same multiplicity as roots of a polynomial with integer coefficients such as $P_{f}$ . We will prove our bound by showing that when $q$ is a square, we have

[TABLE]

For every $\ell\in\Lambda$ (recall that $\Lambda$ is simply the set of primes different from $2$ and $p$ ), we introduce the set $\Omega_{5,\ell}\subset\operatorname{CSp}_{2g}(\mathbb{F}_{\ell})$ of $q$ -symplectic matrices for which $\sqrt{q}$ is not an eigenvalue. From [Kowalskibook]*Lemma B.5 (due to Chavdarov) we have

[TABLE]

Since the set of symplectic $q$ -polynomials of degree $2g$ in ${\mathbb{F}}_{\ell}[T]$ has dimension $g$ , and that the condition of vanishing at one point is a linear equation of the coefficients, we have

[TABLE]

We deduce that there exist a constant $C_{g}$ depending on $g$ such that

[TABLE]

Therefore, for $A=2g^{2}+g+2$ , we have

[TABLE]

The desired bound then follows from Proposition 2.22 by summing only over primes in $\Lambda$ . ∎

Remark 7.2.

We could improve the bound above by not restricting to the sum over primes, but we decided not to pursue this here, as we expect the improvement will only be on the power of $\log q$ .

The following lemma will allow us to reduce our counting to the case of non-degenerate characters $\chi_{f}$ (as in Definition 2.18) and simple roots of $P_{f}$ .

Lemma 7.3.

We have

[TABLE]

where $A=2g^{2}+g+2$ .

Proof.

Let $f$ satisfy the above condition, that is $\chi_{f}$ is degenerate or $P_{f}$ has a multiple root in $\mathbb{C}$ . Then there exist $1\leq i\neq j\leq 2g$ such that $\frac{\alpha_{i}}{\alpha_{j}}$ is a root of unity, we denote $d$ its order (one can take $d=1$ in the case of a multiple root $\alpha_{i}=\alpha_{j}$ ). We first remark that $\alpha_{i}$ and $\alpha_{j}$ are algebraic integers of degree at most $2g$ , so clearly $\frac{\alpha_{i}}{\alpha_{j}}$ is an algebraic number of degree at most $4g^{2}$ , and so $\varphi(d)\leq 4g^{2}$ .

Since $\alpha_{i}^{d}=\alpha_{j}^{d}$ , it means that the polynomial $P_{f,(d)}=\prod_{i=1}^{2g}(X-\alpha_{i}^{d})$ has a multiple root. This implies that its discriminant is [math]. Now, $\mathrm{disc}(P_{f,(d)})$ is a polynomial with integer coefficients in the coefficients of $P_{f,(d)}$ since it is the resultant of $P_{f,(d)}$ and its derivative. Moreover, those coefficients are symmetric polynomials in the $\alpha_{k}^{d}$ ’s, and in particular in the $\alpha_{k}$ ’s. By the fundamental theorem of symmetric polynomials, this is a polynomial expression in the elementary symmetric polynomials in the $\alpha_{k}$ ’s, which are precisely the coefficients of $P_{f}$ .

We have shown that $P_{f}$ satisfies a certain integral polynomial equation, i.e. there exists a polynomial $Q_{g,d}\in\mathbb{Z}[X_{1},\dots,X_{2g}]$ such that, if $a_{0},\dots,a_{2g-1}$ are the coefficients of $P_{f}$ , then one has $Q_{g,d}(a_{0},\dots,a_{2g-1})=0$ . Since there are at most finitely many $d$ such that $\varphi(d)\leq 4g^{2}$ , we get a universal relation

[TABLE]

such that if $\chi_{f}$ is degenerate or $P_{f}$ has a multiple root, then $Q_{g}(a_{0},\dots,a_{2g-1})=0$ .

Moreover, when $q$ is large enough, we know that $Q_{g}$ is non-zero since by Kowalski’s result (Theorem 1.2), there exists a polynomial $h\in{\mathbb{F}}_{q}[x]$ monic of degree $n$ such that $P_{h}(T)=T^{2g}+\dots+b_{1}T+b_{0}$ satisfies LI, and in particular, none of its quotients of roots is a root of unity, and for that polynomial, one has $Q(b_{0},\dots,b_{2g-1})\neq 0$ .

So the equation $Q_{g}=0$ defines a hypersurface in the set of $q$ -symplectic polynomials of fixed degree, and we have

[TABLE]

See also [Kowalskibook]*Theorem B.6. The end of the proof is completely similar to the end of the proof of Lemma 7.1. ∎

In the next lemma, $h_{P_{f}}$ denotes the “real Weil polynomial” attached to $\mathcal{C}_{f}$ , defined by the relation

[TABLE]

Lemma 7.4.

We have

[TABLE]

where $A=2g^{2}+g+2$ .

Proof.

We use Proposition 2.22 with the set

[TABLE]

Since if a monic polynomial is reducible, none of its reduction modulo a prime can be irreducible, we have

[TABLE]

where $\delta_{6,\ell}=\frac{\lvert\Omega_{6,\ell}\rvert}{\lvert\operatorname{Sp}_{2g}({\mathbb{F}}_{\ell})\rvert}$ . There are $\frac{1}{g}\ell^{g}(1+O_{g}(\tfrac{1}{\ell}))$ monic irreducible polynomials of degree $g$ with coefficients in $\mathbb{F}_{\ell}$ . As $P\mapsto h_{P}$ is a bijection from the set of $q$ -symplectic polynomials in ${\mathbb{F}}_{\ell}[T]$ of degree $2g$ to the set of monic polynomials of degree $g$ in ${\mathbb{F}}_{\ell}[T]$ , we deduce from [Kowalskibook]*Lemma B.5 (similarly to Lemma 7.1) that

[TABLE]

We conclude using the estimation of the sum from a theorem of Lau and Wu [Kowalskibook]Theorem G.2 applied the same way as Kowalski in [Kowalskibook](8.24):

[TABLE]

from which we deduce the stated bound. ∎

The last counting lemma is about polynomials $f\in\mathcal{H}_{n}({\mathbb{F}}_{q})$ such that $\mathrm{Gal}_{\mathbb{Q}}(P_{f})$ does not contain certain permutations. Recall from the discussion above Lemma 6.2 that $\mathrm{Gal}_{\mathbf{Q}}(P_{f})$ acts on $\{-g,\dots,-1,1,\dots,g\}$ .

Lemma 7.5.

We have

[TABLE]

where $A=2g^{2}+g+2$ .

Proof.

First, we may assume that $P_{f}$ is separable, since the announced bound is worse than that of 7.3.

We are once again going to use the large sieve bound coming from Proposition 2.22 but the set $\Lambda$ of prime numbers used in the large sieve has to be modified a bit here because of Lemma 7.7: we take $\Lambda$ to be the set of prime numbers different from $2$ and $p$ and larger than $4g^{2}$ (see Remark 2.23). This only induces a further dependency on $g$ in the implied constants, but doesn’t modify the final bound.

For every $\ell\in\Lambda$ , we consider $\Omega_{7,\ell}$ be the set of $q$ -symplectic matrices $M\in\operatorname{CSp}_{2g}(\mathbb{F}_{\ell})$ such that the characteristic polynomial $\chi_{M}$ admits a factorization either as a quadratic irreducible polynomial multiplied by distinct irreducible polynomials of odd degree, or as a quartic irreducible polynomial multiplied by distinct irreducible polynomials of odd degree. Indeed, if $P_{f}$ is separable but the Galois group $\mathrm{Gal}(P_{f})$ does not contain a transposition nor a $4$ -cycle (when seen as a subgroup of $\mathfrak{S}_{2g}$ ), then $\rho_{\ell}(\operatorname{Frob}_{f,q})\notin\Omega_{7,\ell}$ for any $\ell$ (see [Jacobson]*Theorem 4.37).

Therefore, we need to count the symplectic polynomials with such factorizations to be able to conclude as above. For $\ell\in\Lambda$ , we let $\delta_{7,\ell}=\frac{\lvert\Omega_{7,\ell}\rvert}{\lvert\operatorname{Sp}_{2g}({\mathbb{F}}_{\ell})\rvert}$ .

In the case $g$ is even, we use Lemma 7.7 with ( $k=1$ , $n_{\frac{g-2}{2}}=1$ ) and with ( $k=2$ , $n_{\frac{g-4}{2}}=1$ , $n_{0}=1$ ) to get

[TABLE]

In the case $g$ is odd we use Lemma 7.7 with ( $k=1$ , $n_{\frac{g-3}{2}}=1$ , $n_{0}=1$ ) with ( $k=1$ , $n_{\frac{g-5}{2}}=1$ , $n_{1}=1$ ), and with ( $k=2$ , $n_{\frac{g-3}{2}}=1$ ) to get

[TABLE]

In both cases we have $\delta_{7,\ell}\geq\frac{7}{24g}+O(\ell^{-\frac{1}{2}})$ , so we obtain the announced bound in the same way as in the proof of Proposition 7.4. ∎

Remark 7.6.

In the proof above of Lemma 7.5 one could expand the application of Lemma 7.7 to add more terms to the lower bound of $\delta_{7,\ell}$ to gain marginal improvements. The additional condition in the lemma and its application above with $k=2$ delivers our improvement over Kowalski’s bound (1.1).

Lemma 7.7.

Let $0\leq k\leq g$ be two integers, and let $\ell>4g^{2}$ be a prime number. Write $r=\lfloor\frac{g-k-1}{2}\rfloor$ and let $n_{i}$ , $1\leq i\leq r$ , be integers such that $g=k+n_{0}+3n_{1}+5n_{2}+\dots+(2r+1)n_{r}$ . Let $\omega_{k,\ell}(\underline{n})$ be the set of $q$ -symplectic squarefree polynomials $P\in\mathbb{F}_{\ell}[T]$ which factor as a product $P=Q_{2k}R_{0}\tilde{R_{0}}R_{1}\tilde{R_{1}}\dots R_{r}\tilde{R_{r}}$ , where $Q_{2k}$ is an irreducible $q$ -symplectic polynomial of degree $2k$ , each $R_{i}$ is a product of $n_{i}$ distinct irreducible monic polynomials of degree $2i+1$ , and $\tilde{R_{i}}=\frac{T^{(2i+1)n_{i}}}{R_{i}(0)}R_{i}\left(\frac{q}{T}\right)$ is the $q$ -reciprocal of $R_{i}$ . Then, we have

[TABLE]

Proof.

First observe that for any $q$ -symplectic polynomial $P\in{\mathbb{F}}_{\ell}[T]$ , one has $P(0)=q^{\deg P/2}\neq 0$ , in particular, for all $R\mid P$ , one has $R(0)\neq 0$ . We appeal to [Kowalski2006]*Lemma 7.3 (ii), which gives that the count of irreducible symplectic polynomials of degree $2k$ is larger than $\frac{1}{2k}\ell^{k}-O(\ell^{k-1})$ (see also [DDS]*Lemma 3 which can be adapted to the case of $q$ -symplectic polynomials). The irreducible factors of odd degree of a symplectic polynomial come in pairs $\{R(T),\tilde{R}(T)=\frac{T^{\deg R}}{R(0)}R\left(\frac{q}{T}\right)\}$ , uniquely determined by either of its elements. So it suffices to count polynomials of degree $g-k$ that are products of distinct odd degree irreducible polynomials. By [Kowalskibook]*Lemma B.1 there are $\prod_{i=0}^{r}\frac{1}{(2i+1)^{n_{i}}n_{i}!}\ell^{g-k}-O(\ell^{g-k-\frac{1}{2}})$ polynomials with given factorization $R_{0}R_{1}\dots R_{r}$ (as in the statement of the lemma).

For each polynomial with factorization type $R_{0}R_{1}\dots R_{r}$ , for each factor $R_{i}$ , $0\leq i\leq r$ , we have made a choice of which element of the pair $\{R(T),\tilde{R}(T)\}$ to include. There are $2^{n_{i}}$ such choices for each $i$ .

We just need to remove from the final count the monic $q$ -symplectic polynomials that have multiple roots, as counted in the proof of Lemma 7.3, there are at most $O(\ell^{g-1})$ such polynomials, so this does not change the main term. ∎

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

Exceptional biases in counting primes over function fields

Abstract.

1. Introduction

Theorem 1.1**.**

Theorem 1.2** ([Kowalski2010]*Proposition 1.1).**

Remark 1.3**.**

Theorem 1.4**.**

Theorem 1.5**.**

Outline of the paper

Acknowledgements

2. Preliminary results and notations

2.1. Notations and Definitions

Remark 2.1**.**

Definition 2.2**.**

Remark 2.3**.**

Definition 2.4**.**

Remark 2.5**.**

Definition 2.6**.**

Remark 2.7**.**

Remark 2.8**.**

2.2. Properties of limiting distributions

Definition 2.9**.**

Proposition 2.10**.**

Lemma 2.11**.**

Proof.

2.3. Results about linear recurrence sequences

Definition 2.12**.**

Lemma 2.13**.**

Proof.

Theorem 2.14** (Skolem-Mahler-Lech, [RecSeq]*Theorem 2.1).**

Corollary 2.15**.**

Proof.

Theorem 2.16** ([BeGer]*Theorem 1).**

Corollary 2.17**.**

Definition 2.18**.**

Lemma 2.19**.**

Proof.

Remark 2.20**.**

2.4. A large sieve statement

Theorem 2.21**.**

Proof.

Proposition 2.22**.**

Proof.

Remark 2.23**.**

3. Linear dependence

Proof of Theorem 1.1.1..

Theorem 3.1** ([AhmadiShpar]*Theorem 2).**

Proof of Theorem 1.4..

4. Complete biases

4.1. Upper bounds for complete biases

Theorem 4.1** (Bhatia-Davis Inequality).**

Proposition 4.2** (Necessary condition for complete bias).**

Corollary 4.3**.**

Remark 4.4**.**

Proof of Theorem 1.1.2..

Proof of Theorem 1.5.

Proof of Proposition 4.2.

4.2. Examples of complete biases

Lemma 4.5** (Sufficient condition for complete bias).**

Remark 4.6**.**

Remark 4.7**.**

Proof of Lemma 4.5.

Lemma 4.8**.**

Proof.

5. Lower order biases

5.1. Upper bound

Proposition 5.1** (Necessary condition for lower order bias).**

Proof of Theorem 1.1.3.

5.2. A sufficient condition for lower order bias and examples

Lemma 5.2** (Sufficient condition for lower order bias).**

Proof.

Remark 5.3**.**

Remark 5.4**.**

Theorem 1.1.

Theorem 1.2 ([Kowalski2010]*Proposition 1.1).

Remark 1.3.

Theorem 1.4.

Theorem 1.5.

Remark 2.1.

Definition 2.2.

Remark 2.3.

Definition 2.4.

Remark 2.5.

Definition 2.6.

Remark 2.7.

Remark 2.8.

Definition 2.9.

Proposition 2.10.

Lemma 2.11.

Definition 2.12.

Lemma 2.13.

Theorem 2.14 (Skolem-Mahler-Lech, [RecSeq]*Theorem 2.1).

Corollary 2.15.

Theorem 2.16 ([BeGer]*Theorem 1).

Corollary 2.17.

Definition 2.18.

Lemma 2.19.

Remark 2.20.

Theorem 2.21.

Proposition 2.22.

Remark 2.23.

Theorem 3.1 ([AhmadiShpar]*Theorem 2).

Theorem 4.1 (Bhatia-Davis Inequality).

Proposition 4.2 (Necessary condition for complete bias).

Corollary 4.3.

Remark 4.4.

Lemma 4.5 (Sufficient condition for complete bias).

Remark 4.6.

Remark 4.7.

Lemma 4.8.

Proposition 5.1 (Necessary condition for lower order bias).

Lemma 5.2 (Sufficient condition for lower order bias).

Remark 5.3.

Remark 5.4.

Proposition 6.1 (Necessary condition for a reversed bias).

Lemma 6.2.

Lemma 6.3.

Remark 6.4.

Lemma 7.1.

Remark 7.2.

Lemma 7.3.

Lemma 7.4.

Lemma 7.5.

Remark 7.6.

Lemma 7.7.