Angles of Gaussian primes

Ze\'ev Rudnick; Ezra Waxman

arXiv:1705.07498·math.NT·October 2, 2018

Angles of Gaussian primes

Ze\'ev Rudnick, Ezra Waxman

PDF

TL;DR

This paper investigates the distribution and variance of angles associated with Gaussian primes, extending classical results and proposing a conjecture supported by a function field analogue and random matrix models.

Contribution

It introduces a conjecture for the variance of angles of Gaussian primes in short arcs, supported by a proven analogue in function fields and connections to random matrix theory.

Findings

01

Angles are uniformly distributed as primes vary.

02

A conjecture for the variance in short arcs is proposed.

03

Asymptotic form of the variance is proved in the function field case.

Abstract

Fermat showed that every prime p = 1 mod 4 is a sum of two squares: $p = a^{2} + b^{2}$ . To any of the 8 possible representations (a,b) we associate an angle whose tangent is the ratio b/a. In 1919 Hecke showed that these angles are uniformly distributed as p varies, and in the 1950's Kubilius proved uniform distribution in somewhat short arcs. We study fine scale statistics of these angles, in particular the variance of the number of such angles in a short arc. We present a conjecture for this variance, motivated both by a random matrix model, and by a function field analogue of this problem, for which we prove an asymptotic form for the corresponding variance.

Equations456

S_{Q}^{1} = {z \in Q (i) : Norm (z) = 1} = Q (i) \cap S^{1} .

S_{Q}^{1} = {z \in Q (i) : Norm (z) = 1} = Q (i) \cap S^{1} .

u (α) := (\frac{α}{α ˉ})^{2} \in S_{Q}^{1}

u (α) := (\frac{α}{α ˉ})^{2} \in S_{Q}^{1}

\frac{# { Norm p \leq x : θ _{p} \in I }}{# { Norm p \leq x }} \sim \frac{∣ I ∣}{π /2}, x \to \infty

\frac{# { Norm p \leq x : θ _{p} \in I }}{# { Norm p \leq x }} \sim \frac{∣ I ∣}{π /2}, x \to \infty

N := # {p prime : Norm p \leq x} \sim \frac{x}{lo g x},

N := # {p prime : Norm p \leq x} \sim \frac{x}{lo g x},

Sect (θ, x) = {z \in C : Norm (z) = z \overset{z}{ˉ} \leq x, ar g (z) \in I_{K} (θ)}

Sect (θ, x) = {z \in C : Norm (z) = z \overset{z}{ˉ} \leq x, ar g (z) \in I_{K} (θ)}

N_{K, x} (θ) = # {p prime, Norm p \leq x, θ_{p} \in I_{K} (θ)}

N_{K, x} (θ) = # {p prime, Norm p \leq x, θ_{p} \in I_{K} (θ)}

⟨ N_{K, x} ⟩ := \int_{0}^{π /2} N_{K, x} (θ) \frac{d θ}{π /2} = \frac{N}{K} .

⟨ N_{K, x} ⟩ := \int_{0}^{π /2} N_{K, x} (θ) \frac{d θ}{π /2} = \frac{N}{K} .

\operatorname{Var}(\mathcal{N}_{K,x})=\int_{0}^{\pi/2}\Big{|}\mathcal{N}_{K,x}-\left\langle\mathcal{N}_{K,x}\right\rangle\Big{|}^{2}\frac{d\theta}{\pi/2}\;.

\operatorname{Var}(\mathcal{N}_{K,x})=\int_{0}^{\pi/2}\Big{|}\mathcal{N}_{K,x}-\left\langle\mathcal{N}_{K,x}\right\rangle\Big{|}^{2}\frac{d\theta}{\pi/2}\;.

Var (N_{K, x}) \sim \frac{N}{K}, N = o (K) .

Var (N_{K, x}) \sim \frac{N}{K}, N = o (K) .

Var (N_{K, x}) \sim \frac{N}{K} min (1, 2 \frac{lo g K}{lo g N}) .

Var (N_{K, x}) \sim \frac{N}{K} min (1, 2 \frac{lo g K}{lo g N}) .

P (T) = A (T)^{2} + T B (T)^{2}

P (T) = A (T)^{2} + T B (T)^{2}

P = p \cdot \tilde{p} = (A + - T B) (A - - T B)

P = p \cdot \tilde{p} = (A + - T B) (A - - T B)

N = \frac{q ^{ν}}{ν} + O (\frac{q ^{ν /2}}{ν})

N = \frac{q ^{ν}}{ν} + O (\frac{q ^{ν /2}}{ν})

σ : S \mapsto - S, σ (f) (S) = f (- S),

σ : S \mapsto - S, σ (f) (S) = f (- S),

Norm : F_{q} [[S]]^{\times} \to F_{q} [[T]]^{\times}, Norm (f) = f (S) f (- S) .

Norm : F_{q} [[S]]^{\times} \to F_{q} [[T]]^{\times}, Norm (f) = f (S) f (- S) .

S^{1} := {g \in F_{q} [[S]]^{\times} : g (0) = 1, Norm (g) = 1}

S^{1} := {g \in F_{q} [[S]]^{\times} : g (0) = 1, Norm (g) = 1}

Sect (u; k) = {v \in S^{1} : ∣ v - u ∣ \leq q^{- k}} .

Sect (u; k) = {v \in S^{1} : ∣ v - u ∣ \leq q^{- k}} .

S_{k}^{1} = {f \in F_{q} [S] / (S^{k}) : f (0) = 1, Norm (f) := f (- S) f (S) = 1 mod S^{k}}

S_{k}^{1} = {f \in F_{q} [S] / (S^{k}) : f (0) = 1, Norm (f) := f (- S) f (S) = 1 mod S^{k}}

K := # S_{k}^{1} = q^{κ},

K := # S_{k}^{1} = q^{κ},

κ := ⌊ \frac{k}{2} ⌋, so that k = {2 κ + 1 2 κ .

κ := ⌊ \frac{k}{2} ⌋, so that k = {2 κ + 1 2 κ .

U : f \mapsto \frac{f}{σ ( f )} .

U : f \mapsto \frac{f}{σ ( f )} .

N_{k, ν} (u) := # {(p) prime, p (0) \neq = 0 : de g p = ν, U (p) \in Sect (u, k)} .

N_{k, ν} (u) := # {(p) prime, p (0) \neq = 0 : de g p = ν, U (p) \in Sect (u, k)} .

⟨ N_{k, ν} ⟩ := \frac{1}{q ^{κ}} u \in S_{k}^{1} \sum N_{k, ν} (u) = \frac{N}{K} \sim \frac{q ^{ν} / ν}{q ^{κ}} .

⟨ N_{k, ν} ⟩ := \frac{1}{q ^{κ}} u \in S_{k}^{1} \sum N_{k, ν} (u) = \frac{N}{K} \sim \frac{q ^{ν} / ν}{q ^{κ}} .

N_{k, ν} (u) = \frac{N}{K} + O (q^{ν /2})

N_{k, ν} (u) = \frac{N}{K} + O (q^{ν /2})

\operatorname{Var}(\mathcal{N}_{k,\nu}):=\frac{1}{q^{\kappa}}\sum_{u\in\mathbb{S}^{1}_{k}}\Big{|}\mathcal{N}_{k,\nu}-\left\langle\mathcal{N}_{k,\nu}\right\rangle\Big{|}^{2}\;.

\operatorname{Var}(\mathcal{N}_{k,\nu}):=\frac{1}{q^{\kappa}}\sum_{u\in\mathbb{S}^{1}_{k}}\Big{|}\mathcal{N}_{k,\nu}-\left\langle\mathcal{N}_{k,\nu}\right\rangle\Big{|}^{2}\;.

Var (N_{k, ν}) \sim \frac{q ^{ν - κ}}{ν ^{2}} \times {2 κ - 2, ν - 1 + η (ν), ν \geq 2 κ - 2 κ \leq ν \leq 2 κ - 2

Var (N_{k, ν}) \sim \frac{q ^{ν - κ}}{ν ^{2}} \times {2 κ - 2, ν - 1 + η (ν), ν \geq 2 κ - 2 κ \leq ν \leq 2 κ - 2

\frac{Var ( N _{κ, ν} )}{N / K} \sim ⎩ ⎨ ⎧ 2 \frac{l o g _{q} K}{l o g _{q} N} - \frac{2}{l o g _{q} N}, 1 + \frac{η ( l o g _{q} N ) - 1}{l o g _{q} N}, lo g_{q} K \leq \frac{1}{2} lo g_{q} N + 1 \frac{1}{2} lo g_{q} N + 1 \leq lo g_{q} K \leq lo g_{q} N .

\frac{Var ( N _{κ, ν} )}{N / K} \sim ⎩ ⎨ ⎧ 2 \frac{l o g _{q} K}{l o g _{q} N} - \frac{2}{l o g _{q} N}, 1 + \frac{η ( l o g _{q} N ) - 1}{l o g _{q} N}, lo g_{q} K \leq \frac{1}{2} lo g_{q} N + 1 \frac{1}{2} lo g_{q} N + 1 \leq lo g_{q} K \leq lo g_{q} N .

\frac{Var ( N _{K, N} )}{N / K} \sim min (1, 2 \frac{lo g K}{lo g N})

\frac{Var ( N _{K, N} )}{N / K} \sim min (1, 2 \frac{lo g K}{lo g N})

θ_{a} ≫ \frac{1}{Norm a} .

θ_{a} ≫ \frac{1}{Norm a} .

∣ θ_{p} - θ_{q} ∣ \geq \frac{1}{Norm p Norm q} .

∣ θ_{p} - θ_{q} ∣ \geq \frac{1}{Norm p Norm q} .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Angles of Gaussian primes

Zeév Rudnick and Ezra Waxman

Raymond and Beverly Sackler School of Mathematical Sciences, Tel Aviv University, Tel Aviv 69978, Israel

[email protected], [email protected]

Abstract.

Fermat showed that every prime $p=1\bmod 4$ is a sum of two squares: $p=a^{2}+b^{2}$ . To any of the $8$ possible representations $(a,b)$ we associate an angle whose tangent is the ratio $b/a$ . In 1919 Hecke showed that these angles are uniformly distributed as $p$ varies, and in the 1950’s Kubilius proved uniform distribution in somewhat short arcs. We study fine scale statistics of these angles, in particular the variance of the number of such angles in a short arc. We present a conjecture for this variance, motivated both by a random matrix model, and by a function field analogue of this problem, for which we prove an asymptotic form for the corresponding variance.

1 Introduction
1.1 Angles of Gaussian primes
1.2 The number variance
1.3 A function field analogue
2 Repulsion between angles
2.1 Repulsion and its consequences
2.2 Deviations from randomness
2.3 The variance in the trivial regime
3 Almost all sectors contain an angle
3.1 A smooth count
3.2 Variance in the trivial regime
3.3 An upper bound
4 Relation to zeros of Hecke L-functions
4.1 Hecke characters and their L-functions
4.2 An Explicit Formula
4.3 Primes vs prime powers
4.4 Proof of Theorem 3.2
5 A random matrix theory model
5.1 The model
5.2 Proof of Proposition 5.3
6 A function field model
6.1 The group of sectors
6.2 Super-even characters and their L-functions
6.3 A weighted count
6.4 The variance of $\Psi_{k,\nu}$
6.5 Proof of Theorem 6.7
6.6 Relation between variance of $\mathcal{N}_{k,\nu}$ and $\Psi_{k,\nu}$
6.7 Proof of Lemma 6.11

1. Introduction

1.1. Angles of Gaussian primes

An odd prime $p$ is a sum of two squares if and only if $p=1\bmod 4$ , and in that case there are exactly $8$ representations. Each representation corresponds to a Gaussian integer $a+ib=\sqrt{p}e^{i\theta_{a,b}}$ . We wish to understand the statistics of the resulting angles.

It is useful to formulate the results in terms of prime ideals of the ring of Gaussian integers ${\mathbb{Z}}[i]$ , which is the ring of integers of the imaginary quadratic field ${\mathbb{Q}}(i)$ . The basic infra-structure that we need is complex conjugation $z\mapsto\bar{z}$ , the norm map $\operatorname{Norm}:{\mathbb{Q}}(i)^{\times}\to{\mathbb{Q}}^{\times}$ , $\operatorname{Norm}(z)=z\bar{z}$ , and the norm one elements

[TABLE]

For a Gaussian number $\alpha\in{\mathbb{Q}}(i)^{\times}$ , we have a direction vector given by

[TABLE]

so that $u(\alpha)=e^{4i\theta}$ , $\theta=\arg\alpha$ .

Let $\mathfrak{p}$ be a prime ideal in ${\mathbb{Z}}[i]$ . If $\mathfrak{p}=\langle\alpha\rangle$ is generated by the Gaussian integer $\alpha$ , we associate a direction vector $u(\mathfrak{p}):=u(\alpha)\in\mathbb{S}^{1}_{\mathbb{Q}}$ . Since all generators of the ideal differ by multiplication by a unit ${\mathbb{Z}}[i]^{\times}=\{\pm 1,\pm i\}$ , the direction vector $u(\mathfrak{p})=e^{i4\theta_{\mathfrak{p}}}$ is well-defined on ideals, while the angle $\theta_{\mathfrak{p}}$ is only defined modulo $\pi/2$ . We can choose $\theta_{\mathfrak{p}}$ to lie say in $[0,\pi/2)$ , corresponding to taking $\alpha=a+ib$ , with $a>0$ , $b\geq 0$ .

Hecke [5] showed that as $\mathfrak{p}$ varies over prime ideals of ${\mathbb{Z}}[i]$ , the angles $\theta_{\mathfrak{p}}$ become uniformly distributed in $[0,\frac{\pi}{2})$ : For a fixed sector, defined by an interval $I\subseteq[0,\frac{\pi}{2})$ ,

[TABLE]

where $|I|$ is the length of the interval $I$ .

The validity of (1.1) for shrinking sectors was studied by Kubilius and his school [11, 12, 10, 14, 15, 16], obtaining that (1.1) holds for any sector as long as $|I|>x^{-\delta}$ for some $1/4<\delta<1/2$ . See also [4] for existence of prime angles in somewhat smaller sectors without the full force of (1.1). Assuming the Generalized Riemann Hypothesis (GRH), we know that (1.1) holds for intervals with $\operatorname{length}(I)\gg x^{-1/2+o(1)}$ . This regime is the limit of what can be expected to hold for individual sectors, because it is easy to see that there are no Gaussian integers (let alone primes) in the sector $\{a,b>0:a^{2}+b^{2}\leq x,0<\arctan\frac{b}{a}<x^{-1/2}\}$ . Hence for smaller sectors we can only hope for a statistical theory, rather than individual results.

To formulate the theory, we introduce some notation: Given $x\gg 1$ , let $N$ be the number of prime ideals $\mathfrak{p}\subset{\mathbb{Z}}[i]$ of norm at most $x$ :

[TABLE]

where the asymptotic holds by the Prime Ideal Theorem for ${\mathbb{Q}}(i)$ . Given an interval $I_{K}(\theta)=[\theta-\frac{\pi}{4K},\theta+\frac{\pi}{4K}]$ of length $\pi/(2K)$ centered at $\theta$ , define a sector

[TABLE]

of radius $\sqrt{x}$ and opening angle defined by $I_{K}(\theta)$ .

Given $K\gg 1$ , we divide the interval $[0,\pi/2)$ into $K$ disjoint arcs $I_{K}(\theta_{1})$ , $\dots$ , $I_{K}(\theta_{K})$ of equal length, which in turn define $K$ disjoint sectors $\operatorname{Sect}(\theta_{j},x)$ , and study the number of prime angles falling into each such sector. If the sectors are too small, in the sense that the number $K$ of sectors is larger than the number $N$ of angles involved, then the typical such sector will not contain any Gaussian prime. We want to show that in the range $K\ll N^{1-\epsilon}$ , almost all sectors with opening angles of size $\approx 1/K$ contain at least one angle $\theta_{\mathfrak{p}}$ , $\operatorname{Norm}(\mathfrak{p})\leq x$ . We can do so assuming GRH (for the family of Hecke L-functions):

Theorem 1.1.

Assume GRH. Then almost all arcs of length $1/K$ contain at least one angle $\theta_{\mathfrak{p}}$ for a prime ideal with $\operatorname{Norm}(\mathfrak{p})\leq K(\log K)^{2+o(1)}$ .

Unconditionally, one may use zero-density theorems as in [16] to obtain a result with $\operatorname{Norm}(\mathfrak{p})<K^{2-\delta}$ for some small $\delta>0$ .

It is surprising that something like Theorem 1.1 does not seem to have been considered long ago. It has come up independently in the recent work of Ori Parzanchevski and Peter Sarnak [17].

1.2. The number variance

One way to obtain such an “almost-everywhere” result is by computing the variance of a suitable counting function. The study of the structure of the variance is the main point of this paper.

Let

[TABLE]

be the number of angles $\theta_{\mathfrak{p}}$ in $I_{K}(\theta)$ .

The expected number is

[TABLE]

We wish to study the number variance

[TABLE]

If $N=o(K)$ , then for almost all intervals, we do not have any angles $\theta_{\mathfrak{p}}$ in the interval $I_{K}(\theta)$ . We can easily compute the variance in this “trivial” regime:

[TABLE]

For the interesting range, when $K\ll N^{1-\epsilon}$ , we expect:

Conjecture 1.2.

For $1\ll K\ll N^{1-o(1)}$

[TABLE]

For random angles ( $N$ uniform independent points in $[0,\pi/2)$ ), the variance would be $\sim N/K$ . Thus we expect the Gaussian angles to display a marked deviation from randomness, in that there is a crossover from purely random behaviour for very short intervals ( $K\gg N^{1/2}$ ), to a saturation for moderately short intervals ( $1\ll K\ll N^{1/2}$ ), where the variance is smaller than that of random angles, so one can say that they display some measure of rigidity. See Figure 1 for numerical evidence. For an explanation of the underlying rigidity present here and for other deviations from randomness, see §2.

A related saturation effect was previously observed by Bui, Keating and Smith [2], in the context of computing the variance of sums in short intervals of coefficients of a fixed L-function of higher degree.

One of our main goals is to justify Conjecture 1.2. In § 3 we define a suitably smoothed version of the counting function $\mathcal{N}_{K,x}$ and express the corresponding variance in terms of zeros of a family of Hecke L-functions. This enables us, in § 4, to use GRH to give an upper bound for this variance and consequently deduce the almost-everywhere result of Theorem 1.1. Moreover, in § 5 we go on to develop a suitable random matrix theory model of this result, which gives a result corresponding to Conjecture 1.2. We now turn to formulating a similar problem in a function field setting, where we can prove an analogue of Conjecture 1.2.

1.3. A function field analogue

Let $\mathbb{F}_{q}$ be a finite field of cardinality $q$ , from now on assumed to be odd. We want to write prime (irreducible monic) polynomials as

[TABLE]

with $A,B\in\mathbb{F}_{q}[T]$ , which is equivalent to the constant term $P(0)$ being a square in $\mathbb{F}_{q}$ (see e.g. [1]). If additionally $P(0)\neq 0$ , then there are exactly four such representations, obtained from (1.3) by changing the signs of $A$ and $B$ . This decomposition gives a factorization in $\mathbb{F}_{q}[T][\sqrt{-T}]=\mathbb{F}_{q}[\sqrt{-T}]$ as

[TABLE]

and the corresponding factorization of the ideal $(P)\subset\mathbb{F}_{q}[T]$ into a pair of conjugate prime ideals of $\mathbb{F}_{q}[\sqrt{-T}]$ . The number $N$ of such prime polynomials $\mathfrak{p}(\sqrt{-T})$ of degree $\nu$ with $\mathfrak{p}(0)\neq 0$ satisfies

[TABLE]

by the Prime Polynomial Theorem in $\mathbb{F}_{q}[\sqrt{-T}]$ .

Denote by $S=\sqrt{-T}$ and consider the quadratic extension $\mathbb{F}_{q}(T)(\sqrt{-T})=\mathbb{F}_{q}(S)$ , which is still rational (genus zero). Let $\mathbb{F}_{q}[[S]]$ be the ring of formal power series. It is equipped with the Galois involution

[TABLE]

and the norm map

[TABLE]

We denote

[TABLE]

the formal power series with constant term $1$ and unit norm. This is a group, which is our analogue of the unit circle. It is important to note that since $q$ is odd, Hensel’s Lemma tells us that the square map $u\mapsto u^{2}$ is an automorphism of $\mathbb{S}^{1}$ , and in particular each element of $\mathbb{S}^{1}$ admits a unique square root $\sqrt{u}$ .

We put an absolute value $|f|=q^{-\operatorname{ord}(f)}$ on $\mathbb{F}_{q}[[S]]$ , where $\operatorname{ord}(f)=\max(j:S^{j}\mid f)$ . We then divide $\mathbb{S}^{1}$ into “sectors”

[TABLE]

We denote by

[TABLE]

the elements of unit norm and constant term unity in $\Big{(}\mathbb{F}_{q}[S]/(S^{k})\Big{)}^{\times}$ . The group $\mathbb{S}^{1}_{k}$ parameterizes the different sectors. The order of $\mathbb{S}^{1}_{k}$ is

[TABLE]

where

[TABLE]

We next want to define the notion of direction (essentially an angle) for any nonzero polynomial $f=A(T)+\sqrt{-T}B(T)\in\mathbb{F}_{q}[\sqrt{-T}]$ . To motivate the definition below, recall that for a nonzero complex number $\alpha=|\alpha|e^{i\theta}$ , we have $\alpha/\overline{\alpha}=e^{2i\theta}$ . To any nonzero $f\in\mathbb{F}_{q}[S]$ which is coprime to $S$ , we associate a norm-one element $U(f)\in\mathbb{S}^{1}$ via the map

[TABLE]

Note that since $f(0)\neq 0$ , $f/\sigma(f)$ has constant term one, lies in $\mathbb{F}_{q}[[S]]$ , and has unit norm, that is $f/\sigma(f)\in\mathbb{S}^{1}$ , and hence $\sqrt{f/\sigma(f)}\in\mathbb{S}^{1}$ exists and is unique. Moreover, $U(cf)=U(f)$ for all scalars $c\in\mathbb{F}_{q}^{\times}$ , so that if $f\in\mathbb{F}_{q}[S]$ then $U(f)$ only depends on the ideal $(f)\subset\mathbb{F}_{q}[S]$ generated by $f$ .

We want to count the number of prime ideals $(\mathfrak{p})\subset\mathbb{F}_{q}[S]$ with $\mathfrak{p}(0)\neq 0$ , whose directions $U(\mathfrak{p})$ lie in a given sector. For $u\in\mathbb{S}^{1}$ , let

[TABLE]

The mean value is clearly

[TABLE]

For $k\leq\nu$ we can show (see Corollary 6.5) that as $q\to\infty$ ,

[TABLE]

which gives an asymptotic result if $\kappa<\nu/2$ . For larger values of $\kappa$ , there are sectors which do not contain prime directions, as in the number field case, see Remark 6.6.

Our main result is the computation, in the large $q$ limit, of the number variance

[TABLE]

Theorem 1.3.

Assume that $\kappa\geq 3$ , or if $\kappa=2$ that $5\nmid q$ . Then as $q\to\infty$ ,

[TABLE]

where $\eta(\nu)=1$ if $\nu$ is even, and [math] otherwise.

To compare it to our number field conjecture, here the number of sectors is $K=q^{\kappa}$ , the number of directions (the number of Gaussian prime ideals $\mathfrak{p}$ of degree $\nu$ ) is $N\sim q^{\nu}/\nu$ , so that the expected value is $N/K$ , and the variance satisfies, as $q\to\infty$ ,

[TABLE]

Our conjecture 1.2 for the number-field variance is

[TABLE]

which is analogous to the above.

Acknowledgments We thank Steve Lester, for his help in the beginning of the project, and to Jon Keating, Corentin Perret-Gentil and Peter Sarnak for their comments.

The research leading to these results has received funding from the European Research Council under the European Union’s Seventh Framework Programme (FP7/2007-2013) / ERC grant agreement n ${}^{\text{o}}$ 320755.

2. Repulsion between angles

2.1. Repulsion and its consequences

Let $\mathfrak{a}$ be a nonzero ideal in ${\mathbb{Z}}[i]$ . If $\mathfrak{a}=\langle\alpha\rangle$ is generated by the Gaussian integer $\alpha$ , we associate a direction vector $u(\mathfrak{p}):=u(\alpha)\in\mathbb{S}^{1}_{\mathbb{Q}}$ . Since all generators of the ideal differ by multiplication by a unit ${\mathbb{Z}}[i]^{\times}=\{\pm 1,\pm i\}$ , the direction vector $u(\mathfrak{a})=e^{i4\theta_{\mathfrak{a}}}$ is well-defined on ideals, while the angle $\theta_{\mathfrak{a}}$ is only defined modulo $\pi/2$ . We can choose $\theta_{\mathfrak{a}}$ to lie say in $[0,\pi/2)$ , corresponding to taking $\alpha=a+ib$ , with $a>0$ , $b\geq 0$ . If $\mathfrak{a}=\langle\alpha\rangle$ for non-zero $\alpha\in{\mathbb{Z}}$ , then $\theta_{\mathfrak{a}}=0$ .

Lemma 2.1.

i) If $\theta_{\mathfrak{a}}\neq 0$ then

[TABLE]

*ii) If $\mathfrak{p}\neq\mathfrak{q}$ are ideals with distinct angles $\theta_{\mathfrak{p}}\neq\theta_{\mathfrak{q}}$ then *

[TABLE]

Proof.

i) Write $\mathfrak{a}=\langle a+ib\rangle$ with $a,b>0$ . Then

[TABLE]

Since we may assume that $\theta_{\mathfrak{a}}\in(0,\pi/4)$ , we have $\tan\theta_{\mathfrak{a}}\leq\sqrt{2}\theta_{\mathfrak{a}}$ which gives our claim.

ii) Write $\mathfrak{p}=\langle a+ib\rangle$ , $\mathfrak{q}=\langle c+id\rangle$ , with $a,b>0$ and $c>0$ , $d\geq 0$ . Consider the triangle having vertices at the origin, $a+ib$ and $c+id$ . Since $\theta_{\mathfrak{p}}\neq\theta_{\mathfrak{q}}$ , its area is positive and being a lattice triangle, its area is at least $1/2$ .

On the other hand, its area is given in terms of the angle $\theta_{\mathfrak{p}}-\theta_{\mathfrak{q}}$ between the sides $a+ib$ and $c+id$ as

[TABLE]

Thus we find

[TABLE]

and hence

[TABLE]

∎

Lemma 2.1 implies that the interval $\{0<\theta<1/\sqrt{x}\}$ will contain no angles $\theta_{\mathfrak{p}}$ for $\operatorname{Norm}\mathfrak{p}\ll x$ , so that the number $\mathcal{N}_{K,x}$ of prime angles $\theta_{\mathfrak{p}}$ in this interval is zero. Hence we cannot expect an asymptotic formula $\mathcal{N}_{K,x}\sim N/K$ to hold for all intervals if $K\ll N^{1/2}$ , while it does hold (assuming GRH) for larger intervals. Theorem 1.1 guarantees that almost all intervals will contain angles if $K\ll N^{1-o(1)}$ .

2.2. Deviations from randomness

The existence of a “big hole” as above displays a striking deviation from randomness of the angles, when compared to $N$ random angles in $[0,\pi/2)$ . For these, the maximal gap is almost surely of order $\log N/N$ , while Lemma 2.1(i) guarantees a much larger gap, of size $N^{-1/2-o(1)}$ .

Another statistic which indicates that Gaussian angles behave differently than random points is the minimal spacing statistic: For $N$ random angles in $[0,\pi/2)$ as above, the smallest gap is almost surely of size $\approx 1/N^{2}$ [13]. In contrast, the minimal gap between the angles $\{\theta_{\mathfrak{p}}\neq 0:\operatorname{Norm}\mathfrak{p}\leq x\}$ is by Lemma 2.1

[TABLE]

which is much bigger than the random case.

2.3. The variance in the trivial regime

We want to study fluctuations in the number $\mathcal{N}_{K,x}$ of angles falling in “random” short intervals. Take the interval length $1/K=o(1/x)$ , equivalently the number $K$ of intervals, is much larger than the number $N\sim x/\log x$ of angles: $N=o(K)$ . Then for almost all intervals, we do not have any angles $\theta_{\mathfrak{p}}$ in the interval $I_{K}(\theta)$ . Nonetheless we can compute the variance in this “trivial” regime.

Proposition 2.2.

If $x=o(K)$ then

[TABLE]

Proof.

We recall definition (1.2): Given an interval $I_{K}(\theta)=[\theta-\frac{\pi}{4K},\theta+\frac{\pi}{4K}]$ of length $\pi/2K$ centered at $\theta$ , let111We abuse notation and use the same symbol for the interval and its indicator function.

[TABLE]

be the number of prime angles $\theta_{\mathfrak{p}}$ in $I_{K}(\theta)$ . We will take the center $\theta$ of the interval to be random, that is uniform in $(0,\pi/2)$ .

We compute the second moment of $\mathcal{N}=\mathcal{N}_{K,x}$ using its definition

[TABLE]

where throughout we use

[TABLE]

The contribution of pairs of inert primes, where $\theta_{\mathfrak{p}}=0$ , $\mathfrak{p}=\langle p\rangle$ , $p=3\bmod 4$ , $\operatorname{Norm}\mathfrak{p}=p^{2}\leq x$ , is

[TABLE]

Note that $I_{K}^{2}=I_{K}$ and

[TABLE]

Moreover, the number of $p=3\bmod 4$ , $p\leq\sqrt{x}$ is $\ll\sqrt{x}/\log x$ . Hence the contribution of pairs of inert primes is $O\Big{(}\frac{x}{K(\log x)^{2}}\Big{)}$ .

If $\mathfrak{p}\neq\mathfrak{q}$ and at least one of $\mathfrak{p}$ , $\mathfrak{q}$ is not inert, so that $\theta_{\mathfrak{p}}\neq\theta_{\mathfrak{q}}$ , then Lemma 2.1 gives

[TABLE]

For the integral $\left\langle I_{K}(\theta_{\mathfrak{p}}-\theta)I_{K}(\theta_{\mathfrak{q}}-\theta)\right\rangle$ to be nonzero, it is necessary that there be some $\theta$ so that both $\theta_{\mathfrak{p}},\theta_{\mathfrak{q}}\in I_{K}(\theta)$ , which forces the distance between the two angles to be at most $\pi/2K$ :

[TABLE]

Hence if $x=o(K)$ then such off-diagonal pairs contribute nothing.

We conclude that the second moments of $\mathcal{N}_{K,x}$ is essentially given by the sum of the diagonal terms

[TABLE]

We can now compute the variance:

[TABLE]

Since $N=o(K)$ we find

[TABLE]

as claimed. ∎

3. Almost all sectors contain an angle

3.1. A smooth count

Our goal in this section is to prove Theorem 1.1, which claims (assuming GRH) that in the non-trivial range $K\ll X^{1-\epsilon}$ , almost all arcs of size $\approx 1/K$ contain at least one angle $\theta_{\mathfrak{p}}$ , $\operatorname{Norm}(\mathfrak{p})\leq X$ . We can do so assuming GRH (for the family of Hecke L-functions).

To count the number of angles $\theta_{\mathfrak{p}}$ lying in a short segment of $[0,\pi/2)$ , pick a window function $f\in C_{c}^{\infty}({\mathbb{R}})$ , which we take to be even and real valued, and for $K\gg 1$ define

[TABLE]

which is $\pi/2$ -periodic, and localized on a scale of $1/K$ . The Fourier expansion of $F_{K}$ is

[TABLE]

where the Fourier transform is normalized as $\widehat{f}(y)=\int_{-\infty}^{\infty}f(x)e^{-2\pi iyx}dx$ . Note that since $f$ is even and real valued, the same holds for $\widehat{f}$ .

Let $\Phi\in C_{c}^{\infty}(0,\infty)$ . Now set

[TABLE]

the sum over all prime ideals of ${\mathbb{Z}}[i]$ , which gives a smooth count of prime angles $\theta_{\mathfrak{p}}$ lying in a smooth window defined $F_{K}$ around $\theta$ . We also define

[TABLE]

the sum over all powers of prime ideals, with the von Mangoldt function $\Lambda(\mathfrak{a})=\log\operatorname{Norm}(\mathfrak{p})$ if $\mathfrak{a}=\mathfrak{p}^{r}$ is a power of a prime ideal $\mathfrak{p}$ , and equal to zero otherwise.

We next compute the mean value.

Lemma 3.1.

The mean values of $\psi_{K,X}$ and $\psi_{K,X}^{\rm prime}$ are asymptotically

[TABLE]

Moreover,

[TABLE]

Proof.

The mean value is

[TABLE]

We can evaluate this using the Prime Ideal Theorem to obtain:

[TABLE]

and likewise for $\left\langle\psi_{K,X}^{\rm prime}\right\rangle$ . If in addition we use GRH, we obtain a remainder term of $O(\frac{X^{1/2}}{K})$ for both.

We bound the difference by

[TABLE]

which shows that the mean values are close. ∎

Note that the inert primes $\mathfrak{p}=\langle p\rangle$ give angle $\theta_{\mathfrak{p}}=0$ , but that $\operatorname{Norm}\mathfrak{p}=p^{2}$ so that in $\psi_{K,X}^{\rm prime}$ , we get a contribution of size $\sqrt{X}$ if $\theta\approx 0$ . This is significantly larger than the mean value if $K\gg X^{1/2}$ .

3.2. Variance in the trivial regime

The variance of $\psi_{K,X}^{\rm prime}$ in the trivial regime $X=o(K)$ is:

[TABLE]

where

[TABLE]

Indeed, if $X=o(K)$ then the same argument of repulsion between angles as in § 2.3 allows us to compute the second moment as asymptotically equal to the sum over the diagonal pairs

[TABLE]

By Parseval’s theorem, we have

[TABLE]

and

[TABLE]

by the Prime Ideal Theorem. This gives the second moment as

[TABLE]

and since $X=o(K)$ , we obtain (3.3) for $\operatorname{Var}(\psi_{K,X})$ . The argument for $\operatorname{Var}(\psi_{K,X}^{\rm prime})$ is identical.

3.3. An upper bound

We give an upper bound on the variance of $\psi_{K,X}^{\rm prime}$ in the non-trivial regime $K\ll X$ , assuming GRH.

Theorem 3.2.

Assume GRH. Then

[TABLE]

From this bound we easily deduce Theorem 1.1: We use Chebyshev’s inequality and Theorem 3.2 to deduce

[TABLE]

Taking $X=K(\log K)^{2+o(1)}$ we find that for almost all $\theta$ ,

[TABLE]

is nonzero. Therefore the sum defining $\psi_{K,X}^{\rm prime}$ is non-empty, and since it is a sum over prime ideals giving angles $\theta_{\mathfrak{p}}$ in the arc of length $\approx 1/K$ around $\theta$ , we find that for almost all $\theta$ , such arcs contain an angle $\theta_{\mathfrak{p}}$ for a prime ideal with $\operatorname{Norm}(\mathfrak{p})\leq X=K(\log K)^{2+o(1)}$ . ∎

The proof of Theorem 3.2 will be presented in § 4.4.

4. Relation to zeros of Hecke L-functions

4.1. Hecke characters and their L-functions

The Hecke characters $\Xi_{k}(\alpha)=(\alpha/\bar{\alpha})^{2k}$ , $k\in{\mathbb{Z}}$ , give well defined functions on the ideals of ${\mathbb{Z}}[i]$ . In terms of the angles associated to ideals, we have $e^{i4k\theta_{\mathfrak{p}}}=\Xi_{k}(\mathfrak{p})$ .

To each such character Hecke [5] associated its L-function

[TABLE]

Note that $L(s,\Xi_{k})=L(s,\Xi_{-k})$ . Hecke showed that if $k\neq 0$ , these functions have an analytic continuation to the entire complex plane, and satisfy a functional equation:

[TABLE]

The completed L-function $\xi_{k}(s)$ has all its zeros in the critical strip $0<\operatorname{Re}(s)<1$ (the non-trivial zeros of $L(s,\Xi_{k})$ ), and the Generalized Riemann Hypothesis asserts that they all lie on the critical line $\operatorname{Re}(s)=1/2$ . The growth of the number of nontrivial zeros of $L(s,\Xi_{k})$ in a fixed rectangle is

[TABLE]

in other words, the density of zeros is $\frac{\log|k|}{\pi}$ .

Lemma 4.1.

[TABLE]

and

[TABLE]

Proof.

Inserting the Fourier expansion (3.1) of $F_{K}$ gives

[TABLE]

Now note that $e^{i4k\theta_{\mathfrak{p}}}=\Xi_{k}(\mathfrak{p})$ is the Hecke character, to obtain (4.4). The same argument gives (4.3). ∎

The zero mode $k=0$ in (4.4) is the mean value (3.2). The same holds for $\psi_{K,X}$ .

4.2. An Explicit Formula

Proposition 4.2.

Let $\Phi\in C_{c}^{\infty}(0,\infty)$ , and

[TABLE]

be its Mellin transform. Then for $k\neq 0$ and $X\gg_{\Phi}1$ ,

[TABLE]

where the sum on the RHS is over all non-trivial zeros of $L(s,\Xi_{k})$ .

Proof.

We abbreviate $L_{k}(s):=L(s,\Xi_{k})$ . Using Mellin inversion $\Phi(x)=\frac{1}{2\pi i}\int_{\operatorname{Re}(s)=2}\tilde{\Phi}(s)x^{-s}ds$ we obtain

[TABLE]

In terms of the completed L-function $\xi_{k}(s)$ , the logarithmic derivative of $L(s,\Xi_{k})$ is

[TABLE]

Inserting into the above gives

[TABLE]

We shift the contour in the integral to $\operatorname{Re}(s)=-1$ , picking up the poles of $-\frac{\xi_{k}^{\prime}}{\xi_{k}}(s)$ , which are all simple poles with residue $-1$ at the non-trivial zeros of $L_{k}(s)$ , giving

[TABLE]

Changing variables $s\mapsto 1-s$ gives

[TABLE]

The functional equation (4.1) of $L(s,\Xi_{k})$ implies

[TABLE]

which gives

[TABLE]

Returning to the incomplete L-function gives

[TABLE]

By Mellin inversion,

[TABLE]

which vanishes for $X\gg 1$ as $\Phi$ is compactly supported in $(0,\infty)$ . Likewise,

[TABLE]

since each term vanishes for $X\gg 1$ (independently of $\mathfrak{a}$ , since $\operatorname{Norm}(\mathfrak{a})\geq 1$ ).

Collecting terms, we find

[TABLE]

as claimed. ∎

Lemma 4.3.

For $k\neq 0$ ,

[TABLE]

Proof.

Note that the integrand is analytic in $-2<\operatorname{Re}(s)<3$ , so we may shift the contour of integration to $\operatorname{Re}(s)=1/2$ . Let

[TABLE]

The integral is essentially $X^{1/2}$ times the Fourier transform $\widehat{h}_{k}(\log X)$ , that is

[TABLE]

We can estimate the derivatives of $h_{k}(t)$ by using Stirling’s formula and the rapid decay of $\tilde{\Phi}(\frac{1}{2}+it)$ as being bounded by

[TABLE]

Hence integration by parts shows that the Fourier transform of $h_{k}$ is bounded by

[TABLE]

which proves the Lemma. ∎

From Lemma 4.1, Proposition 4.2 and Lemma 4.3 we deduce:

Corollary 4.4.

Assume GRH. Then

[TABLE]

Averaging Corollary 4.4 over $\theta$ we find

Corollary 4.5.

Assume GRH. Then

[TABLE]

Corollary 4.6.

Assume GRH. Then

[TABLE]

Proof.

We use GRH to obtain $|X^{i\gamma_{k,n}}|=1$ so that

[TABLE]

We use a standard bound for the number of zeros of $L(s,\Xi_{k})$ in an interval (see [6, Proposition 5.7]):

[TABLE]

Note that $\tilde{\Phi}$ decays rapidly in vertical strips, say

[TABLE]

which together with (4.6) gives

[TABLE]

Inserting (4.7) into Corollary 4.5 gives

[TABLE]

as claimed. ∎

4.3. Primes vs prime powers

We pass from a sum over prime ideals to a sum over all prime powers:

Lemma 4.7.

Assume GRH. For $k\neq 0$ such that $\log|k|\ll\log X$ ,

[TABLE]

Proof.

We denote

[TABLE]

and

[TABLE]

Assuming GRH, we have

[TABLE]

Indeed, from the Explicit Formula (Proposition 4.2), Lemma 4.3 and GRH we have

[TABLE]

on using the density of zeros of $L(s,\Xi_{k})$ (4.2).

Next we crudely bound the contribution $\Sigma_{\geq 2}(X,k,\Phi)$ to $\Sigma_{\rm all}(X,k,\Phi)$ of the higher prime powers $\mathfrak{p}^{j}$ , $j\geq 2$ :

[TABLE]

Therefore we obtain a crude a priori bound on the contribution of primes:

[TABLE]

We now seek a more refined estimate. In the sum $\Sigma_{\rm all}(X,k,\Phi)$ over all prime power, we separately treat the contributions of primes, of squares of primes, and of higher powers:

[TABLE]

where

[TABLE]

and

[TABLE]

By definition,

[TABLE]

where $\Phi_{2}(u)=\Phi(u^{2})$ . Therefore inputting the a priori bound (4.8) (which uses GRH to get cancellation) gives

[TABLE]

For the contribution of higher powers, we use

[TABLE]

Thus we obtain

[TABLE]

which gives us the result since $\log|k|\ll\log X$ . ∎

Lemma 4.8.

Assume GRH. Then

[TABLE]

Proof.

We use Lemma 4.1 to write

[TABLE]

The term $k=0$ is the difference between mean values, which by Lemma 3.1 is $O(X^{1/2}/K)$ . Hence

[TABLE]

say. Hence it suffices to show that $\left\langle I^{2}\right\rangle\ll X^{2/3}/K$ .

We have

[TABLE]

By Lemma 4.7, the sum over $\mathfrak{a}$ non prime is $O(X^{1/3})$ (assuming $\log K\ll\log X$ ), and therefore

[TABLE]

as desired. ∎

4.4. Proof of Theorem 3.2

We want to show that

[TABLE]

where

[TABLE]

is the standard $L^{2}$ norm on $[0,\pi/2]$ .

Using the triangle inequality, we have

[TABLE]

By Lemma 4.8

[TABLE]

by Corollary 4.6,

[TABLE]

and by Lemma 3.1, the mean values are close:

[TABLE]

Thus we obtain

[TABLE]

hence

[TABLE]

which proves Theorem 3.2. ∎

5. A random matrix theory model

In this section we present a conjecture for the variance of the smooth count $\psi_{K,X}$ :

Conjecture 5.1.

[TABLE]

where

[TABLE]

Note that Conjecture 5.1 coincides with our result (3.3) in the trivial regime range $K\gg X$ .

To recover Conjecture 1.2 from Conjecture 5.1, we can (at a heuristic level) pass to an actual count with sharp cutoffs: Take $f=\mathbf{1}_{[-1/2,1/2]}$ and $\Phi=\mathbf{1}_{(0,1]}$ , and replace the weight $\Lambda(\mathfrak{p})$ by $\log X$ throughout, and ignore the contribution of higher powers of primes.

We use Corollary 4.5 with $X=K^{\alpha}$ for $\alpha>0$ , and note that since $\widehat{f}$ is even, and $\xi_{-k}(s)=\xi_{k}(s)$ , we can pass to a sum over positive $k$ ’s, to obtain

[TABLE]

the inner sums over all non-trivial zeros of $L(s,\Xi_{k})$ ; we have ignored the remainder term in Corollary 4.5 as it can be seen to be $o(X/K)$ by using (4.7).

Let

[TABLE]

and

[TABLE]

Since the density of zeros of $L(s,\Xi_{k})$ is about $\approx\log|k|$ , the sum in $\mathcal{S}_{n}(\Xi_{k})$ is over $O(\log K)$ zeros.

Conjecture 5.1 is clearly implied by

Conjecture 5.2.

Fix $\alpha>0$ . Then as $K\to\infty$ ,

[TABLE]

5.1. The model

We model the sum $\mathcal{S}_{n}(\Xi_{k})$ by replacing the zeros of $L(s,\Xi_{k})$ by the eigenvalues of a fictitious $N\times N$ (diagonal) unitary matrix

[TABLE]

We may want to require that $U$ be symplectic222or orthogonal, in which case $N=2g$ is even and the eigenphases $\gamma_{j}$ will come in conjugate pairs $\gamma_{N-j}=-\gamma_{j}$ , $j=1,\dots,g$ .

We choose $N$ so that the density of angles, namely $N$ , matches the density of zeros of $L(s,\Xi_{k})$ by requiring

[TABLE]

We replace $\tilde{\Phi}(\frac{1}{2}+i\gamma)$ by a periodic function $w(\gamma)=w(\gamma+1)$ , to get a linear statistic

[TABLE]

Expanding $w(\gamma)=\sum_{\ell\in{\mathbb{Z}}}\widehat{w}(\ell)e^{2\pi i\ell\gamma}$ in a Fourier series we obtain

[TABLE]

We obtain the following model for the sum (5.3):

[TABLE]

where the unitary matrices $U_{k}$ are picked uniformly and independently from a certain subgroup $G(N)\subseteq U(N)$ of unitary $N\times N$ matrices, $N\approx\frac{1}{\pi}\log K$ , say $G(N)=U(N)$ is the full unitary group, or the symplectic group $G(N)={\rm USp}(N)$ (possible only when $N$ is even).

We now replace the discrete average $\frac{2}{K}\sum_{k>0}\widehat{f}\left(\frac{k}{K}\right)^{2}H(U_{k})$ by the continuous average $c_{f}\int_{G(N)}H(U)dU$ with respect to the Haar probability measure on $G(N)$ , with $c_{f}$ chosen so that the two averages coincide when the test function $H(U)\equiv 1$ is constant, that is

[TABLE]

(recalling that $f$ is even and real valued). Therefore we model (5.3) by the matrix integral

[TABLE]

where $n\approx N$ grows linearly with the matrix size $N$ , precisely so that under the correspondence (5.4) and (5.2), $n\longleftrightarrow\frac{\alpha}{2}\frac{\log K}{\pi}$ is assumed to be an integer.

We claim that for all the classical groups ( $G=$ U, USp, O) under these conditions the answer is

Proposition 5.3.

For $G=$ $\rm U$ , $\rm USp$ , $\rm O$ , and $n\approx N$ , as $N\to\infty$

[TABLE]

Therefore we are led to conjecture 5.2, once we understand the analogue of $\int_{0}^{1}|w(\gamma)|^{2}d\gamma$ : Recall that $w(\gamma)$ corresponded to $\tilde{\Phi}(\frac{1}{2}+i\gamma)$ , which we can write in terms of $\phi(t):=\Phi(e^{t})e^{t/2}$ as

[TABLE]

Hence $\int_{0}^{1}|w(\gamma)|^{2}d\gamma$ corresponds to

[TABLE]

Thus we obtain Conjecture 5.2

[TABLE]

5.2. Proof of Proposition 5.3

Proof.

We use the Fourier expansion (5.5) to obtain

[TABLE]

We trivially have $|\operatorname{tr}U^{m}|\leq N$ , and since $n\approx N$ and $\widehat{w}$ is rapidly decreasing, only the terms with say $m,m^{\prime}=n+O(\log N)$ contribute anything non-negligible. Thus

[TABLE]

The unitary case $G(N)=U(N)$ :

We use Dyson’s lemma [3]

[TABLE]

In particular only the diagonal terms contribute. In our case, $m,m^{\prime}\sim n$ are nonzero, hence we get

[TABLE]

Since $m$ varies very little around $n$ , we can replace $\min(|m|,N)$ by $\min(n,N)$ with negligible error to obtain

[TABLE]

by Plancherel.

The symplectic case $G(N)={\rm USp}(2g)$ :

The expected values for the symplectic group ( $N=2g$ ) are [8, Lemma 2]

i) If $m=n$ then

[TABLE]

ii) If $1\leq m<n$

[TABLE]

and in particular, if $m\neq m^{\prime}$ (and neither is zero) then

[TABLE]

while for $m=m^{\prime}\neq 0$ we obtain

[TABLE]

so that

[TABLE]

The second term is $O(\log N)$ , while the first is as in the unitary case, so that again we recover

[TABLE]

For the orthogonal group $G(N)={\rm SO}(N)$ with $N$ even, we have the same result because (5.7), (5.8) are still valid (see [8, Lemma 2]). ∎

6. A function field model

6.1. The group of sectors

Our goal in this section is to formulate and prove an analogue of Conjecture 1.2 and of Conjecture 5.1 in the setting of the ring of polynomials over a finite field of $q$ elements ( $q$ odd), in the limit of large $q$ . Using the notation in the Introduction, we denote by333Katz [7, §2] denotes $B^{\times}_{\rm even}=H_{k}$ , and $B^{\times}_{\rm odd}=\mathbb{S}^{1}_{k}$ .

[TABLE]

the elements of unit norm and constant term $1$ in $\Big{(}\mathbb{F}_{q}[S]/(S^{k})\Big{)}^{\times}$ , and

[TABLE]

the subgroup of even polynomials.

Lemma 6.1.

[7, Lemma 2.1]** i) We have a direct product decomposition

[TABLE]

ii) The order of $\mathbb{S}^{1}_{k}$ is

[TABLE]

where $\kappa:=k-1-\lfloor\frac{k-1}{2}\rfloor=\lfloor\frac{k}{2}\rfloor$ , so that

[TABLE]

Proof.

i) is stated in [7] for $k$ even, but the proof is valid for arbitrary $k\geq 1$ .

ii) The order of $H_{k}$ is

[TABLE]

since we can write any element of $H_{k}$ as

[TABLE]

and the number of such elements is clearly $(q-1)q^{\lfloor\frac{k-1}{2}\rfloor}$ . Since the order of $\Big{(}\mathbb{F}_{q}[S]/(S^{k})\Big{)}^{\times}$ is $(q-1)q^{k-1}$ , we obtain that the order of $\mathbb{S}^{1}_{k}$ is

[TABLE]

as claimed. ∎

We put an absolute value $|f|=q^{-\operatorname{ord}(f)}$ on $\mathbb{F}_{q}[[S]]$ , where $\operatorname{ord}(f)=\max(j:S^{j}\mid f)$ . We then divide $\mathbb{S}^{1}$ into “sectors”

[TABLE]

so that by definition, for $u,v\in\mathbb{S}^{1}\subset\mathbb{F}_{q}[[S]]$

[TABLE]

Consequently, the sectors $\operatorname{Sect}(u;k)$ are in bijection with the group $\mathbb{S}^{1}_{k}$ , and their number is

[TABLE]

Expanding in $\mathbb{F}_{q}[[S]]$ :

[TABLE]

and likewise for $v$ , we see that $v\in\operatorname{Sect}(u;k)$ is equivalent to

[TABLE]

We have a modular version of the homomorphism $U$ from (1.4)

[TABLE]

whose kernel is $H_{k}$ . Note that $f/\sigma(f)\in\mathbb{S}^{1}_{k}$ as it has unit norm and constant term $1$ , and in $\mathbb{S}^{1}_{k}$ the square root is well defined since $\mathbb{S}^{1}_{k}=q^{\kappa}$ has odd order.

Lemma 6.2.

The homomorphism $U_{k}:\Big{(}\mathbb{F}_{q}[S]/(S^{k})\Big{)}^{\times}\to\mathbb{S}^{1}_{k}$ is surjective.

Proof.

The kernel of $U_{k}:\Big{(}\mathbb{F}_{q}[S]/(S^{k})\Big{)}^{\times}\to\mathbb{S}^{1}_{k}$ is $H_{k}$ because the kernel of $f\mapsto f/\sigma(f)$ is, by definition, $H_{k}$ , and the square root map is an automorphism of $\mathbb{S}^{1}_{k}$ . According to Lemma 6.1(i), the map is therefore onto. ∎

6.2. Super-even characters and their L-functions

A super-even character modulo $S^{k}$ is a Dirichlet character

[TABLE]

which is trivial on $H_{k}$ . In particular, $\Xi$ is even (trivial on the scalars $\mathbb{F}_{q}^{\times}$ ). These are the analogues of Hecke characters in § 4.1. The group of super-even characters mod $S^{k}$ is the character group of $\Big{(}\mathbb{F}_{q}[S]/(S^{k})\Big{)}^{\times}/H_{k}\simeq\mathbb{S}^{1}_{k}$ . Hence by general orthogonality relations for characters of a finite Abelian group, the super-even characters separate the cosets of $H_{k}$ , that is the elements of $\mathbb{S}^{1}_{k}$ .

Proposition 6.3.

For $f\in\Big{(}\mathbb{F}_{q}[S]/(S^{k})\Big{)}^{\times}$ , and $u\in\mathbb{S}^{1}_{k}$ , the following are equivalent:

(i)

$U_{k}(f)\in\operatorname{Sect}(u;k)$ ** 2. (ii)

$U_{k}(f)=U_{k}(u)$ ** 3. (iii)

$f\cdot H_{k}=u\cdot H_{k}$ 4. (iv)

$\Xi(f)=\Xi(u)$ * for all super-even characters mod $S^{k}$ .*

Proof.

For $u\in\mathbb{S}^{1}$ we have $U_{k}(u)=\sqrt{u/\sigma(u)}=\sqrt{u^{2}}=u\bmod S^{k}$ and so combining with (6.1) we find that $U_{k}(f)=U_{k}(u)$ is equivalent to $U_{k}(f)\in\operatorname{Sect}(u;k)$ .

According to Lemma 6.2, the map $U_{k}$ is onto. Therefore, since the kernel of $U_{k}(u)$ is $H_{k}$ , we obtain that $U_{k}(f)=U_{k}(u)$ is equivalent to $f\cdot H_{k}=u\cdot H_{k}$ in $\Big{(}\mathbb{F}_{q}[S]/(S^{k})\Big{)}^{\times}$ .

Using the orthogonality relations for characters of $\mathbb{S}^{1}_{k}$ (super-even characters) we obtain the final equivalence. ∎

The Swan conductor of an even nontrivial character $\Xi$ mod $S^{k}$ is the maximal integer $d<k$ such that $\Xi$ is nontrivial on the subgroup

[TABLE]

Then $\Xi$ is a primitive character modulo $S^{d(\Xi)+1}$ . For a super-even character, the Swan conductor is necessarily odd, since super-even characters are automatically trivial on $\Gamma_{d}$ for $d$ even.

Let $\Xi$ be a nontrivial even character modulo $S^{k}$ . The L-function associated to $\Xi$ is:

[TABLE]

which for nontrivial even $\Xi$ is a polynomial in $z$ of degree exactly $d(\Xi)$ (the Swan conductor of $\Xi$ ), including a trivial zero at $z=1$ . Thus we write for any non-trivial super-even character

[TABLE]

for a unitary matrix $\Theta_{\Xi}\in U(N)$ ( $N=d(\Xi)-1$ ).

For any nontrivial super-even character mod $S^{k}$ , let

[TABLE]

be the sum over all monic polynomials of degree $\nu$ , with $\Lambda(f)$ being the von Mangoldt function. The Explicit Formula (obtained by comparing the logarithmic derivative of (6.2) and (6.3), see e.g. [9]) shows that for nontrivial super-even $\Xi$ , the sum over prime powers $\Psi(\nu;\Xi)$ is a sum over zeros of the L-function associated to $\Xi$ :

[TABLE]

6.3. A weighted count

We introduce a weighted count in terms of the von Mangoldt function on $\mathbb{F}_{q}[S]$ , defined as $\Lambda(f)=\deg\mathfrak{p}$ if $f=c\mathfrak{p}^{j}$ for some prime $\mathfrak{p}\in\mathbb{F}_{q}[S]$ and $j\geq 1$ and scalar $c\in\mathbb{F}_{q}^{\times}$ , and $\Lambda(f)=0$ otherwise. Set

[TABLE]

the sum over monic $f\in\mathbb{F}_{q}[S]$ with $\deg f=\nu$ and $f(0)\neq 0$ .

We want to average over all directions $u\in\mathbb{S}^{1}_{k}$ . The mean value is

[TABLE]

By definition, the sum is just the sum over all monic $f\in M_{\nu}$ (with $f(0)\neq 0$ ), that is

[TABLE]

by the Prime Polynomial Theorem in $\mathbb{F}_{q}[S]$ .

We use Proposition 6.3 to pick out prime powers lying in a given sector, and obtain a formula for the sum $\Psi_{k,\nu}(u)$ in terms of super-even characters.

Lemma 6.4.

[TABLE]

the sum being over all nontrivial super-even characters mod $S^{k}$ .

Proof.

From Proposition 6.3 and the orthogonality relations we find

[TABLE]

which gives

[TABLE]

with the sum over all monic $f\in\mathbb{F}_{q}[S]$ of degree $\nu$ . Hence

[TABLE]

The contribution of the trivial character $\Xi_{0}$ is

[TABLE]

Inserting the Explicit Formula (6.4) gives

[TABLE]

on using the orthogonality relations in the form

[TABLE]

∎

We use $|\operatorname{tr}\Theta_{\Xi}^{\nu}|\leq 2\kappa-2$ for $\Xi\neq\Xi_{0}$ to obtain

Corollary 6.5.

As $q\to\infty$ ,

[TABLE]

Hence for $\kappa<\nu/2$ , we obtain an asymptotic formula.

By a standard argument, this implies that $\mathcal{N}_{k,\nu}(u)=N/K+O(q^{\nu/2})$ .

*Remark 6.6**.*

Note that for $\kappa>\nu/2$ , it is no longer necessarily the case that $\Psi_{k,\nu}(u)\sim\frac{q^{\nu}}{q^{\kappa}}$ , in fact there may not be any polynomials $g\in\mathbb{F}_{q}[S]$ of degree $\deg g=\nu<2\kappa$ with direction $U(g)\in\operatorname{Sect}(u;k)$ . As an example, assume that $k-1$ is odd, and take

[TABLE]

and suppose that $\deg g=\nu<2\kappa\leq k-1$ satisfies

[TABLE]

By Proposition 6.3, this is equivalent to $g\in(1+2S^{k-1})H_{k}$ . Reducing modulo $S^{k-1}$ gives $g\in H_{k-1}$ , so that $g(-S)=g(S)\bmod S^{k-1}$ . But $\deg g<k-1$ hence $g(-S)=g(S)$ , that is $g$ is an even polynomial, hence $U(g)=1$ . But then $U(g)=1\notin\operatorname{Sect}(1+2S^{k-1};k)$ , a contradiction.

6.4. The variance of $\Psi_{k,\nu}$

The variance of $\Psi_{k,\nu}$ is

[TABLE]

Theorem 6.7.

Assume $q$ is odd, and $\kappa\geq 3$ , or that $\kappa=2$ and additionally $5\nmid q$ . Then as $q\to\infty$ ,

[TABLE]

In other words, if we denote $X=q^{\nu}$ the number of all monics of degree $\nu$ , then

[TABLE]

This is to be compared with conjecture 5.1. Note that the range $\nu<\kappa$ is the “trivial regime”, where there are more sectors than directions; in that case the result is elementary, but of little interest.

Lemma 6.8.

[TABLE]

the sum over all nontrivial super-even characters mod $S^{k}$ .

Proof.

Inserting (6.5) we find

[TABLE]

We use the orthogonality relations in the group of super-even characters, which is the character group of $\mathbb{S}^{1}_{k}$ :

[TABLE]

This gives

[TABLE]

Set $c(u)=\delta(u,1)-\frac{1}{q^{\kappa}}$ . From Lemma 6.4 we obtain, on denoting by $\left\langle\bullet\right\rangle_{\mathbb{S}^{1}}$ the average over all $u\in\mathbb{S}^{1}_{k}$ , that

[TABLE]

Using the orthogonality relations, the averages over $u\in\mathbb{S}^{1}$ are

[TABLE]

since $\Xi\neq\Xi_{0}$ , and

[TABLE]

Substituting into our formula gives

[TABLE]

Finally we use $|\operatorname{tr}\Theta_{\Xi}^{\nu}|\leq 2\kappa-2$ for $\Xi\neq\Xi_{0}$ to get our claim. ∎

Hence we get an inequality (for all $\kappa$ and $\nu$ )

Corollary 6.9.

[TABLE]

This is analogous to Theorem 3.2. To do better, we invoke an equidistribution result for the zeros of these L-functions.

6.5. Proof of Theorem 6.7

We use Lemma 6.8. We separate the characters according to their Swan conductor, which is necessarily an odd integer $d(\Xi)<k$ , whose maximal value is $2\kappa-1$ (recall $k=2\kappa$ or $2\kappa+1$ ). Characters with such maximal conductor make up all primitive super-even characters modulo $S^{2\kappa}$ . As in [9], the contribution of characters with smaller Swan conductor $d(\Xi)<2\kappa-1$ is negligible, and up to lower order terms one finds

[TABLE]

the average over all primitive super-even characters modulo $S^{2\kappa}$ .

Katz [7, Theorem 5.1] showed that for any sequence of odd444In [7, Theorem 5.1] $q$ is allowed to be even for $2\kappa-2\geq 6$ . $q\to\infty$ , the Frobenii

[TABLE]

become uniformly distributed in the unitary symplectic group ${\rm USp}(2\kappa-2)$ provided $2\kappa-2\geq 4$ , and that the same holds for $2\kappa-2=2$ if the $q$ are co-prime to $10$ (i.e. the characteristic of $\mathbb{F}_{q}$ is not $2$ or $5$ ). Katz’s equidistribution theorem allows us to replace the average over primitive super-even characters in (6.6) by the corresponding continuous average over the unitary symplectic group ${\rm USp}(2\kappa-2)$ , to get

[TABLE]

The matrix integral equals, for $\nu>0$ [8, Lemma 2],

[TABLE]

where $\eta(\nu)=1$ for $\nu$ even, and equals [math] for $\nu$ odd. This proves Theorem 6.7.

6.6. Relation between variance of $\mathcal{N}_{k,\nu}$ and $\Psi_{k,\nu}$

We can now proceed to prove Theorem 1.3, which follows from Theorem 6.7 once we establish the following relation between the variance of $\mathcal{N}_{k,\nu}$ and of $\Psi_{k,\nu}$ :

Proposition 6.10.

Under the conditions of Theorem 6.7,

[TABLE]

as $q\to\infty$ .

Let $\mathbf{1}_{\operatorname{Sect}(u;k)}$ be the indicator function of the sector $\operatorname{Sect}(u;k)$ . We write

[TABLE]

with the sums over monic polynomials, where

[TABLE]

We subtract the expected value of $\Psi$ , which is

[TABLE]

where we write $\left\langle\bullet\right\rangle$ for the average over all sectors $u\in\mathbb{S}^{1}_{k}$ . Compare this with the expected value of $\mathcal{N}=\mathcal{N}_{k,\nu}$ , which is

[TABLE]

by the Prime Polynomial Theorem. Therefore

[TABLE]

We claim that the mean square of $R$ is bounded by

Lemma 6.11.

[TABLE]

This bound is certainly negligible compared to the variance of $\Psi_{k,\nu}$ , which by Theorem 6.7 is of order $q^{\nu-\kappa}$ . Using (6.7) gives

[TABLE]

and we obtain

[TABLE]

Hence by Theorem 6.7

[TABLE]

as $q\to\infty$ .

6.7. Proof of Lemma 6.11

To prove Lemma 6.11 we write

[TABLE]

We compute

[TABLE]

By Proposition 6.3, the condition $U(f)=U(g)\bmod S^{k}$ is equivalent to $\Xi(f)=\Xi(g)$ for all super-even characters modulo $S^{k}$ , that is

[TABLE]

Therefore

[TABLE]

where

[TABLE]

We will show below that if $\Xi=1$ , then

[TABLE]

and if $\Xi\neq 1$ , then

[TABLE]

Assuming (6.9) and (6.10), we use the expansion (6.8) for $\left\langle R^{2}\right\rangle$ , and insert the bounds (6.9) for $\Xi=1$ , and (6.10) for $\Xi\neq 1$ to obtain

[TABLE]

proving Lemma 6.11.

It remains to prove (6.9) and (6.10). We set

[TABLE]

so that

[TABLE]

The trivial bound for $A(\nu,\Xi)$ is

[TABLE]

This gives (6.9), because

[TABLE]

since the largest divisor $\delta\mid\nu$ which is smaller than $\nu$ is not larger than $\nu/2$ .

If $\Xi\neq 1$ then we have a better bound:

[TABLE]

Indeed, write $A(\nu,\Xi)=\Psi(\nu,\Xi)-B(\nu,\Xi)$ , and then use the trivial bound (6.9): $|B(\nu,\Xi)|\ll q^{\nu/2}$ and (6.4): $|\Psi(\nu,\Xi)|\ll q^{\nu/2}$ , to obtain (6.12).

Next, we use the expansion (6.11) of $B(\nu,\Xi)$ to write

[TABLE]

To bound the contribution of divisors $\delta$ with $\Xi^{\nu/\delta}=1$ , note that the order of $\Xi$ divides $\#\mathbb{S}^{1}=q^{\kappa}$ , so that if $\Xi\neq 1$ but $\Xi^{\nu/\delta}=1$ then necessarily $p\mid\nu/\delta$ , where $q=p^{r}$ with $p$ an odd prime (since $q$ is odd). Hence using the trivial bound $A(\delta,1)\leq q^{\delta}$ gives

[TABLE]

Now if $p\mid\frac{\nu}{\delta}$ , then $\delta\mid\frac{\nu}{p}$ so $\delta\leq\frac{\nu}{p}$ , and we obtain

[TABLE]

We bound the contribution of divisors $\delta$ with $\Xi^{\nu/\delta}\neq 1$ , using (6.12), by

[TABLE]

again using that the largest divisor $\delta\mid\nu$ which is smaller than $\nu$ is not larger than $\nu/2$ . Thus we find that for $\Xi\neq 1$ ,

[TABLE]

which proves (6.10) since $p\geq 3$ .

Bibliography18

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] L. Bary-Soroker, Y. Smilansky, A. Wolf, On the Function Field Analogue of Landau’s Theorem on Sums of Squares , Finite Fields Appl. 39 (2016) 195–215.
2[2] H. M Bui, J. P. Keating and D. J. Smith, On the variance of sums of arithmetic functions over primes in short intervals and pair correlation for L-functions in the Selberg class . J. Lond. Math. Soc. (2) 94 (2016), no. 1, 161–185.
3[3] F.J. Dyson, Statistical theory of the energy levels of complex systems , I, II and III. J. Math. Phys. 3, 140–175 (1962).
4[4] G. Harman and P. A. Lewis, Gaussian primes in narrow sectors , Mathematika 48 (2001), no. 1-2, 119–135 (2003).
5[5] E. Hecke, Eine neue Art von Zetafunktionen und ihre Beziehungen zur Verteilung der Primzahlen . I. , Math. Z. 1 (1918), 357-376. II, Math. Z. 6 (1920), 11–51
6[6] H. Iwaniec and E. Kowalski, Analytic number theory. American Mathematical Society Colloquium Publications, 53. American Mathematical Society, Providence, RI, 2004.
7[7] N. M. Katz, Witt Vectors and a Question of Rudnick and Waxman . Int. Math. Res. Not. IMRN, Vol. 2016, No. 00, pp. 1–36 doi: 10.1093/imrn/rnw 130
8[8] J.P. Keating and B.E. Odgers, Symmetry transitions in random matrix theory & \& L-functions . Comm. Math. Phys. 281 (2008), no. 2, 499–528.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Angles of Gaussian primes

Abstract.

Contents

1. Introduction

1.1. Angles of Gaussian primes

Theorem 1.1**.**

1.2. The number variance

Conjecture 1.2**.**

1.3. A function field analogue

Theorem 1.3**.**

2. Repulsion between angles

2.1. Repulsion and its consequences

Lemma 2.1**.**

Proof.

2.2. Deviations from randomness

2.3. The variance in the trivial regime

Proposition 2.2**.**

Proof.

3. Almost all sectors contain an angle

3.1. A smooth count

Lemma 3.1**.**

Proof.

3.2. Variance in the trivial regime

3.3. An upper bound

Theorem 3.2**.**

4. Relation to zeros of Hecke L-functions

4.1. Hecke characters and their L-functions

Lemma 4.1**.**

Proof.

4.2. An Explicit Formula

Proposition 4.2**.**

Proof.

Lemma 4.3**.**

Proof.

Corollary 4.4**.**

Corollary 4.5**.**

Corollary 4.6**.**

Proof.

4.3. Primes vs prime powers

Lemma 4.7**.**

Proof.

Lemma 4.8**.**

Proof.

4.4. Proof of Theorem 3.2

5. A random matrix theory model

Conjecture 5.1**.**

Conjecture 5.2**.**

5.1. The model

Proposition 5.3**.**

5.2. Proof of Proposition 5.3

Proof.

6. A function field model

6.1. The group of sectors

Lemma 6.1**.**

Proof.

Lemma 6.2**.**

Proof.

6.2. Super-even characters and their L-functions

Proposition 6.3**.**

Proof.

6.3. A weighted count

Lemma 6.4**.**

Proof.

Corollary 6.5**.**

Remark 6.6*.*

6.4. The variance of Ψk,ν\Psi_{k,\nu}Ψk,ν​

Theorem 6.7**.**

Lemma 6.8**.**

Proof.

Corollary 6.9**.**

6.5. Proof of Theorem 6.7

6.6. Relation between variance of Nk,ν\mathcal{N}_{k,\nu}Nk,ν​ and Ψk,ν\Psi_{k,\nu}Ψk,ν​

Proposition 6.10**.**

Lemma 6.11**.**

Theorem 1.1.

Conjecture 1.2.

Theorem 1.3.

Lemma 2.1.

Proposition 2.2.

Lemma 3.1.

Theorem 3.2.

Lemma 4.1.

Proposition 4.2.

Lemma 4.3.

Corollary 4.4.

Corollary 4.5.

Corollary 4.6.

Lemma 4.7.

Lemma 4.8.

Conjecture 5.1.

Conjecture 5.2.

Proposition 5.3.

Lemma 6.1.

Lemma 6.2.

Proposition 6.3.

Lemma 6.4.

Corollary 6.5.

*Remark 6.6**.*

6.4. The variance of $\Psi_{k,\nu}$

Theorem 6.7.

Lemma 6.8.

Corollary 6.9.

6.6. Relation between variance of $\mathcal{N}_{k,\nu}$ and $\Psi_{k,\nu}$

Proposition 6.10.

Lemma 6.11.