The shape of quadratic Gauss paths

Justine Dell; Djordje Mili\'cevi\'c

arXiv:2508.21707·math.NT·September 1, 2025

The shape of quadratic Gauss paths

Justine Dell, Djordje Mili\'cevi\'c

PDF

Open Access

TL;DR

This paper studies the distribution of quadratic Gauss paths, showing they converge to a specific random Fourier series and providing a classification of their limiting shapes, explaining their notable visual features.

Contribution

It introduces a new probabilistic description of quadratic Gauss paths and characterizes their limiting behavior as the parameter grows large.

Findings

01

Quadratic Gauss paths converge in law to a random Fourier series.

02

The paper provides a classification of the limiting shapes of these paths.

03

It establishes convergence in probability for the ensemble of paths.

Abstract

We consider the distribution of quadratic Gauss paths, polygonal paths joining partial sums of quadratic Gauss sums to square-free fundamental discriminant moduli in a dyadic range [Q,2Q]. We prove that this striking ensemble converges in law, as Q->\infty, to a random Fourier series we explicitly describe, and we prove a convergence in probability result and a classification result for the limiting shapes that explain the visually remarkable properties of these Gauss paths.

Equations740

G(c)=\frac{1}{\sqrt{c}}\sum_{m=1}^{c}\Big{(}\frac{m}{c}\Big{)}e^{2\pi im/c}.

G(c)=\frac{1}{\sqrt{c}}\sum_{m=1}^{c}\Big{(}\frac{m}{c}\Big{)}e^{2\pi im/c}.

D_{Q} \to C^{0} ([0, 1], C), c \mapsto G (\cdot; c)

D_{Q} \to C^{0} ([0, 1], C), c \mapsto G (\cdot; c)

λ_{p} (1) = \frac{p}{2 ( p + 1 )}, λ_{p} (- 1) = \frac{p}{2 ( p + 1 )}, λ_{p} (0) = \frac{1}{p + 1} .

λ_{p} (1) = \frac{p}{2 ( p + 1 )}, λ_{p} (- 1) = \frac{p}{2 ( p + 1 )}, λ_{p} (0) = \frac{1}{p + 1} .

λ_{2} (1) = λ_{2} (- 1) = \frac{1}{2} .

λ_{2} (1) = λ_{2} (- 1) = \frac{1}{2} .

X_{m} = X_{p_{1}}^{a_{1}} \dots X_{p_{k}}^{a_{k}} .

X_{m} = X_{p_{1}}^{a_{1}} \dots X_{p_{k}}^{a_{k}} .

G^{*} (t) := n \neq = - 1, 0 \sum X_{n} (\frac{e (( n + 1 ) t ) - 1}{2 iπ ( n + 1 )}) + t

G^{*} (t) := n \neq = - 1, 0 \sum X_{n} (\frac{e (( n + 1 ) t ) - 1}{2 iπ ( n + 1 )}) + t

\mathcal{D}_{Q,\bm{\epsilon}_{Z}}=\Big{\{}n\in\mathcal{D}_{Q}:\Big{(}\frac{p}{n}\Big{)}=\epsilon_{p}\text{ for every }p\leqslant Z\Big{\}}

\mathcal{D}_{Q,\bm{\epsilon}_{Z}}=\Big{\{}n\in\mathcal{D}_{Q}:\Big{(}\frac{p}{n}\Big{)}=\epsilon_{p}\text{ for every }p\leqslant Z\Big{\}}

X_{m, ϵ_{Z}} = p^{a_{p}} ∥ m p ⩽ Z \prod ϵ_{p}^{a_{p}} p^{a_{p}} ∥ m p > Z \prod X_{p}^{a_{p}} .

X_{m, ϵ_{Z}} = p^{a_{p}} ∥ m p ⩽ Z \prod ϵ_{p}^{a_{p}} p^{a_{p}} ∥ m p > Z \prod X_{p}^{a_{p}} .

G_{ϵ_{Z}}^{*} (t) = n \neq = - 1, 0 \sum X_{n, ϵ_{Z}} (\frac{e (( n + 1 ) t ) - 1}{2 iπ ( n + 1 )}) + t

G_{ϵ_{Z}}^{*} (t) = n \neq = - 1, 0 \sum X_{n, ϵ_{Z}} (\frac{e (( n + 1 ) t ) - 1}{2 iπ ( n + 1 )}) + t

G_{ϵ_{Z}}^{♯} (t) = E (G_{ϵ_{Z}}^{*} (t)) = n \neq = - 1, 0 p ∣ n \Rightarrow p ⩽ Z \sum p^{a_{p}} ∥ n \prod ϵ_{p}^{a_{p}} (\frac{e (( n + 1 ) t ) - 1}{2 iπ ( n + 1 )}) + t .

G_{ϵ_{Z}}^{♯} (t) = E (G_{ϵ_{Z}}^{*} (t)) = n \neq = - 1, 0 p ∣ n \Rightarrow p ⩽ Z \sum p^{a_{p}} ∥ n \prod ϵ_{p}^{a_{p}} (\frac{e (( n + 1 ) t ) - 1}{2 iπ ( n + 1 )}) + t .

\includegraphics \includegraphics \includegraphics \includegraphics \includegraphics \includegraphics \includegraphics \includegraphics \includegraphics \includegraphics \includegraphics \includegraphics \includegraphics \includegraphics \includegraphics \includegraphics \includegraphics \includegraphics

\includegraphics \includegraphics \includegraphics \includegraphics \includegraphics \includegraphics \includegraphics \includegraphics \includegraphics \includegraphics \includegraphics \includegraphics \includegraphics \includegraphics \includegraphics \includegraphics \includegraphics \includegraphics

t \to t_{0} \pm lim \frac{G _{ϵ_{Z}}^{♯} ( t ) - G _{ϵ_{Z}}^{♯} ( t _{0} )}{e ( t _{0} ) ( t - t _{0} )} = \pm \infty or \mp \infty

t \to t_{0} \pm lim \frac{G _{ϵ_{Z}}^{♯} ( t ) - G _{ϵ_{Z}}^{♯} ( t _{0} )}{e ( t _{0} ) ( t - t _{0} )} = \pm \infty or \mp \infty

\mathbb{P}\Big{(}\|G_{\bm{\epsilon}_{Z}}^{\ast}-G_{\bm{\epsilon}_{Z}}^{\sharp}\|_{\infty}\geqslant\delta\Big{)}\to 0\quad(Z\to\infty),

\mathbb{P}\Big{(}\|G_{\bm{\epsilon}_{Z}}^{\ast}-G_{\bm{\epsilon}_{Z}}^{\sharp}\|_{\infty}\geqslant\delta\Big{)}\to 0\quad(Z\to\infty),

\lim_{Z\to\infty}\limsup_{Q\to\infty}\mathbb{P}\Big{(}\|G_{Q,\bm{\epsilon}_{Z}}-G_{\bm{\epsilon}_{Z}}^{\sharp}\|_{\infty}\geqslant\delta\Big{)}=0.

\lim_{Z\to\infty}\limsup_{Q\to\infty}\mathbb{P}\Big{(}\|G_{Q,\bm{\epsilon}_{Z}}-G_{\bm{\epsilon}_{Z}}^{\sharp}\|_{\infty}\geqslant\delta\Big{)}=0.

\mathcal{M}_{\bm{m},\bm{n}}(X)=\mathbb{E}\bigg{(}\prod_{i=1}^{k}\overline{X_{i}}^{m_{i}}X_{i}^{n_{i}}\bigg{)}.

\mathcal{M}_{\bm{m},\bm{n}}(X)=\mathbb{E}\bigg{(}\prod_{i=1}^{k}\overline{X_{i}}^{m_{i}}X_{i}^{n_{i}}\bigg{)}.

\sum\sum_{m, n \in Z_{⩾ 0}^{k}} M_{m, n} (X) \frac{z _{1}^{m_{1}} z _{1}^{'} ^{n_{1}} \dots z _{k}^{m_{k}} z _{k}^{'} ^{n_{k}}}{m _{1} ! n _{1} ! \dots m _{k} ! n _{k} !}

\sum\sum_{m, n \in Z_{⩾ 0}^{k}} M_{m, n} (X) \frac{z _{1}^{m_{1}} z _{1}^{'} ^{n_{1}} \dots z _{k}^{m_{k}} z _{k}^{'} ^{n_{k}}}{m _{1} ! n _{1} ! \dots m _{k} ! n _{k} !}

M_{n} (m, n) \to M (m, n) (n \to \infty),

M_{n} (m, n) \to M (m, n) (n \to \infty),

E (∣ X_{n} (t) - X_{n} (s) ∣^{α}) ≪ ∣ s - t ∣^{1 + δ},

E (∣ X_{n} (t) - X_{n} (s) ∣^{α}) ≪ ∣ s - t ∣^{1 + δ},

\mathop{\sum\nolimits^{\ast}}_{m\leqslant M}\bigg{|}\mathop{\sum\nolimits^{\ast}}_{n\leqslant N}a_{n}\bigg{(}\frac{n}{m}\bigg{)}\bigg{|}^{2}\ll_{\epsilon}(MN)^{\epsilon}(M+N)\mathop{\sum\nolimits^{\ast}}_{n\leqslant N}|a_{n}|^{2},

\mathop{\sum\nolimits^{\ast}}_{m\leqslant M}\bigg{|}\mathop{\sum\nolimits^{\ast}}_{n\leqslant N}a_{n}\bigg{(}\frac{n}{m}\bigg{)}\bigg{|}^{2}\ll_{\epsilon}(MN)^{\epsilon}(M+N)\mathop{\sum\nolimits^{\ast}}_{n\leqslant N}|a_{n}|^{2},

N < n ⩽ M + N \sum χ (n) ≪_{ε} N^{1/2} c^{3/16 + ε} .

N < n ⩽ M + N \sum χ (n) ≪_{ε} N^{1/2} c^{3/16 + ε} .

n \in I \sum χ (n) e^{2 π i α n} ≪_{κ} ∣ I ∣ q^{- δ} .

n \in I \sum χ (n) e^{2 π i α n} ≪_{κ} ∣ I ∣ q^{- δ} .

∣ b_{0} + b_{1} lo g p_{1} + \dots + b_{k} lo g p_{k} ∣ ⩾ \frac{1}{( e ∥ b ∥ _{\infty} ) ^{C}} .

∣ b_{0} + b_{1} lo g p_{1} + \dots + b_{k} lo g p_{k} ∣ ⩾ \frac{1}{( e ∥ b ∥ _{\infty} ) ^{C}} .

G^{*} (t) = y \to \infty lim n \neq = - 1, 0 P^{+} (∣ n ∣) ⩽ y \sum X_{n} (\frac{e (( n + 1 ) t ) - 1}{2 iπ ( n + 1 )}) + t .

G^{*} (t) = y \to \infty lim n \neq = - 1, 0 P^{+} (∣ n ∣) ⩽ y \sum X_{n} (\frac{e (( n + 1 ) t ) - 1}{2 iπ ( n + 1 )}) + t .

A_{y_{1}} := {n \in N : P^{-} (n) > y_{1}}, A_{y_{1}}^{y_{2}} := {n \in N : y_{2} ⩾ P^{+} (n) ⩾ P^{-} (n) > y_{1}},

A_{y_{1}} := {n \in N : P^{-} (n) > y_{1}}, A_{y_{1}}^{y_{2}} := {n \in N : y_{2} ⩾ P^{+} (n) ⩾ P^{-} (n) > y_{1}},

A (N) := A \cap (N / e, N],

A (N) := A \cap (N / e, N],

S_{A} := t \in [0, 1] max n \in A \sum \frac{e ( n t )}{n} X_{n}, S_{A} (E) := t \in [0, 1] max n \in A \sum \frac{e ( n t )}{n} ϵ_{n} .

S_{A} := t \in [0, 1] max n \in A \sum \frac{e ( n t )}{n} X_{n}, S_{A} (E) := t \in [0, 1] max n \in A \sum \frac{e ( n t )}{n} ϵ_{n} .

E (S_{A_{y_{1}}^{y_{2}}}^{2 k}) ≪ y_{1}^{- k /21} .

E (S_{A_{y_{1}}^{y_{2}}}^{2 k}) ≪ y_{1}^{- k /21} .

d_{k} (n) := n_{1} \dots n_{k} = n \sum 1, d_{k} (n; N) := n_{1} \dots n_{k} = n n_{i} \in A_{y_{1}} (N) \sum 1.

d_{k} (n) := n_{1} \dots n_{k} = n \sum 1, d_{k} (n; N) := n_{1} \dots n_{k} = n n_{i} \in A_{y_{1}} (N) \sum 1.

P^{-} (n) > y \sum \frac{d _{k} ( n ) ^{2}}{n ^{2 σ}} ≪ e^{O (k / l o g k)} .

P^{-} (n) > y \sum \frac{d _{k} ( n ) ^{2}}{n ^{2 σ}} ≪ e^{O (k / l o g k)} .

S_{A_{y_{1}}^{y_{2}}} (E) ⩽ j > l o g y_{1} \sum S_{A_{y_{1}}^{y_{2}} (e^{j})} (E) .

S_{A_{y_{1}}^{y_{2}}} (E) ⩽ j > l o g y_{1} \sum S_{A_{y_{1}}^{y_{2}} (e^{j})} (E) .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRandom Matrices and Applications · Analytic Number Theory Research · Geometry and complex manifolds

Full text

The shape of quadratic Gauss paths

Justine Dell

University of California San Diego (UCSD), Department of Mathematics, 9500 Gilman Drive #0112, La Jolla, CA 92093, USA

[email protected]

and

Djordje Milićević

Bryn Mawr College, Department of Mathematics, 101 North Merion Avenue, Bryn Mawr, PA 19010, USA Institute for Advanced Study, 1 Einstein Drive, Princeton, NJ 08540, USA [email protected]

Abstract.

We consider the distribution of quadratic Gauss paths, polygonal paths joining partial sums of quadratic Gauss sums to square-free fundamental discriminant moduli in a dyadic range $[Q,2Q]$ . We prove that this striking ensemble converges in law, as $Q\to\infty$ , to a random Fourier series we explicitly describe, and we prove a convergence in probability result and a classification result for the limiting shapes that explain the visually remarkable properties of these Gauss paths.

Key words and phrases:

Legendre symbol, Gauss sums, short character sums, random Fourier series, probability in Banach spaces, shapes of exponential sum paths

2020 Mathematics Subject Classification:

11L05 Primary, 11L40, 11N64, 60F17, 60G17, 60G50 Secondary

Research supported in part by the National Science Foundation Grant DMS-1903301, the Simons Foundation Award MPS-TSM-00008085, and by the Charles Simonyi Endowment (D.M.).

1. Introduction

1.1. Gauss paths

Cancellation in exponential sums is a major player across analytic number theory. A fascinating insight into its chaotic formation is provided by polygonal paths joining the consecutive partial sums, which have been studied at least since the work of Lehmer [Leh76] and Loxton [Lox83, Lox85] on exponential sums with quadratic and other analytically defined smoothly varying phases. The pioneering work of Kowalski–Sawin [KS16] and subsequent papers including [RR18, RRS20, MZ23, Hus22, HL24] extended this paradigm and studied the distribution of paths arising from oscillatory sums of arithmetic origin, such as Kloosterman and character sums.

A natural family of character paths arises from the normalized quadratic Gauss sums to square-free fundamental discriminant moduli:

[TABLE]

We term the polygonal paths joining the partial sums of $G(c)$ Gauss paths. Having perhaps been conditioned to expect arithmetically defined sums such as Kloosterman sums to exhibit fractal-like behavior seen in Figure 1, the pictures of several (mostly randomly chosen) Gauss paths to large moduli shown in Figure 2 might take one for a surprise.

Pictures don’t lie, of course, and provide additional food for thought when, after even just a modest amount of experimentation, stunningly similar pictures start showing up; see Figure 3.

What is going on here? We can observe very long stretches in which the Legendre symbol $\big{(}\frac{m}{c}\big{)}$ has a definite (statistical) preference for one of the $\pm 1$ signs over the other, and it is easy to believe that this behavior is guided by $c$ falling in certain residue classes to some small moduli. But why would only a few small moduli seemingly matter, and what is the deal with the sharp reversals?

1.2. Limiting distribution

Let $\mathcal{D}$ denote the set of positive, square-free integers $c$ for which $c\equiv 1\bmod{4}$ . We now formally define the quadratic Gauss path $G(t;c)$ for $c\in\mathcal{D}$ . For $t={j}/{(c-1)}$ for some $j\in[0,c-1]\cap\mathbb{Z}$ , we let $G(t;c)=g_{j}$ , where $g_{j}=c^{-1/2}\sum_{m=1}^{j}\left(\frac{m}{c}\right)e_{c}(m)$ is the $j$ th partial sum of (1.1). For ${(j-1)}/{(c-1)}<t<{j}/{(c-1)}$ ( $j\in[1,c-1]\cap\mathbb{Z}$ ), we obtain $G(t;c)$ by linearly interpolating between $g_{j}$ and $g_{j+1}$ . Then, $G(\cdot;c):[0,1]\to\mathbb{C}$ is a continuous function which maps $t\mapsto G(t;c)$ .

For every $Q\geqslant 3$ , we may consider the sample space $\mathcal{D}_{Q}:=[Q,2Q]\cap\mathcal{D}$ with the uniform probability measure $m_{Q}$ , and the map

[TABLE]

can be viewed as a $C^{0}([0,1],\mathbb{C})$ -valued random variable $G_{Q}$ on this probability space. We collect some relevant background on probability in Banach spaces in §2.1 for convenient reference. Note that $|\mathcal{D}_{Q}|\sim Q/(3\zeta(2))$ ; see (4.2).

Our first main result, Theorem 1.1, establishes that the random variables $(G_{Q})$ converge in law, as $Q\to\infty$ , to a specific $C^{0}([0,1],\mathbb{C})$ -valued random variable $G^{\ast}$ , which we are about to describe. It incorporates completely multiplicative random variables $X_{n}$ , which are the same as those used in the probabilistic model for the Jacobi symbols $(d/n)$ (as $d$ ranges through all fundamental discriminants) described in [GS03]. For every odd prime $p$ , we let $X_{p}$ be the random variable which takes the value 0 with probability $1/(p+1)$ and takes the values 1 and $-1$ each with probability $p/2(p+1)$ ; in other words, it is the identity random variable on the sample space $\{0,1,-1\}$ equipped with the measure $\lambda_{p}$ defined by

[TABLE]

For $p=2$ , we let $X_{2}$ be the usual Bernoulli random variable; that is, the identity random variable on $\{1,-1\}$ with the measure

[TABLE]

Let $(X_{p})$ be a sequence of independent random variables of laws $\lambda_{p}$ , and let $X_{m}$ be completely multiplicative random variables defined, for $m=\pm p_{1}^{a_{1}}\cdots p_{k}^{a_{k}}$ , as

[TABLE]

Theorem 1.1.

Let $(X_{m})$ be a completely multiplicative sequence of random variables of law given by (1.2)–(1.4).

(1)

The random Fourier series

[TABLE]

converges almost surely to a continuous function and so defines a $C^{0}([0,1],\mathbb{C})$ -valued random variable $G^{\ast}$ . 2. (2)

The sequence of random variables $(G_{Q})$ converges in law to $G^{\ast}$ as $Q\to\infty$ .

1.3. Convergence in probability and the atlas of shapes

We now turn our attention to the visually observed sensitivity of the Gauss paths $G(\cdot;c)$ on congruence properties of $c$ to small moduli. To this end, for a parameter $Z\geqslant 1$ , and let $\bm{\epsilon}_{Z}=(\epsilon_{p})_{p\leqslant Z}$ denote any fixed choice of $\epsilon_{p}\in\{-1,0,1\}$ over primes $p\leqslant Z$ (with $\epsilon_{2}\in\{-1,1\}$ ).

Now, on the one hand, we may consider the sample space

[TABLE]

equipped with the uniform probability measure and the $C^{0}([0,1],\mathbb{C})$ -valued random variable $G_{Q,\bm{\epsilon}_{Z}}:c\mapsto G(\cdot;c)$ ; in other words, the random variable $G_{Q,\bm{\epsilon}_{Z}}$ is obtained from $G_{Q}$ by conditioning on the event that $(p/c)=\epsilon_{p}$ for all $p\leqslant Z$ . On the other hand, letting $(X_{p})_{p>Z}$ be a sequence of independent random variables of laws $\lambda_{p}$ as in (1.2), we may consider the sequence of completely multiplicative random variables defined as

[TABLE]

In other words, the random variables $X_{m,\bm{\epsilon}_{Z}}$ are obtained as in (1.4), but changing the law of $\lambda_{p}$ for $p\leqslant Z$ to the delta mass on $\epsilon_{p}$ . Then we may consider the random Fourier series

[TABLE]

and the deterministic Fourier series

[TABLE]

The series defining $G^{\sharp}_{\bm{\epsilon}_{Z}}(t)$ is akin to the classical everywhere continuous, nowhere differentiable Weierstrass function. A small sample consisting of all possible shapes $G^{\sharp}_{\bm{\epsilon}_{5}}$ is shown in Figure 4. The following Theorem 1.2 formalizes the empirical observation that Gauss paths tend to be strongly aligned close to the shapes in the ensemble of the deterministic paths $G^{\sharp}_{\bm{\epsilon}_{Z}}(t)$ (according to $c\in\mathcal{D}_{Q,\bm{\epsilon}_{Z}}$ as in (1.6) for an increasingly large $Z$ ), which we may correspondingly term the atlas of shapes of quadratic Gauss paths.

Theorem 1.2.

Let $Z\geqslant 1$ , and let $\bm{\epsilon}_{Z}=(\epsilon_{p})_{p\leqslant Z}$ denote any fixed choice of $\epsilon_{p}\in\{-1,0,1\}$ over primes $p\leqslant Z$ , with $\epsilon_{2}\in\{-1,1\}$ .

(1)

The random Fourier series $G^{\ast}_{\bm{\epsilon}_{Z}}(t)$ defined in (1.7) converges a.s. to a continuous function and defines a $C^{0}([0,1],\mathbb{C})$ -valued random variable $G^{\ast}_{\bm{\epsilon}_{Z}}$ . Moreover, the sequence of random variables $(G_{Q,\bm{\epsilon}_{Z}})$ defined in (1.6) converges in law to $G^{\ast}_{\bm{\epsilon}_{Z}}$ as $Q\to\infty$ . 2. (2)

The deterministic Fourier series $G^{\sharp}_{\bm{\epsilon}_{Z}}(t)$ converges absolutely and uniformly to a continuous function. If $\epsilon_{p}=-1$ for at least two $p\leqslant Z$ , this deterministic path $G^{\sharp}_{\bm{\epsilon}_{Z}}(t)$ satisfies

[TABLE]

on an everywhere dense set of points $t_{0}\in[0,1]\cap\mathbb{Q}$ satisfying (6.7) and described in Proposition 6.5. If $\epsilon_{p}=-1$ for exactly one $p\leqslant Z$ , the same holds for $t_{0}=a/q$ satisfying the additional condition (6.27); see Remark 1. 3. (3)

For every $\delta>0$ ,

[TABLE]

as well as

[TABLE]

Our argument in fact provides an explicit upper bound for the exceptional probabilities in item (3) (after $\limsup_{Q\to\infty}$ in (1.9)) of size $\ll\exp(-\delta^{2}Z^{1/8})$ for $Z\geqslant Z_{1}(\delta)$ (see (7.6) and (7.7)). We chose not to optimize this rate, which is already quite rapidly decreasing in $Z$ and thus suggests an explanation for why the experimentally observed shapes (which of course are just a finite and presumably random sample) appear to fall into very few classes dictated by congruence classes modulo several small primes. It bears emphasizing that Theorem 1.2 does not claim that the paths $G(\cdot;c)$ must be uniformly close to $G^{\sharp}_{\bm{\epsilon}_{Z}}(t)$ for all $c\in\mathcal{D}_{Q,\bm{\epsilon}_{Z}}$ , nor can such a statement be true. A path $G(\cdot;c)$ will be substantially away from $G_{\bm{\epsilon}_{Z}}^{\sharp}$ as long as the values of $(p/c)$ over $p>Z$ exhibit a significant bias, and the same is true for a sample of the random path $G^{\ast}_{\bm{\epsilon}_{Z}}$ if the corresponding sample of $(X_{p})_{p>Z}$ is biased. The point is that such an event is rare. Figures 5 and 6 illustrate Theorem 1.2 and these points. We also point the reader to Remarks 1 and 2 and the accompanying Figures 7 and 8, which illustrate some delicate phenomena for limiting paths $G^{\sharp}_{\bm{\epsilon}_{Z}}(t)$ when all (or all but one) $\epsilon_{p}=1$ .

1.4. Acknowledgement

This paper grew out of the first author’s 2022 thesis, advised by the second author. We were both inspired by and indebted to the existing literature on Kloosterman paths [KS16, MZ23] and on character paths in the unitary ensemble [Hus22]. Some of the ingredients in establishing the convergence in law also appear in the work of Hussain and Lamzouri on Legendre paths [HL24], although our main focus on the striking sharp reversals and congruence-guided classification of Gauss paths is very different.

1.5. Organization of the paper

We collect some preliminaries on probability theory and estimates on character sums in section 2. The convergence and properties of the limiting random variable, including the proof of Theorem 1.1 (1) is studied in section 3. We establish convergence in the sense of finite distributions of $(G_{Q})\to G^{\ast}$ using the method of moments in section 4, and then prove the convergence in law claim of Theorem 1.1(2) in section 5. In section 6, we describe the local asymptotics of the limiting shapes $G^{\sharp}_{\bm{\epsilon}_{Z}}(t)$ at rational points $t_{0}\in[0,1]\cap\mathbb{Q}$ and then produce a collection of rational points at which $G^{\sharp}_{\bm{\epsilon}_{Z}}(t)$ exhibits a cusp. Finally, we prove Theorem 1.2 in section 7.

1.6. Notation

As is common in analytic number theory, for $z\in\mathbb{C}$ we write $e(z)=e^{2\pi iz}$ . We write $f=\mathrm{O}(g)$ or $f\ll g$ (using the notations interchangeably) if there exists a constant $C$ , independent of all parameters not explicitly indicated with a subscript, such that $|f|\leqslant Cg$ . We also write $f\asymp g$ if $f\ll g$ and $g\ll f$ , and $f\sim g$ if $\lim(f/g)=1$ , where the direction of the limit is either indicated or clear from the context. We denote by $\epsilon>0$ a positive value, which may differ from line to line but may in each case be taken to be as small as desired.

2. Preliminaries

2.1. Probability in Banach Spaces

In this section, we collect some definitions and facts pertaining to probability in Banach spaces.

For general facts about Banach spaces and probability, we refer to [LQ18, Preliminary Chapter, Chapters 1 and 4]. In particular, for an arbitrary probability space $(\Omega,\mathcal{A},\mathbb{P})$ , a separable Banach space $E$ , and the Borel $\sigma$ -algebra $\mathcal{B}$ on $E$ , an $E$ -valued random variable is a map $X:\Omega\to E$ that is $(\mathcal{A}$ – $\mathcal{B})$ -measurable (in other words, for any $B\in\mathcal{B}$ , $X^{-1}(B)\in\mathcal{A}$ ).

We will be particularly interested in the separable Banach space $C^{0}([0,1],\mathbb{C})$ of complex-valued continuous functions on $[0,1]$ equipped with the sup-norm. We can define several notions of convergence of random variables on this space. We closely follow the exposition in [RR18, Appendix A] and [MZ23, §2.1], and refer to [Kow21, §B.11] for proofs.

Definition 2.1 (Convergence of Random Variables).

Let $E$ be a Banach space and $(X_{n})$ be a sequence of $E$ -valued random variables on the probability spaces $(\Omega_{n},\mathcal{A}_{n},\mathbb{P}_{n})$ . Let $X$ be an $E$ -valued random variable on the probability space $(\Omega,\mathcal{A},\mathbb{P})$ .

(1)

If each $(\Omega_{n},\mathcal{A}_{n},\mathbb{P}_{n})=(\Omega,\mathcal{A},\mathbb{P})$ , we say that $(X_{n})$ converges to $X$ almost surely if $\mathbb{P}(\{\omega\in\Omega:\lim_{n\to\infty}X_{n}(\omega)=X(\omega)\})=1$ . 2. (2)

We say that $(X_{n})$ converges in law to $X$ if for every continuous and bounded map $\varphi:E\to\mathbb{C}$ , the sequence $\left(\mathbb{E}(\varphi(X_{n}))\right)$ converges to $\mathbb{E}(\varphi(X))$ . 3. (3)

If $E=C^{0}([0,1],\mathbb{C})$ , we say that $(X_{n})$ converges to $X$ in the sense of finite distributions if, for all $k\geqslant 1$ and all $k$ -tuples $(t_{1},t_{2},\dots,t_{k})$ where $0\leqslant t_{1}<\dots<t_{k}\leqslant 1$ , the sequence of $\mathbb{C}^{k}$ -valued random vectors $(X_{n}(t_{1}),\dots,X_{n}(t_{k}))$ converges in law to the random vector $(X(t_{1}),\dots,X(t_{k}))$ . 4. (4)

If $E$ is separable, we say that the sequence $(X_{n})$ is tight if, for all $\epsilon>0$ , there exists a compact subset $K\subseteq E$ such that, for all $n\geqslant 1$ , $\mathbb{P}_{n}(\{X_{n}\in K\})\geqslant 1-\epsilon$ .

Here, when $E=C^{0}([0,1],\mathbb{C})$ and $t\in[0,1]$ , we denote by $X(t)$ the complex-valued random variable which is the evaluation of the random function $X$ at the point $t$ , that is, $X(t)=e_{t}\circ X$ , where $e_{t}:C^{0}([0,1],\mathbb{C})\to\mathbb{C}$ is the evaluation map.

In order to prove convergence in the sense of finite distributions, we can use the method of moments. Recall that, for a $\mathbb{C}^{k}$ -valued random vector $X=(X_{1},X_{2},\dots,X_{k})$ and any $k$ -tuples $\bm{m}=(m_{1},\dots,m_{k})$ and $\bm{n}=(n_{1},\dots,n_{k})$ in $\mathbb{Z}_{\geqslant 0}^{k}$ , the complex moments of $(X_{1},X_{2},\dots,X_{k})$ are defined as

[TABLE]

The random variable $X$ is said to be mild if there exists a $\delta>0$ such that the power series

[TABLE]

converges in the disk $\{(z_{1},z_{1}^{\prime},\dots,z_{k},z_{k}^{\prime})\in\mathbb{C}^{2k}:|z_{i}|,|z_{i}^{\prime}|\leqslant\delta\}$ . A real-valued random variable $X$ is said to be $\sigma^{2}/$ -sub-Gaussian if, for every $t\in\mathbb{R}$ , $\mathbb{E}(e^{tX})\leqslant e^{\sigma^{2}t^{2}/2}$ , and a complex-valued $X=Y+iZ$ is said to be $\sigma^{2}/2$ -sub-Gaussian if $Y$ and $Z$ are. The sum $X=X_{1}+X_{2}$ of two independent $\sigma_{i}^{2}/2$ -sub-Gaussian complex-valued random variables is $(\sigma_{1}^{2}+\sigma_{2}^{2})/2$ -sub-Gaussian. Moreover, a $\mathbb{C}^{k}$ -valued random variable $X=(X_{1},\dots,X_{k})$ whose components are all sub-Gaussian is automatically mild. See [Kow21, §B.5,§B.8] for details.

Proposition 2.2 (Method of moments, [Kow21, Theorem B.5.5(2)]).

Let $(X_{n})$ be a sequence of $\mathbb{C}^{k}$ -valued random vectors, and let $X$ be a mild $\mathbb{C}^{k}$ -valued random vector. If, for any two $k$ -tuples $\bm{m},\bm{n}\in\mathbb{Z}_{\geqslant 0}^{k}$ , their corresponding complex moments $\mathcal{M}_{n}(\bm{m},\bm{n})$ and $\mathcal{M}(\bm{m},\bm{n})$ satisfy

[TABLE]

then $(X_{n})$ converges to $X$ in law.

The notion of tightness provides an effective way to upgrade the convergence in the sense of finite distributions of a sequence of $C^{0}([0,1],\mathbb{C})$ -valued random variables to convergence in law via the following two propositions.

Proposition 2.3 (Prokhorov’s Criterion).

Let $(X_{n})_{n=1}^{\infty}$ and $X$ be $C^{0}([0,1],\mathbb{C})$ -valued random variables. If the sequence $(X_{n})$ is tight and converges to $X$ in the sense of finite distributions, then $(X_{n})$ converges to $X$ in law.

Proposition 2.4 (Kolmogorov’s Tightness Criterion).

Let $(X_{n})$ be a sequence of $C^{0}([0,1],\allowbreak\mathbb{C})$ -valued random variables. If there exist $\alpha,\delta>0$ such that for all $0\leqslant s,t\leqslant 1$ and $n\geqslant 1$ , we have

[TABLE]

then $(X_{n})$ is tight.

2.2. Character sums and quadratic large sieve

In this section, we recall several ingredients from classical analytic number theory. One of them is Heath-Brown’s quadratic large-sieve inequality.

Proposition 2.5 ([HB95, Theorem 1]).

For any $M,N\in\mathbb{N}$ and $a_{1},\dots,a_{N}\in\mathbb{C}$ ,

[TABLE]

with $\mathop{\sum\nolimits^{\ast}}$ denoting the sum over positive odd square-free values.

For our proof of tightness, we will also require the $r=2$ and $r=4$ cases of Burgess’ classical bound for short character sums. Here we note that what we really require of the second bound is that it is nontrivial in the range $N\ll c^{1/3-\delta}$ for some small $\delta>0$ .

Proposition 2.6 ([IK04, Theorem 12.6]).

For every primitive character of any conductor $c>1$ and any $M$ and $N\geqslant 1$ ,

[TABLE]

Proposition 2.7 ([Cha14, Theorem 1’, special case]).

For every $\kappa>0$ , there exists a constant $\delta=\delta_{\kappa}>0$ such that, for every $\alpha\in\mathbb{R}$ , every multiplicative character $\chi$ to any square-free modulus $q$ , and every interval $I\subset[1,q]$ of size $|I|>q^{1/4+\kappa}$ ,

[TABLE]

In fact, [Cha14, Theorem 1’] allows an arbitrary degree $d$ polynomial phase, explicates the $d$ -dependence in the implied constant, and implies that any $\delta_{\kappa}<c\kappa^{2}d^{-2}$ ( $c$ an absolute constant) is admissible. The above estimate is all we need since $\kappa\in(0,\frac{1}{12})$ (that is, $\frac{1}{4}+\kappa\in(\frac{1}{4},\frac{1}{3})$ ) will eventually be fixed. We note that more precise versions of Proposition 2.7 are available in [Ker14, HBP15] in various degrees of generality of $q$ and $|I|$ .

2.3. Linear forms in logarithms

In our investigation of the limiting shapes of Gauss sums in section 6, we will make use of the following classical and powerful theorem of Baker on linear forms in logarithms.

Proposition 2.8 ([Bak77, Theorem 1]).

For every finite set of distinct primes $S=\{p_{1},\dots,p_{k}\}$ , there exists an (effectively computable) constant $C=C(S)>0$ depending on $S$ only such that, for every $\mathbf{0}\neq\mathbf{b}=(b_{0},b_{1},\dots,b_{k})\in\mathbb{Z}^{k+1}$ ,

[TABLE]

3. The limiting random variable

In this section, we establish the properties of the random Fourier series $G^{\ast}(t)$ defined by (1.5), where we recall that $(X_{n})$ is a completely multiplicative sequence of random variables of law given by (1.4) and (1.2). We often use $\epsilon_{m}$ to denote a particular value of $X_{m}$ and $\mathcal{E}$ to denote any single infinite choice of $\epsilon_{p}\in\{-1,0,1\}$ for each prime $p$ (with $\epsilon_{2}\in\{-1,1\}$ ) and of the accompanying multiplicatively determined $\epsilon_{n}=\epsilon_{p_{1}}^{a_{1}}\dots\epsilon_{p_{k}}^{a_{k}}$ ( $|n|=p_{1}^{a_{1}}\dots p_{k}^{a_{k}}$ ).

Let $P^{-}(n)$ and $P^{+}(n)$ denote the smallest and largest prime divisors of $n$ , respectively. Following [Hus22, §2], we first consider the limit

[TABLE]

Of course, this has the same summands as (1.5), but in a different order that is more strongly attuned to the multiplicative nature of the random coefficients $X_{n}$ . In §3.2, we will show that this is in fact equivalent to defining $G^{*}(t)$ as the limit of partial sums, as in (1.5), and prove some important properties of this random variable. In the first subsection §3.1, we focus on proving the following result.

Proposition 3.1.

Let $X_{n}$ be multiplicative random variables, with $X_{p}$ for each prime $p$ independent and distributed according to the probability measure $\lambda_{p}$ defined in (1.2). Then, the random Fourier series $\widetilde{G^{\ast}}(t)$ in (3.1) is almost surely the Fourier series of a continuous function.

3.1. Arithmetic convergence

In this section, we study in detail the convergence of the random series $\widetilde{G^{\ast}}(t)$ in (3.1) and prove Proposition 3.1.

We denote, for $y_{2}>y_{1}\geqslant 1$ ,

[TABLE]

and, for every subset $A\subseteq\mathbb{N}$ and $N\in\mathbb{N}$ ,

[TABLE]

as well as

[TABLE]

The following result an analog of [BGGK18, Proposition 5.2], with a number of details adjusted to suit our case.

Proposition 3.2.

Let $k\geqslant 3$ be an integer and let $y_{2}>y_{1}\geqslant k^{3}$ be real numbers. With notations as in (3.2) and (3.3), we have

[TABLE]

For positive integers $n,k,N\in\mathbb{N}$ , we define

[TABLE]

Our proof of Proposition 3.2 makes use of the following lemma to control sums over rough integers.

Lemma 3.3 ([BGGK18, Lemma 5.4]).

Let $\epsilon\in(0,1]$ , and let $k\geqslant 2$ be an integer. For $\sigma\geqslant(2+\epsilon)/(2+2\epsilon)$ and $y\geqslant k^{1+\epsilon}$ , we have

[TABLE]

Proof of Proposition 3.2.

We begin by noting that the series defining $S_{A_{y_{1}}^{y_{2}}}$ converges comfortably (and uniformly across all samples $\mathcal{E}$ of the random coefficients $(X_{n})$ ) and open with a simple decomposition estimate

[TABLE]

Using Hölder’s Inequality with $a_{j}=1/j^{2}$ , $b_{j}=j^{2}S_{A_{y_{1}}^{y_{2}}(e^{j})}(\mathcal{E})$ , $p=2k/(2k-1)$ , and $q=2k$ gives

[TABLE]

via a short calculation with integral comparison. Thus, in order to find a bound for $\mathbb{E}(S_{A_{y_{1}}^{y_{2}}}^{2k})$ , it suffices to first bound $\mathbb{E}(S_{A_{y_{1}}^{y_{2}}(N)}^{2k})$ for $N\geqslant y_{1}$ .

Note that, for any $R\in\mathbb{N}$ , we may write

[TABLE]

Taking $R=\lfloor N^{21/20}\rfloor$ and applying convexity of $x\mapsto x^{2k}$ , we obtain

[TABLE]

Therefore, it suffices to bound

[TABLE]

Denoting

[TABLE]

and expanding, $S_{N,r,y_{1},y_{2}}$ equals

[TABLE]

Notice that $\mathbb{E}(X_{mn})=0$ when $mn$ is not a square and $|\mathbb{E}(X_{mn})|\leqslant 1$ in any case. Moreover, in (3.6), the terms which will survive are those for which $mn$ is a square, $m=uf^{2}$ and $n=ug^{2}$ for $f,g\in\mathbb{N}$ and $u$ squarefree. Finally, note that $|\widetilde{d_{k}}(n;N)|\leqslant d_{k}(n;N)$ . Therefore, we have

[TABLE]

Since $udf\geqslant(N/e)^{k}$ , we further obtain

[TABLE]

by using the elementary inequality $d_{k}(mn)\leqslant d_{k}(m)d_{k}(n)$ and dropping various conditions. We then use Lemma 3.3 with $\epsilon=1$ to get, for $y_{1}\geqslant k^{2}$ ,

[TABLE]

Plugging this into (3.5) we get

[TABLE]

since $k\geqslant 3$ . Returning to (3.4), we have

[TABLE]

Keeping in mind that since $y_{1}\geqslant k^{3}\geqslant 27$ , $(\log y_{1}+1+n)/\log y_{1}\leqslant(2+n)$ , and so

[TABLE]

We now prove that $\widetilde{G^{\ast}}(t)$ defined in (3.1) is almost surely the Fourier series of a continuous function. We use similar methods as in [Hus22, §2]. We may rewrite every sample as

[TABLE]

Since the latter series converges absolutely and uniformly, it converges to a continuous function. Hence it suffices to show that

[TABLE]

converges almost surely to a continuous function. We remark that this passage is, in fact, valid in any order of summation.

Define

[TABLE]

Then it is clear, by comparison with the absolutely convergent series $\sum_{P^{+}(n)\leqslant y}(4/n)=4\prod_{p\leqslant y}(1-p^{-1})^{-1}$ , that, for every fixed $y>0$ , every sample $S_{y}(\mathcal{E};t)$ converges absolutely and uniformly to a continuous function. Since $S_{y}(\mathcal{E};t)$ defines a continuous function for any $y$ and any choice of $\mathcal{E}$ , it suffices to show that the sequence $(S_{y})_{y}$ almost surely converges uniformly, as this will allow us to conclude that $\lim_{y\to\infty}S_{y}(t)$ is almost surely a continuous function. We do this using Cauchy’s Criterion for uniform convergence.

Define

[TABLE]

The series defining $R_{y_{1},y_{2}}(t)$ converges absolutely and uniformly and may therefore be rearranged at will. Using the multiplicativity of the $X_{n}$ ’s we may thus write

[TABLE]

The following lemma is an analog of [Hus22, Lemma 2.1], whose proof we closely follow.

Lemma 3.4.

For every $\delta>0$ , there exists a $y_{0}(\delta)>0$ such that for every $y_{2}>y_{1}\geqslant y_{0}(\delta)$ , we have

[TABLE]

Proof.

As before, we let $\epsilon_{m}$ be a value of the random variable $X_{m}$ , and let $\mathcal{E}$ denote the choice of a value $\epsilon_{p}$ for each $X_{p}$ . Notice that

[TABLE]

where $S_{A_{y_{1}}^{y_{2}}}(\mathcal{E})$ is defined as in (3.3). Thus,

[TABLE]

for some absolute constant $C>0$ , by the classical evaluation in [MV07, Theorem 2.7]. We also recall that, by Proposition 3.2, we have for $k\geqslant 3$ and $y_{1}\geqslant k^{3}$ the bound

[TABLE]

Let $\delta(y_{1})=\delta/(4e^{\gamma}\log y_{1}+C)$ , and let

[TABLE]

Then, for suitably sufficiently large $y_{1}\geqslant y_{0}(\delta)$ , the conditions $k\geqslant 3$ , $y_{1}\geqslant k^{3}$ and $\delta(y_{1})>y_{1}^{-1/11}$ are satisfied, hence

[TABLE]

Thus, we have

[TABLE]

for sufficiently large $y_{1}\geqslant y_{0}(\delta)$ (adjusting the value of $y_{0}(\delta)$ if needed). ∎

We now prove Proposition 3.1.

Proof.

First, we claim that, for every $\delta>0$ ,

[TABLE]

holds for all sufficiently large $y_{1}>y_{0}(\delta)$ . Indeed, let $C>0$ be the constant provided by Lemma 3.4. Let $\delta^{\prime}=\min(\delta,1)$ , $z_{1}=\max(y_{1},\lceil(\delta^{\prime}/2)^{-C}\rceil)$ , and, for $n\geqslant 2$ , let

[TABLE]

This choice ensures that $z_{n}\geqslant\lceil(\delta^{\prime}2^{-n-1})^{-C}\rceil$ and $z_{n+1}-z_{n}\asymp z_{1}(\delta^{\prime}2^{-n-1})^{-C}$ , and hence by Lemma 3.4

[TABLE]

From this it follows that, for every sufficiently large $y_{1}>y_{0}(\delta)$ ,

[TABLE]

as claimed.

Now, let $\delta>0$ be arbitrary. The bound (3.15) combined with $R_{y_{1},y_{2}}=S_{y_{2}}-S_{y_{1}}$ shows that

[TABLE]

By the Borel–Cantelli Lemma (see, for example, [LQ18, Proposition I.2]), this implies that almost surely only finitely many events on the left-hand side occur; in other words, almost surely

[TABLE]

holds for all sufficiently large $y_{1}$ . But this means exactly that the sequence $(S_{y})$ almost surely converges uniformly. Since each function $S_{y}$ is continuous, their a.s. uniform limit $\widetilde{G^{\ast}}(t)=\lim_{y\to\infty}S_{y}(t)$ in (3.1) is also a.s. a continuous function. ∎

3.2. Properties of the limiting random variable

We defined $\widetilde{G^{\ast}}(t)$ in (3.1) as the limit of sum over $\{n\neq-1,0:P^{+}(|n|)\leqslant y\}$ . Due to the uniform convergence we just established, $\widetilde{G^{\ast}}(t)$ defines almost surely a continuous function such that the $n$ th Fourier coefficient of $\widetilde{G^{\ast}}(t)-t$ (for $n\neq 0,1$ ) is precisely $X_{n}/2\pi in=\mathrm{O}(1/n)$ .

Now, it is true that, for every $f\in C(\mathbb{R}/\mathbb{Z})$ such that $\hat{f}(n)=\textnormal{O}(1/n)$ , the partial sums of its Fourier series $S_{n}(f)$ converge uniformly to $f$ . For completeness, we reproduce the argument, which we learned from Ullrich [Ull18]. Fix for now an arbitrary $\epsilon>0$ , let $\psi_{\epsilon}\in C_{c}^{\infty}(\mathbb{R})$ be a “trapezoidal” bump function satisfying $0\leqslant\psi_{\epsilon}\leqslant 1$ , $\psi_{\epsilon}(x)=1$ for $x\in[-1,1]$ and $\psi_{\epsilon}(x)=0$ for $|x|\geqslant 1+\epsilon$ , and let $F_{\epsilon}$ be its Fourier transform. For $n\in\mathbb{N}$ , consider the Schwartz class function $\delta_{n}F_{\epsilon}(t):=nF_{\epsilon}(nt)$ and its periodization $K_{n,\epsilon}\in C^{\infty}(\mathbb{R}/\mathbb{Z})$ defined by $K_{n,\epsilon}(t)=\sum_{k\in\mathbb{Z}}\delta_{n}F_{\epsilon}(t+2\pi k)$ . On the one hand, using the uniform continuity of $f$ , the integrability of $F_{\epsilon}$ , and unfolding, we have that

[TABLE]

for sufficiently small $\delta>0$ and then sufficiently large $n\in\mathbb{N}$ , uniformly in $x\in\mathbb{R}/\mathbb{Z}$ . On the other hand, $K_{n,\epsilon}$ is the trigonometric polynomial $K_{n,\epsilon}(t)=\sum_{j\in\mathbb{Z}}\psi_{\epsilon}(j/n)e(nt)$ , and so by the trivial bounds we have that

[TABLE]

uniformly in $x\in\mathbb{R}/\mathbb{Z}$ . Thus $\|S_{n}(f)-f\|_{\infty}\ll\epsilon$ for sufficiently large $n\geqslant n_{0}(\epsilon)\in\mathbb{N}$ , which precisely establishes that $S_{n}(f)\,\rightrightarrows\,f$ on $\mathbb{R}/\mathbb{Z}$ .

Using the just established fact, the almost surely continuous function $\widetilde{G^{\ast}}(t)-t$ equals a.s. the limit of the usual partial sums as in (1.5) and the two definitions will be equivalent, that is,

[TABLE]

Therefore, we use these definitions interchangeably throughout the paper.

Now, let $\Omega=\prod_{p}\Omega_{p}$ denote the probability space underlying the sequence $(X_{p})$ of independent random variables; that is, $\Omega_{p}=\{0,1,-1\}$ equipped with the measure $\lambda_{p}$ appearing in (1.2) for odd prime $p$ , and $\Omega_{2}=\{-1,1\}$ with $\lambda_{2}$ as in (1.2). Then, for every $h\in\mathbb{Z}\setminus\{0\}$ we have

[TABLE]

We also formally set $\eta(0)=0$ .

The a.s. convergent series $G^{\ast}(t)$ defines a $C^{0}([0,1],\mathbb{C})$ -valued random variable on $\Omega$ . The main result of this section, the following Lemma 3.5, shows that, for arbitrary fixed $k\in\mathbb{Z}_{\geqslant 0}$ and $\bm{t}=(t_{1},\dots,t_{k})\in[0,1]^{k}$ , the $\mathbb{C}^{k}$ -valued random variable $(G^{\ast}(t_{1}),\dots,G^{\ast}(t_{k}))$ has complex moments of all orders $\bm{m},\bm{n}\in\mathbb{Z}_{\geqslant 0}^{k}$ :

[TABLE]

and provides an exact evaluation for these orders. To state the result precisely, the following notations are convenient. Let $\mathcal{H}^{\ast}_{\bm{m},\bm{n}}$ denote the set of all $k$ -tuples of vectors $\vec{\bm{h}}=(\vec{h}_{1},\dots,\vec{h}_{k})$ , with each individual $\vec{h}_{j}=(h_{j,1},\dots,h_{j,n_{j}},h_{j,n_{j}+1},\dots,h_{j,n_{j}+m_{j}})\in\mathbb{Z}^{n_{j}+m_{j}}$ . For every $\vec{\bm{h}}\in\mathcal{H}^{\ast}_{\bm{m},\bm{n}}$ and $\bm{t}\in[0,1]^{k}$ , define

[TABLE]

where, for every $h\in\mathbb{Z}$ and $t\in[0,1]$ ,

[TABLE]

Then we have the following result.

Lemma 3.5.

For every $t\in[0,1]$ , the random variable $G^{\ast}(t)\in\bigcap_{p<\infty}L^{p}(\Omega)$ . Moreover, for every $k\in\mathbb{Z}_{\geqslant 0}$ and every $\bm{t}=(t_{1},\dots,t_{k})\in[0,1]^{k}$ , the $\mathbb{C}^{k}$ -valued random variable $G^{\ast}(\bm{t}):=((G^{\ast}(t_{1}),\dots,G^{\ast}(t_{k}))$ has complex moments $\mathcal{M}^{\ast}(\bm{t};\bm{m},\bm{n})$ as in (3.18) of all orders $\bm{m},\bm{n}\in\mathbb{Z}_{\geqslant 0}^{k}$ , given by the absolutely convergent sum

[TABLE]

with $\beta(\vec{\bm{h}};\bm{t})$ and $\eta(H(\vec{\bm{h}}))$ as in (3.17) and (3.19). These moments satisfy $\mathcal{M}^{\ast}(\bm{t};\bm{m},\bm{n})\leqslant C^{m+n}$ for a suitable absolute $C>0$ , and so $G^{\ast}(\bm{t})$ is a mild random variable.

Proof.

Returning to (3.8), we denote

[TABLE]

with the terms corresponding to $n=-1$ formally interpreted as $t$ . In this paragraph, we verify that all results of the previous section remain valid with $S^{\ast}_{y}(t)$ , $S^{\ast}_{y}(\mathcal{E};t)$ , and analogously defined $R^{\ast}_{y_{1},y_{2}}(t)$ in place of $S(t)$ , $S(\mathcal{E};t)$ and $R_{y_{1},y_{2}}(t)$ . Indeed, for any choice of $|\delta_{n}|\leqslant 1$ , we may repeat the full proof of Proposition 3.2 with the quantities

[TABLE]

instead of (3.3), with the only substantive change in the proof being that $\tilde{d}_{k}(n;N)$ needs to be replaced with

[TABLE]

This satisfies the estimate $|\tilde{d}_{k}^{\ast}(n;N)|\leqslant e^{\textnormal{O}(k)}d_{k}(n;N)$ , which is all that is needed for the proof. As in (3.10), $S^{\ast}_{y}(\mathcal{E};t)$ converges absolutely and uniformly to a continuous function. In place of (3.12), we have the decomposition

[TABLE]

Therefore, denoting $\delta^{\prime}(m)=1/n$ , Lemma 3.4 remains valid for $R^{\ast}_{y_{1},y_{2}}$ as stated, with the key estimate (3.13) in the proof replaced by

[TABLE]

for which the newly adjusted Proposition 3.2 provides $\mathbb{E}(S_{A_{y_{1}}^{y_{2}},\bm{\delta}^{\prime}}^{\ast})\ll y_{1}^{-k/21}$ , and the rest of the proof is unchanged.

Using the estimate (3.15) and the fact that $S_{z}^{\ast}\,\rightrightarrows\,\widetilde{G^{\ast}}$ (uniformly in $t$ , as $z\to\infty$ ) almost surely, we conclude that

[TABLE]

holds for all sufficiently large $y$ . Letting

[TABLE]

we have that $(E_{n})$ form a (non-strictly) increasing sequence of events with $\mathbb{P}(E_{n})\geqslant 1-C\exp(-n^{1/7}/4)$ , so that $\chi_{E_{n}}\,\nearrow\,1$ a.s. In particular, for every fixed $t\in[0,1]$ , we have by the Monotone Convergence Theorem for every $p\geqslant 0$

[TABLE]

Now, for every $n,p\in\mathbb{N}$ , we have by rapid convergence

[TABLE]

Now, for every prime $q$ , we have by a simple combinatorial argument that $d_{2p}(q^{k})=\binom{2p+k-1}{k}$ , and so

[TABLE]

whence

[TABLE]

Inserting this into (3.21) and (3.20) completes the proof of

[TABLE]

where the values of $p\not\in 2\mathbb{Z}_{\geqslant 0}$ are covered by an interpolation argument.

We also claim that $S^{\ast}_{n}(t)\to\widetilde{G^{\ast}}(t)$ in $L^{p}(\Omega)$ . We proceed by a similar argument. Let $\varepsilon>0$ and $n\in\mathbb{N}$ be arbitrary, and denote

[TABLE]

Then the same argument using (3.15) as above shows that $(E_{m,\varepsilon})_{m}$ form a (non-strictly) increasing sequence of events with $\mathbb{P}(E_{m,\varepsilon})\geqslant 1-C\exp(-\varepsilon^{2}m^{1/7})$ , so that $\chi_{E_{m,\varepsilon}}\,\nearrow\,1$ a.s. as $m\to\infty$ , and thus for every $p\geqslant 1$

[TABLE]

Now, for $m>n$ we have, arguing as in (3.21) and below, that

[TABLE]

Inserting this into the previous estimate, we conclude that

[TABLE]

Executing the limits as $\varepsilon\to 0$ and $n\to\infty$ (in either order), we conclude that indeed

[TABLE]

The same claim is true for all real values $p\geqslant 1$ by interpolation.

Finally, we turn to the complex moments $\mathcal{M}^{\ast}(\bm{t};\bm{m},\bm{n})$ , which are all finite by Hölder’s inequality. By writing $\widetilde{G^{\ast}}(t_{i})^{k}=(\widetilde{G^{\ast}}(t_{i})^{k}-S^{\ast}_{n}(t_{i})^{k})+S^{\ast}_{n}(t_{i})^{k}$ ( $k\in\{m_{i},n_{i}\}$ ) and expanding the products and complex conjugates, we may write

[TABLE]

Since $\widetilde{G^{\ast}},S^{\ast}_{n}\in\bigcap_{p<\infty}L^{p}(\Omega)$ , in fact with $\|S^{\ast}_{n}\|_{p}=\mathrm{O}_{p}(1)$ uniformly in $n$ in light of (3.21), taking expectations on both sides, factoring the differences of powers, and applying Hölder’s inequality and (3.25), we conclude that

[TABLE]

But this final expectation is straightforward to evaluate; indeed, denoting by $\mathcal{H}^{\ast\ast}_{\bm{m},\bm{n}}$ the set of all tuples $\vec{\bm{h}}=(\vec{h}_{1},\dots,\vec{h}_{k})$ with $\vec{h}_{j}=(h_{j,1},\dots,h_{j,n_{j}},h_{j,n_{j}+1},\dots,h_{j,n_{j}+m_{j}})$ and each $h_{j,\ell}\in\mathbb{Z}\setminus\{0\}$ , and $\Pi(\vec{\bm{h}})=\prod_{j=1}^{k}\prod_{\ell=1}^{n_{j}+m_{j}}h_{j,\ell}$ ,

[TABLE]

Since, denoting $u=\sum_{i=1}^{k}m_{i}+\sum_{i=1}^{k}n_{i}$ , we comfortably have absolute convergence

[TABLE]

this implies the announced evaluation

[TABLE]

4. Computing the moments

In this section, we compute asymptotically the complex moments of $G_{Q}(\bm{t})$ , which are given by

[TABLE]

where $k$ is a positive integer, $\bm{t}=(t_{1},\dots,t_{k})$ is a $k$ -tuple in $[0,1]^{k}$ , and $\bm{n}=(n_{1},\dots,n_{k})$ and $\bm{m}=(m_{1},\dots,m_{k})$ are $k$ -tuples of non-negative integers. Additionally, we denote $m=\sum_{i=1}^{k}m_{i}$ and $n=\sum_{i=1}^{k}n_{i}$ . Specifically we will prove the following evaluation.

Proposition 4.1.

For every positive integer $k$ and all $k$ -tuples $\bm{t}\in[0,1]^{k}$ and $\bm{m},\bm{n}\in\mathbb{Z}_{\geqslant 0}^{k}$ , the complex moments $\mathcal{M}_{Q}(\bm{t};\bm{m},\bm{n})$ of $G_{Q}(\bm{t})$ in (4.1) satisfy

[TABLE]

where $\mathcal{M}^{\ast}(\bm{t};\bm{m},\bm{n})$ are the corresponding complex moments of $G^{\ast}(\bm{t})$ in (3.18).

Corollary 4.2.

The sequence of $C^{0}([0,1],\mathbb{C})$ -valued random variables $(G_{Q})$ converges in the sense of finite distributions to $G^{\ast}$ as $Q\to\infty$ .

As a preliminary step, we compute by simple sieving that

[TABLE]

so that the uniform probability measure $m_{Q}$ on $\mathcal{D}_{Q}$ satisfies

[TABLE]

4.1. Reduction steps

We also consider slightly different functions $\widetilde{G}(\cdot,c):[0,1]\to\mathbb{C}$ defined by

[TABLE]

The functions $\widetilde{G}(\cdot,c)$ are discontinuous but they agree with the Gauss paths $G(\cdot,c)$ at the points $t=j/(c-1)$ $(0\leqslant j\leqslant c-1)$ and (as we will quickly see) stay very close to them, while being technically easier to work with. Indeed, we have the following expansion.

Lemma 4.3.

We have

[TABLE]

and

[TABLE]

Proof.

This follows by the completion method, analogously to the case of Kloosterman paths [KS16, RR18, MZ23] and discussed in some detail in [Kow21, Chapter 6]. Indeed, the Parseval identity for the discrete Fourier transform $f(x)=\left(\frac{x}{c}\right)e_{c}(x)$ and $g(x)=1_{1\leqslant x\leqslant j}(x)$ (where $j=\lfloor(c-1)t\rfloor$ ) gives

[TABLE]

and we compute

[TABLE]

This proves the first identity. Now, inserting the classical bound for $h\neq 0$ (see, for example, [Mon94, Chapter 3])

[TABLE]

where $||h/c||$ is the distance between $h/c$ and the nearest integer, and bounding Gauss sums individually, we get

[TABLE]

For every $\bm{t}\in[0,1]^{k}$ we may consider the complex moments of $\widetilde{G}(\cdot,c)$ given by

[TABLE]

That the moments $\mathcal{M}_{Q}(\bm{t};\bm{m},\bm{n})$ and $\widetilde{\mathcal{M}_{Q}}(\bm{t};\bm{m},\bm{n})$ are very close will follow directly from the following lemma.

Lemma 4.4.

We have

[TABLE]

Proof.

We first note that

[TABLE]

We now use the bounds

[TABLE]

The first of these follows immediately from the definitions of $G(t;c)$ and $\widetilde{G}(t;c)$ , and the second one follows from the first one and Lemma 4.3. Using these bounds we conclude that

[TABLE]

Consequently, we have for every choice of $\Sigma_{1},\Sigma_{2}\subseteq\{1,\dots,k\}$ with $\Sigma_{1}\cup\Sigma_{2}\neq\emptyset$

[TABLE]

and the claim follows. ∎

Having proven Lemma 4.4, it is clear that

[TABLE]

so that we may from now on focus on an asymptotic evaluation of $\widetilde{\mathcal{M}_{Q}}(\bm{t};\bm{m},\bm{n})$ .

We first rewrite the first conclusion of Lemma 4.3 as

[TABLE]

Inserting the expansion (4.8) into the definition (4.5) and expanding, we write

[TABLE]

Here, $\vec{h}_{j}$ ranges over all $(n_{j}+m_{j})$ -tuples $\vec{h}_{j}=(h_{j,1},\dots,h_{j,n_{j}},h_{j,n_{j}+1},\dots,h_{j,n_{j}+m_{j}})$ , where each $h_{j,\ell}\in(-c/2,c/2)$ , and

[TABLE]

Now, write $\mathcal{H}_{c}$ for the set of all such $k$ -tuples $\vec{\bm{h}}=(\vec{h}_{1},\dots,\vec{h}_{k})$ and denote

[TABLE]

recalling also the notations $H(\vec{\bm{h}})$ and $\beta(\vec{\bm{h}};\bm{t})$ for every $\vec{\bm{h}}\in\mathcal{H}\supseteq\mathcal{H}_{c}$ from (3.19). Recalling that $G(1-h_{j,\ell},c)=\big{(}\frac{1-h_{j,\ell}}{c}\big{)}\sqrt{c}$ and using multiplicativity of Jacobi symbols, we get

[TABLE]

Next, we show that $\alpha_{c}(\vec{\bm{h}};\bm{t})$ may be replaced with $\beta(\vec{\bm{h}};\bm{t})$ , an expression independent of $c$ , at the cost of a negligible error. We begin with the following elementary lemma, for an independent proof of which we refer to [KS16, Section 2] (where the condition that $c=p$ is a prime is clearly immaterial). We will provide a different argument, which also sets the stage for the proof of the next Lemma 4.6.

Lemma 4.5.

For $|h|<c/2$ , we have

[TABLE]

Proof.

We may write

[TABLE]

where

[TABLE]

Now, if $h=0$ , the statement of the lemma is trivially true. Otherwise, noting that $F_{t}(h,c)$ does not depend on $x$ , and inserting the classical bound (4.4), we conclude that, for $|h|<c/2$ ,

[TABLE]

Lemma 4.6.

For $|h|<c/2$ and $t\in[0,1]$ ,

[TABLE]

Proof.

On the one hand, by completion (that is essentially by the Pólya–Vinogradov inequality), we have as in the proof of Lemma 4.3 for every $x\in\mathbb{Z}$ , $\xi\geqslant 0$ the bound

[TABLE]

On the other hand, we may insert the representation for $f(h)$ from the proof of Lemma 4.5, noting that, moreover, $F_{t}(\cdot,c)$ and $G_{t}(\cdot,c)$ define functions of a continuous real variable that satisfy

[TABLE]

Moreover, denoting $x_{t}^{+}=\lfloor(c-1)t\rfloor+1$ , we may write

[TABLE]

Using summation by parts, this leads to

[TABLE]

The contributions from $-c/2<h<0$ are estimated analogously, and the $h=0$ term is trivially admissible. ∎

This leads to the following.

Lemma 4.7.

We have

[TABLE]

Proof.

Let $\Sigma_{k,\bm{m},\bm{n}}$ be the set of all pairs $(j,\ell)$ such that $1\leqslant j\leqslant k$ and $1\leqslant\ell\leqslant n_{j}+m_{j}$ . By writing $\alpha_{c}(h_{j,\ell};t_{j})/\sqrt{c}=(\alpha_{c}(h_{j,\ell},t_{j})/\sqrt{c}-\beta(h_{j,\ell};t_{j}))+\beta(h_{j,\ell};t_{j})$ and expanding the product, may write

[TABLE]

where

[TABLE]

and similarly for $(\alpha_{c}(h_{j,\ell},t_{j})/\sqrt{c}-\beta(h_{j,\ell};t_{j})^{\ast})$ .

Using this expansion, we have that

[TABLE]

In this expression, we bound the sums over $h_{j,\ell}$ corresponding to $(j,\ell)\in\Sigma_{k,\bm{m},\bm{n}}\setminus\Sigma$ trivially, using $\beta(h;t)\ll 1/(1+|h|)$ , and those corresponding to $(j,\ell)\in\Sigma$ using Lemma 4.6. This gives the announced estimate. ∎

We would now like to change the order of summation, so that the $c$ -sum is inside of the $\vec{\bm{h}}$ sums. We now show that the limits of summation can indeed be changed from $|h_{j,\ell}|<c/2$ to the range $|h_{j,\ell}|<Q/2$ independent of $c$ , while only introducing negligible error.

Lemma 4.8.

Let $\mathcal{H}^{\prime}$ be the set of all $k$ -tuples $\vec{\bm{h}}\!=\!(\vec{h}_{1},\dots,\vec{h}_{k})$ of vectors $\vec{h}_{j}\!=\!(h_{j,\ell})_{1\leqslant\ell\leqslant n_{j}+m_{j}}$ satisfying $|h_{j,\ell}|<Q/2$ . Then,

[TABLE]

Proof.

Arguing as in the proof of Lemma 4.7, we find that, for every $c\in[Q,2Q]\cap\mathcal{D}$ ,

[TABLE]

We claim that

[TABLE]

The first of these bounds is trivial from $\beta(h;t)\ll 1/(1+|h|)$ .

For the second bound, we argue analogously as in the proof of Lemma 4.6. Recall the notation and the Pólya–Vinogradov bound for $A_{x}(\xi)$ from (4.11), and denote $x_{t}^{\ast}=\lfloor ct\rfloor$ . Then, we may rewrite the contribution of $c/2<h<Q/2$ to the second sum as

[TABLE]

where

[TABLE]

Then summation by parts indeed yields

[TABLE]

The terms with $-Q/2<h<-c/2$ are estimated analogously. Putting everything together, summing over all $c\in[Q,2Q]\cap\mathcal{D}$ with weights $m_{Q}(c)$ , and invoking Lemma 4.7, we obtain

[TABLE]

Since the range of summation in $\mathcal{H}^{\prime}$ does not depend on $c$ , we may exchange the order of summation, and the lemma follows. ∎

4.2. Isolating the main term and proofs of main results

In view of Lemma 4.8, we may write

[TABLE]

where

[TABLE]

The following subsection will be devoted to the proofs of the following two lemmata.

Lemma 4.9.

[TABLE]

Lemma 4.10.

[TABLE]

Taking Lemmata 4.9 and 4.10 for granted, we are now ready for the proofs of the main results of this section.

Proof of Proposition 4.1.

Proposition 4.1 follows immediately by combining (4.7), (4.14), and Lemmata 4.9 and 4.10. ∎

Proof of Corollary 4.2.

For every $\bm{t}=(t_{1},\dots,t_{k})\in[0,1]^{k}$ , the $\mathbb{C}^{k}$ -valued random variable $G^{\ast}(\bm{t})=(G^{\ast}(t_{1}),\dots,G^{\ast}(t_{k}))$ is mild by Lemma 3.5. According to Proposition 2.2, the convergence in law of $G_{Q}(\bm{t})$ to $G^{\ast}(\bm{t})$ as $Q\to\infty$ may be verified by checking that the corresponding moments satisfy $\mathcal{M}_{Q}(\bm{t};\bm{m},\bm{n})\to\mathcal{M}^{\ast}(\bm{t};\bm{m},\bm{n})$ as $Q\to\infty$ for every $\bm{m},\bm{n}\in\mathbb{Z}_{\geqslant 0}^{k}$ . But this follows immediately (in a strong form) from Proposition 4.1. ∎

4.3. Square and non-square contributions

In this subsection, we prove Lemmata 4.9 and 4.10.

Proof of Lemma 4.9.

Adapting the sieving argument of (4.2), we find that for every $0\neq H=Q^{\mathrm{O}(1)}$ ,

[TABLE]

Inserting this into the definition of $\widetilde{\mathcal{M}_{Q}^{0}}(\bm{t};\bm{m},\bm{n})$ yields

[TABLE]

We then estimate, arguing as in (3.27) and (3.24),

[TABLE]

as well as

[TABLE]

Putting everything together and invoking the evaluation of $\mathcal{M}^{\ast}(\bm{t};\bm{m},\bm{n})$ from Lemma 3.5 completes the proof. ∎

Proof of Lemma 4.10.

We begin by noting the (direct) upper bound

[TABLE]

Therefore, grouping the terms indexed by $\vec{\bm{h}}\in\mathcal{H}^{\prime}$ according to the values of $\square\neq|H(\vec{\bm{h}})|\leqslant Q^{\ast}:=(Q/2+1)^{m+n}$ , we have

[TABLE]

where, by the divisor bound,

[TABLE]

Now, sieving for the conditions $c\in\mathcal{D}$ and $(c,k)=1$ , we find that

[TABLE]

Since $d>1$ is square-free, $(d/c)$ is a non-principal character of conductor $\asymp d$ , we find by the Pólya–Vinogradov inequality and straightforward estimates that

[TABLE]

Therefore

[TABLE]

On the other hand, using the Cauchy–Schwarz inequality followed by Heath-Brown’s quadratic large sieve (Proposition 2.5), we find that

[TABLE]

Therefore

[TABLE]

Making the optimal choice $D=Q^{2/3}$ we conclude that

[TABLE]

We note that we did not try to optimize the final exponent of power savings in Lemma 4.10 and that the character sums in the proof can surely be treated more delicately. We opted for brevity since the present power savings suffice for us.

5. Convergence in law

The goal of this section is to prove the following statement, which will in turn be used to verify the tightness of the sequence of $C^{0}([0,1],\mathbb{C})$ -valued random variables $(G_{Q})$ as $Q\to\infty$ .

Proposition 5.1.

For every real $\alpha>2$ , there exists a $\delta=\delta(\alpha)>0$ such that for every $Q\geqslant 1$ and every $0\leqslant s,t\leqslant 1$ ,

[TABLE]

We will prove Proposition 5.1 in §5.3, along with the corollaries for the tightness and convergence in law of the sequence $(G_{Q})$ , after laying the ground work in §5.1 and §5.2. It will be seen that we can, in fact, choose $\delta(\alpha)=\min(\delta_{1}(\alpha-2),\delta_{2})$ for some two constants $\delta_{1},\delta_{2}>0$ , which may in principle be explicated; in particular, any $\delta_{2}<\frac{1}{3}$ is allowable with a corresponding suitable $\delta_{1}>0$ .

5.1. Preparatory lemmata

The first principal arithmetic input into the proof of tightness is the following lemma, which is a simple variation of the Burgess-like bound for short mixed character sums (Proposition 2.7).

Lemma 5.2.

For every $\kappa<\frac{3}{4}$ , there exists a $\delta=\delta_{\kappa}>0$ such that for every primitive character of any square-free conductor $c>1$ and any $M$ and every $1\leqslant N\leqslant c^{\kappa}$ ,

[TABLE]

Proof.

We begin with an application of completion, as in Lemma 4.3, obtaining

[TABLE]

where $\epsilon_{\chi}$ is the sign of the Gauss sum for the character $\chi$ . Denote the two sums above (including the factor $\epsilon_{\chi}/c^{1/2}$ ) by $S_{1}$ and $S_{2}$ . On the one hand, by exchanging the order of summation and applying Proposition 2.7 we have

[TABLE]

On the other hand, denoting

[TABLE]

we have by integration by parts that

[TABLE]

Now, the conclusion of Proposition 2.7 implies, for $|I|>c^{1/4+\kappa}$ , that

[TABLE]

simply because the complete sum over all $n\bmod c$ is of size at most $\mathrm{O}(c^{1/2})$ . This implies that

[TABLE]

The proof is complete. ∎

The following variation of Heath-Brown’s quadratic large sieve is convenient and probably known, but we could not locate a ready reference.

Lemma 5.3.

For any $M\in\mathbb{N}$ , real numbers $0<s<t$ , and complex numbers $(a_{n})_{sM<n\leqslant 2tM}$ ,

[TABLE]

Proof.

Clearly we may assume with out loss of generality that $1/(2M)\leqslant s<t$ , and then we may also assume that $s<t\leqslant 2s$ , as the general case follows by splitting $[s,t]$ into $\mathrm{O}(1+\log(t/s))$ intervals of this form.

Now, for every $0\leqslant k\leqslant\log_{2}M+1$ , consider the collection $\mathcal{I}_{k}$ of intervals of length $|t-s|M/2^{k}$ intersecting $[sM,2tM]$ . For every $M<m\leqslant 2M$ , we split the interval $(sm,tm]$ into $\mathrm{O}(\log M)$ such intervals (at most two for each value of $k$ ), using the obvious greedy algorithm, and we estimate the inner sum in (5.1) using the Cauchy–Schwarz inequality.

Now, each specific interval $I\in\mathcal{I}_{k}$ appears in at most $\mathrm{O}(|I|/s+1)$ such decompositions. For, indeed, for $I$ to appear in the decomposition of $[sm,tm]$ , one of the points $sm$ or $tm$ must appear in the interval $\tilde{I}$ centered at the midpoint of $I$ and of length $3|I|$ ; but this forces $m$ to lie in the union $\tilde{J}(I)=\tilde{I}/s\cup\tilde{I}/t$ and, in particular, determines $m$ to within the stated number of choices. Putting this together, we have the estimate

[TABLE]

Estimating the inner double sum using Heath-Brown’s quadratic large sieve (Proposition 2.5), we have that

[TABLE]

This completes the proof. ∎

Lemma 5.3 allows us to prove the following estimate, which will be crucial in our estimates.

Lemma 5.4.

For every $0\leqslant s<t\leqslant 1$ with $|t-s|\geqslant 1/(2Q)$ , we have

[TABLE]

Proof.

Clearly we may assume that $s\geqslant 1/(2Q)$ without loss of generality. For $\tau\in[s,t]$ , denote

[TABLE]

By Lemma 5.3, we have the estimate

[TABLE]

where the final estimate follows by separately considering the cases $|t-s|\leqslant s$ and $|t-s|>s$ and keeping in mind that $s\geqslant 1/(2Q)$ .

Now, by summation by parts and the Cauchy–Schwarz inequality, we have

[TABLE]

Moreover, by the integral Minkowski’s inequality,

[TABLE]

Upon applying (5.2) in the two previous displays, we conclude that

[TABLE]

as desired. ∎

5.2. Estimates according to ranges

he following two lemmata are purely analytical in nature.

Lemma 5.5.

If $\alpha>0$ and

[TABLE]

then

[TABLE]

Lemma 5.6.

If $\alpha\geqslant 1$ and

[TABLE]

then

[TABLE]

These statements are analogues of [RRS20, Lemma 4.2] and [RRS20, Lemma 4.3] (as well as of the corresponding statements in [MZ23]). The proofs are essentially verbatim, and so we omit them for brevity, the only notable adaptation being the insertion of (4.6) in place of [RRS20, (5)].

Lemma 5.7.

For every $\kappa>\frac{1}{4}$ , there exists a $\delta=\delta_{\kappa}>0$ such that, if $\alpha\geqslant 2$ and if

[TABLE]

for some $\lambda\geqslant\kappa$ , then

[TABLE]

Proof.

Without loss of generality, let $s<t$ . By the very definition of $\widetilde{G}(\cdot;c)$ ,

[TABLE]

where we incur the harmless error term for no other reason than notational simplicity. Thus, using Lemma 5.6 and recalling the condition that $|t-s|\gg 1/Q$ , we first have that

[TABLE]

Using Lemmata 5.2 and 5.4 we conclude that

[TABLE]

The statement of the lemma follows upon inputing the condition $Q\ll|t-s|^{-1/\lambda}$ , and setting $\min(\frac{1}{2},\delta/\lambda)$ as the value of $\delta$ . ∎

Lemma 5.8.

For every even integer $\alpha\geqslant 2$ , if

[TABLE]

for some $\lambda<\frac{1}{3}$ , then

[TABLE]

Proof.

First off, in analogy with (4.5), we denote

[TABLE]

In other words, the moment $\widetilde{\mathcal{M}^{\ast}_{Q}}([s,t];\alpha)$ is constructed in exactly the same fashion as $\widetilde{\mathcal{M}_{Q}}(\bm{t};\bm{m},\bm{n})$ , with $k=1$ and $m_{1}=n_{1}=\alpha/2$ , but with $\widetilde{G}(t;c)-\widetilde{G}(s;c)$ in place of each $\widetilde{G}(t_{1};c)$ . In yet other words, we have a finite (with length and coefficients depending only on the fixed value of $\alpha$ ) expansion

[TABLE]

Applying Lemma 4.8, decomposition (4.14), and Lemma 4.10 to this expansion, we have

[TABLE]

where $\mathcal{H}^{\prime}$ is the set of all pairs $\vec{\bm{h}}=(\vec{h}_{1},\vec{h}_{2})$ of vectors $\vec{h}_{j}=(h_{j,\ell})_{1\leqslant\ell\leqslant\alpha/2}$ satisfying $|h_{j,\ell}|<Q/2$ , and

[TABLE]

But then a moment’s reflection shows that in fact

[TABLE]

The same result can be arrived at by following the evaluation of $\widetilde{\mathcal{M}_{Q}}(\bm{t};\bm{m},\bm{n})$ in section 4, with $k=1$ and $m_{1}=n_{1}=\alpha/2$ and with $\widetilde{G}(t;c)-\widetilde{G}(s;c)$ in place of each $\widetilde{G}(t_{1};c)$ .

Now, the expectation occurring in this evaluation may be further estimated as

[TABLE]

bounding $|\beta(h;t)-\beta(h;s)|$ by interpolating between the obvious estimates $|\beta(h;t)-\beta(h;s)|\ll\min(|t-s|,1/(1+|h|))$ . The statement of the lemma follows upon inputing the condition $Q\gg|t-s|^{-1/\lambda}$ and recalling that $m_{Q}(c)\asymp 1/Q$ . ∎

5.3. Proof of tightness

We are now ready for the main proofs of this section.

Proof of Proposition 5.1.

Fix an arbitrary $\frac{1}{4}<\kappa=\lambda<\frac{1}{3}$ , and let $\delta=\delta_{\kappa}>0$ be as in the statement of Lemma 5.7. Applying Lemma 5.5, 5.7, or 5.8 according to the size of $|t-s|$ , we conclude that, for every even integer $\alpha\geqslant 2$ and every $t,s\in[0,1]$ ,

[TABLE]

with

[TABLE]

This completes the proof of Proposition 5.1 when $\alpha>2$ is an even integer; the claim for other values of $\alpha>2$ follows by interpolation. We also see that we can take $\delta(\alpha)=\min(\delta_{1}(\alpha-2),\delta_{2})$ with (in principle) explicit $\delta_{1},\delta_{2}>0$ , and we may obtain any $\delta_{2}<\frac{1}{3}$ by taking $\kappa=\lambda$ sufficiently close to $\frac{1}{4}$ . ∎

Using Kolmogorov’s Tightness Criterion (Proposition 2.4), Proposition 5.1 immediately implies the following statement.

Corollary 5.9.

The sequence of $C^{0}([0,1],\mathbb{C})$ -valued random variables $(G_{Q})$ is tight as $Q\to\infty$ .

Combining Corollaries 4.2 and 5.9 and applying Prokhorov’s Criterion (Proposition 2.3), we then obtain the following capstone statement.

Corollary 5.10.

The sequence of $C^{0}([0,1],\mathbb{C})$ -valued random variables $(G_{Q})$ converges in law to $G^{\ast}$ as $Q\to\infty$ .

6. The atlas of shapes

In this section, we consider the local behavior of the limiting shapes $G^{\sharp}_{\bm{\epsilon}_{Z}}(t)$ at rational points $t_{0}\in[0,1]\cap\mathbb{Q}$ . We prove the general first-order asymptotics in §6.1 and a refined two-term asymptotic expansion in §6.2. These asymptotics relate the local behavior of $G^{\sharp}_{\bm{\epsilon}_{Z}}(t)$ at $t_{0}=a/q$ to certain complete exponential sums modulo $q$ , which we study in detail in §6.3. In §6.4, we combine all these conclusions and prove Proposition 6.5, which provides a collection of rational points $t_{0}=a/q$ at which the path $G^{\sharp}_{\bm{\epsilon}_{Z}}(t)$ has a cusp and explains the striking sharp reversals observed in the introduction (Figures 2–4).

6.1. Local behavior of limiting shapes

We begin by writing for short

[TABLE]

We record the simple estimates

[TABLE]

Now, fix a $t_{0}=a/b\in[0,1]\cap\mathbb{Q}$ with $(a,b)=1$ , and write

[TABLE]

By absolute convergence, we may rewrite

[TABLE]

It will be convenient to write

[TABLE]

as well as $\mathcal{N}_{\bm{\epsilon}_{Z}}^{+}=\mathcal{N}_{\bm{\epsilon}_{Z}}\cap\mathbb{N}$ , $\mathcal{N}_{\bm{\epsilon}_{Z}}^{+}[d]=\mathcal{N}_{\bm{\epsilon}_{Z}}[d]\cap\mathbb{N}$ . Finally, for a modulus $m\in\mathbb{N}$ , we will also consider the normalized complete exponential sums

[TABLE]

The connection between the exponential sums (6.5) and the local behavior of the paths $G^{\sharp}_{\bm{\epsilon}_{Z}}(t)$ and their decomposition (6.3) is given by the following lemma.

Lemma 6.1.

For every $t_{0}=a/b\in[0,1]\cap\mathbb{Q}$ as in (6.2), and for every $d\mid b_{Z}$ , the function $G^{\sharp}_{\bm{\epsilon}_{Z}}[d]$ defined in (6.3) satisfies, for $t\in[0,1]$ ,

[TABLE]

where $\mathcal{P}_{\bm{\epsilon}_{Z}}[d]$ , $s^{\ast}(a/(b/d);\bm{\epsilon}_{Z})$ , and $c_{\bm{\epsilon}_{Z}}[d]\neq 0$ are as in (6.4), (6.5), and (6.6).

Proof.

For a $K>0$ to be chosen suitably large later, we may write

[TABLE]

where, separating the terms in (6.3) according to whether $n=dn^{\prime}\leqslant dK$ or $n>dK$ , splitting the summands into dyadic ranges of the form $n^{\prime}\in[K^{\prime},2K^{\prime}]$ (and denoting dyadic summations over $K^{\prime}=2^{k^{\prime}}$ , $k^{\prime}\in\mathbb{Z}_{\geqslant 0}$ by $\sum^{\mathrm{dy}}$ ), and estimating using (6.1) and the Mean Value Theorem,

[TABLE]

Now, $n^{\prime}\in\mathcal{N}_{\bm{\epsilon}_{Z}}^{+}[d]$ if and only if $p\mid n^{\prime}\,\Rightarrow\,p\in\mathcal{P}_{\bm{\epsilon}_{Z}}[d]$ . Moreover, denoting by $B=[0,2\varphi(b/d)-1)^{\mathcal{P}_{\bm{\epsilon}_{Z}}[d]}$ the cube with edges indexed by $p\in\mathcal{P}_{\bm{\epsilon}_{Z}}[d]$ and all edge lengths $2\varphi(b/d)$ and

[TABLE]

we have that $|\mathcal{N}_{\bm{\epsilon}_{Z}}^{+}[d]\cap[1,K]|=|\mathcal{M}_{\bm{\epsilon}_{Z}}[d](K)|$ as well as

[TABLE]

Putting everything together, we have proved that

[TABLE]

where

[TABLE]

The claim of the lemma follows upon choosing $K=|t-t_{0}|^{-1}$ to balance the error terms. ∎

In view of Lemma 6.1, the behavior of the deterministic path $G^{\sharp}_{\bm{\epsilon}_{Z}}(t)$ close to various rational points is guided at first by whether

[TABLE]

or not. In particular, close to a rational point $t_{0}=a/b\in\mathbb{Q}$ , the deterministic path $G^{\sharp}_{\bm{\epsilon}_{Z}}(t)$ splits as in (6.3) as the sum of components $G^{\sharp}_{\bm{\epsilon}_{Z}}[d](t)$ over $d\mid b_{Z}$ , with the component $G^{\sharp}_{\bm{\epsilon}_{Z}}[d]$ having a logarithmic singularity of order $|\log|t-t_{0}||^{|\mathcal{P}_{\bm{\epsilon}_{Z}}[d]|}$ as $t\to t_{0}$ whenever $\mathop{\mathrm{Re}}s^{\ast}(a/(b/d);\bm{\epsilon}_{Z})\neq 0$ . The property of $G^{\sharp}_{\bm{\epsilon}_{Z}}(t)$ having a logarithmic singularity at $t_{0}=a/b$ should not be confused with this path having a cusp at $t_{0}$ , which is substantially more delicate and studied in §6.2.

The nonvanishing condition (6.7) is immediate to numerically check for every specific pair $(a/b,\bm{\epsilon}_{Z})$ , and in §6.3 we offer a more detailed analysis of this fascinating question in more generality. In particular, if $\mathop{\mathrm{Re}}s^{\ast}(a/b^{Z};\bm{\epsilon}_{Z})\neq 0$ , then the full path has a logarithmic singularity of order $|\log|t-t_{0}||^{|\mathcal{P}_{\bm{\epsilon}_{Z}}|}$ ; if $\mathop{\mathrm{Re}}s^{\ast}(a/b^{Z};\bm{\epsilon}_{Z})=0$ but $\mathop{\mathrm{Re}}s^{\ast}(a/(b^{Z}(b_{Z}/d));\bm{\epsilon}_{Z})\neq 0$ for some $d\mid b_{Z}$ , then the component $G^{\sharp}_{\bm{\epsilon}_{Z}}[d](t)$ has a logarithmic singularity of lesser severity as $t\to t_{0}$ , which may or may not be inherited by the full path $G^{\sharp}_{\bm{\epsilon}_{Z}}(t)$ .

6.2. Finer local information

Already Lemma 6.1 clearly shows that the local behavior of $G^{\sharp}_{\bm{\epsilon}_{Z}}$ near $t_{0}=a/b$ is, at the first order, guided by whether $\mathop{\mathrm{Re}}s^{\ast}(a/b^{Z};\bm{\epsilon}_{Z})\neq 0$ or not. This condition is analyzed in more detail in §6.3. In the case of nonvanishing, we see that

[TABLE]

according to the sign of $\mathop{\mathrm{Re}}s^{\ast}(a/b^{Z};\bm{\epsilon}_{Z})\in\mathbb{R}_{\neq 0}$ , indicating (in contrast to the conclusion of Theorem 1.2(2)) that the function $G^{\sharp}_{\bm{\epsilon}_{Z}}(t)$ is not differentiable at $t=t_{0}=a/b$ but the path $G^{\sharp}_{\bm{\epsilon}_{Z}}$ appears smooth around the point $G^{\sharp}_{\bm{\epsilon}_{Z}}(t_{0})$ , in the sense that all of its Dini quotients vanish (to an arbitrarily high degree on the logarithmic scale). Such points are clearly observable (but not specifically marked) in Figure 5; check, for example, neighborhoods of $t_{0}=0,\frac{1}{2},1$ .

To identify points $t_{0}=a/b$ where the path $G^{\sharp}_{\bm{\epsilon}_{Z}}(t)$ exhibits cusp behavior, we need to be able to consider the cases where $\mathop{\mathrm{Re}}s^{\ast}(a/b^{Z};\bm{\epsilon}_{Z})=0$ and analyze lower-order local behavior. From now on, throughout the rest of Section 6, we consider the case $b_{Z}=d=1$ , so that $a/b^{Z}=a/b$ .

Fix once and for all an even nonnegative function $\phi\in C_{c}^{\infty}(\mathbb{R})$ such that $\phi(x)=1$ for $x\in[-1,1]$ and $\phi(x)=0$ for $x\not\in[-2,2]$ . By a familiar repeated integration by parts argument, for every $m\in\mathbb{N}$ , the Mellin transform $\widetilde{\phi}(s)$ has a meromorphic continuation to $\mathop{\mathrm{Re}}(s)>-m$ given by

[TABLE]

In particular, we have the asymptotic expansions

[TABLE]

for every $\ell>0$ , as well as the uniform bounds

[TABLE]

For a large parameter $K>0$ , to be suitably chosen later, we consider the (finite!) sum

[TABLE]

Using the absolutely and uniformly convergent Taylor series expansion for $e(n\tau)$ , we have

[TABLE]

where

[TABLE]

Denoting as usual $G(\chi)=\sum^{\ast}_{x\bmod b}\chi(x)e(x/b)$ the unnormalized Gauss sum of a (not necessarily primitive) character $\chi$ modulo $b$ and using the discrete and archimedean Mellin transforms, we find that, for any $\sigma>k$ ,

[TABLE]

The function $\gamma(\chi,s)=\prod_{p\in\mathcal{P}_{\bm{\epsilon}_{Z}}}(1-\epsilon_{p}\chi(p)/p^{s})^{-1}$ continues to a meromorphic function with a pole at $s=0$ of order at most $|\mathcal{P}_{\bm{\epsilon}_{Z}}|$ and an asymptotic expansion

[TABLE]

where $P_{\bm{\epsilon}_{Z}}=1/{\prod_{p\in\mathcal{P}_{\bm{\epsilon}_{Z}}}(\log p)}$ . In addition to the pole at $s=0$ , we encounter in the evaluation of $I_{K,k}(\phi)$ poles whenever

[TABLE]

and these poles are simple and pairwise distinct except possibly for the pole at $s=k$ . When $k\neq 0$ , we also encounter a distinct pole of $\widetilde{\phi}(s)$ at $s=0$ ; this then accounts for all the poles of the integrand in $\mathop{\mathrm{Re}}s>-1$ . We will show that the total contributions of these poles converge absolutely and that the contour of integration in (6.11) may be shifted to $\mathop{\mathrm{Re}}s=-\delta$ for a suitable $0<\delta<1$ , and we will write

[TABLE]

We begin by evaluating

[TABLE]

The total contribution of these residues to $I_{K,k}(\phi)$ in (6.11) equals

[TABLE]

where $\iota_{k,\nu}(\phi,b)$ are arithmetic functions given by

[TABLE]

and we note that $j_{\nu,k}(b)$ depends only on $k\bmod 2$ .

We proceed to estimate each of the remaining contributions $I_{K,k}^{j}(\phi,b)$ ( $1\leqslant j\leqslant 3$ ). At each of the poles $s_{k,p,\ell,m}\neq k$ , we have for $q\in\mathcal{P}_{\bm{\epsilon}_{Z}}$ , $q\neq p$ , that

[TABLE]

for a fixed constant $C$ depending on $Z$ only, by using Baker’s theorem on linear forms in logarithms (Proposition 2.8). Therefore, also using the uniform bound (6.9) for $\widetilde{\phi}(s)$ , the total contribution of these poles is at most

[TABLE]

by choosing $N\in(C^{\prime}+2,C^{\prime}+3]$ . We emphasize that this is only a preliminary bound, whose primary role is to ensure absolute convergence; we will be estimating the combined contributions of all $I_{K,k}^{1}(\phi,b)$ far more delicately. Using the same Proposition 2.8, we also see that the total contributions of the integrals over the horizontal segments $[\delta+it,\sigma+it]$ may be bounded by $\ll_{Z,b}(\sigma-\delta)(2K)^{k}/(2+|t|)^{C^{\prime}-N}\to 0$ over a suitable sequence of $|t|\to\infty$ , and thus we may indeed shift the vertical contour past the line $\mathop{\mathrm{Re}}s=k$ .

When $k\neq 0$ , we also collect the contribution from the simple pole at $s=0$ equal to

[TABLE]

Finally, using the uniform bound (6.9), the total contribution of the remaining integrals over $\mathop{\mathrm{Re}}s=-\delta$ is easily seen to be

[TABLE]

where $C=\prod_{p\in\mathcal{P}_{\bm{\epsilon}_{Z}}}p$ .

We now return to (6.10) and compute the summands in the decomposition

[TABLE]

corresponding to the total contributions of the four summands in (6.13). Using (6.14), we compute

[TABLE]

where

[TABLE]

and

[TABLE]

with the kernels $\psi_{0}$ and $\psi_{1}$ given by

[TABLE]

The total contribution of terms with the factor $j_{\nu_{2},1}(b)$ to the sum in (6.17) is $j_{\nu_{2},1}(b)$ times

[TABLE]

where $P^{1}_{\nu}(L)$ is the degree $\nu$ polynomial given by the absolutely convergent integral

[TABLE]

Similarly, the total contribution of terms with the factor $j_{\nu_{2},0}(b)$ is $j_{\nu_{2},0}(b)$ times

[TABLE]

by a little calculation using integration by parts, where $P^{0}_{\nu}(L)$ is the degree $\nu$ polynomial given by the conditionally convergent improper integral

[TABLE]

Indeed, the error term we encounter in the final line equals

[TABLE]

Putting everything together into (6.17), we find that

[TABLE]

Next, we proceed to estimate $I_{K}^{1}(\phi)$ , the combined contribution of the poles at $s=s_{k,p,\ell,\chi}\neq k$ . Using (6.8) with any $m\geqslant 1$ , we find that, for every $s_{k,p,\ell,\chi}=k+i\gamma$ , $k\geqslant 0$ , $\gamma=\gamma_{p,\ell,\chi}\neq 0$ ,

[TABLE]

Using the elementary integral expression for the beta function and substituting into (6.13), we find that

[TABLE]

where

[TABLE]

and

[TABLE]

where $\widetilde{\psi_{0}}=\psi_{0}+1$ and $\widetilde{\psi_{1}}=\psi_{1}$ , with the kernels $\psi_{0}$ and $\psi_{1}$ as in (6.18). Using the integration by parts in the $x$ -variable $N$ times, we find that

[TABLE]

For clarity, we note that this calculation can be performed equally well with any $m\geqslant 1$ (even with $m=1$ ); if we choose $m$ sufficiently large, then a similar conclusion can be reached by integration by parts in the $y$ -variable $N\leqslant m-1$ times. Inserting these estimates above and using Baker’s bound as in (6.16), we find that

[TABLE]

by choosing $N\in(C^{\prime}+2,C^{\prime}+3]$ and $C^{\prime\prime}=C^{\prime}+2+\mathbf{1}_{|t-t_{0}|K\geqslant 1}$ .

Finally, we address the contributions of $I_{K}^{2}(\phi)$ and $I_{K}^{3}(\phi)$ . Indeed, $I_{K}^{2}(\phi)$ is a smooth function given by the absolutely and convergent series

[TABLE]

and

[TABLE]

where we may choose, say, $\delta=\frac{1}{2}$ for simplicity.

In total, we have proved that

[TABLE]

Critically, this yields a nontrivial asymptotic when $K$ is appreciably larger than $1/|t-t_{0}|$ . Now, arguing as in the proof of Lemma 6.1, we can also estimate

[TABLE]

Choosing

[TABLE]

we conclude that

[TABLE]

where $\ell=|\log|t-t_{0}||$ , and the constants $c^{+}(t_{0})$ , $c^{\prime}(t_{0})$ , and $c^{-}(t_{0})$ (which also depend on $\bm{\epsilon}_{Z}$ ) may be read off from (6.12), (6.15), (6.20), and (6.19) as

[TABLE]

where we also used the classical evaluations

[TABLE]

kept in mind from (6.6) the notation $c_{\bm{\epsilon}_{Z}}=c_{\bm{\epsilon}_{Z}}[1]=1/(|\mathcal{P}_{\bm{\epsilon}_{Z}}|!\prod_{p\in\mathcal{P}_{\bm{\epsilon}_{Z}}}(\log p))=P_{\bm{\epsilon}_{Z}}/|\mathcal{P}_{\bm{\epsilon}_{Z}}|!$ and additionally denoted

[TABLE]

noted that $\ell=-\log|t-t_{0}|$ in the ranges in which the leading terms in (6.21) are dominant, and set, for $\varepsilon\in\{\pm\}=\{\pm 1\}$ and $q\in\mathcal{P}_{\bm{\epsilon}_{Z}}$ ,

[TABLE]

For future reference we record the following simple lemma. In particular, it confirms that (as it must be) the leading constants in (6.21) and Lemma 6.1 match. To simplify the notation, we write $\bm{\epsilon}_{Z}^{q-}=\bm{\epsilon}_{Z}\mathbf{1}_{\{q\}^{c}}$ for the sequence $(\epsilon_{p}\delta_{p\neq q})_{p\leqslant Z}$ , and we introduce the related exponential sum

[TABLE]

where $\{x\}$ is the familiar sawtooth function defined by $\{x\}=x-\lfloor x\rfloor-1/2$ .

Lemma 6.2.

For every $t_{0}=a/b\in[0,1]\cap\mathbb{Q}$ with $(a,b)=(\prod_{p\in\mathcal{P}_{\bm{\epsilon}_{Z}}}p,b)=1$ , the constants $c^{+}(t_{0}),c^{\prime}(t_{0}),c^{-}(t_{0})\in\mathbb{R}$ shown in (6.22) satisfy

[TABLE]

Proof.

From orthogonality of characters, we have that

[TABLE]

The first two statements of the lemma follows by substituting this expression into (6.22).

On the other hand, by using the geometric series expansion and l’Hôpital’s rule, we can evaluate

[TABLE]

Using this evaluation, we can then argue as above that

[TABLE]

and the third statement again follows by invoking (6.22). ∎

Putting everything together completes the proof of the following proposition, the crowning achievement of this subsection.

Proposition 6.3.

There exists a $\delta>0$ , depending only on $Z$ , such that, for every $t_{0}=a/b\in[0,1]\cap\mathbb{Q}$ with $(a,b)=1$ and $(\prod_{p\in\mathcal{P}_{\bm{\epsilon}_{Z}}}p,b)=1$ , the function $G^{\sharp}_{\bm{\epsilon}_{Z}}(t)$ satisfies

[TABLE]

for certain constants $c^{+}(t_{0}),c^{\prime}(t_{0}),c^{-}(t_{0})\in\mathbb{R}$ shown in (6.22) and Lemma 6.2 and $\ell=|\log|t-t_{0}||$ .

6.3. Nonvanishing and multiplicativity of exponential sums

The exponential sums $s^{\ast}(a/b;\bm{\epsilon}_{Z})$ are related (though not always in a straightforward fashion) to the generalized Gauss power sums, defined for any rational number $a/q$ with $(a,q)=1$ , $d\mid\varphi(q)$ , and $\iota\in\{0,1\}$ as

[TABLE]

We summarize this relationship in the following lemma. To succinctly state the multiplicativity property of the sums $s^{\ast}(a/m;\bm{\epsilon}_{Z})$ , we introduce for $\delta\mid\varphi(m)$ and $m\mid m^{\prime}$ slightly more general sums

[TABLE]

so that $s^{\ast}(a/m,\bm{\epsilon}_{Z})=s(a/m,\bm{\epsilon}_{Z})[1,m]$ .

Lemma 6.4.

(1)

For $q=p^{k}$ for an odd prime $p$ and $k\geqslant 1$ ,

[TABLE]

where $\iota^{\prime}=1$ if $\iota=1$ and $2\nmid k$ , and $\iota^{\prime}=0$ otherwise. In particular, for $q=p$ :

•

$\sigma_{d}^{\iota}(a/p)\neq 0$ * for $d\mid(p-1)/(1+\iota)$ ,*

•

$\sigma_{d}^{\iota}(a/p)\in\mathbb{R}$ * whenever $-1\in(\mathbb{Z}/p\mathbb{Z})^{\times d(1+\iota)}$ ,*

•

$\sigma_{d}^{1}(a/p)\in i\mathbb{R}$ * whenever $2\nmid d$ and $p\equiv 3\bmod 4$ ,*

•

for $d=1$ , $\sigma_{1}^{0}(a/p)=-1$ and $\sigma_{1}^{1}(a/p)=(a/p)\sqrt{p}G(p)$ . 2. (2)

If $2\nmid\mathop{\mathrm{ord}}_{m}p$ for some $p\leqslant Z$ with $p\nmid m$ and $\epsilon_{p}=-1$ , then $s^{\ast}(a/m;\bm{\epsilon}_{Z})=0$ . Further, if $m=q=\mathtt{p}^{k}$ for an odd prime $\mathtt{p}$ and $k\geqslant 2$ , $s^{\ast}(a/q;\bm{\epsilon}_{Z})=0$ unless $\mathop{\mathrm{ord}}_{q}p\mid(\mathtt{p}-1)$ for all $p\leqslant Z$ with $\epsilon_{p}\neq 0$ . 3. (3)

For $m=m_{1}m_{2}$ with $(m_{1},m_{2})=1$ and $\delta=(\varphi(m_{1}),\varphi(m_{2}))$ ,

[TABLE]

where $m_{i}\overline{m_{i}}\equiv 1\pmod{m_{3-i}}$ and

[TABLE] 4. (4)

For $q$ an odd prime power, let (noting that then $\mathcal{P}_{\bm{\epsilon}_{Z}}\subseteq(\mathbb{Z}/q\mathbb{Z})^{\times d_{q}}$ )

[TABLE]

Then,

[TABLE]

Proof.

Item (1) is essentially elementary (and probably well known). For $k=1$ , we may write

[TABLE]

If $\iota=1$ , then changing variables $x\mapsto xg^{(p-1)/d}$ (for $g$ an arbitrary primitive root modulo $p$ ) shows that $\sigma_{d}^{1}(a/p)=0$ unless $2\mid(p-1)/d$ , that is, $d\mid(p-1)/2$ .

Now, the root of unity $\zeta_{p}=e(1/p)$ generates the cyclotomic field $\mathbb{Q}(\zeta_{p})$ , in which $\varpi_{p}=1-\zeta_{p}$ is a prime of absolute norm $p\sim\varpi_{p}^{p-1}$ . From $\zeta_{p}\equiv 1\pmod{\varpi_{p}}$ we conclude that

[TABLE]

and thus in particular $\sigma_{d}^{0}(a/p)\neq 0$ . For $\iota=1$ and $d\mid(p-1)/2$ , we have by a slightly more involved argument that

[TABLE]

from which we conclude that

[TABLE]

so in particular $\sigma_{d}^{1}(a/p)\neq 0$ . As for the realness claim, if $\iota=1$ and $-1\equiv y^{2d}\bmod p$ , we see by making a change of variable $x\mapsto xy^{2}$ that

[TABLE]

The case of $\iota=0$ and $-1\in(\mathbb{Z}/p\mathbb{Z})^{\times d}$ is similar (even easier). That $\sigma_{d}^{1}(a/p)=-\overline{\sigma_{d}^{1}(a/p)}$ when $2\nmid d$ and $p\equiv 3\bmod 4$ follows by a change of variable $x\mapsto-x$ . Finally, the claims identifying $\sigma_{1}^{\iota}(a/p)$ as the Ramanujan and Gauss sums modulo $p$ are immediate.

For $k\geqslant 2$ , we may write $d=p^{\kappa}\delta$ for some $0\leqslant\kappa\leqslant k-1$ and $\delta\mid(p-1)$ . Denoting by $g$ a (fixed but otherwise arbitrary) primitive root modulo $p^{k}$ , $g^{p^{\kappa}}$ is a primitive root modulo $p^{k-\kappa}$ , from which it is easy to see that

[TABLE]

When $\kappa=k-1$ , we find that $\sigma_{d}^{\iota}(a/q)$ equals $p^{k-1}\sigma_{\delta}^{\iota^{\prime}}(a/p)$ and thus doesn’t vanish by what we already proved. On other hand, for $\kappa\leqslant k-2$ , it follows from the $p$ -adic method of stationary phase (see, for example, [MZ23, Lemma 1]) that the above sum vanishes, since no summands satisfy the stationary phase condition $\delta\cdot ax^{\delta-1}\equiv 0\pmod{p^{\lfloor(k-\kappa)/2\rfloor}}$ .

We proceed to item (2). If $2\nmid\mathop{\mathrm{ord}}_{m}p$ for some $p\leqslant Z$ with $p\nmid m$ and $\epsilon_{p}=-1$ , then we see by shifting variables $m_{p}\mapsto m_{p}+\mathop{\mathrm{ord}}_{m}p$ in (6.5) that $s^{\ast}(a/m;\bm{\epsilon}_{Z})=0$ . Now, let $m=\mathtt{p}^{k}$ for some $k\geqslant 2$ , and write $\delta=\mathop{\mathrm{ord}}_{\mathtt{p}}p$ , $p^{\delta}=1+\mathtt{p}^{e}f$ for some $p\leqslant Z$ with $\epsilon_{p}\neq 0$ , $p\neq\mathtt{p}$ , $e\geqslant 1$ , $\mathtt{p}\nmid f$ . If $2\nmid\delta$ and $\epsilon_{p}=-1$ , then, noting that $p^{\delta\mathtt{p}^{k-e}}\equiv 1\pmod{\mathtt{p}^{k}}$ and changing variables $m_{p}\mapsto m_{p}+\delta\mathtt{p}^{k-e}$ in (6.5), we again see that $s^{\ast}(a/m;\bm{\epsilon}_{Z})=0$ ; thus, from now on we may assume that $2\mid\delta$ or $\epsilon_{p}=1$ . If $e\leqslant k-2$ , then for $1\leqslant\kappa\leqslant k-e$ we have that

[TABLE]

Using the above with any $\kappa\geqslant k/2-e$ and applying the $\mathtt{p}$ -adic method of stationary phase (see [MZ23, Lemma 1] and note that $\epsilon_{p}^{m_{p}+\delta\mathtt{p}^{\kappa}t}=\epsilon_{p}^{m_{p}}$ ), we conclude that the summation in (6.5) may be restricted to $m_{p}$ satisfying $\mathtt{p}^{k-\kappa}\mid f_{1}(m_{p})$ ; thus, picking any $\max(1,k/2-e)\leqslant\kappa\leqslant k-e-1$ we conclude that $s^{\ast}(a/m;\bm{\epsilon}_{Z})=0$ . If $e=k-1$ , only a minor tweak is needed:

[TABLE]

whence by shifting $m_{p}\mapsto m_{p}+(\mathtt{p}-1)t$ in (6.5) we analogously find that

[TABLE]

In conclusion, $s^{\ast}(a/\mathtt{p}^{k};\bm{\epsilon}_{Z})=0$ unless $p^{\delta}\equiv 1\pmod{\mathtt{p}^{k}}$ , which is to say that $\mathop{\mathrm{ord}}_{q}p\mid(\mathtt{p}-1)$ , and this condition must hold for every $p\leqslant Z$ with $\epsilon_{p}\neq 0$ .

Item (3) is a direct consequence of the Chinese Remainder Theorem. Indeed, the value of each summand in (6.5) depends only on $m_{p}$ modulo $2\varphi(m_{1})\varphi(m_{2})/\delta$ . Writing every $0\leqslant m_{p}<2\varphi(m_{1})\varphi(m_{2})/\delta$ as

[TABLE]

where $0\leqslant\mu_{p}<2\delta$ and $0\leqslant k_{pi}<\varphi(m_{i})/\delta$ and inverses are modulo $\varphi(m_{i})/\delta$ , we have that

[TABLE]

where $u_{i}=\overline{\varphi(m_{3-i})/\delta}(\varphi(m_{3-i})/\delta)$ are independent of $k_{pi}$ and satisfy $(u_{i},\varphi(m_{i})/\delta)=1$ . Thus,

[TABLE]

the claim follows immediately from this upon summing $e(a\bm{p}_{Z}^{\delta^{\prime}\bm{m}}/m)\bm{\epsilon}_{Z}^{\bm{m}}$ over $0\leqslant\mu_{p}<2\delta$ and $0\leqslant k_{pi}<\varphi(m_{i})/\delta$ , which encounters $(1/\delta)s(a/m;\bm{\epsilon}_{Z})[\delta^{\prime},m^{\prime}]$ and the product of $\bm{\epsilon}_{Z}^{\bm{\mu}}$ with two sums $(1/2\delta)s(a\overline{m_{3-i}}\bm{p}_{Z}^{\delta^{\prime}\bm{\mu}}/m_{i};|\bm{\epsilon}_{Z}|)[2\delta\delta^{\prime},m^{\prime}]$ .

Finally, we turn our attention to item (4). Fixing an arbitrary primitive root $g$ modulo $q$ and writing $p=g^{k_{p}}$ , $\epsilon_{p}=(-1)^{\varepsilon_{p}}$ , we have that $d_{q}=\gcd[(k_{p},\varphi(q))_{p\in\mathcal{P}_{\bm{\epsilon}_{Z}}}]$ , and, by definition, $(2\varphi(q))^{|\mathcal{P}_{\bm{\epsilon}_{Z}}|}s^{\ast}(a/q;\bm{\epsilon}_{Z})$ equals

[TABLE]

where, for every $0\leqslant k<\varphi(q)$ with $d_{q}\mid k$ , we denote by $(m_{p}^{\circ}(k))$ an arbitrary particular solution of the congruence $\sum^{\ast}_{p\leqslant Z}k_{p}m_{p}^{\circ}(k)\equiv k\bmod\varphi(q)$ . The indexing set in the latter sum forms an additive group modulo $2\varphi(q)$ (a disjoint union of $2^{|\mathcal{P}_{\bm{\epsilon}_{Z}}|}$ additive groups modulo $\varphi(q)$ ) of combined order $M:=(2\varphi(q))^{|\mathcal{P}_{\bm{\epsilon}_{Z}}|-1}\cdot 2d_{q}$ , and the sum equals $M$ or [math] according to whether the implication

[TABLE]

holds or not. Let $2^{\varphi_{q}}\mathrel{\|}\varphi(q)$ ; then, by adjusting the values of $m_{p}$ modulo $\varphi(q)/2^{\varphi_{q}}$ using the Chinese Remainder Theorem, we see that the above implication holds if and only if we have a valid implication

[TABLE]

Further, denote $2^{\delta_{q}}\mathrel{\|}d_{q}$ , so that $\mathcal{P}_{\bm{\epsilon}_{Z}}^{-}=\{p\leqslant Z:2^{\delta_{q}}\mathrel{\|}k_{p}\}\neq\emptyset$ . If $\delta_{q}=\varphi_{q}$ , the above can clearly hold only if all $\varepsilon_{p}=0$ ; otherwise, by dividing through the first congruence by $2^{\delta_{q}}$ and adjusting the values of $(m_{p})_{p\in\mathcal{P}_{\bm{\epsilon}_{Z}}^{-}}$ by even amounts, the above implication holds if and only if

[TABLE]

The latter plainly holds if and only if there exists a $\varepsilon\in\{0,1\}$ such that $\varepsilon_{p}=\varepsilon$ for all $p\in\mathcal{P}_{\bm{\epsilon}_{Z}}^{-}$ and $\varepsilon_{p}=0$ for all $p\in\mathcal{P}_{\bm{\epsilon}_{Z}}\setminus\mathcal{P}_{\bm{\epsilon}_{Z}}^{-}$ , and in this case

[TABLE]

6.4. Locating a dense set of singularities

In this subsection, we use our results from §§6.1–6.3 to prove the following culminating proposition of Section 6, which provides a collection of points $t_{0}\in[0,1]\cap\mathbb{Q}$ at which the path $G^{\sharp}_{\bm{\epsilon}_{Z}}(t)$ has a cusp.

Proposition 6.5.

Assume that $\bm{\epsilon}_{Z}$ is not identically zero or one. Let $q>Z$ be an odd prime such that $q\equiv 3\bmod 4$ and

[TABLE]

Then, for every $1\leqslant a\leqslant q-1$ ,

[TABLE]

and, for $t_{0}=a/q$ , the path $G^{\sharp}_{\bm{\epsilon}_{Z}}(t)$ satisfies

[TABLE]

where $c_{\pm}(t_{0})\in\mathbb{R}$ are given explicitly in (6.28).

The curve $G^{\sharp}_{\bm{\epsilon}_{Z}}(t)$ has a cusp at $t=t_{0}$ as long as $c_{+}(t_{0})c_{-}(t_{0})<0$ . Specifically:

(1)

If $\epsilon_{p}=-1$ for at least two $p\leqslant Z$ , then (6.26) holds with

[TABLE]

with $c^{-}(t_{0})=1/(|\mathcal{P}_{\bm{\epsilon}_{Z}}|-1)!\prod_{p\in\mathcal{P}_{\bm{\epsilon}_{Z}}}(\log p))\cdot\mathop{\mathrm{Im}}s^{\ast}(a/q;\bm{\epsilon}_{Z})\in\mathbb{R}_{\neq 0}$ , and the path $G^{\sharp}_{\bm{\epsilon}_{Z}}(t)$ has a cusp at $t=t_{0}$ . In particular, the set of such $t_{0}=a/q\in[0,1]\cap\mathbb{Q}$ (over different values of $q$ ) is everywhere dense in $[0,1]$ . 2. (2)

If $\epsilon_{p}=-1$ for exactly one $p=p_{1}\leqslant Z$ and the residues of all $p\in\mathcal{P}_{\bm{\epsilon}_{Z}}\setminus\{p_{1}\}$ modulo $q$ generate all quadratic residues modulo $q$ :

[TABLE]

then (6.26) holds with

[TABLE]

where $c^{-}(t_{0})\in\mathbb{R}_{\neq 0}$ is as above, $\delta_{p_{1},t_{0}}=(2/\pi)(a/p_{1})\log p_{1}/\sqrt{p_{1}}$ satisfies $|\delta_{p_{1},t_{0}}|<1/2$ , and the path $G^{\sharp}_{\bm{\epsilon}_{Z}}(t)$ has a cusp at $t=t_{0}$ .

Proof.

For any $p\in\mathcal{P}_{\bm{\epsilon}_{Z}}$ such that $\epsilon_{p}=-1$ , the congruence conditions on $q$ imply (fixing an arbitrary primitive root $g$ modulo $q$ ) that $p\equiv g^{k_{p}}\bmod q$ for some $2\nmid k_{p}$ , whence $\varphi(q)/\mathop{\mathrm{ord}}_{q}p$ is odd and a fortiori $d_{q}$ is odd as well. In the notation of Lemma 6.4, this, in turn, implies that, for $p\in\mathcal{P}_{\bm{\epsilon}_{Z}}$ , $p\in\mathcal{P}_{\bm{\epsilon}_{Z}}^{+}(q)$ if and only if $\epsilon_{p}=1$ , whence according to Lemma 6.4, items (1) and (4),

[TABLE]

From Proposition 6.3 and Lemma 6.2, we have that

[TABLE]

where $\ell=|\log|t-t_{0}||$ and

[TABLE]

This completes the proof of (6.26).

Recall the condition that $\mathcal{P}^{-}_{\bm{\epsilon}_{Z}}(q)=\mathcal{P}_{\bm{\epsilon}_{Z}}^{-}=\{p\in\mathcal{P}_{\bm{\epsilon}_{Z}}:\epsilon_{p}=-1\}\neq\emptyset$ . Now, if $|\mathcal{P}^{-}_{\bm{\epsilon}_{Z}}|\geqslant 2$ , then, for every $p_{1}\in\mathcal{P}_{\bm{\epsilon}_{Z}}$ , there exists a $p\in\mathcal{P}_{\bm{\epsilon}_{Z}}^{-}\setminus\{p_{1}\}$ , and by shifting variables in (6.24) by $m_{p}\mapsto m_{p}+\varphi(q)/2$ and recalling that $p^{\varphi(q)/2}\equiv-1\bmod q$ , we see that $\widetilde{s}_{p_{1}}(a/q;\bm{\epsilon}_{Z})\in i\mathbb{R}$ ; since this conclusion holds for every $p_{1}\in\mathcal{P}_{\bm{\epsilon}_{Z}}$ , we conclude that $c^{\prime}(t_{0})=0$ and (6.26) follows with $c_{+}(t_{0})=c_{-}(t_{0})=c^{-}(t_{0})$ . Moreover, it follows from quadratic reciprocity and Dirichlet’s theorem on primes in arithmetic progressions that there are infinitely many primes such that $q\equiv 3\bmod 4$ and $(q/p)=\epsilon_{p}$ for all $p\leqslant Z$ with $\epsilon_{p}\neq 0$ , whence the set of the corresponding fractions $t_{0}=a/q$ is dense in $[0,1]$ . This settles the case (1).

If $\mathcal{P}_{\bm{\epsilon}_{Z}}^{-}=\{p_{1}\}$ , the situation is more complicated: the same change of variables $m_{p}\mapsto m_{p}+\varphi(q)/2$ in (6.24) still shows that $\widetilde{s}_{p}(a/q;\bm{\epsilon}_{Z})\in i\mathbb{R}$ for all $p\in\mathcal{P}_{\bm{\epsilon}_{Z}}\setminus\{p_{1}\}$ , while

[TABLE]

and

[TABLE]

One final simplification is possible, as follows. The group generated by the residues of all $p\in\mathcal{P}_{\bm{\epsilon}_{Z}}^{+}$ modulo $q$ is of the form $(\mathbb{Z}/q\mathbb{Z})^{\times 2d_{1}}$ , where $2d_{1}=\varphi(q)/\mathop{\mathrm{lcm}}(\mathop{\mathrm{ord}}_{q}p:p\in\mathcal{P}_{\bm{\epsilon}_{Z}}^{+})$ . If we denote by $2\Delta>0$ the smallest positive exponent such that $p_{1}^{2\Delta}\in(\mathbb{Z}/q\mathbb{Z})^{\times 2d_{1}}$ , say $p_{1}^{2\Delta}=\prod_{p\in\mathcal{P}_{\bm{\epsilon}_{Z}}}p^{m_{p}^{1}}$ , then $\Delta\mid(\varphi(q)/2)$ and a change of variables

[TABLE]

shows that the summation in (6.29) may be restricted to $\varphi(q)/2-\Delta\leqslant m_{p_{1}}<\varphi(q)/2+\Delta$ , since the contributions of the terms outside this range cancel out. The same argument as in the proof of Lemma 6.4(4) then shows that

[TABLE]

where $\iota_{p_{1}}(x^{d})=\iota_{p_{1}}(p_{1}^{\delta}y^{2d_{1}})=(-1)^{\lfloor\delta/\Delta\rfloor}$ for the well-defined value of $\delta\pmod{2\Delta}$ in a decomposition $x^{d}\equiv p_{1}^{\delta}y^{2d_{1}}\pmod{b}$ .

All of the above applies whenever $\mathcal{P}^{-}_{\bm{\epsilon}_{Z}}=\{p_{1}\}$ . To reach the conclusion that, for some $t_{0}=a/q$ with $(a/q)=1$ , $c_{+}(t_{0})c_{-}(t_{0})<0$ holds (whence the path $G^{\sharp}_{\bm{\epsilon}_{Z}}(t)$ would have a cusp at $t=t_{0}$ ), it is thus necessary and sufficient to verify that the constants $c^{\prime}(t_{0})$ and $c^{-}(t_{0})$ as explicated in (6.28) and (6.30) satisfy

[TABLE]

We do not know how to verify or fully characterize the set of points $t_{0}$ where this fascinating condition is satisfied. However, under the condition (6.27), $d_{1}=\Delta=d=1$ , and the sums $\mathop{\mathrm{Im}}s^{\ast}(a/q;\bm{\epsilon}_{Z})$ and $\mathop{\mathrm{Re}}\widetilde{s}_{p_{1}}(a/q;\bm{\epsilon}_{Z})$ are essentially the quadratic Gauss and Ramanujan sums:

[TABLE]

Indeed, the former follows from Lemma 6.4; the latter is clear from the above expressions, or directly from the expression (6.23) for $s_{p_{1}}^{+}(1/b,\bm{\epsilon}_{Z})=2\mathop{\mathrm{Re}}\widetilde{s}_{p_{1}}(a/b;\bm{\epsilon}_{Z})$ , in which only the prinicipal character $\chi=\chi_{0}$ contributes $(1/\varphi(q))\overline{G(\chi_{0})}$ . This completes the proof of (2). ∎

*Remark 1**.*

The condition (6.27), which pertains to the case when exactly all but one $\epsilon_{p}=1$ , appears similar in spirit to (and perhaps in some ways weaker than) Artin’s primitive root conjecture, so it is perhaps reasonable to expect that it is satisfied for infinitely many primes $q\equiv 3\bmod 4$ in the fixed arithmetic progression described by (6.25), which would give an everywhere dense set of corresponding points $t_{0}=a/q$ ; we stop short of stating this as a formal conjecture. Unconditionally, using Schmidt’s estimates on complete character sums [Sch76, Theorem II.2C’] and Poisson summation, one can argue that, for every sufficiently large odd prime $q$ and every $2\nmid d\mid(q-1)$ ,

[TABLE]

for at least one $a\in I$ in every sufficiently large interval $I$ ; for example, we were able to prove that this holds in the mean square average over all $a\in I$ for $|I|\geqslant 2\sqrt{d}q^{1/4}$ . This alone shows that $\pi|\mathop{\mathrm{Im}}s^{\ast}(a/q;\bm{\epsilon}_{Z})|$ in (6.31) is often rather large (that is, of expected size), but a sufficiently good complementary upper bound on $2|\mathop{\mathrm{Re}}\widetilde{s}_{p_{1}}(a/q;\bm{\epsilon}_{Z})|$ is also needed, and in the case $d_{1}=\Delta=d=1$ of (6.27) this is guaranteed by reduction to the Ramanujan sum.

*Remark 2**.*

An interesting situation arises when all $\epsilon_{p}=1$ . Consider the specific case when $Z=5$ , $\bm{\epsilon}_{5}=(\epsilon_{2},\epsilon_{3},\epsilon_{5})=(1,1,1)$ . As in other cases, pictures strongly suggest that the path $G^{\sharp}_{\bm{\epsilon}_{5}}$ has an everywhere dense set of cusps, with the most visually prominent ones at many of the points $a/71$ ( $1\leqslant a\leqslant 70$ ); see Figure 7, which appears to be in parallel with the situation of Figure 5 (in which $\bm{\epsilon}_{5}=(1,1,-1)$ ). That these appear at the denominator $q=71$ is not a coincidence in light of

[TABLE]

Nevertheless, it is not difficult to verify that both 2 and 3 are of multiplicative order 35 modulo 71, so they generate (already each one of them generates) the subgroup $(\mathbb{Z}/71\mathbb{Z})^{\times 2}$ , whence in this case

[TABLE]

Thus, $\mathop{\mathrm{Re}}s^{\ast}(a/q;\bm{\epsilon}_{Z})\neq 0$ , so already in light of Lemma 6.1, the left and right slopes of $G^{\sharp}_{\bm{\epsilon}_{Z}}$ agree at $t_{0}=a/q$ , and the path has no cusp at any of these points, in apparent contradiction with the very convincing Figure 7. What is going on here?

The answer is that the apparent “cusps” are effects of lower-order terms which, in fact, disappear as $t$ gets really close to $t_{0}$ . Indeed, according to Proposition 6.3 and a quick calculation of constants using Lemma 6.2 (in which all $\widetilde{s}_{p}(a/b;\bm{\epsilon}_{Z})=0$ ), the leading two terms in the asymptotic expansion for $G^{\sharp}_{\bm{\epsilon}_{Z}}(t)-G^{\sharp}_{\bm{\epsilon}_{Z}}(t_{0})$ are given by

[TABLE]

where $c^{\prime+}_{\bm{\epsilon}_{5}}=-(\log 2\pi+\gamma-1)+\log(30)/2\approx 0{.}286$ and

[TABLE]

as $t\to t_{0}\pm$ (or $t\to t_{0}\mp$ , according to the value of $(a/71)$ ). Thus, while the leading term indeed dominates for extremely small $|t-t_{0}|$ , the secondary term is substantially dominant for $\ell$ of moderate size, say up to $\ell\leqslant 30$ , that is up to around $|t-t_{0}|\leqslant 10^{-15}$ , and explains the illusion of cusp behavior. In truth, the path exhibits a transition to a “smooth” behavior through $t=t_{0}$ , which can also be observed upon zooming in to the appropriate scale; see Figure 8.

7. Classifying quadratic Gauss paths

Having prepared the ground with establishing the properties of the limiting shapes $G^{\sharp}_{\bm{\epsilon}_{Z}}$ in §§6.1–6.3, in this section we complete the proof of our main result on the atlas of shapes of Gauss paths, Theorem 1.2.

Proof of Theorem 1.2 (1).

The arguments of section 3 proceed analogously with the random series $G^{\ast}_{\bm{\epsilon}_{Z}}(t)$ and its coefficients $(X_{n,\bm{\epsilon}_{Z}})$ in place of $G^{\ast}(t)$ and $(X_{n})$ .

Indeed, the sets $A_{y_{1}}$ and $A_{y_{1}}^{y_{2}}$ of (3.2) and along with them the sums $S_{A_{y_{1}}}$ , $S_{A_{y_{1}}^{y_{2}}}$ , $S_{A_{y_{1}}}(\mathcal{E})$ , and $S_{A_{y_{1}}^{y_{2}}}(\mathcal{E})$ of (3.3) are insensitive to the values of $(\bm{\epsilon}_{Z})$ as soon as $y_{1}\geqslant Z$ ; hence the statement and proof of Proposition 3.2 remain the same as long as $y_{2}>y_{1}\geqslant\max(k^{3},Z)$ .

Turning to the proof of Proposition 3.1, the sums $S_{y}(t)$ and $S_{y}(\mathcal{E},t)$ of (3.10) are affected by the change of $(X_{n})$ to $(X_{n,\bm{\epsilon}_{Z}})$ , and so are the sums $R_{y_{1},y_{2}}(t)$ and $R_{y_{1},y_{2}}(\mathcal{E};t)$ of (3.11), the dependence in $R_{y_{1},y_{2}}(t)$ in (3.12) being that, in the outer sum over $n\neq 0$ with $P^{+}(|n|)\leqslant y_{1}$ , the coefficients $X_{n}$ are replaced by $X_{n,\bm{\epsilon}_{Z}}$ . Since $|X_{n,\bm{\epsilon}_{Z}}|\leqslant 1$ , the key estimate on $\|R_{y_{1},y_{2}}(\mathcal{E};t)\|_{\infty}$ in (3.14) remains as stated for $y_{1}\geqslant Z$ , and the rest of the proof of Lemma 3.4 and Proposition 3.1 remains exactly the same with the additionally assumption that $y_{1}\geqslant Z$ and correspondingly replacing $y_{0}(\delta)$ with $\max(y_{0}(\delta),Z)$ .

This shows that, indeed, the random Fourier series $\widetilde{G^{\ast}_{\bm{\epsilon}_{Z}}}(t)$ almost surely uniformly converges and defines a continuous function such that the $n$ th Fourier coefficient of $\widetilde{G^{\ast}_{\bm{\epsilon}_{Z}}}(t)-t$ (for $n\neq 0,1$ ) is precisely $X_{n,\bm{\epsilon}_{Z}}/2\pi in$ ; then, it follows as in (3.16) that the original random Fourier series $G^{\ast}_{\bm{\epsilon}_{Z}}(t)$ converges a.s. and that $G^{\ast}_{\bm{\epsilon}_{Z}}(t)=\widetilde{G^{\ast}_{\bm{\epsilon}_{Z}}}(t)$ a.s.

Turning to the computation of the moments, we may write every $h\in\mathbb{Z}\setminus\{0\}$ as $h=h_{Z}h^{Z}$ as in (6.2); then, in place of (3.17) we have the evaluations

[TABLE]

For every $\bm{t}=(t_{1},\dots,t_{k})\in[0,1]^{k}$ and $\bm{m},\bm{n}\in\mathbb{Z}_{\geqslant 0}^{k}$ , the evaluation of the moments $\mathcal{M}^{\ast}_{\bm{\epsilon}_{Z}}(\bm{t};\bm{m},\bm{n})$ (defined analogously to (3.18)) of the $\mathbb{C}^{k}$ -valued random variable $G^{\ast}_{\bm{\epsilon}_{Z}}(\bm{t})=(G^{\ast}_{\bm{\epsilon}_{Z}}(t_{1}),\dots,G^{\ast}_{\bm{\epsilon}_{Z}}(t_{k}))$ proceeds analogously to the proof of Lemma 3.5, with $\eta_{Z}(h)$ in place of $\eta(h)$ . In the evaluation of $\mathbb{E}(|S_{n,\bm{\epsilon}_{Z}}(t)|^{2p})$ in (3.21), all terms with $|(h_{1}\cdots h_{2p})^{Z}|=\square$ contribute, and consequently we may estimate

[TABLE]

also using (3.22). For the same reason, we encounter the dependence of the implied constants on $Z$ in (3.23), (3.24) and below, (3.26) and below, and (3.27). The proof otherwise runs verbatim the same (with $\eta_{Z}(h)$ and the condition $|h^{Z}|=\square$ in place of $\eta(h)$ and $|h|=\square$ ), and we obtain the evaluation

[TABLE]

In particular, our discussion above shows that $\mathcal{M}^{\ast}_{\bm{\epsilon}_{Z}}(\bm{t};\bm{m},\bm{n})\leqslant C_{Z}^{m+n}$ for some $C_{Z}>0$ depending only on $Z$ (in fact, $C_{Z}=e^{\mathrm{O}(\log\log Z)}$ is admissible), and so $G^{\ast}_{\bm{\epsilon}_{Z}}(\bm{t})$ is a mild random variable.

We then proceed to follow section 4 and demonstrate the analogue of Proposition 4.1, that the complex moments $\mathcal{M}_{Q,\bm{\epsilon}_{Z}}(\bm{t};\bm{m},\bm{n})$ , defined as in (4.1) with $G_{Q,\bm{\epsilon}_{Z}}(t_{i})$ in place of $G_{Q}(t_{i})$ , satisfy

[TABLE]

Denoting $\lambda^{\prime}_{p}(\pm 1)=(1-1/p)/2$ and $\lambda^{\prime}_{p}(0)=1/p$ for $p>2$ and $\lambda_{2}^{\prime}(\pm 1)=1/2$ , we first confirm by sieving as in (4.2) that

[TABLE]

where $\mathcal{N}^{Z}=\{n\in\mathbb{N}:p\mid n\,\Rightarrow\,p>Z\}$ and

[TABLE]

Lemmata 4.3–4.6 are valid for all values of $c$ . In the rest of §4.1, we only need to replace $m_{Q}(c)$ with $m_{Q}(\bm{\epsilon}_{Z})$ and all sums over $c\in\mathcal{D}_{Q}$ by $c\in\mathcal{D}_{Q,\bm{\epsilon}_{Z}}$ , in particular when replacing the modified moment $\widetilde{\mathcal{M}_{Q}}(\bm{t};\bm{m},\bm{n})$ defined in (4.5) with the analogously defined moment $\widetilde{\mathcal{M}_{Q,\bm{\epsilon}_{Z}}}(\bm{t};\bm{m},\bm{n})$ ; with these changes, the rest of §4.1 is valid verbatim (with even the implied constants independent of $(\bm{\epsilon}_{Z})$ ). In place of (4.14) we obtain

[TABLE]

Adjusting the proof of Lemma 4.9 as in (7.4), we find that for $0\neq H=Q^{\textnormal{O}_{Z}(1)}$

[TABLE]

Estimating tails as in (4.15) and (7.1), including

[TABLE]

and keeping in mind the evaluation (7.2), we finally conclude

[TABLE]

We estimate the off-diagonal contributions as in the proof of Lemma 4.10, with the basic estimate after grouping $\vec{\bm{h}}\in\mathcal{H}^{\prime}$ according to the values of $|H(\vec{\bm{h}})^{Z}|\neq\square$ and $H(\vec{\bm{h}})_{Z}$ with $|H(\vec{\bm{h}})|\leqslant Q^{\ast}$ being

[TABLE]

The rest of the argument proceeds completely analogously, additionally restricting the sieving variables to $\alpha,\delta\in\mathcal{N}^{Z}$ , and using the Pólya–Vinogradov inequality for non-principal characters $(dd_{Z}/c)$ (for various $d_{Z}\mid\prod_{p\leqslant Z}p$ ) of conductor $\asymp_{Z}d$ to additionally detect congruence conditions $(p/c)=\epsilon_{p}\neq 0$ , and we obtain

[TABLE]

Putting everything together completes the proof of (7.3) and along with it the convergence of $(G_{Q,\bm{\epsilon}_{Z}})\to G^{\ast}_{\bm{\epsilon}_{Z}}$ in the sense of finite distributions as $Q\to\infty$ .

Moreover, the sequence of $C^{0}([0,1],\mathbb{C})$ -valued random variables $(G_{Q,\bm{\epsilon}_{Z}})$ is tight at $Q\to\infty$ by Kolmogorov’s Tightness Criterion, because in light of (7.4) we have in the situation of Proposition 5.1 that a fortiori

[TABLE]

(Here, we profit from the fact that $|\mathcal{D}_{Q,\bm{\epsilon}_{Z}}|\asymp_{Z}|\mathcal{D}_{Q}|$ to execute this bootstrap argument, but, alternatively, it takes only minimal changes to sequentially adapt the arguments of section 5 to the family $\mathcal{D}_{Q,\bm{\epsilon}_{Z}}$ , with implied constants depending on $Z$ .) As in §5.3, using Prokhorov’s Criterion we conclude that, indeed, $(G_{Q,\bm{\epsilon}_{Z}})\to G^{\ast}_{\bm{\epsilon}_{Z}}$ in law as $Q\to\infty$ . ∎

Proof of Theorem 1.2 (2).

The deterministic Fourier series $G^{\sharp}_{\bm{\epsilon}_{Z}}(t)$ converges absolutely and uniformly by comparison with the absolutely convergent series $\sum_{n\in\mathcal{N}_{\bm{\epsilon}_{Z}}}(1/n)\leqslant\prod_{p\in\mathcal{P}_{\bm{\epsilon}_{Z}}}(1+1/p)$ ; along with its individual summands, its sum is therefore also a continuous function. The remaining statements about the everywhere dense set of points $t_{0}\in[0,1]\cap\mathbb{Q}$ at which the path $G^{\sharp}_{\bm{\epsilon}_{Z}}(t)$ (for $\bm{\epsilon}_{Z}$ not identically zero) has a cusp follow from Proposition 6.5. ∎

Proof of Theorem 1.2 (3).

We begin by recalling the beginning of the proof of Lemma 3.5, where we established that all results of §3.1, including crucially Proposition 3.2 and Lemma 3.4, remain valid for

[TABLE]

and $R^{\ast}_{y_{1},y_{2}}(t)=S_{y_{2}}^{\ast}(t)-S^{\ast}_{y_{1}}(t)$ . Further, denote

[TABLE]

As already discussed in the proof of item (1), for $y_{1}\geqslant\max(y_{0}(\delta),Z)$ , Proposition 3.2 is literally unchanged when replacing $X_{n}$ by $X_{n,\bm{\epsilon}_{Z}}$ because $X_{n}=X_{n,\bm{\epsilon}_{Z}}$ whenever $P^{-}(n)>Z$ ; the same is true for the statement of Lemma 3.4, because the proof only additionally requires that $|X_{n}|,|X_{n,\bm{\epsilon}_{Z}}|\leqslant 1$ for $P^{+}(n)\leqslant Z$ . Hence Proposition 3.1 and its proof remain valid for $S^{\ast}_{y,\bm{\epsilon}_{Z}}$ as well.

As in (3.8), we have that

[TABLE]

where we have already verified that the limit converges almost surely. Now, as in (3.15), for $y_{1}\geqslant\max(y_{0}(\delta),Z)$ we have that

[TABLE]

Using this for $y_{1}=Z\geqslant y_{0}(\delta)$ , we have that outside an event of probability $\ll\exp(-\delta^{2}y_{1}^{1/7})$ , $\|S^{\ast}_{y_{2},\bm{\epsilon}_{Z}}-S^{\ast}_{Z,\bm{\epsilon}_{Z}}\|_{\infty}\leqslant\delta$ for all $y_{2}>Z$ ; taking limits as $y_{2}\to\infty$ and invoking (7.5) we conclude that

[TABLE]

Finally, for any $Z\geqslant y_{0}(\delta)$ , we may consider the bounded continuous function $\varphi:C^{0}([0,1],\mathbb{C})\to\mathbb{C}$ defined by

[TABLE]

Since $(G_{Q,\bm{\epsilon}_{Z}})\to G^{\ast}_{\bm{\epsilon}_{Z}}$ in law as $Q\to\infty$ , for sufficiently large $Q\geqslant Q_{0}(\delta,\varepsilon,\bm{\epsilon})$ we have that

[TABLE]

Since $\|G^{\ast}_{\bm{\epsilon}_{Z}}-G^{\sharp}_{\bm{\epsilon}_{Z}}\|_{\infty}\leqslant\delta$ outside an event of probability $\ll\exp(-\delta^{2}Z^{1/7})$ , we have using Chebyshev’s inequality that, for all $Q\geqslant Q_{0}(\delta,\varepsilon,\bm{\epsilon})$ ,

[TABLE]

Therefore

[TABLE]

for all sufficiently large $Z\geqslant Z_{1}(\delta)$ . This completes the proof. ∎

Bibliography23

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[Bak 77] A. Baker, The theory of linear forms in logarithms , Transcendence theory: advances and applications (Proc. Conf., Univ. Cambridge, Cambridge, 1976), Academic Press, London-New York, 1977, pp. 1–27. MR 498417
2[BGGK 18] Jonathan Bober, Leo Goldmakher, Andrew Granville, and Dimitris Koukoulopoulos, The frequency and the structure of large character sums , J. Eur. Math. Soc. (JEMS) 20 (2018), no. 7, 1759–1818. MR 3807313
3[Cha 14] Mei-Chu Chang, Short character sums for composite moduli , J. Anal. Math. 123 (2014), 1–33. MR 3233573
4[GS 03] A. Granville and K. Soundararajan, The distribution of values of L ( 1 , χ d ) L(1,\chi_{d}) , Geom. Funct. Anal. 13 (2003), no. 5, 992–1028. MR 2024414
5[HB 95] D. R. Heath-Brown, A mean value estimate for real character sums , Acta Arith. 72 (1995), no. 3, 235–275. MR 1347489
6[HBP 15] D. R. Heath-Brown and L. B. Pierce, Burgess bounds for short mixed character sums , J. Lond. Math. Soc. (2) 91 (2015), no. 3, 693–708. MR 3355121
7[HL 24] Ayesha Hussain and Youness Lamzouri, The limiting distribution of Legendre paths , J. Éc. polytech. Math. 11 (2024), 589–611. MR 4767013
8[Hus 22] Ayesha Hussain, The limiting distribution of character sums , Int. Math. Res. Not. IMRN (2022), no. 20, 16292–16326. MR 4498175

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

The shape of quadratic Gauss paths

Abstract.

Key words and phrases:

2020 Mathematics Subject Classification:

Contents

1. Introduction

1.1. Gauss paths

1.2. Limiting distribution

Theorem 1.1**.**

1.3. Convergence in probability and the atlas of shapes

Theorem 1.2**.**

1.4. Acknowledgement

1.5. Organization of the paper

1.6. Notation

2. Preliminaries

2.1. Probability in Banach Spaces

Definition 2.1** (Convergence of Random Variables).**

Proposition 2.2** (Method of moments, [Kow21, Theorem B.5.5(2)]).**

Proposition 2.3** (Prokhorov’s Criterion).**

Proposition 2.4** (Kolmogorov’s Tightness Criterion).**

2.2. Character sums and quadratic large sieve

Proposition 2.5** ([HB95, Theorem 1]).**

Proposition 2.6** ([IK04, Theorem 12.6]).**

Proposition 2.7** ([Cha14, Theorem 1’, special case]).**

2.3. Linear forms in logarithms

Proposition 2.8** ([Bak77, Theorem 1]).**

3. The limiting random variable

Proposition 3.1**.**

3.1. Arithmetic convergence

Proposition 3.2**.**

Lemma 3.3** ([BGGK18, Lemma 5.4]).**

Proof of Proposition 3.2.

Lemma 3.4**.**

Proof.

Proof.

3.2. Properties of the limiting random variable

Lemma 3.5**.**

Proof.

4. Computing the moments

Proposition 4.1**.**

Corollary 4.2**.**

4.1. Reduction steps

Lemma 4.3**.**

Proof.

Lemma 4.4**.**

Proof.

Lemma 4.5**.**

Proof.

Lemma 4.6**.**

Proof.

Lemma 4.7**.**

Proof.

Lemma 4.8**.**

Proof.

4.2. Isolating the main term and proofs of main results

Lemma 4.9**.**

Lemma 4.10**.**

Proof of Proposition 4.1.

Proof of Corollary 4.2.

4.3. Square and non-square contributions

Proof of Lemma 4.9.

Proof of Lemma 4.10.

5. Convergence in law

Proposition 5.1**.**

5.1. Preparatory lemmata

Lemma 5.2**.**

Proof.

Lemma 5.3**.**

Proof.

Lemma 5.4**.**

Proof.

5.2. Estimates according to ranges

Lemma 5.5**.**

Theorem 1.1.

Theorem 1.2.

Definition 2.1 (Convergence of Random Variables).

Proposition 2.2 (Method of moments, [Kow21, Theorem B.5.5(2)]).

Proposition 2.3 (Prokhorov’s Criterion).

Proposition 2.4 (Kolmogorov’s Tightness Criterion).

Proposition 2.5 ([HB95, Theorem 1]).

Proposition 2.6 ([IK04, Theorem 12.6]).

Proposition 2.7 ([Cha14, Theorem 1’, special case]).

Proposition 2.8 ([Bak77, Theorem 1]).

Proposition 3.1.

Proposition 3.2.

Lemma 3.3 ([BGGK18, Lemma 5.4]).

Lemma 3.4.

Lemma 3.5.

Proposition 4.1.

Corollary 4.2.

Lemma 4.3.

Lemma 4.4.

Lemma 4.5.

Lemma 4.6.

Lemma 4.7.

Lemma 4.8.

Lemma 4.9.

Lemma 4.10.

Proposition 5.1.

Lemma 5.2.

Lemma 5.3.

Lemma 5.4.

Lemma 5.5.

Lemma 5.6.

Lemma 5.7.

Lemma 5.8.

Corollary 5.9.

Corollary 5.10.

Lemma 6.1.

Lemma 6.2.

Proposition 6.3.

Lemma 6.4.

Proposition 6.5.

*Remark 1**.*

*Remark 2**.*