The shape of quadratic Gauss paths
Justine Dell, Djordje Mili\'cevi\'c

TL;DR
This paper studies the distribution of quadratic Gauss paths, showing they converge to a specific random Fourier series and providing a classification of their limiting shapes, explaining their notable visual features.
Contribution
It introduces a new probabilistic description of quadratic Gauss paths and characterizes their limiting behavior as the parameter grows large.
Findings
Quadratic Gauss paths converge in law to a random Fourier series.
The paper provides a classification of the limiting shapes of these paths.
It establishes convergence in probability for the ensemble of paths.
Abstract
We consider the distribution of quadratic Gauss paths, polygonal paths joining partial sums of quadratic Gauss sums to square-free fundamental discriminant moduli in a dyadic range [Q,2Q]. We prove that this striking ensemble converges in law, as Q->\infty, to a random Fourier series we explicitly describe, and we prove a convergence in probability result and a classification result for the limiting shapes that explain the visually remarkable properties of these Gauss paths.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRandom Matrices and Applications · Analytic Number Theory Research · Geometry and complex manifolds
The shape of quadratic Gauss paths
Justine Dell
University of California San Diego (UCSD), Department of Mathematics, 9500 Gilman Drive #0112, La Jolla, CA 92093, USA
and
Djordje Milićević
Bryn Mawr College, Department of Mathematics, 101 North Merion Avenue, Bryn Mawr, PA 19010, USA Institute for Advanced Study, 1 Einstein Drive, Princeton, NJ 08540, USA [email protected]
Abstract.
We consider the distribution of quadratic Gauss paths, polygonal paths joining partial sums of quadratic Gauss sums to square-free fundamental discriminant moduli in a dyadic range . We prove that this striking ensemble converges in law, as , to a random Fourier series we explicitly describe, and we prove a convergence in probability result and a classification result for the limiting shapes that explain the visually remarkable properties of these Gauss paths.
Key words and phrases:
Legendre symbol, Gauss sums, short character sums, random Fourier series, probability in Banach spaces, shapes of exponential sum paths
2020 Mathematics Subject Classification:
11L05 Primary, 11L40, 11N64, 60F17, 60G17, 60G50 Secondary
Research supported in part by the National Science Foundation Grant DMS-1903301, the Simons Foundation Award MPS-TSM-00008085, and by the Charles Simonyi Endowment (D.M.).
Contents
- 1 Introduction
- 2 Preliminaries
- 3 The limiting random variable
- 4 Computing the moments
- 5 Convergence in law
- 6 The atlas of shapes
- 7 Classifying quadratic Gauss paths
1. Introduction
1.1. Gauss paths
Cancellation in exponential sums is a major player across analytic number theory. A fascinating insight into its chaotic formation is provided by polygonal paths joining the consecutive partial sums, which have been studied at least since the work of Lehmer [Leh76] and Loxton [Lox83, Lox85] on exponential sums with quadratic and other analytically defined smoothly varying phases. The pioneering work of Kowalski–Sawin [KS16] and subsequent papers including [RR18, RRS20, MZ23, Hus22, HL24] extended this paradigm and studied the distribution of paths arising from oscillatory sums of arithmetic origin, such as Kloosterman and character sums.
A natural family of character paths arises from the normalized quadratic Gauss sums to square-free fundamental discriminant moduli:
[TABLE]
We term the polygonal paths joining the partial sums of Gauss paths. Having perhaps been conditioned to expect arithmetically defined sums such as Kloosterman sums to exhibit fractal-like behavior seen in Figure 1, the pictures of several (mostly randomly chosen) Gauss paths to large moduli shown in Figure 2 might take one for a surprise.
Pictures don’t lie, of course, and provide additional food for thought when, after even just a modest amount of experimentation, stunningly similar pictures start showing up; see Figure 3.
What is going on here? We can observe very long stretches in which the Legendre symbol \big{(}\frac{m}{c}\big{)} has a definite (statistical) preference for one of the signs over the other, and it is easy to believe that this behavior is guided by falling in certain residue classes to some small moduli. But why would only a few small moduli seemingly matter, and what is the deal with the sharp reversals?
1.2. Limiting distribution
Let denote the set of positive, square-free integers for which . We now formally define the quadratic Gauss path for . For for some , we let , where is the th partial sum of (1.1). For (), we obtain by linearly interpolating between and . Then, is a continuous function which maps .
For every , we may consider the sample space with the uniform probability measure , and the map
[TABLE]
can be viewed as a -valued random variable on this probability space. We collect some relevant background on probability in Banach spaces in §2.1 for convenient reference. Note that ; see (4.2).
Our first main result, Theorem 1.1, establishes that the random variables converge in law, as , to a specific -valued random variable , which we are about to describe. It incorporates completely multiplicative random variables , which are the same as those used in the probabilistic model for the Jacobi symbols (as ranges through all fundamental discriminants) described in [GS03]. For every odd prime , we let be the random variable which takes the value 0 with probability and takes the values 1 and each with probability ; in other words, it is the identity random variable on the sample space equipped with the measure defined by
[TABLE]
For , we let be the usual Bernoulli random variable; that is, the identity random variable on with the measure
[TABLE]
Let be a sequence of independent random variables of laws , and let be completely multiplicative random variables defined, for , as
[TABLE]
Theorem 1.1**.**
Let be a completely multiplicative sequence of random variables of law given by (1.2)–(1.4).
- (1)
The random Fourier series
[TABLE]
converges almost surely to a continuous function and so defines a -valued random variable . 2. (2)
The sequence of random variables converges in law to as .
1.3. Convergence in probability and the atlas of shapes
We now turn our attention to the visually observed sensitivity of the Gauss paths on congruence properties of to small moduli. To this end, for a parameter , and let denote any fixed choice of over primes (with ).
Now, on the one hand, we may consider the sample space
[TABLE]
equipped with the uniform probability measure and the -valued random variable ; in other words, the random variable is obtained from by conditioning on the event that for all . On the other hand, letting be a sequence of independent random variables of laws as in (1.2), we may consider the sequence of completely multiplicative random variables defined as
[TABLE]
In other words, the random variables are obtained as in (1.4), but changing the law of for to the delta mass on . Then we may consider the random Fourier series
[TABLE]
and the deterministic Fourier series
[TABLE]
The series defining is akin to the classical everywhere continuous, nowhere differentiable Weierstrass function. A small sample consisting of all possible shapes is shown in Figure 4. The following Theorem 1.2 formalizes the empirical observation that Gauss paths tend to be strongly aligned close to the shapes in the ensemble of the deterministic paths (according to as in (1.6) for an increasingly large ), which we may correspondingly term the atlas of shapes of quadratic Gauss paths.
Theorem 1.2**.**
Let , and let denote any fixed choice of over primes , with .
- (1)
The random Fourier series defined in (1.7) converges a.s. to a continuous function and defines a -valued random variable . Moreover, the sequence of random variables defined in (1.6) converges in law to as . 2. (2)
The deterministic Fourier series converges absolutely and uniformly to a continuous function. If for at least two , this deterministic path satisfies
[TABLE]
on an everywhere dense set of points satisfying (6.7) and described in Proposition 6.5. If for exactly one , the same holds for satisfying the additional condition (6.27); see Remark 1. 3. (3)
For every ,
[TABLE]
as well as
[TABLE]
Our argument in fact provides an explicit upper bound for the exceptional probabilities in item (3) (after in (1.9)) of size for (see (7.6) and (7.7)). We chose not to optimize this rate, which is already quite rapidly decreasing in and thus suggests an explanation for why the experimentally observed shapes (which of course are just a finite and presumably random sample) appear to fall into very few classes dictated by congruence classes modulo several small primes. It bears emphasizing that Theorem 1.2 does not claim that the paths must be uniformly close to for all , nor can such a statement be true. A path will be substantially away from as long as the values of over exhibit a significant bias, and the same is true for a sample of the random path if the corresponding sample of is biased. The point is that such an event is rare. Figures 5 and 6 illustrate Theorem 1.2 and these points. We also point the reader to Remarks 1 and 2 and the accompanying Figures 7 and 8, which illustrate some delicate phenomena for limiting paths when all (or all but one) .
1.4. Acknowledgement
This paper grew out of the first author’s 2022 thesis, advised by the second author. We were both inspired by and indebted to the existing literature on Kloosterman paths [KS16, MZ23] and on character paths in the unitary ensemble [Hus22]. Some of the ingredients in establishing the convergence in law also appear in the work of Hussain and Lamzouri on Legendre paths [HL24], although our main focus on the striking sharp reversals and congruence-guided classification of Gauss paths is very different.
1.5. Organization of the paper
We collect some preliminaries on probability theory and estimates on character sums in section 2. The convergence and properties of the limiting random variable, including the proof of Theorem 1.1 (1) is studied in section 3. We establish convergence in the sense of finite distributions of using the method of moments in section 4, and then prove the convergence in law claim of Theorem 1.1(2) in section 5. In section 6, we describe the local asymptotics of the limiting shapes at rational points and then produce a collection of rational points at which exhibits a cusp. Finally, we prove Theorem 1.2 in section 7.
1.6. Notation
As is common in analytic number theory, for we write . We write or (using the notations interchangeably) if there exists a constant , independent of all parameters not explicitly indicated with a subscript, such that . We also write if and , and if , where the direction of the limit is either indicated or clear from the context. We denote by a positive value, which may differ from line to line but may in each case be taken to be as small as desired.
2. Preliminaries
2.1. Probability in Banach Spaces
In this section, we collect some definitions and facts pertaining to probability in Banach spaces.
For general facts about Banach spaces and probability, we refer to [LQ18, Preliminary Chapter, Chapters 1 and 4]. In particular, for an arbitrary probability space , a separable Banach space , and the Borel -algebra on , an -valued random variable is a map that is –-measurable (in other words, for any , ).
We will be particularly interested in the separable Banach space of complex-valued continuous functions on equipped with the sup-norm. We can define several notions of convergence of random variables on this space. We closely follow the exposition in [RR18, Appendix A] and [MZ23, §2.1], and refer to [Kow21, §B.11] for proofs.
Definition 2.1** (Convergence of Random Variables).**
Let be a Banach space and be a sequence of -valued random variables on the probability spaces . Let be an -valued random variable on the probability space .
- (1)
If each , we say that converges to almost surely if . 2. (2)
We say that converges in law to if for every continuous and bounded map , the sequence converges to . 3. (3)
If , we say that converges to in the sense of finite distributions if, for all and all -tuples where , the sequence of -valued random vectors converges in law to the random vector . 4. (4)
If is separable, we say that the sequence is tight if, for all , there exists a compact subset such that, for all , .
Here, when and , we denote by the complex-valued random variable which is the evaluation of the random function at the point , that is, , where is the evaluation map.
In order to prove convergence in the sense of finite distributions, we can use the method of moments. Recall that, for a -valued random vector and any -tuples and in , the complex moments of are defined as
[TABLE]
The random variable is said to be mild if there exists a such that the power series
[TABLE]
converges in the disk . A real-valued random variable is said to be -sub-Gaussian if, for every , , and a complex-valued is said to be -sub-Gaussian if and are. The sum of two independent -sub-Gaussian complex-valued random variables is -sub-Gaussian. Moreover, a -valued random variable whose components are all sub-Gaussian is automatically mild. See [Kow21, §B.5,§B.8] for details.
Proposition 2.2** (Method of moments, [Kow21, Theorem B.5.5(2)]).**
Let be a sequence of -valued random vectors, and let be a mild -valued random vector. If, for any two -tuples , their corresponding complex moments and satisfy
[TABLE]
then converges to in law.
The notion of tightness provides an effective way to upgrade the convergence in the sense of finite distributions of a sequence of -valued random variables to convergence in law via the following two propositions.
Proposition 2.3** (Prokhorov’s Criterion).**
Let and be -valued random variables. If the sequence is tight and converges to in the sense of finite distributions, then converges to in law.
Proposition 2.4** (Kolmogorov’s Tightness Criterion).**
Let be a sequence of -valued random variables. If there exist such that for all and , we have
[TABLE]
then is tight.
2.2. Character sums and quadratic large sieve
In this section, we recall several ingredients from classical analytic number theory. One of them is Heath-Brown’s quadratic large-sieve inequality.
Proposition 2.5** ([HB95, Theorem 1]).**
For any and ,
[TABLE]
with denoting the sum over positive odd square-free values.
For our proof of tightness, we will also require the and cases of Burgess’ classical bound for short character sums. Here we note that what we really require of the second bound is that it is nontrivial in the range for some small .
Proposition 2.6** ([IK04, Theorem 12.6]).**
For every primitive character of any conductor and any and ,
[TABLE]
Proposition 2.7** ([Cha14, Theorem 1’, special case]).**
For every , there exists a constant such that, for every , every multiplicative character to any square-free modulus , and every interval of size ,
[TABLE]
In fact, [Cha14, Theorem 1’] allows an arbitrary degree polynomial phase, explicates the -dependence in the implied constant, and implies that any ( an absolute constant) is admissible. The above estimate is all we need since (that is, ) will eventually be fixed. We note that more precise versions of Proposition 2.7 are available in [Ker14, HBP15] in various degrees of generality of and .
2.3. Linear forms in logarithms
In our investigation of the limiting shapes of Gauss sums in section 6, we will make use of the following classical and powerful theorem of Baker on linear forms in logarithms.
Proposition 2.8** ([Bak77, Theorem 1]).**
For every finite set of distinct primes , there exists an (effectively computable) constant depending on only such that, for every ,
[TABLE]
3. The limiting random variable
In this section, we establish the properties of the random Fourier series defined by (1.5), where we recall that is a completely multiplicative sequence of random variables of law given by (1.4) and (1.2). We often use to denote a particular value of and to denote any single infinite choice of for each prime (with ) and of the accompanying multiplicatively determined ().
Let and denote the smallest and largest prime divisors of , respectively. Following [Hus22, §2], we first consider the limit
[TABLE]
Of course, this has the same summands as (1.5), but in a different order that is more strongly attuned to the multiplicative nature of the random coefficients . In §3.2, we will show that this is in fact equivalent to defining as the limit of partial sums, as in (1.5), and prove some important properties of this random variable. In the first subsection §3.1, we focus on proving the following result.
Proposition 3.1**.**
Let be multiplicative random variables, with for each prime independent and distributed according to the probability measure defined in (1.2). Then, the random Fourier series in (3.1) is almost surely the Fourier series of a continuous function.
3.1. Arithmetic convergence
In this section, we study in detail the convergence of the random series in (3.1) and prove Proposition 3.1.
We denote, for ,
[TABLE]
and, for every subset and ,
[TABLE]
as well as
[TABLE]
The following result an analog of [BGGK18, Proposition 5.2], with a number of details adjusted to suit our case.
Proposition 3.2**.**
Let be an integer and let be real numbers. With notations as in (3.2) and (3.3), we have
[TABLE]
For positive integers , we define
[TABLE]
Our proof of Proposition 3.2 makes use of the following lemma to control sums over rough integers.
Lemma 3.3** ([BGGK18, Lemma 5.4]).**
Let , and let be an integer. For and , we have
[TABLE]
Proof of Proposition 3.2.
We begin by noting that the series defining converges comfortably (and uniformly across all samples of the random coefficients ) and open with a simple decomposition estimate
[TABLE]
Using Hölder’s Inequality with , , , and gives
[TABLE]
via a short calculation with integral comparison. Thus, in order to find a bound for , it suffices to first bound for .
Note that, for any , we may write
[TABLE]
Taking and applying convexity of , we obtain
[TABLE]
Therefore, it suffices to bound
[TABLE]
Denoting
[TABLE]
and expanding, equals
[TABLE]
Notice that when is not a square and in any case. Moreover, in (3.6), the terms which will survive are those for which is a square, and for and squarefree. Finally, note that . Therefore, we have
[TABLE]
Since , we further obtain
[TABLE]
by using the elementary inequality and dropping various conditions. We then use Lemma 3.3 with to get, for ,
[TABLE]
Plugging this into (3.5) we get
[TABLE]
since . Returning to (3.4), we have
[TABLE]
Keeping in mind that since , , and so
[TABLE]
We now prove that defined in (3.1) is almost surely the Fourier series of a continuous function. We use similar methods as in [Hus22, §2]. We may rewrite every sample as
[TABLE]
Since the latter series converges absolutely and uniformly, it converges to a continuous function. Hence it suffices to show that
[TABLE]
converges almost surely to a continuous function. We remark that this passage is, in fact, valid in any order of summation.
Define
[TABLE]
Then it is clear, by comparison with the absolutely convergent series , that, for every fixed , every sample converges absolutely and uniformly to a continuous function. Since defines a continuous function for any and any choice of , it suffices to show that the sequence almost surely converges uniformly, as this will allow us to conclude that is almost surely a continuous function. We do this using Cauchy’s Criterion for uniform convergence.
Define
[TABLE]
The series defining converges absolutely and uniformly and may therefore be rearranged at will. Using the multiplicativity of the ’s we may thus write
[TABLE]
The following lemma is an analog of [Hus22, Lemma 2.1], whose proof we closely follow.
Lemma 3.4**.**
For every , there exists a such that for every , we have
[TABLE]
Proof.
As before, we let be a value of the random variable , and let denote the choice of a value for each . Notice that
[TABLE]
where is defined as in (3.3). Thus,
[TABLE]
for some absolute constant , by the classical evaluation in [MV07, Theorem 2.7]. We also recall that, by Proposition 3.2, we have for and the bound
[TABLE]
Let , and let
[TABLE]
Then, for suitably sufficiently large , the conditions , and are satisfied, hence
[TABLE]
Thus, we have
[TABLE]
for sufficiently large (adjusting the value of if needed). ∎
We now prove Proposition 3.1.
Proof.
First, we claim that, for every ,
[TABLE]
holds for all sufficiently large . Indeed, let be the constant provided by Lemma 3.4. Let , , and, for , let
[TABLE]
This choice ensures that and , and hence by Lemma 3.4
[TABLE]
From this it follows that, for every sufficiently large ,
[TABLE]
as claimed.
Now, let be arbitrary. The bound (3.15) combined with shows that
[TABLE]
By the Borel–Cantelli Lemma (see, for example, [LQ18, Proposition I.2]), this implies that almost surely only finitely many events on the left-hand side occur; in other words, almost surely
[TABLE]
holds for all sufficiently large . But this means exactly that the sequence almost surely converges uniformly. Since each function is continuous, their a.s. uniform limit in (3.1) is also a.s. a continuous function. ∎
3.2. Properties of the limiting random variable
We defined in (3.1) as the limit of sum over . Due to the uniform convergence we just established, defines almost surely a continuous function such that the th Fourier coefficient of (for ) is precisely .
Now, it is true that, for every such that , the partial sums of its Fourier series converge uniformly to . For completeness, we reproduce the argument, which we learned from Ullrich [Ull18]. Fix for now an arbitrary , let be a “trapezoidal” bump function satisfying , for and for , and let be its Fourier transform. For , consider the Schwartz class function and its periodization defined by . On the one hand, using the uniform continuity of , the integrability of , and unfolding, we have that
[TABLE]
for sufficiently small and then sufficiently large , uniformly in . On the other hand, is the trigonometric polynomial , and so by the trivial bounds we have that
[TABLE]
uniformly in . Thus for sufficiently large , which precisely establishes that on .
Using the just established fact, the almost surely continuous function equals a.s. the limit of the usual partial sums as in (1.5) and the two definitions will be equivalent, that is,
[TABLE]
Therefore, we use these definitions interchangeably throughout the paper.
Now, let denote the probability space underlying the sequence of independent random variables; that is, equipped with the measure appearing in (1.2) for odd prime , and with as in (1.2). Then, for every we have
[TABLE]
We also formally set .
The a.s. convergent series defines a -valued random variable on . The main result of this section, the following Lemma 3.5, shows that, for arbitrary fixed and , the -valued random variable has complex moments of all orders :
[TABLE]
and provides an exact evaluation for these orders. To state the result precisely, the following notations are convenient. Let denote the set of all -tuples of vectors , with each individual . For every and , define
[TABLE]
where, for every and ,
[TABLE]
Then we have the following result.
Lemma 3.5**.**
For every , the random variable . Moreover, for every and every , the -valued random variable has complex moments as in (3.18) of all orders , given by the absolutely convergent sum
[TABLE]
with and as in (3.17) and (3.19). These moments satisfy for a suitable absolute , and so is a mild random variable.
Proof.
Returning to (3.8), we denote
[TABLE]
with the terms corresponding to formally interpreted as . In this paragraph, we verify that all results of the previous section remain valid with , , and analogously defined in place of , and . Indeed, for any choice of , we may repeat the full proof of Proposition 3.2 with the quantities
[TABLE]
instead of (3.3), with the only substantive change in the proof being that needs to be replaced with
[TABLE]
This satisfies the estimate , which is all that is needed for the proof. As in (3.10), converges absolutely and uniformly to a continuous function. In place of (3.12), we have the decomposition
[TABLE]
Therefore, denoting , Lemma 3.4 remains valid for as stated, with the key estimate (3.13) in the proof replaced by
[TABLE]
for which the newly adjusted Proposition 3.2 provides , and the rest of the proof is unchanged.
Using the estimate (3.15) and the fact that (uniformly in , as ) almost surely, we conclude that
[TABLE]
holds for all sufficiently large . Letting
[TABLE]
we have that form a (non-strictly) increasing sequence of events with , so that a.s. In particular, for every fixed , we have by the Monotone Convergence Theorem for every
[TABLE]
Now, for every , we have by rapid convergence
[TABLE]
Now, for every prime , we have by a simple combinatorial argument that , and so
[TABLE]
whence
[TABLE]
Inserting this into (3.21) and (3.20) completes the proof of
[TABLE]
where the values of are covered by an interpolation argument.
We also claim that in . We proceed by a similar argument. Let and be arbitrary, and denote
[TABLE]
Then the same argument using (3.15) as above shows that form a (non-strictly) increasing sequence of events with , so that a.s. as , and thus for every
[TABLE]
Now, for we have, arguing as in (3.21) and below, that
[TABLE]
Inserting this into the previous estimate, we conclude that
[TABLE]
Executing the limits as and (in either order), we conclude that indeed
[TABLE]
The same claim is true for all real values by interpolation.
Finally, we turn to the complex moments , which are all finite by Hölder’s inequality. By writing () and expanding the products and complex conjugates, we may write
[TABLE]
Since , in fact with uniformly in in light of (3.21), taking expectations on both sides, factoring the differences of powers, and applying Hölder’s inequality and (3.25), we conclude that
[TABLE]
But this final expectation is straightforward to evaluate; indeed, denoting by the set of all tuples with and each , and ,
[TABLE]
Since, denoting , we comfortably have absolute convergence
[TABLE]
this implies the announced evaluation
[TABLE]
4. Computing the moments
In this section, we compute asymptotically the complex moments of , which are given by
[TABLE]
where is a positive integer, is a -tuple in , and and are -tuples of non-negative integers. Additionally, we denote and . Specifically we will prove the following evaluation.
Proposition 4.1**.**
For every positive integer and all -tuples and , the complex moments of in (4.1) satisfy
[TABLE]
where are the corresponding complex moments of in (3.18).
Corollary 4.2**.**
The sequence of -valued random variables converges in the sense of finite distributions to as .
As a preliminary step, we compute by simple sieving that
[TABLE]
so that the uniform probability measure on satisfies
[TABLE]
4.1. Reduction steps
We also consider slightly different functions defined by
[TABLE]
The functions are discontinuous but they agree with the Gauss paths at the points and (as we will quickly see) stay very close to them, while being technically easier to work with. Indeed, we have the following expansion.
Lemma 4.3**.**
We have
[TABLE]
and
[TABLE]
Proof.
This follows by the completion method, analogously to the case of Kloosterman paths [KS16, RR18, MZ23] and discussed in some detail in [Kow21, Chapter 6]. Indeed, the Parseval identity for the discrete Fourier transform and (where ) gives
[TABLE]
and we compute
[TABLE]
This proves the first identity. Now, inserting the classical bound for (see, for example, [Mon94, Chapter 3])
[TABLE]
where is the distance between and the nearest integer, and bounding Gauss sums individually, we get
[TABLE]
For every we may consider the complex moments of given by
[TABLE]
That the moments and are very close will follow directly from the following lemma.
Lemma 4.4**.**
We have
[TABLE]
Proof.
We first note that
[TABLE]
We now use the bounds
[TABLE]
The first of these follows immediately from the definitions of and , and the second one follows from the first one and Lemma 4.3. Using these bounds we conclude that
[TABLE]
Consequently, we have for every choice of with
[TABLE]
and the claim follows. ∎
Having proven Lemma 4.4, it is clear that
[TABLE]
so that we may from now on focus on an asymptotic evaluation of .
We first rewrite the first conclusion of Lemma 4.3 as
[TABLE]
Inserting the expansion (4.8) into the definition (4.5) and expanding, we write
[TABLE]
Here, ranges over all -tuples , where each , and
[TABLE]
Now, write for the set of all such -tuples and denote
[TABLE]
recalling also the notations and for every from (3.19). Recalling that G(1-h_{j,\ell},c)=\big{(}\frac{1-h_{j,\ell}}{c}\big{)}\sqrt{c} and using multiplicativity of Jacobi symbols, we get
[TABLE]
Next, we show that may be replaced with , an expression independent of , at the cost of a negligible error. We begin with the following elementary lemma, for an independent proof of which we refer to [KS16, Section 2] (where the condition that is a prime is clearly immaterial). We will provide a different argument, which also sets the stage for the proof of the next Lemma 4.6.
Lemma 4.5**.**
For , we have
[TABLE]
Proof.
We may write
[TABLE]
where
[TABLE]
Now, if , the statement of the lemma is trivially true. Otherwise, noting that does not depend on , and inserting the classical bound (4.4), we conclude that, for ,
[TABLE]
Lemma 4.6**.**
For and ,
[TABLE]
Proof.
On the one hand, by completion (that is essentially by the Pólya–Vinogradov inequality), we have as in the proof of Lemma 4.3 for every , the bound
[TABLE]
On the other hand, we may insert the representation for from the proof of Lemma 4.5, noting that, moreover, and define functions of a continuous real variable that satisfy
[TABLE]
Moreover, denoting , we may write
[TABLE]
Using summation by parts, this leads to
[TABLE]
The contributions from are estimated analogously, and the term is trivially admissible. ∎
This leads to the following.
Lemma 4.7**.**
We have
[TABLE]
Proof.
Let be the set of all pairs such that and . By writing and expanding the product, may write
[TABLE]
where
[TABLE]
and similarly for .
Using this expansion, we have that
[TABLE]
In this expression, we bound the sums over corresponding to trivially, using , and those corresponding to using Lemma 4.6. This gives the announced estimate. ∎
We would now like to change the order of summation, so that the -sum is inside of the sums. We now show that the limits of summation can indeed be changed from to the range independent of , while only introducing negligible error.
Lemma 4.8**.**
Let be the set of all -tuples of vectors satisfying . Then,
[TABLE]
Proof.
Arguing as in the proof of Lemma 4.7, we find that, for every ,
[TABLE]
We claim that
[TABLE]
The first of these bounds is trivial from .
For the second bound, we argue analogously as in the proof of Lemma 4.6. Recall the notation and the Pólya–Vinogradov bound for from (4.11), and denote . Then, we may rewrite the contribution of to the second sum as
[TABLE]
where
[TABLE]
Then summation by parts indeed yields
[TABLE]
The terms with are estimated analogously. Putting everything together, summing over all with weights , and invoking Lemma 4.7, we obtain
[TABLE]
Since the range of summation in does not depend on , we may exchange the order of summation, and the lemma follows. ∎
4.2. Isolating the main term and proofs of main results
In view of Lemma 4.8, we may write
[TABLE]
where
[TABLE]
The following subsection will be devoted to the proofs of the following two lemmata.
Lemma 4.9**.**
[TABLE]
Lemma 4.10**.**
[TABLE]
Taking Lemmata 4.9 and 4.10 for granted, we are now ready for the proofs of the main results of this section.
Proof of Proposition 4.1.
Proposition 4.1 follows immediately by combining (4.7), (4.14), and Lemmata 4.9 and 4.10. ∎
Proof of Corollary 4.2.
For every , the -valued random variable is mild by Lemma 3.5. According to Proposition 2.2, the convergence in law of to as may be verified by checking that the corresponding moments satisfy as for every . But this follows immediately (in a strong form) from Proposition 4.1. ∎
4.3. Square and non-square contributions
In this subsection, we prove Lemmata 4.9 and 4.10.
Proof of Lemma 4.9.
Adapting the sieving argument of (4.2), we find that for every ,
[TABLE]
Inserting this into the definition of yields
[TABLE]
We then estimate, arguing as in (3.27) and (3.24),
[TABLE]
as well as
[TABLE]
Putting everything together and invoking the evaluation of from Lemma 3.5 completes the proof. ∎
Proof of Lemma 4.10.
We begin by noting the (direct) upper bound
[TABLE]
Therefore, grouping the terms indexed by according to the values of , we have
[TABLE]
where, by the divisor bound,
[TABLE]
Now, sieving for the conditions and , we find that
[TABLE]
Since is square-free, is a non-principal character of conductor , we find by the Pólya–Vinogradov inequality and straightforward estimates that
[TABLE]
Therefore
[TABLE]
On the other hand, using the Cauchy–Schwarz inequality followed by Heath-Brown’s quadratic large sieve (Proposition 2.5), we find that
[TABLE]
Therefore
[TABLE]
Making the optimal choice we conclude that
[TABLE]
We note that we did not try to optimize the final exponent of power savings in Lemma 4.10 and that the character sums in the proof can surely be treated more delicately. We opted for brevity since the present power savings suffice for us.
5. Convergence in law
The goal of this section is to prove the following statement, which will in turn be used to verify the tightness of the sequence of -valued random variables as .
Proposition 5.1**.**
For every real , there exists a such that for every and every ,
[TABLE]
We will prove Proposition 5.1 in §5.3, along with the corollaries for the tightness and convergence in law of the sequence , after laying the ground work in §5.1 and §5.2. It will be seen that we can, in fact, choose for some two constants , which may in principle be explicated; in particular, any is allowable with a corresponding suitable .
5.1. Preparatory lemmata
The first principal arithmetic input into the proof of tightness is the following lemma, which is a simple variation of the Burgess-like bound for short mixed character sums (Proposition 2.7).
Lemma 5.2**.**
For every , there exists a such that for every primitive character of any square-free conductor and any and every ,
[TABLE]
Proof.
We begin with an application of completion, as in Lemma 4.3, obtaining
[TABLE]
where is the sign of the Gauss sum for the character . Denote the two sums above (including the factor ) by and . On the one hand, by exchanging the order of summation and applying Proposition 2.7 we have
[TABLE]
On the other hand, denoting
[TABLE]
we have by integration by parts that
[TABLE]
Now, the conclusion of Proposition 2.7 implies, for , that
[TABLE]
simply because the complete sum over all is of size at most . This implies that
[TABLE]
The proof is complete. ∎
The following variation of Heath-Brown’s quadratic large sieve is convenient and probably known, but we could not locate a ready reference.
Lemma 5.3**.**
For any , real numbers , and complex numbers ,
[TABLE]
Proof.
Clearly we may assume with out loss of generality that , and then we may also assume that , as the general case follows by splitting into intervals of this form.
Now, for every , consider the collection of intervals of length intersecting . For every , we split the interval into such intervals (at most two for each value of ), using the obvious greedy algorithm, and we estimate the inner sum in (5.1) using the Cauchy–Schwarz inequality.
Now, each specific interval appears in at most such decompositions. For, indeed, for to appear in the decomposition of , one of the points or must appear in the interval centered at the midpoint of and of length ; but this forces to lie in the union and, in particular, determines to within the stated number of choices. Putting this together, we have the estimate
[TABLE]
Estimating the inner double sum using Heath-Brown’s quadratic large sieve (Proposition 2.5), we have that
[TABLE]
This completes the proof. ∎
Lemma 5.3 allows us to prove the following estimate, which will be crucial in our estimates.
Lemma 5.4**.**
For every with , we have
[TABLE]
Proof.
Clearly we may assume that without loss of generality. For , denote
[TABLE]
By Lemma 5.3, we have the estimate
[TABLE]
where the final estimate follows by separately considering the cases and and keeping in mind that .
Now, by summation by parts and the Cauchy–Schwarz inequality, we have
[TABLE]
Moreover, by the integral Minkowski’s inequality,
[TABLE]
Upon applying (5.2) in the two previous displays, we conclude that
[TABLE]
as desired. ∎
5.2. Estimates according to ranges
he following two lemmata are purely analytical in nature.
Lemma 5.5**.**
If and
[TABLE]
then
[TABLE]
Lemma 5.6**.**
If and
[TABLE]
then
[TABLE]
These statements are analogues of [RRS20, Lemma 4.2] and [RRS20, Lemma 4.3] (as well as of the corresponding statements in [MZ23]). The proofs are essentially verbatim, and so we omit them for brevity, the only notable adaptation being the insertion of (4.6) in place of [RRS20, (5)].
Lemma 5.7**.**
For every , there exists a such that, if and if
[TABLE]
for some , then
[TABLE]
Proof.
Without loss of generality, let . By the very definition of ,
[TABLE]
where we incur the harmless error term for no other reason than notational simplicity. Thus, using Lemma 5.6 and recalling the condition that , we first have that
[TABLE]
Using Lemmata 5.2 and 5.4 we conclude that
[TABLE]
The statement of the lemma follows upon inputing the condition , and setting as the value of . ∎
Lemma 5.8**.**
For every even integer , if
[TABLE]
for some , then
[TABLE]
Proof.
First off, in analogy with (4.5), we denote
[TABLE]
In other words, the moment is constructed in exactly the same fashion as , with and , but with in place of each . In yet other words, we have a finite (with length and coefficients depending only on the fixed value of ) expansion
[TABLE]
Applying Lemma 4.8, decomposition (4.14), and Lemma 4.10 to this expansion, we have
[TABLE]
where is the set of all pairs of vectors satisfying , and
[TABLE]
But then a moment’s reflection shows that in fact
[TABLE]
The same result can be arrived at by following the evaluation of in section 4, with and and with in place of each .
Now, the expectation occurring in this evaluation may be further estimated as
[TABLE]
bounding by interpolating between the obvious estimates . The statement of the lemma follows upon inputing the condition and recalling that . ∎
5.3. Proof of tightness
We are now ready for the main proofs of this section.
Proof of Proposition 5.1.
Fix an arbitrary , and let be as in the statement of Lemma 5.7. Applying Lemma 5.5, 5.7, or 5.8 according to the size of , we conclude that, for every even integer and every ,
[TABLE]
with
[TABLE]
This completes the proof of Proposition 5.1 when is an even integer; the claim for other values of follows by interpolation. We also see that we can take with (in principle) explicit , and we may obtain any by taking sufficiently close to . ∎
Using Kolmogorov’s Tightness Criterion (Proposition 2.4), Proposition 5.1 immediately implies the following statement.
Corollary 5.9**.**
The sequence of -valued random variables is tight as .
Combining Corollaries 4.2 and 5.9 and applying Prokhorov’s Criterion (Proposition 2.3), we then obtain the following capstone statement.
Corollary 5.10**.**
The sequence of -valued random variables converges in law to as .
6. The atlas of shapes
In this section, we consider the local behavior of the limiting shapes at rational points . We prove the general first-order asymptotics in §6.1 and a refined two-term asymptotic expansion in §6.2. These asymptotics relate the local behavior of at to certain complete exponential sums modulo , which we study in detail in §6.3. In §6.4, we combine all these conclusions and prove Proposition 6.5, which provides a collection of rational points at which the path has a cusp and explains the striking sharp reversals observed in the introduction (Figures 2–4).
6.1. Local behavior of limiting shapes
We begin by writing for short
[TABLE]
We record the simple estimates
[TABLE]
Now, fix a with , and write
[TABLE]
By absolute convergence, we may rewrite
[TABLE]
It will be convenient to write
[TABLE]
as well as , . Finally, for a modulus , we will also consider the normalized complete exponential sums
[TABLE]
The connection between the exponential sums (6.5) and the local behavior of the paths and their decomposition (6.3) is given by the following lemma.
Lemma 6.1**.**
For every as in (6.2), and for every , the function defined in (6.3) satisfies, for ,
[TABLE]
where , , and are as in (6.4), (6.5), and (6.6).
Proof.
For a to be chosen suitably large later, we may write
[TABLE]
where, separating the terms in (6.3) according to whether or , splitting the summands into dyadic ranges of the form (and denoting dyadic summations over , by ), and estimating using (6.1) and the Mean Value Theorem,
[TABLE]
Now, if and only if . Moreover, denoting by the cube with edges indexed by and all edge lengths and
[TABLE]
we have that as well as
[TABLE]
Putting everything together, we have proved that
[TABLE]
where
[TABLE]
The claim of the lemma follows upon choosing to balance the error terms. ∎
In view of Lemma 6.1, the behavior of the deterministic path close to various rational points is guided at first by whether
[TABLE]
or not. In particular, close to a rational point , the deterministic path splits as in (6.3) as the sum of components over , with the component having a logarithmic singularity of order as whenever . The property of having a logarithmic singularity at should not be confused with this path having a cusp at , which is substantially more delicate and studied in §6.2.
The nonvanishing condition (6.7) is immediate to numerically check for every specific pair , and in §6.3 we offer a more detailed analysis of this fascinating question in more generality. In particular, if , then the full path has a logarithmic singularity of order ; if but for some , then the component has a logarithmic singularity of lesser severity as , which may or may not be inherited by the full path .
6.2. Finer local information
Already Lemma 6.1 clearly shows that the local behavior of near is, at the first order, guided by whether or not. This condition is analyzed in more detail in §6.3. In the case of nonvanishing, we see that
[TABLE]
according to the sign of , indicating (in contrast to the conclusion of Theorem 1.2(2)) that the function is not differentiable at but the path appears smooth around the point , in the sense that all of its Dini quotients vanish (to an arbitrarily high degree on the logarithmic scale). Such points are clearly observable (but not specifically marked) in Figure 5; check, for example, neighborhoods of .
To identify points where the path exhibits cusp behavior, we need to be able to consider the cases where and analyze lower-order local behavior. From now on, throughout the rest of Section 6, we consider the case , so that .
Fix once and for all an even nonnegative function such that for and for . By a familiar repeated integration by parts argument, for every , the Mellin transform has a meromorphic continuation to given by
[TABLE]
In particular, we have the asymptotic expansions
[TABLE]
for every , as well as the uniform bounds
[TABLE]
For a large parameter , to be suitably chosen later, we consider the (finite!) sum
[TABLE]
Using the absolutely and uniformly convergent Taylor series expansion for , we have
[TABLE]
where
[TABLE]
Denoting as usual the unnormalized Gauss sum of a (not necessarily primitive) character modulo and using the discrete and archimedean Mellin transforms, we find that, for any ,
[TABLE]
The function continues to a meromorphic function with a pole at of order at most and an asymptotic expansion
[TABLE]
where . In addition to the pole at , we encounter in the evaluation of poles whenever
[TABLE]
and these poles are simple and pairwise distinct except possibly for the pole at . When , we also encounter a distinct pole of at ; this then accounts for all the poles of the integrand in . We will show that the total contributions of these poles converge absolutely and that the contour of integration in (6.11) may be shifted to for a suitable , and we will write
[TABLE]
We begin by evaluating
[TABLE]
The total contribution of these residues to in (6.11) equals
[TABLE]
where are arithmetic functions given by
[TABLE]
and we note that depends only on .
We proceed to estimate each of the remaining contributions (). At each of the poles , we have for , , that
[TABLE]
for a fixed constant depending on only, by using Baker’s theorem on linear forms in logarithms (Proposition 2.8). Therefore, also using the uniform bound (6.9) for , the total contribution of these poles is at most
[TABLE]
by choosing . We emphasize that this is only a preliminary bound, whose primary role is to ensure absolute convergence; we will be estimating the combined contributions of all far more delicately. Using the same Proposition 2.8, we also see that the total contributions of the integrals over the horizontal segments may be bounded by over a suitable sequence of , and thus we may indeed shift the vertical contour past the line .
When , we also collect the contribution from the simple pole at equal to
[TABLE]
Finally, using the uniform bound (6.9), the total contribution of the remaining integrals over is easily seen to be
[TABLE]
where .
We now return to (6.10) and compute the summands in the decomposition
[TABLE]
corresponding to the total contributions of the four summands in (6.13). Using (6.14), we compute
[TABLE]
where
[TABLE]
and
[TABLE]
with the kernels and given by
[TABLE]
The total contribution of terms with the factor to the sum in (6.17) is times
[TABLE]
where is the degree polynomial given by the absolutely convergent integral
[TABLE]
Similarly, the total contribution of terms with the factor is times
[TABLE]
by a little calculation using integration by parts, where is the degree polynomial given by the conditionally convergent improper integral
[TABLE]
Indeed, the error term we encounter in the final line equals
[TABLE]
Putting everything together into (6.17), we find that
[TABLE]
Next, we proceed to estimate , the combined contribution of the poles at . Using (6.8) with any , we find that, for every , , ,
[TABLE]
Using the elementary integral expression for the beta function and substituting into (6.13), we find that
[TABLE]
where
[TABLE]
and
[TABLE]
where and , with the kernels and as in (6.18). Using the integration by parts in the -variable times, we find that
[TABLE]
For clarity, we note that this calculation can be performed equally well with any (even with ); if we choose sufficiently large, then a similar conclusion can be reached by integration by parts in the -variable times. Inserting these estimates above and using Baker’s bound as in (6.16), we find that
[TABLE]
by choosing and .
Finally, we address the contributions of and . Indeed, is a smooth function given by the absolutely and convergent series
[TABLE]
and
[TABLE]
where we may choose, say, for simplicity.
In total, we have proved that
[TABLE]
Critically, this yields a nontrivial asymptotic when is appreciably larger than . Now, arguing as in the proof of Lemma 6.1, we can also estimate
[TABLE]
Choosing
[TABLE]
we conclude that
[TABLE]
where , and the constants , , and (which also depend on ) may be read off from (6.12), (6.15), (6.20), and (6.19) as
[TABLE]
where we also used the classical evaluations
[TABLE]
kept in mind from (6.6) the notation and additionally denoted
[TABLE]
noted that in the ranges in which the leading terms in (6.21) are dominant, and set, for and ,
[TABLE]
For future reference we record the following simple lemma. In particular, it confirms that (as it must be) the leading constants in (6.21) and Lemma 6.1 match. To simplify the notation, we write for the sequence , and we introduce the related exponential sum
[TABLE]
where is the familiar sawtooth function defined by .
Lemma 6.2**.**
For every with , the constants shown in (6.22) satisfy
[TABLE]
Proof.
From orthogonality of characters, we have that
[TABLE]
The first two statements of the lemma follows by substituting this expression into (6.22).
On the other hand, by using the geometric series expansion and l’Hôpital’s rule, we can evaluate
[TABLE]
Using this evaluation, we can then argue as above that
[TABLE]
and the third statement again follows by invoking (6.22). ∎
Putting everything together completes the proof of the following proposition, the crowning achievement of this subsection.
Proposition 6.3**.**
There exists a , depending only on , such that, for every with and , the function satisfies
[TABLE]
for certain constants shown in (6.22) and Lemma 6.2 and .
6.3. Nonvanishing and multiplicativity of exponential sums
The exponential sums are related (though not always in a straightforward fashion) to the generalized Gauss power sums, defined for any rational number with , , and as
[TABLE]
We summarize this relationship in the following lemma. To succinctly state the multiplicativity property of the sums , we introduce for and slightly more general sums
[TABLE]
so that .
Lemma 6.4**.**
- (1)
For for an odd prime and ,
[TABLE]
where if and , and otherwise. In particular, for :
- •
* for ,*
- •
* whenever ,*
- •
* whenever and ,*
- •
for , and . 2. (2)
If for some with and , then . Further, if for an odd prime and , unless for all with . 3. (3)
For with and ,
[TABLE]
where and
[TABLE] 4. (4)
For an odd prime power, let (noting that then )
[TABLE]
Then,
[TABLE]
Proof.
Item (1) is essentially elementary (and probably well known). For , we may write
[TABLE]
If , then changing variables (for an arbitrary primitive root modulo ) shows that unless , that is, .
Now, the root of unity generates the cyclotomic field , in which is a prime of absolute norm . From we conclude that
[TABLE]
and thus in particular . For and , we have by a slightly more involved argument that
[TABLE]
from which we conclude that
[TABLE]
so in particular . As for the realness claim, if and , we see by making a change of variable that
[TABLE]
The case of and is similar (even easier). That when and follows by a change of variable . Finally, the claims identifying as the Ramanujan and Gauss sums modulo are immediate.
For , we may write for some and . Denoting by a (fixed but otherwise arbitrary) primitive root modulo , is a primitive root modulo , from which it is easy to see that
[TABLE]
When , we find that equals and thus doesn’t vanish by what we already proved. On other hand, for , it follows from the -adic method of stationary phase (see, for example, [MZ23, Lemma 1]) that the above sum vanishes, since no summands satisfy the stationary phase condition .
We proceed to item (2). If for some with and , then we see by shifting variables in (6.5) that . Now, let for some , and write , for some with , , , . If and , then, noting that and changing variables in (6.5), we again see that ; thus, from now on we may assume that or . If , then for we have that
[TABLE]
Using the above with any and applying the -adic method of stationary phase (see [MZ23, Lemma 1] and note that ), we conclude that the summation in (6.5) may be restricted to satisfying ; thus, picking any we conclude that . If , only a minor tweak is needed:
[TABLE]
whence by shifting in (6.5) we analogously find that
[TABLE]
In conclusion, unless , which is to say that , and this condition must hold for every with .
Item (3) is a direct consequence of the Chinese Remainder Theorem. Indeed, the value of each summand in (6.5) depends only on modulo . Writing every as
[TABLE]
where and and inverses are modulo , we have that
[TABLE]
where are independent of and satisfy . Thus,
[TABLE]
the claim follows immediately from this upon summing over and , which encounters and the product of with two sums .
Finally, we turn our attention to item (4). Fixing an arbitrary primitive root modulo and writing , , we have that , and, by definition, equals
[TABLE]
where, for every with , we denote by an arbitrary particular solution of the congruence . The indexing set in the latter sum forms an additive group modulo (a disjoint union of additive groups modulo ) of combined order , and the sum equals or [math] according to whether the implication
[TABLE]
holds or not. Let ; then, by adjusting the values of modulo using the Chinese Remainder Theorem, we see that the above implication holds if and only if we have a valid implication
[TABLE]
Further, denote , so that . If , the above can clearly hold only if all ; otherwise, by dividing through the first congruence by and adjusting the values of by even amounts, the above implication holds if and only if
[TABLE]
The latter plainly holds if and only if there exists a such that for all and for all , and in this case
[TABLE]
6.4. Locating a dense set of singularities
In this subsection, we use our results from §§6.1–6.3 to prove the following culminating proposition of Section 6, which provides a collection of points at which the path has a cusp.
Proposition 6.5**.**
Assume that is not identically zero or one. Let be an odd prime such that and
[TABLE]
Then, for every ,
[TABLE]
and, for , the path satisfies
[TABLE]
where are given explicitly in (6.28).
The curve has a cusp at as long as . Specifically:
- (1)
If for at least two , then (6.26) holds with
[TABLE]
with , and the path has a cusp at . In particular, the set of such (over different values of ) is everywhere dense in . 2. (2)
If for exactly one and the residues of all modulo generate all quadratic residues modulo :
[TABLE]
then (6.26) holds with
[TABLE]
where is as above, satisfies , and the path has a cusp at .
Proof.
For any such that , the congruence conditions on imply (fixing an arbitrary primitive root modulo ) that for some , whence is odd and a fortiori is odd as well. In the notation of Lemma 6.4, this, in turn, implies that, for , if and only if , whence according to Lemma 6.4, items (1) and (4),
[TABLE]
From Proposition 6.3 and Lemma 6.2, we have that
[TABLE]
where and
[TABLE]
This completes the proof of (6.26).
Recall the condition that . Now, if , then, for every , there exists a , and by shifting variables in (6.24) by and recalling that , we see that ; since this conclusion holds for every , we conclude that and (6.26) follows with . Moreover, it follows from quadratic reciprocity and Dirichlet’s theorem on primes in arithmetic progressions that there are infinitely many primes such that and for all with , whence the set of the corresponding fractions is dense in . This settles the case (1).
If , the situation is more complicated: the same change of variables in (6.24) still shows that for all , while
[TABLE]
and
[TABLE]
One final simplification is possible, as follows. The group generated by the residues of all modulo is of the form , where . If we denote by the smallest positive exponent such that , say , then and a change of variables
[TABLE]
shows that the summation in (6.29) may be restricted to , since the contributions of the terms outside this range cancel out. The same argument as in the proof of Lemma 6.4(4) then shows that
[TABLE]
where for the well-defined value of in a decomposition .
All of the above applies whenever . To reach the conclusion that, for some with , holds (whence the path would have a cusp at ), it is thus necessary and sufficient to verify that the constants and as explicated in (6.28) and (6.30) satisfy
[TABLE]
We do not know how to verify or fully characterize the set of points where this fascinating condition is satisfied. However, under the condition (6.27), , and the sums and are essentially the quadratic Gauss and Ramanujan sums:
[TABLE]
Indeed, the former follows from Lemma 6.4; the latter is clear from the above expressions, or directly from the expression (6.23) for , in which only the prinicipal character contributes . This completes the proof of (2). ∎
Remark 1*.*
The condition (6.27), which pertains to the case when exactly all but one , appears similar in spirit to (and perhaps in some ways weaker than) Artin’s primitive root conjecture, so it is perhaps reasonable to expect that it is satisfied for infinitely many primes in the fixed arithmetic progression described by (6.25), which would give an everywhere dense set of corresponding points ; we stop short of stating this as a formal conjecture. Unconditionally, using Schmidt’s estimates on complete character sums [Sch76, Theorem II.2C’] and Poisson summation, one can argue that, for every sufficiently large odd prime and every ,
[TABLE]
for at least one in every sufficiently large interval ; for example, we were able to prove that this holds in the mean square average over all for . This alone shows that in (6.31) is often rather large (that is, of expected size), but a sufficiently good complementary upper bound on is also needed, and in the case of (6.27) this is guaranteed by reduction to the Ramanujan sum.
Remark 2*.*
An interesting situation arises when all . Consider the specific case when , . As in other cases, pictures strongly suggest that the path has an everywhere dense set of cusps, with the most visually prominent ones at many of the points (); see Figure 7, which appears to be in parallel with the situation of Figure 5 (in which ). That these appear at the denominator is not a coincidence in light of
[TABLE]
Nevertheless, it is not difficult to verify that both 2 and 3 are of multiplicative order 35 modulo 71, so they generate (already each one of them generates) the subgroup , whence in this case
[TABLE]
Thus, , so already in light of Lemma 6.1, the left and right slopes of agree at , and the path has no cusp at any of these points, in apparent contradiction with the very convincing Figure 7. What is going on here?
The answer is that the apparent “cusps” are effects of lower-order terms which, in fact, disappear as gets really close to . Indeed, according to Proposition 6.3 and a quick calculation of constants using Lemma 6.2 (in which all ), the leading two terms in the asymptotic expansion for are given by
[TABLE]
where and
[TABLE]
as (or , according to the value of ). Thus, while the leading term indeed dominates for extremely small , the secondary term is substantially dominant for of moderate size, say up to , that is up to around , and explains the illusion of cusp behavior. In truth, the path exhibits a transition to a “smooth” behavior through , which can also be observed upon zooming in to the appropriate scale; see Figure 8.
7. Classifying quadratic Gauss paths
Having prepared the ground with establishing the properties of the limiting shapes in §§6.1–6.3, in this section we complete the proof of our main result on the atlas of shapes of Gauss paths, Theorem 1.2.
Proof of Theorem 1.2 (1).
The arguments of section 3 proceed analogously with the random series and its coefficients in place of and .
Indeed, the sets and of (3.2) and along with them the sums , , , and of (3.3) are insensitive to the values of as soon as ; hence the statement and proof of Proposition 3.2 remain the same as long as .
Turning to the proof of Proposition 3.1, the sums and of (3.10) are affected by the change of to , and so are the sums and of (3.11), the dependence in in (3.12) being that, in the outer sum over with , the coefficients are replaced by . Since , the key estimate on in (3.14) remains as stated for , and the rest of the proof of Lemma 3.4 and Proposition 3.1 remains exactly the same with the additionally assumption that and correspondingly replacing with .
This shows that, indeed, the random Fourier series almost surely uniformly converges and defines a continuous function such that the th Fourier coefficient of (for ) is precisely ; then, it follows as in (3.16) that the original random Fourier series converges a.s. and that a.s.
Turning to the computation of the moments, we may write every as as in (6.2); then, in place of (3.17) we have the evaluations
[TABLE]
For every and , the evaluation of the moments (defined analogously to (3.18)) of the -valued random variable proceeds analogously to the proof of Lemma 3.5, with in place of . In the evaluation of in (3.21), all terms with contribute, and consequently we may estimate
[TABLE]
also using (3.22). For the same reason, we encounter the dependence of the implied constants on in (3.23), (3.24) and below, (3.26) and below, and (3.27). The proof otherwise runs verbatim the same (with and the condition in place of and ), and we obtain the evaluation
[TABLE]
In particular, our discussion above shows that for some depending only on (in fact, is admissible), and so is a mild random variable.
We then proceed to follow section 4 and demonstrate the analogue of Proposition 4.1, that the complex moments , defined as in (4.1) with in place of , satisfy
[TABLE]
Denoting and for and , we first confirm by sieving as in (4.2) that
[TABLE]
where and
[TABLE]
Lemmata 4.3–4.6 are valid for all values of . In the rest of §4.1, we only need to replace with and all sums over by , in particular when replacing the modified moment defined in (4.5) with the analogously defined moment ; with these changes, the rest of §4.1 is valid verbatim (with even the implied constants independent of ). In place of (4.14) we obtain
[TABLE]
Adjusting the proof of Lemma 4.9 as in (7.4), we find that for
[TABLE]
Estimating tails as in (4.15) and (7.1), including
[TABLE]
and keeping in mind the evaluation (7.2), we finally conclude
[TABLE]
We estimate the off-diagonal contributions as in the proof of Lemma 4.10, with the basic estimate after grouping according to the values of and with being
[TABLE]
The rest of the argument proceeds completely analogously, additionally restricting the sieving variables to , and using the Pólya–Vinogradov inequality for non-principal characters (for various ) of conductor to additionally detect congruence conditions , and we obtain
[TABLE]
Putting everything together completes the proof of (7.3) and along with it the convergence of in the sense of finite distributions as .
Moreover, the sequence of -valued random variables is tight at by Kolmogorov’s Tightness Criterion, because in light of (7.4) we have in the situation of Proposition 5.1 that a fortiori
[TABLE]
(Here, we profit from the fact that to execute this bootstrap argument, but, alternatively, it takes only minimal changes to sequentially adapt the arguments of section 5 to the family , with implied constants depending on .) As in §5.3, using Prokhorov’s Criterion we conclude that, indeed, in law as . ∎
Proof of Theorem 1.2 (2).
The deterministic Fourier series converges absolutely and uniformly by comparison with the absolutely convergent series ; along with its individual summands, its sum is therefore also a continuous function. The remaining statements about the everywhere dense set of points at which the path (for not identically zero) has a cusp follow from Proposition 6.5. ∎
Proof of Theorem 1.2 (3).
We begin by recalling the beginning of the proof of Lemma 3.5, where we established that all results of §3.1, including crucially Proposition 3.2 and Lemma 3.4, remain valid for
[TABLE]
and . Further, denote
[TABLE]
As already discussed in the proof of item (1), for , Proposition 3.2 is literally unchanged when replacing by because whenever ; the same is true for the statement of Lemma 3.4, because the proof only additionally requires that for . Hence Proposition 3.1 and its proof remain valid for as well.
As in (3.8), we have that
[TABLE]
where we have already verified that the limit converges almost surely. Now, as in (3.15), for we have that
[TABLE]
Using this for , we have that outside an event of probability , for all ; taking limits as and invoking (7.5) we conclude that
[TABLE]
Finally, for any , we may consider the bounded continuous function defined by
[TABLE]
Since in law as , for sufficiently large we have that
[TABLE]
Since outside an event of probability , we have using Chebyshev’s inequality that, for all ,
[TABLE]
Therefore
[TABLE]
for all sufficiently large . This completes the proof. ∎
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[Bak 77] A. Baker, The theory of linear forms in logarithms , Transcendence theory: advances and applications (Proc. Conf., Univ. Cambridge, Cambridge, 1976), Academic Press, London-New York, 1977, pp. 1–27. MR 498417
- 2[BGGK 18] Jonathan Bober, Leo Goldmakher, Andrew Granville, and Dimitris Koukoulopoulos, The frequency and the structure of large character sums , J. Eur. Math. Soc. (JEMS) 20 (2018), no. 7, 1759–1818. MR 3807313
- 3[Cha 14] Mei-Chu Chang, Short character sums for composite moduli , J. Anal. Math. 123 (2014), 1–33. MR 3233573
- 4[GS 03] A. Granville and K. Soundararajan, The distribution of values of L ( 1 , χ d ) L(1,\chi_{d}) , Geom. Funct. Anal. 13 (2003), no. 5, 992–1028. MR 2024414
- 5[HB 95] D. R. Heath-Brown, A mean value estimate for real character sums , Acta Arith. 72 (1995), no. 3, 235–275. MR 1347489
- 6[HBP 15] D. R. Heath-Brown and L. B. Pierce, Burgess bounds for short mixed character sums , J. Lond. Math. Soc. (2) 91 (2015), no. 3, 693–708. MR 3355121
- 7[HL 24] Ayesha Hussain and Youness Lamzouri, The limiting distribution of Legendre paths , J. Éc. polytech. Math. 11 (2024), 589–611. MR 4767013
- 8[Hus 22] Ayesha Hussain, The limiting distribution of character sums , Int. Math. Res. Not. IMRN (2022), no. 20, 16292–16326. MR 4498175
