Limit laws for rational continued fractions and value distribution of quantum modular forms
Sandro Bettin, Sary Drappeau

TL;DR
This paper establishes the limiting distributions of Birkhoff sums for rational numbers under the Gauss map, extending previous results, and applies these findings to demonstrate Gaussian and Cauchy distributions of values of quantum modular forms and related functions.
Contribution
It extends the understanding of distribution laws for cost functions along orbits of the Gauss map and applies these results to quantum modular forms, providing new proofs and generalizations.
Findings
Birkhoff sums converge to stable laws with power-saving error estimates.
Central values of the Esterman function follow a Gaussian distribution with large variance.
Dedekind sums converge to a Cauchy law, with proofs using dynamical methods.
Abstract
We study the limiting distributions of Birkhoff sums of a large class of cost functions (observables) evaluated along orbits, under the Gauss map, of rational numbers in ordered by denominators. We show convergence to a stable law in a general setting, by proving an estimate with power-saving error term for the associated characteristic function. This extends results of Baladi and Vall\'ee on Gaussian behaviour for costs of moderate growth. We apply our result to obtain the limiting distribution of values of several key examples of quantum modular forms. We show that central values of the Esterman function ( function of the divisor function twisted by an additive character) tend to have a Gaussian distribution, with a large variance. We give a dynamical, "trace formula free" proof that central modular symbols associated with a holomorphic cusp form for have…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnalytic Number Theory Research · Advanced Mathematical Identities · Advanced Algebra and Geometry
Limit laws for rational continued fractions and value distribution of quantum modular forms
S. Bettin
SB: Dipartimento di Matematica, Università di Genova, via Dodecaneso 35, 16146 Genova, Italy
and
S. Drappeau
SD: Aix Marseille Université, CNRS, I2M UMR 7373, 13453 Marseille, France
Abstract.
We study the limiting distributions of Birkhoff sums of a large class of cost functions (observables) evaluated along orbits, under the Gauss map, of rational numbers in ordered by denominators. We show convergence to a stable law in a general setting, by proving an estimate with power-saving error term for the associated characteristic function. This extends results of Baladi and Vallée on Gaussian behaviour for costs of moderate growth.
We apply our result to obtain the limiting distribution of values of several key examples of quantum modular forms. We obtain the Gaussian behaviour of central values of the Esterman function (), a problem for which known approaches based on Eisenstein series have been so far ineffective. We give a new proof, based on dynamical systems, that central modular symbols associated with a holomorphic cusp form for have a Gaussian distribution, and give the first proof of an estimate for their probabilities of large deviations. We also recover a result of Vardi on the convergence of Dedekind sums to a Cauchy law, using dynamical methods.
Key words and phrases:
Gauss map, continued fractions, modular forms, transfer operator, stable laws
2010 Mathematics Subject Classification:
11A55 (Primary); 37C30, 11F03, 11F67, 60F05 (Secondary)
1. Value distribution of quantum modular forms
Let be a cofinite Fuchsian subgroup, which acts of functions on by the weight- “slash operator”
[TABLE]
where is the Möbius transformation.
In their simplest guise, quantum modular forms, introduced by Zagier [Zag10] (see [Zag99] for early examples), denote the set of functions
[TABLE]
for some finite set , satisfying a form of modularity, in the purposely vague sense that for all , the function of defined by
[TABLE]
has some regularity property. Part of the research effort has focused on constructing examples in interrelated ways:
- –
generating series associated with combinatorial sequences: Fishburn matrices [BLR14], unimodal sequences [BFR15, KLL16], partition theory [NR17] (we note that in the latter, quantum modularity is actually a crucial tool for the asymptotic estimation of partition-related sequences),
- –
radial limits of modular objects (mock theta function, quasi-modular forms) defined on the hyperbolic disk [Zag01, CLR16, Fol14, BLR18, FOR13],
- –
Eichler integrals, periods of modular functions [BR16, BKM19]; state integrals involving the quantum dilogarithm function [LZ99, DGLZ09, GK15],
- –
describing the homology of spaces of cusp forms [BLZ15, CL16, BCD18],
- –
Kashaev knot invariants and Nahm sums [Zag10, GZ, BDb],
- –
correlations of the fractional part functions appearing as covariances in the Nyman-Beurling reformulation of the Riemann hypothesis [BC13a, BC13b, BM15, LZ19],
- –
Diophantine approximation and multifractal analysis [JM18, RR13].
In the present paper we are concerned with the following problem: given a quantum modular form , how do the multi-sets
[TABLE]
appropriately normalized, distribute as ? This topic is tightly related to weak limits of partial sums of certain arithmetic functions, which goes back to Hardy-Littlewood [HL14], and has been since then periodically revisited: we mention in particular the works [Wil33, JVH83, Mar99] on theta sums, and [Bet15, MR16] on cotangent sums. These works are all concerned with instances of QMF of non-zero weight.
Our interest in this question comes from the statistical study of additive twists central values of functions, which as we will see are weight-zero QMF related to the third item described above (periods of modular functions). We are aware of two occurences of this setting in the literature. The first is a result of Vardi [Var93] on the existence of a limiting distribution for Dedekind sums. The second is a recent result of Petridis and Risager [PR18] on the distribution of modular symbols, which was motivated by conjectures of Mazur-Rubin [MR16] and Stein [Ste15]. Both results exploit a close connection with Fourier analysis of the modular surface (the spectral analysis of the hyperbolic Laplacian). It is unclear how to extend these methods to more general QMF.
In the present paper we present an approach, based on dynamical systems and the spectral properties of a family of transfer operators, which allows us to answer the question (1.2) for essentially all level (i.e. ) and weight [math] QMF.
We denote and the two usual generators of , so that the associated period functions (1.1) with weight are given by
[TABLE]
Our result may be stated informally as follows.
Theorem 1.1**.**
Let be a weight [math] QMF for , in the sense that extends to a Hölder-continuous function on with some regular growth behaviour at [math], and . Then, up to a suitable renormalization, the multi-sets (1.2) become equidistributed according to a stable law, which is characterized by the growth of at [math].
The hypothesis are stated more precisely in Theorem 3.1 below. The restriction to and the assumption are made to clarify the statement, but are inessential (cf. footnote 2, page 2) and are natural in applications. However, the restriction to is important in our argument. What is required is that the action of on can be induced into an expanding Markov map (the Gauss map in the case ). The restriction to weight zero QMF is also natural, as the problem for non-zero weights is of different nature and typically simpler; see [BDc] for more details.
We will find in practice that any bound of the shape as () ensures convergence to a Gaussian law. A bound of type for some , will typically imply the convergence to a stable law of parameter .
Due to the relative weakness in its hypotheses, Theorem 1.1 applies to a wide class of QMF. In the next section, using Theorem 1.1, we answer the distribution question for several arithmetic invariants. We expect further applications to follow in the future.
2. Applications
For all , we endow the set
[TABLE]
with the uniform probability measure , and we denote the associated expectation,
[TABLE]
2.1. Central values of the Estermann function
Let
[TABLE]
denote the divisor function. The Estermann function, introduced in 1930 [Est30], is defined by
[TABLE]
for , and extended by analytic continuation otherwise. It was initially introduced in relation with the shifted divisor problem . Its functional equation still serves as a basic tool to derive Voronoï summation formulae, which are then used in conjunction with the circle method to study moments of -function and their arithmetics applications: we mention the proportion of critical zeroes of [Con89], the binary divisor problems [Mot94], and non-vanishing of central -values [Luo15].
We mention a further connection with moments of Dirichlet -functions . By [Bet16, Theorem 5], the twisted second moment of Dirichlet -functions satisfies
[TABLE]
for prime and . From this expression, we see that the second moment is essentially the fourth moment of Dirichlet -functions , whose full evaluation in [You11] lies at the threshold of current techniques of analytic number theory (see [BFK*+*17a, BFK*+*17b] for further work on this topic). This fits in the general problem of understanding the distribution of central values of -functions and their twists, which is a fundamental topic in analytic number theory [Sel92, CFK*+*05, Sou09, Har13, RS15]. Up to now, essentially all known results have been obtained by the moments method.
Using Theorem 1.1, we obtain the following Central Limit Theorem.
Theorem 2.1**.**
For all , , and all rectangle , we have
[TABLE]
where .
We will obtain this result as an application of Theorem 1.1, using the fact, proved in [Bet16], that is a weight-[math] QMF, the associated function (1.3) being a -Hölder continuous function on .
The simplicity with which we will deduce Theorem 2.1 from Theorem 1.1 contrasts with the fact that other methods appear to be completely ineffective with this problem. The moments method (and therefore also the approach of [Nor21] described in the following section) can not be applied, due to the presence of a negligible proportion of with abnormally large continued fraction coefficients, whose contribution dominates the integer moments of . In fact, all moments of and have recently been computed in [Bet19]: starting already from the second they grow faster that what is suggested by Theorem 2.1.
Another tentative approach to Theorem 2.1 consists in obtaining a limiting distribution result for , where ; is chosen at random and , and transfering these properties to its discrete counterpart , for , by Fourier expansion. This method is employed for the incomplete Gauss sum in [DA14], based on [Mar99]. In our setting, however, connecting the continuous and the discrete averages raises several additional difficulties, among which the problem of dealing with a divergent second moment as well as those coming from considering a central value rather than an object corresponding to an -value off the line. Also the incomplete Gauss sums are not Gaussian distributed, but rather are distributed as the push-forward measure by a theta series. This suggests a difference in nature between the two problems and, notwithstanding the technical difficulties, it suggests this method could not be adapted to our case.
It would be interesting to obtain a statement analogous to Theorem 2.1 for the values , however the identity (2.1) as stated holds only for prime; the corresponding identity for generic involves a multiplicative convolution, which is not clearly accountable for using the present method. However we still believe is distributed according to a normal law.
Conjecture 2.2**.**
As along primes, the multi-set
[TABLE]
become distributed according to a dilated centered Gaussian.
2.2. Modular symbols
Out of many possible ways, modular symbols [Man09] can be seen as elements of the space of linear forms on , the vector space of cusp forms of weight and level , spanned by the Shimura integrals
[TABLE]
for any , , , and where . (In this section and section 9.1 only, the letter will denote a holomorphic cusp form).
Up to an explicit factor, the value is also the special value of the analytic continuation of the -function
[TABLE]
where we write for . Being at the intersection of the geometric, modular and arithmetic aspects of , modular symbols received a considerable amount of interest, e.g. for computing with modular forms [Cre97, Ste07]. The central values of the Estermann function , which was the subject of the previous section, may be interpreted as a regularized modular symbol for the derivative of the Eisenstein series [Iwa02, chapter 3.5].
The symbol associated with the central value plays a particular rôle and is the focus of this section. Motivated by the -conjecture, the question of the size of modular symbols was initially asked by Goldfeld [Gol99] and later studied by Petridis and Risager [Pet02, PR04]. Interest in this question was recently revived by questions of Mazur, Rubin and Stein [MR16, Ste15], motivated in part by the growth of ranks of elliptic curves. In particular, for a fixed form of weight (and so ) and varying along rationals of reduced denominator , with , Mazur and Rubin predicted that
[TABLE]
becomes asymptotically distributed according to a suitably dilated normal law. To our knowledge, only the first and second moments have been computed [BFK*+*, Chapter 9].
The situation changes with an additional average over . Then the Central Limit Theorem for weight , level forms has been proved by Petridis and Risager [PR18], by using the spectral analysis of twisted Eisenstein series and the location of eigenvalues of the hyperbolic Laplacian.
Using Theorem 1.1, we will prove the following Central Limit Theorem for modular symbols associated with forms of level and arbitrary weight .
Theorem 2.3**.**
Let be even, and be fixed. Then for all and rectangle , we have
[TABLE]
with , where is the weight- Petersson norm of .
For example, this result applies for with being the discriminant modular form . The error term is optimal and uniform with respect to .
Using Theorem 1.1, we will also deduce the following statement, which can be interpreted as a Large Deviations Principle.
Theorem 2.4**.**
Let be fixed as in Theorem 2.3. For any , we have
[TABLE]
We will deduce Theorems 2.3 and 2.4 from Theorem 1.1 by showing that the map is a weight [math] QMF with the period function (1.3) being Hölder-continuous on . We will estimate all small exponential moments, from which the Large Deviations Principle will follow.
Theorem 2.3 is related to three works, two of which have been carried out around the same time as the present.
- –
The work of Petridis and Risager [PR18] was concerned with the case of weight forms, which is relevant to the conjectures of Mazur-Rubin and Stein (cusp forms associated with elliptic curves). Their method is based on the analysis of twisted Eisenstein series. Although the authors do not seem to mention it, their method is capable of obtaining a Large Deviations Principle. The assumption is however crucial in their approach, since only in this case the analogues of the function in (1.3) are constant (this translates into the independence of the multiplier system defined in [PR18, p. 7] with respect to ).
- –
Very recently, Nordentoft [Nor21] has obtained the Central Limit Theorem for arbitrary weight and level modular forms. Specializing to , this gives in particular an independent proof of Theorem 2.3. Similarly as [PR18], the spectral analysis of twisted Eisenstein series play a central rôle, however his argument is based on a completely different construction (due to the “independence with respect to ” obstruction mentioned above). As a result, it crucially relies on the consideration of moments, and falls short of establishing a Large Deviations Principle (Theorem 2.4).
- –
Also very recently, Lee and Sun [LS19] have independently obtained a proof of the main theorem of [PR18], for weight , by a method similar to the one we pursue here. This is achieved by considering a certain twisted version of the Gauss map, keeping track at each iteration of a coset in . On the other hand, for the period functions (the analogues of in (1.3)) are constant. The technical difficulties they are faced with are of very different nature from the ones we encounter here. By mixing the methods presented here with those of [LS19], it is plausible that the main results of [Nor21], for arbitrary weight and levels, could be recovered by dynamical methods.
The difference in the magnitude of the variance between Theorems 2.1 and 2.4 is noteworthy, and is due to the presence, or not, of a pole the associated -functions ( and respectively) at .
We have considered only the modular symbols at the central value . The behaviour for is very different and simpler: for instance, when , the series extends to a continuous function of , and the values become distributed according to the push-forward of the Lebesgue measure. We return to this question in more details in [BDc].
The original conjectures of Mazur-Rubin and Stein were concerned with the case of a single . There does not seem to be an effective way, with our method or those of [PR18, Nor21, LS19], to isolate rationals of fixed denominators, whence all results proven so far rely on the extra average over .
2.3. Kashaev invariants of the knot and sums of continued fraction coefficients
Our next application is motivated by a question in [Zag10]. To a knot and an integer , we associate the -colored Jones polynomial [Gar18, section 1], which is a Laurent polynomial defined by a combinatorial construction through a diagram representation of . For any root of unity , we define where is the order of . In [MM01], it was shown that the sequence is the Kashaev invariant of [Kas95] (the full function can be reconstructed by the action of the Galois group). In the case of the knot (or “figure-eight” knot), the simplest hyperbolic knot, we have explicitely [Zag10]
[TABLE]
Note that for each given , the sum is finite. In this case, Zagier’s modularity conjecture, stated precisely in [Zag10], predicts that is a weight [math] QMF: the difference
[TABLE]
which is depicted in [Zag10, Fig. 4] is expected to behave “nicely” with respect to , although not continuously. Conjecturally, we expect as , where and is the hyperbolic volume of ; Kashaev’s volume conjecture is the case , , which is known in this case [AH06].
A proof of Zagier’s conjecture for the knot has been announced by Garoufalidis and Zagier [GZ]. In [BDb], we obtained independently another proof, complemented by a reciprocity formula relative to a transformation of another kind, by which we deduced the following asymptotic estimate: for , denote
[TABLE]
the sum of continued fraction coefficients of . Then by [BDb], we have
[TABLE]
This is in accordance with the conjectured behaviour of as .
The map , suitably extended to , is a weight [math] QMF with associated period function (1.3) satisfying for . Using Theorem 1.1, we compute in Theorem 9.4 below the distribution of for all , extending work of [BV05a, BH08]. In particular, in the case we obtain the following result.
Theorem 2.5**.**
Let denote the cumulative distribution function of the stable law , that is where
[TABLE]
and let be the Euler–Mascheroni constant. Then, as ,
[TABLE]
Theorem 2.5 answers a question in [FVV], and echoes a result of Heinrich [Hei87] which obtained a similar convergence for Lebesgue-almost all . In particular, (2.6) implies that the generic time complexity of the substractive algorithm for the GCD is asymptotically on pairs of coprime numbers at most , as . This is in sharp contrast with the average time complexity, which is (the latter is known even with a single average over numerators, see [YK75]). Note that, contrary to the case of the Estermann function (Theorem 2.1), this discrepancy between the typical size and the average size is consistent with fact that the stable law has a divergent first moment.
Combined with (2.5), Theorem 2.5 implies the following law of large numbers for the values of .
Corollary 2.6**.**
As , we have
[TABLE]
for a proportion of fraction of denominators at most .
The typical size of among fractions with is therefore much less than that of (by (2.5)).
We expect a convergence in law analogous to (2.6) for the values , however the error term which we obtain in (2.5) is not precise enough to carry this out. This issue is discussed more precisely in [BDb], along with the case of other knots.
2.4. Dedekind sums
For all coprime integers and with , the Dedekind sum
[TABLE]
is a rational number of modulus at most . They appear naturally in the multiplier system attached to the Dedekind function; we refer to the monograph [RG72] for further properties and references. The value distribution of on average over rational has been studied from several points of view [Hic77, Var87, Bru90, Var93].
From Theorem 1.1, we will deduce a short proof of the following convergence to a Cauchy law, which is the main result of [Var93].
Theorem 2.7** ([Var93]).**
Uniformly for and , we have
[TABLE]
This statement will easily follow from Theorem 1.1 by noting that is a weight [math] QMF, with associated period function (1.3) being roughly .111Notice that we obtain a different distribution than in Theorem 2.5. This is due to the fact in that case the period relation is more precisely rather than . This last fact is a consequence of the reciprocity formula for [RG72, Chapter 2, Theorem 1].
The proof of [Var93] builds on the close connection between and the multiplier system associated to the function, which brings the problem to an analysis of twisted Poincaré series on the modular surface, which are in turn studied by means of the Kuznetsov trace formula. By contrast, our arguments are dynamical in nature, and use little arithmetic information beyond the group structure of .
3. Overview
3.1. Reduction to dynamical analysis
We now overview the arguments underlying Theorem 1.1. Suppose is such that the functions defined through (1.1) satisfy and extends to a continuous function on . To simplify the presentation, in this section only, we assume that is an even function.
The starting point is the remark that the action of and on can be induced into an expanding Markov map on , namely the Gauss map , where the braces denote the fractional part. More precisely, by periodicity and since is even, we have , which after iteration (Euclid’s algorithm) yields222Without the assumption a similar formula holds: we have in general where for , and the ensuing analysis still applies.
[TABLE]
where is minimal subject to .
This expression relates the values of to Birkhoff sums associated with the Gauss map . This map is known to have good mixing properties [CFS82, p. 174], by which we expect the sum (3.1) to behave like a sum of independent random variables. Precise limiting behaviour may be obtained through the study of spectral properties of transfer operators associated with : this is an important theme in smooth dynamics and stationary Markov chains [Doe40, For40, ITM50, Wir74, RE83, Bro96, AD01]. We refer in particular to [Bro96] and to the introduction of [AD01] for an extensive historical account and references. Among maps of the interval, the Gauss map has been particularly studied because of its link with the analysis [May76, Pol86, May91b, MV12] of geodesic flows on the surface (where is the upper-half plane).
A prominent example is given by [Bro96, Theorem 8.1]: suppose is of bounded variations and not of the shape , for some function of bounded variations and some constant , and let
[TABLE]
Then for some constants and , uniformly for ,
[TABLE]
as , where is chosen uniformly according to the Lebesgue measure. The implied constant may depend only on . This relies on the spectral analysis of perturbations of the Gauss-Kuzmin-Wirsing transfer operator
[TABLE]
We are interested in the case when is a rational chosen at random in , and the Birkhoff sum in (3.1) is over the full orbit, its length varying with . Denote
[TABLE]
In [Val00], Vallée has shown that the expectations of satisfy
[TABLE]
for functions which are constant on each interval () and under some growth condition at [math]. Here the number is absolute and the implied constant may depend on . The numbers depend only on , and in fact
[TABLE]
The quantity , and the measure , are essentially the projection of on the eigenspace of associated to its dominant eigenvalue . An important point is that this question was studied within the framework of dynamical methods. The argument uses the construction of a suitable generating series involving the quasi-inverse of twisted transfer operators
[TABLE]
for arbitrary large (by contrast with the continuous setting when was chosen uniformly at random in , which involves perturbations of a single fixed operator). This construction crucially relies on the fact that the denominator of can be detected by means of the Birkhoff sum . Earlier approaches [Hei69, Dix70], restricted to , involved number-theoretic methods based on bounds on algebraic exponential sums.
The approach of [Val00] was developed by Baladi and Vallée [BV05a], who proved that for constant on each interval of the shape and under a logarithmic growth condition , the Laplace transform satisfies the “quasi-powers expansion”
[TABLE]
for in some complex neighborhood of [math]; the holomorphic functions , the number and the implied constant may depend on . By Hwang’s theorem [Hwa96], this has a number of consequences in terms of the limiting distribution, and among them, an effective central limit theorem: if is real, non identically zero and as above, then for some numbers , and all we have
[TABLE]
The power-saving error term in (3.4) depends on proving a pole-free strip for the quasi-inverse of the twists (3.3), which can be viewed as a “quasi-Riemann hypothesis” for the generating function of interest333The same set of idea imply a zero-free strip for the Selberg zeta function of the full modular group [Nau05].. The approach of [BV05a] extends seminal work of Dolgopyat [Dol98] to the case of an expanding interval map with an infinite partition (see also [BV05b]).
Motivated by our application (3.1), we extend the methods of [BV05a] in two directions. In the first direction, we consider cost functions which are not necessarily constant by parts. We will require that can be extended to a Hölder continuous function, with some uniform exponent, on each interval .
The second direction we wish to consider is cost functions having a possibly divergent first or second moment, say , or , with the consequence that the limit law will be stable, but not necessarily Gaussian anymore [BV05a, p. 384]. This is a well-known theme in the theory of sums of independent random variables; see Chapter VI of [Lév25], or Chapters VI.1 and XVII.5-6 of [Fel71]. The corresponding phenomenon for sums of continued fractions coefficients in the continuous setting (Lebesgue-almost all ) has been investigated by elementary means by Lévy [Lév52] (see also [Hei87]), and later by transfer operator methods [GH88, GLJ93, Sze09, AD01]. In fact, this falls into the general “countable Markov-Gibbs” framework of [AD01], where it is refered to as the “distributional limit” problem. A large part of later work has focused on non-uniformly hyperbolic maps; we refer to the survey [Gou15] and the references therein.
We investigate the corresponding question in the discrete setting ( at random in ). We evaluate with effective, power-saving error terms the characteristic function
[TABLE]
for in a real neighborhood of [math], under hypotheses which essentially reduce to the boundedness of some positive absolute moment, for some .
3.2. Statement of the main distributional result
For technical reasons, we will work in a more general setting. For a parameter , a real interval and a normed space let denotes the set of functions such that the Hölder semi-norm
[TABLE]
is finite.
We let be the inverse branches of the Gauss map, and .
We fix an integer , and functions . We extend this definition by periodicity, letting for all , and we define a function on by letting and
[TABLE]
where we recall that
[TABLE]
For with , we define
[TABLE]
We make the following hypotheses:
- (1)
(-Hölder continuity) For each and , the function can be extended as a -Hölder continuous function on the interval . 2. (2)
(Norm -th moment) For each , we have
[TABLE] 3. (3)
(Hölder -th moment) For each , we have
[TABLE]
For all , we denote . Finally, let
[TABLE]
Theorem 3.1**.**
Let be given with , and satisfying the conditions (3.7)–(3.8). There exists , , and functions such that for all with , we have
[TABLE]
and
[TABLE]
If moreover , then there exists a real matrix such that
[TABLE]
with interpreted as a column vector and its transpose. The numbers and the implied constant depend at most on and on an upper-bound for the left-hand sides of (3.7)–(3.8).
Remark*.*
- –
It is important to note that the actual values of and only affect the statement up to the value of and the implied constants. In particular, if is on each interval and if there exists such that for , then (3.8) is satisfied with and any .
- –
In our applications, we will have for all in , uniformly in . In this situation, (3.7) is equivalent to .
- –
In typical applications, the quantity can be evaluated by standard methods, and the results relevant for the present paper are worked out in [BDa].
- –
The results of [BV05a] are stated in a formalism which includes the Gauss map as a special case. The most crucial assumption, at least as far as one is interested in power-saving error terms, is the “uniform non-integrability” assumption [BV05a, p.357] (see [Mor15] for a qualitative result, not using the UNI condition). In the present work, we do not use any more specific properties of the Gauss map; with suitable modifications, the arguments presented here apply to the centered and odd Euclidean algorithms [BV05a, Fig. 1] as well.
- –
A generalization in a different direction of Baladi-Vallée’s results, for maps associated to a reduction algorithm in congruence subgroups, has very recently and independently been obtained by Lee and Sun [LS19]444Another way to study levels would be by building an expanding map out of Atkin-Lehner homographies, but our attempts to construct such a map satisfying the UNI condition were not successful..
The main feature of Theorem 3.1 we use is the relative weakness of the hypotheses on . This is important in our applications, where often little is known on besides the regularity properties, and rough bounds on the Hölder norms. To obtain this uniformity, we systematically use Hölder spaces, not only because of the regularity of , but also in order to dampen the oscillations of (see (4.3) below). This is the main reason why arbitrarily small values of are admissible for Theorem 3.1.
The shape of the asymptotic estimates (3.9)–(3.11) are characteristic of infinitely divisible distribution [Fel71, Chapter XVII.2]. The informally stated Theorem 1.1 follows, in all the cases we consider, by an evaluation of and the Berry-Esseen inequality [Ten15, Theorem II.7.16], [FS09, Theorem IX.5].
The variety of situations we consider prevents us from making a clear and concise set of hypotheses on which would make Theorem 1.1 rigorous. Besides working out in details the applications, we restrict to illustrating the case and by the following Central Limit Theorem for the Birkhoff sums defined in (3.2), which recovers and extends in particular [BV05a, Theorem 3.(a)] (see also [BH08, Remark 1.3]). We recall that is the cumulative distribution function of the standard normal law.
Corollary 3.2**.**
Assume that the bound
[TABLE]
holds for some and . Suppose that is not of the form for a function and . Then, for some and
[TABLE]
we have
[TABLE]
uniformly in .
A variation on the argument shows that the milder hypothesis implies the estimate (3.13) with a qualitative error term as . For any , under the condition , a variation on the arguments also leads to an estimate with power-saving error term for .
Acknowledgments
This paper was partially written during a visit of of S. Bettin at the Aix-Marseille University and a visit of S. Drappeau at the University of Genova. The authors thank both Institution for the hospitality and Aix-Marseille University and INdAM for the financial support for these visits. The authors thank B. Vallée for help regarding the references, J. Marklof and B. Borda for discussions, and the anonymous referees for her or his remarks.
S. Bettin is member of the INdAM group GNAMPA and his work is partially supported by PRIN 2017 “Geometric, algebraic and analytic methods in arithmetic”.
Notations
For any function of two real or complex variables, and all , we let whenever this function is defined.
We recall that the semi-norm is defined in (3.5). The Landau symbol means that there is a constant for which whenever and are defined. The notation means . If the constant depends on a parameter, say , this is indicated in subscript, e.g. or .
4. Lemmas
4.1. Hölder constants
We compile here several facts we will use on the Hölder norms for .
- (1)
For , we have
[TABLE]
This follows by writing . 2. (2)
For and , we have and
[TABLE]
This follows by writing \frac{|f(g(x))-f(g(y))|}{|x-y|^{\kappa}}=\big{|}\frac{g(x)-g(y)}{x-y}\big{|}^{\kappa}\frac{|f(g(x))-f(g(y))|}{|g(x)-g(y)|^{\kappa}}. 3. (3)
For and real, we have
[TABLE]
This follows by writing . 4. (4)
For and , we have
[TABLE]
This follows by writing \frac{|f(x)-f(y)|}{|x-y|^{\kappa}}=|f(x)-f(y)|^{1-\kappa/\lambda}\big{(}\frac{|f(x)-f(y)|}{|x-y|^{\lambda}}\big{)}^{\kappa/\lambda}.
4.2. Oscillating integrals
We will require the following analogue of van der Corput’s lemma. We let
[TABLE]
Lemma 4.1**.**
Let . Assume that is with and that is monotonous on . Let . Then
[TABLE]
Proof.
The lemma is obtained by combining the methods of the usual van der Corput Lemma ([Ste93], Proposition VIII.2, p. 332), and the bound on Fourier coefficients of a Hölder continuous function ([SS03], ex. 15, p. 92); we restrict to mentioning the main steps. We change variables and let
[TABLE]
Let . We have
[TABLE]
Now, on the one hand,
[TABLE]
by (4.2) on the first term, and on the other hand,
[TABLE]
by monotonicity. We conclude using . ∎
5. Properties of the transfer operator
In this section and the following ones, all implied constants in the notations and may depend on , and an upper-bound for the values of (3.7)–(3.8). Additional dependences will be indicated in subscript.
Definition
Let
[TABLE]
where we recall that is the Hölder exponent of on each interval . For all and with , define an operator acting on by
[TABLE]
When , this is independent of , in which case we drop the notation . We abbreviate further
[TABLE]
Define the norm and the semi-norm
[TABLE]
Here we recall that . We equip with the norms
[TABLE]
For and , , and , let
[TABLE]
with . In what follows, we will often abbreviate
[TABLE]
and define
[TABLE]
Note that belongs to for all . Moreover, since whenever is differentiable at , for all , we have
[TABLE]
and therefore
[TABLE]
5.1. Properties at the central point
By [Bro96] (section 2.2, Proposition 4.1 and Theorem 4.2), along with the Ruelle-Perron-Frobenius theorem (see [May91a]), we have that the operator acting on is quasi-compact. It has as a simple eigenvalue, and no other eigenvalue of modulus . The projection associated with the eigenvalue is given by
[TABLE]
where is the Lebesgue measure. In particular, we have
[TABLE]
Since , we obtain the existence of a linear operator acting on , such that
[TABLE]
and additionally the spectral radius of satisfies and . We will use on several occasions that for continuous by parts,
[TABLE]
5.2. Dominant spectral properties
In the sequel, we will repeatedly use the following facts. We let .
- –
The bounded distortion property [BV05a, eq. (3.1)]: for all , we have with a uniform constant, in particular, independently of the depth of . This implies that for all and ,
[TABLE]
- –
For all , we have
[TABLE]
- –
The “contracting ratio” property of the inverse branches [BV05a, bound 2.4, fig. 1]: for some , we have the following uniform bound for all and :
[TABLE]
Lemma 5.1**.**
- –
For , we have
[TABLE]
- –
For all , there exists such the map is Lipschitz-continuous and decreasing, , and we have the bounds on operator norms
[TABLE]
In particular, for .
- –
For some , all and all with , we have
[TABLE]
The implied constant is absolute.
Proof.
- –
The first statement is trivial by the triangle inequality.
- –
By a direct computation, we have
[TABLE]
The properties we require of are readily verified.
- –
The third item follows from (5.4) with any fixed , by the definition of the spectral radius.
∎
5.3. Spectral gap at and
Lemma 5.2**.**
For , we have , and so similarly for .
Proof.
This is a well-studied phenomenon; see [PP90, Proposition 6.1]. Our exact statement for acting on a space of holomorphic functions is proved in [Val03, p. 476] (see also [Mor15, Proposition 3.2]), however an inspection of the proof shows that it actually holds for acting on , and therefore also on . ∎
5.4. Perturbation
Lemma 5.3**.**
For all , there exists such that for , the following holds.
- –
For and , we have
[TABLE]
- –
For and all , we have
[TABLE]
- –
For , we have
[TABLE]
Proof.
We assume , the general case being essentially identical.
- –
We have
[TABLE]
However, for all , by the bounded distortion property (5.6), we have
[TABLE]
with . By Hölder’s inequality, we deduce
[TABLE]
with . Picking ensures that , and with our hypothesis (3.7) and the bounded distortion property (5.6), we obtain
[TABLE]
Therefore both sums in (5.13) are bounded in terms of and only, and we deduce the claimed bound .
- –
Proceeding as above, we find
[TABLE]
For any , the last sum is finite, so that our statement follows for any fixed .
- –
Once again proceeding again as above, we have
[TABLE]
Letting , we insert the inequality . The resulting sum over being absolutely bounded, we deduce as required.∎
5.5. First estimate on
The following is a weak form of [BV05a, Lemma 2] (which is refered to, there, as a Lasota-Yorke type inequality). We recall that the Hölder exponent was defined in (5.1).
Lemma 5.4**.**
For all , there exists such that for , with , and , we have
[TABLE]
Proof.
Let . We write
[TABLE]
Splitting as a sum of differences, we obtain
[TABLE]
where
[TABLE]
This last equality follows from the fact that each is a homography associated with an element of with non-negative entries, , and so the supremum above is . Since by the chain rule, we deduce
[TABLE]
Note that by hypothesis, and we have by the chain rule. Moreover, for any value of , we may find at least one with , e.g. by composing repeatedly . Since , we deduce
[TABLE]
Next, by using the rules (4.2), (4.3), (4.4), the bounded distortion property , and simple computations, we obtain successively
[TABLE]
In the last line, we used the definition (5.1). Grouping these bounds using (4.1), we deduce
[TABLE]
since , (by the definition (5.1)), and by our hypothesis (3.8). ∎
6. Meromorphic continuation
Following [Val00], define the generating series
[TABLE]
where is the reduced denominator of .
Lemma 6.1**.**
For and , we have
[TABLE]
Proof.
This is a straightforward extension of the computations in [BV05a, eq. (2.17)]. ∎
The aim of this section is to show the meromorphic continuation of to a half-plane . This will will then be used in conjunction with Mellin transformation to prodive an estimate with power-saving for . We use different arguments according to the size of , which is refered to as the height in what follows.
6.1. Small height
The following lemma deals with in a neighborhood of [math]. In this case we use the spectral expansion of .
Lemma 6.2**.**
There exists , such that for all with , and , the operator acting on is quasi-compact, and for some , we have
[TABLE]
where is of rank 1, , , and . Moreover, for each such fixed , the operators , , and the eigenvalue depend analytically on .
Proof.
This is a direct application of e.g. Theorem 2.3 of [Klo19]; see also chapter IV.3 of [Kat95], and [BV05a, p.342]. ∎
6.2. Moderate height
The following lemma is concerned with of bounded size and away from [math]. The main tool is again perturbation theory.
Lemma 6.3**.**
For all with , there exists such that for all with , and , we have
[TABLE]
Proof.
By (5.12) and the triangle inequality, the map is continuous. By Lemma 5.2, we deduce that for some number , depending on and , we have the bound . By the perturbation bounds (5.10) and (5.11), we may pick small enough so that , and our claim follows. ∎
6.3. Large height
The following lemma deals with the case of large enough .
Lemma 6.4**.**
For some constants , whenever , with , and , the operator acting on has spectral radius , and
[TABLE]
For unbounded values of , perturbation theory is not effective, instead we adapt the arguments of Dolgopyat [Dol98] and Baladi-Vallée [BV05a], which exploit the cancellation due to the varying argument of . Compared with Baladi-Vallée’s setup, we make two modifications: we work with Hölder-continuous functions, rather than , and the cost function is not assumed to be constant on each interval of the partition.
6.3.1. Sums over branches
We will require two estimates involving sums over inverse branches on . Define, as in [BV05a, eq. (3.10)],
[TABLE]
Note that by the bounded distortion property, there exists such that
[TABLE]
for all . The following property is the statement that condition UNI.(a) of Baladi-Vallée [BV05a] holds for the Gauss map.
Lemma 6.5**.**
For some absolute constant , we have uniformly in , and that
[TABLE]
Proof.
See Lemmas 1 and 6 of [BV05a]; the main point is the construction of a dual dynamical system (Section 3.4 of [BV05a]) which encodes naturally the quantity , and satisfies the dominant spectral bound (5.9). The dual map of the Gauss map is in fact the Gauss map. ∎
Lemma 6.6**.**
Under the assumption (3.8), uniformly for all , we have
[TABLE]
We recall that .
Proof.
We decompose , where , and . We have
[TABLE]
so that
[TABLE]
The sums over and are uniformly bounded by (5.8). The sum over is finite by our hypothesis (3.8) and the triangle inequality (cf. 5.14). ∎
6.3.2. Bound on the norm
We recall that is an upper-bound for the norm provided in (5.8).
Lemma 6.7**.**
For some and , whenever , , and , we have
[TABLE]
Proof.
Changing to , to and taking conjugates if necessary, we may assume that . Define, for all , , so that
[TABLE]
We note that for all , by (4.2) and (4.3), we have
[TABLE]
For , let
[TABLE]
This defines a function in . Expanding the square, we find
[TABLE]
We have, for all , the trivial bound
[TABLE]
On the other hand, for all satisfying , we have from Lemma 4.1 the bound
[TABLE]
The norms are bounded, using (5.7), (4.1), (4.2) and (6.2), by
[TABLE]
We write , which implies the variant
[TABLE]
Next, for all (where we recall (6.1)), we have uniformly
[TABLE]
by Lemmas 6.5 and 6.6. Let be a parameter. We insert the bounds (6.5) and (6.6) in (6.3), (6.4), and we sum over . When , we use the trivial bound (6.3), otherwise we use (6.4). Using our bond on above, the symmetry , the fact that , and partial summation, we find
[TABLE]
Choosing , we obtain
[TABLE]
as claimed. ∎
6.3.3. Bound on the norm
Next, we transfer the bound relative to the invariant measure into an bound, following ideas of Dolgopyat [Dol98] adapted to this context by Baladi and Vallée [BV05a, section 3.3].
Lemma 6.8**.**
For some and , depending on and at most, whenever
[TABLE]
then letting , we have
[TABLE]
Proof.
By using the Cauchy–Shwarz inequality, as in [BV05a, Lemma 1], for all and , we have
[TABLE]
for some independent of . We use this with replaced by , with being any choice with and . By Lemma 6.7 and the bound (5.8), we deduce that for all small enough , if , then
[TABLE]
By choosing with , and then large enough and small enough in terms of , we may ensure that
[TABLE]
with and as claimed. ∎
6.3.4. Proof of Lemma 6.4
Iterating the bound of Lemma 5.4, and using (5.8), we have for small enough and all the bound
[TABLE]
for some . We replace by and use Lemmas 6.8 and 5.4. We find that for some constants , and , if is small enough and , then
[TABLE]
At the cost of choosing large enough and small enough in terms of the implied constants, we obtain
[TABLE]
By iterating, this bounds also holds for , , from which we deduce by Gelfand’s inequality that , and . Finally, from the bounds
[TABLE]
and , we get the claimed result.
6.4. Deduction of the meromorphic continuation
Proposition 6.9**.**
For some , and all , the function , initially only defined for , has a meromorphic continuation to the set
[TABLE]
with possible poles occuring only for and . The meromorphic continuation of is bounded uniformly in and by
[TABLE]
More precisely, for , the function
[TABLE]
has an analytic continuation to and , and is uniformly bounded there.
Proof.
By Lemma 6.1, we have for
[TABLE]
Then Lemma 6.2 yields the analytic continuation of (6.8) for for some . Then Lemma 6.4 ensures the existence of such that the sum over in (6.8) converges uniformly over compacts in the region and , and yields the bound (6.7). Finally, applying Lemma 6.3 with the values of and gives the same conclusion for . The conjunction of these three cases covers the whole half-plane . ∎
7. Asymptotic behaviour of the leading eigenvalue
In this section, we study the behaviour, for small and close to , of the leading eigenvalue . The estimates in this section will reduce the problem to the estimation as of the integral
[TABLE]
where we recall the notation (3.6). We recall the hypotheses (3.7), (3.8).
7.1. Perturbation theory and existence
Let
[TABLE]
Lemma 7.1**.**
For all small enough , there exists such that whenever and , we have
[TABLE]
and
[TABLE]
Proof.
Let denote an eigenfunction of associated with the eigenvalue . By Lemma 5.3 and [Klo19, Theorem 2.6, estimation of ], we have
[TABLE]
On the other hand, differentiating the eigenvalue equation and integrating with respect to the Lebesgue measure, we get
[TABLE]
Here we recall the notation (5.2). Setting , with and , gives . Using the bound (7.4), we get the approximation (7.2). ∎
Lemma 7.2**.**
For all , there exists and a unique function such that and, for ,
[TABLE]
Proof.
This follows from a general form of the implicit functions theorem, e.g. [Kum80, Theorem 1.1], whose hypotheses are satisfied by virtue of Lemma 7.1. ∎
In what follows we will not discuss the regularity of at each : we are only interested about its asymptotic behaviour around . We will use results on effective perturbation theory of linear operators, which have been worked out recently in [Klo19].
7.2. The sub-CLT case
We first focus on the case where we do not aim at extracting a term of order in the asymptotic expansion.
Lemma 7.3**.**
For , we have
[TABLE]
Proof.
By Theorem 2.6 of [Klo19], we have
[TABLE]
by (5.2). Since and , the first integral is . The second equals . For , we have
[TABLE]
by our hypothesis (3.7) and the triangle inequality (cf. (5.14)). Setting , we deduce by combining (7.5) and (7.6). Then another use of (7.5) yields our claimed estimate. ∎
7.3. The CLT case
Our next goal is to extract the term of order in the analysis above. We assume throughout that . In order to describe the order 2 coefficients, we introduce the following notation. Recalling (5.2), let
[TABLE]
The well-definedness of follows from our hypothesis . Note that is bounded on , because
[TABLE]
which is finite by (3.7).
Lemma 7.4**.**
If , then
[TABLE]
with
[TABLE]
Moreover, if , then
[TABLE]
with
[TABLE]
Remark*.*
With the definition (7.12), it is clear that the matrix is symmetric, positive semi-definite. It is definite if and only if the vectors span the whole space . By our hypothesis (3.7), we have whenever , so that the matrix is well-defined in this case. Similarly, the matrix is well-defined whenever , since in that case .
Proof.
We extend the computations of Lemma 7.3, using our hypothesis on to expand the quantity to order at and order at . Let
[TABLE]
We write . Letting , we obtain
[TABLE]
where . We use again Theorem 2.6 of [Klo19], getting
[TABLE]
By computations similar to (5.10), we have
[TABLE]
Note that the left-hand side can be written . We deduce
[TABLE]
Since and , we have where we recall the definition (7.7). Therefore, we have
[TABLE]
where denotes the transpose of the column vector , and C_{2,{\bm{\phi}}}=\int_{[0,1]}\big{(}{\bm{\psi}}\cdot{\mathbb{K}}[{\bm{\psi}}]^{T}\big{)}\xi\mathop{}\!\mathrm{d}\nu. This proves (7.9) with as claimed.
To prove (7.11), we note that by the hypothesis , the quantity (7.12) is well-defined. In order to expand it, we first note that, with the definition (7.8), that by construction
[TABLE]
since . Therefore
[TABLE]
By the property , we have
[TABLE]
Similarly, we have
[TABLE]
We deduce that, with the definition (7.12), we have
[TABLE]
On the other hand, expanding the squares in (7.10), we find
[TABLE]
The claimed formula (7.11) follows by the Taylor expansion with . ∎
8. Proof of Theorem 3.1
Recall that consists of the rationals in of denominators at most , and let
[TABLE]
Proposition 8.1**.**
For all , there exists such that for , we have
[TABLE]
Proof.
Recall that was defined in (8.1), so that . Let be a parameter, and be a smooth function satisfying
[TABLE]
Then by a trivial bound on the contribution of , we have
[TABLE]
By Perron’s formula, we have
[TABLE]
where the Mellin transform is defined for . We move the contour to the line . If are small enough, then by Proposition 6.9, we encounter exactly one pole, at . By Cauchy’s theorem, we deduce
[TABLE]
For some absolute constant , Proposition 6.9 yields the bound
[TABLE]
On the other hand, we have for by integration by parts and (8.2), so that
[TABLE]
Finally, we have by (6.8)
[TABLE]
where we abbreviated in the right-hand side. Again by [Klo19, Theorem 1.6, estimation of ] and Lemma 5.3, we have , and therefore
[TABLE]
for . The quantity was estimated in (7.2). Finally, we have by (8.2)
[TABLE]
Inserting these estimates in (8.6), we deduce
[TABLE]
Grouping this (8.3)(8.4) and (8.5), we conclude
[TABLE]
Our claim follows by optimizing and dividing by . ∎
Proof of Theorem 3.1.
We use Proposition 8.1 along with Lemmas 7.3–7.4 and the value (7.1). ∎
9. Applications
When using the Berry–Esseen inequality, we will require a separate treatment of very small values of in order to handle the error term in Theorem 3.1 (the argument described in [Gou15, Remark 3.8] is not readily adapted since part of this term originates from counting pairwise coprime numbers).
Lemma 9.1**.**
Suppose that the function satisfies (3.7) and (3.8). Then we have
[TABLE]
Proof.
For all , define . Then with the terminology of [BH08, p.750], the function has strong moments to order . Setting , we deduce by [BH08, Remark 1.2]. We conclude by taking expectations in the bound . ∎
Proof of Corollary 3.2.
In (3.11), we estimate the integral using [BDa, Proposition 2.1], which amounts here to integrating the Taylor expansion of at . From (7.12), and with the notation (3.12), we obtain
[TABLE]
where, with ,
[TABLE]
We recall that is related to by (7.8) (with ). It is clear that . If , then the integrand vanishes identically, and we would conclude that , contradicting our hypothesis. We use the Berry-Esseen theorem [Ten15, theorem II.7.16], [Fel71, equation XVI.(3.13)] with for some parameter to be chosen. Let
[TABLE]
When , we use Lemma 9.1 and a Taylor bound, getting
[TABLE]
When , we use Theorem 3.1, getting
[TABLE]
Finally, when , we get
[TABLE]
and so, for some small enough choice of , we have .
We deduce
[TABLE]
and therefore by the Berry-Esseen inequality
[TABLE]
as claimed. ∎
9.1. Central modular symbols
Let be a non-zero primitive Hecke eigencuspform of weight for with trivial multiplier. Note that is necessarily even and .
Define, for all integer and all , the modular symbol
[TABLE]
Lemma 9.2**.**
For , the function , initially defined over , can be extended to a bounded function in for any .
Proof.
By Deligne’s bound [Del74], we have . Therefore the sum is finite, and we deduce by Fubini’s theorem that for ,
[TABLE]
and the left-hand side is now defined for . By [Iwa97, Theorem 5.3], we have
[TABLE]
uniformly in . Let , , and for , . Then using (9.4) and partial summation, we obtain , and so
[TABLE]
as claimed, since . ∎
Lemma 9.3**.**
For any , some function , and all , we have
[TABLE]
Moreover, we have uniformly for .
We will actually only require the last bound for .
Proof.
For , define
[TABLE]
Using the modularity relation , and changing variables , we obtain
[TABLE]
and so the period polynomial of
[TABLE]
is indeed a polynomial in of degree at most .
Let now . As with , we have
[TABLE]
The second integral is as for any fixed , since is a cusp form. Thus, by the binomial formula, we obtain
[TABLE]
In the same way, since with , so that , we have
[TABLE]
With , reading the coefficients of on each side of the definition of , and since is even, we deduce
[TABLE]
[TABLE]
We single out the term . The function
[TABLE]
defines, by Lemma 9.2, a function in for all and . The value proves our claimed formula. Finally, for , by the rules (4.1), (4.2), (4.4) and -periodicity of , we have
[TABLE]
By the rule (4.2) again, and since for , we deduce that the same bound holds for . ∎
Proof of Theorems 2.3 and 2.4.
Iterating Lemma 9.3, we have for all ,
[TABLE]
Note that changing the coordinates of the set by an amount in (2.2) does not alter the right-hand side, so that we may replace by . For all , identifying with basis , we let
[TABLE]
where we recall that was defined in Theorem 2.3. We apply Theorem 3.1 with and . The hypothesis (3.7) is satisfied with , since the function is continuous on by Lemma 9.3 and therefore bounded. The hypothesis (3.8) is satisfied for any , by using Lemma 9.3 and noting that . Using (3.11) along with the expressions (7.11), (7.12), we obtain for some and real matrix the estimate
[TABLE]
To compute the variance, we appeal to the bound
[TABLE]
This can be proved by shifting to the setting of [BV05a], where the variable is extended to a complex neighborhood of the origin; the functions defined in (3.10) are, in this case, analytic in near the origin, by boundedness of . Then by e.g. [Bil95, th. 25.12] and (9.6), we find
[TABLE]
On the other hand, as , we have the following asymptotic formulae, where now is interpreted as a complex number:
[TABLE]
[TABLE]
[TABLE]
These statements can be proven (in a stronger form) by standard methods, using orthogonality of additive characters, the approximate functional equation [IK04, Theorem 5.3] and Rankin-Selberg theory [Iwa97, Chapter 13.6]. The value appears as , which is evaluated in [Iwa97, eq. (13.52)]. Note that the analogues of (9.8) and (9.9) with a single average over numerator have recently been computed in [BFK*+*]; in their result as stated, however, the denominator is assumed to be prime.
The equality (9.7) shows that . The equality (9.8) shows that the matrix is a multiple of the identity, and the equality (9.9) then shows that . Using the Berry–Esseen inequality, along with Lemma 9.1, concludes the proof of Theorem 2.3.
To justify Theorem 2.4, we use again the analyticity of defined in (3.10) in a neighborhood of the origin. Hypothesis of [Hwa96] is therefore satisfied in our case with replaced by , and (2.3) follows by [Hwa96, Theorem 1], taking . ∎
9.2. Central value of the Estermann function
Proof of Theorem 2.1.
The Estermann function corresponds to in notation of [Bet16]. For , it is the analytic continuation of
[TABLE]
initially defined for , evaluated at . We recall that is the number of divisors of . We use [Bet16, Lemma 10], noting that the quantity corresponds to . Therefore
[TABLE]
where
[TABLE]
and , which corresponds to in the notation of [Bet16, p. 6900], is bounded and continuous. By comparing the cases and of [Bet16, eq. (3.17)], we have
[TABLE]
where . For , the function belongs to for all . We deduce that the right-hand side of (9.10) defines, for all , a function in . By an argument identical to Lemma 9.3, we also have the bound for all , and similarly for . This validates the hypothesis (3.8) with any .
On the other hand, setting , we have
[TABLE]
and so the hypothesis (3.7) holds. We apply Theorem 3.1 with , , and the above given value . In the estimate (3.11) we evaluate the integrals by appealing to [BDa, Corollary 3.1]. We deduce that there is a constant such that, letting and
[TABLE]
we have
[TABLE]
We then obtain, by the Berry-Essen inequalities and the bound (9.1) for small frequencies, the statement of Theorem 2.1 up to the value of the expectation. We compute the expectation from the initial object, using the expression [IK04, eq. (3.2)] for Ramanujan sums. For with we have
[TABLE]
and so by analytic continuation
[TABLE]
We deduce that
[TABLE]
On the other hand, by (9.11), we have
[TABLE]
whereas exanding the exponential as in the definition of , by (9.12), we get
[TABLE]
Using the trivial bound and Hölder’s inequality, we find
[TABLE]
by the bound and [YK75, Theorem]. Setting , we obtain
[TABLE]
and so , by letting . ∎
9.3. Large moments of continued fractions expansions
For all and , we recall that was defined in (2.4). For , define
[TABLE]
and by continuity . Let
[TABLE]
be the probability distribution function of a stable law (see [ST94]), and
[TABLE]
Theorem 9.4**.**
Let and , and for define .
- (1)
If , then with for some , we have
[TABLE] 2. (2)
If , then with , we have
[TABLE] 3. (3)
If , then
[TABLE] 4. (4)
If , then letting denote the Euler constant,
[TABLE] 5. (5)
If , then
[TABLE]
In all four cases the implied constant depends at most on and .
Except for (9.15), we expect the error terms to be optimal up to an exponent . The estimate (9.17) is in accordance with results on the statistical distribution of [Hen91, CV11].
To prove Theorem 9.4, we note that with . The function satisfies the hypotheses of Theorem 3.1 with , , , and all small enough exponents .
9.3.1. Case
We wish to apply Corollary 3.2. To proceed, we need to show that is not of the shape . Suppose that it were, let and solve . Evaluating at yields as , a contradiction. Corollary 3.2 may be applied and yields (9.13).
9.3.2. Case
The estimates (3.9)–(3.11) hold, and the integral is evaluated in [BDa, Corollary 3.2]. With the notation and
[TABLE]
we find that for ,
[TABLE]
On the other hand, by (9.1), we have . Inserting these two bounds in the Berry-Esseen theorem [Ten15, theorem II.7.16] yields the claimed conclusion (9.14).
9.3.3. Case
We use the estimates (3.9)–(3.10). Define
[TABLE]
Then for , we obtain by [BDa, Corollary 3.3]
[TABLE]
and we may again conclude by the Berry-Esseen inequality.
9.3.4. Case
Assume first . Then we use the estimates (3.9)–(3.10). Define
[TABLE]
Then, by [BDa, Corollary 3.2], for we obtain
[TABLE]
and we may again conclude by the Berry-Esseen inequality and Lemma 9.1.
The case follows by identical computations, the shift by being accounted for by the linear term in the asymptotic evaluation of (3.11), as performed in [BDa, Corollary 3.2]
9.4. Dedekind sums
Proof of Theorem 2.7.
By [Hic77, Theorem 1], we have for the equality
[TABLE]
where and . Note that depends only on the parity of . Since changing by an amount does not affect the right-hand side of (2.7), we may replace by . Let
[TABLE]
Then Theorem 3.1 applies with , and the functions defined above, with . We use the expression (3.10) and refer to [BDa, Corollary 3.4] for the evaluation of the integral, obtaining
[TABLE]
and we conclude again by the Berry-Esseen inequality. ∎
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[AD 01] J. Aaronson and M. Denker, Local limit theorems for partial sums of stationary sequences generated by Gibbs-Markov maps , Stochastics Dyn. 1 (2001), no. 2, 193–237.
- 2[AH 06] J. E. Andersen and S. K. Hansen, Asymptotics of the quantum invariants for surgeries on the figure 8 knot , J. Knot Theory Ramifications 15 (2006), no. 4, 479–548.
- 3[BH 08] V. Baladi and A. Hachemi, A local limit theorem with speed of convergence for Euclidean algorithms and diophantine costs , Ann. Inst. H. Poincaré Probab. Statist. 44 (2008), no. 4, 749–770.
- 4[BV 05a] V. Baladi and B. Vallée, Euclidean algorithms are Gaussian , J. Number Theory 110 (2005), no. 2, 331–386.
- 5[BV 05b] V. Baladi and B. Vallée, Exponential decay of correlations for surface semi-flows without finite Markov partitions , Proc. Amer. Math. Soc. 133 (2005), no. 3, 865–874.
- 6[BM 15] M. Balazard and B. Martin, Sur une équation fonctionnelle approchée due à J. R. Wilton , Mosc. Math. J. 15 (2015), no. 4, 629–652.
- 7[Bet 15] S. Bettin, On the distribution of a cotangent sum , Int. Math. Res. Not. IMRN (2015), no. 21, 11419–11432.
- 8[Bet 16] by same author, On the reciprocity law for the twisted second moment of Dirichlet L-functions , Trans. Amer. Math. Soc. 368 (2016), no. 10, 6887–6914.
