Cotangent sums, quantum modular forms, and the generalized Riemann hypothesis
John Lewis, Don Zagier

TL;DR
This paper links the asymptotic behavior of cotangent sum matrices to the generalized Riemann hypothesis (GRH) for odd Dirichlet characters and explores their connection to quantum modular forms associated with Maass Eisenstein series.
Contribution
It establishes an equivalence between cotangent sum matrix asymptotics and the GRH, and connects this to quantum modular forms related to Maass Eisenstein series.
Findings
Asymptotic properties of cotangent sum matrices are equivalent to GRH for odd Dirichlet characters.
Connection between cotangent sums and quantum modular forms is demonstrated.
Provides a new perspective on GRH through quantum modular forms.
Abstract
We show that an asymptotic property of the determinants of certain matrices whose entries are finite sums of cotangents with rational arguments is equivalent to the GRH for odd Dirichlet characters. This is then connected to the existence of certain quantum modular forms related to Maass Eisenstein series.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Algebra and Geometry · Advanced Mathematical Identities · Analytic Number Theory Research
Cotangent sums, quantum modular forms,
and the generalized Riemann hypothesis
John Lewis
and
Don Zagier
1. Introduction and main results
To answer immediately a question that the reader may be asking, we should say from the outset that the generalized Riemann hypothesis (henceforth, just GRH) of the title refers only to the -series associated to odd real Dirichlet characters and that we do not aim to prove it, but merely to show its equivalence to an asymptotic statement about the determinants of certain matrices whose entries are given by finite sums of cotangents of rational multiples of . To answer a second natural question, since it may seem unusual for an author to contribute to the proceedings volume of a conference partly intended to celebrate his own birthday, we should also say a few words about the genesis of this joint paper. Both the definition of the cotangent sums considered and their relation to GRH were found by the first author and presented in his talk at the conference in question. In the course of that talk he mentioned the homogeneity property , leading the second author to ask whether the function defined by might be related to a quantum modular form in the sense of [16]. The answer to this question turned out to be positive. Furthermore, and unexpectedly, this quantum modular aspect turned out also to be related to the earlier joint work of the authors [13] on the analogues of classical period polynomials for Maass wave forms. A glance at the title of the final section of this paper will show why it seemed natural, and even irresistible, to publish both parts of the story in these proceedings.
We now describe the main results of the paper in more detail. We denote by
[TABLE]
the primitive Dirichlet character of conductor 4 and its associated -series. (In Section 5 we will describe the generalization to other odd primitive real Dirichlet characters.) For positive integers and we define a real number by
[TABLE]
where the conditionally convergent sum is to be interpreted as the limit for of the sum over the rectangle , . These numbers are studied in more detail in §2, where we will see that they are algebraic numbers (in fact, algebraic integers) that can be expressed as finite sums of cotangents of odd multiples of or , a typical example being
[TABLE]
For each we define the two symmetric matrices
[TABLE]
which we will show later are positive definite, and define a function by
[TABLE]
The first main result of this paper is then the following.
Theorem 1**.**
The following two statements are equivalent.
- (a)
The function is unbounded.
- (b)
The -series has no zeros in the half-plane (GRH).
The proof of this theorem will be given in §3. The direction (a)(b) is fairly elementary, while the reverse direction uses functional analysis and techniques coming from old work of Beurling. The result that we actually prove (Theorem 3 in §3) is in fact stronger than Theorem 1 in both directions: if GRH holds, then is not merely unbounded, but actually tends to infinity as , and if has a zero with , then the function is bounded by a number depending explicitly on . Of course we believe that the two statements in Theorem 1 are both true rather than both false, the evidence being on the one hand the generally held belief in the validity of (b) and on the other hand the following plot of the function for , which suggests that tends to infinity and perhaps even grows roughly linearly with . (The straight-line fit shown in the picture is the graph of the function .)
Figure 1. Graph of the function
The second main result concerns the quantum modular nature of certain functions related to the coefficients . On the one hand, the numbers defined by (2) have the obvious homogeneity property and can be rewritten in the form
[TABLE]
On the other hand, by splitting the sums in (2) or (5) into two pieces depending on which of its two arguments realizes the “max” or “min”, we obtain the decompositions
[TABLE]
where the function and numbers are defined by
[TABLE]
(Here and later an asterisk on a summation sign means that terms with equality—in this case, those with —are to be counted with multiplicity 1/2.) We will show in Section 2 that (or its sum with 1/2 if is odd) is a sum of cotangents of odd multiples of and hence is an algebraic integer, refining the corresponding properties for given above.
We can now come to the quantum modular form property mentioned in the opening paragraph. The graph of the function on looks as follows:
Figure 2. Graph of the function
This graph suggests that the function is only naturally defined on and is not well-behaved as a function on , having a possibly dense set of discontinuities and certainly not differentiable at generic points. The graph of the function shown in Figure 3, on the other hand, suggest that it is everywhere continuous and possibly even differentiable except at rational points with odd numerator and denominator, where it seems to be left and right differentiable but with a discontinuous derivative. Our second main result, proved in Section 4, refutes the last of these statements and confirms the others.
Figure 3. Graph of the function
Theorem 2**.**
The function is discontinuous at all rational points with odd numerator and denominator, and is continuous but not left or right differentiable at other rational points. The function is continuous everywhere, is not left or right differentiable at rational points with odd numerator and denominator, but is differentiable at all other rational points.
The similarity between the continuity and differentiability statements for the two functions and in this theorem is not coincidental, since we will see in Section 4 that the derivative of is equal to .
We now discuss the “quantum modular form” aspect of Theorem 2. Note first that the function is periodic of period 4 and hence extends to a periodic function on all of . This function is even, so we could also extend from to by . One might then think of extending to negative values of the arguments by setting for all , but if we did that then for would exhibit oscillatory behavior of the same sort as . Instead, we extend to all of as an even function, i.e., we set for and extend this by continuity to . Then the relationship between and becomes
[TABLE]
On the other hand, the 4-periodicity property of can be strengthened to the statement
[TABLE]
saying that up to sign and up to a constant actually has period 2. Also, since is even, we can replace the term in (8) by . But the two matrices S=\bigl{(}\begin{smallmatrix}0&-1\\ 1&0\end{smallmatrix}\bigr{)} and T^{2}=\bigl{(}\begin{smallmatrix}1&2\\ 0&1\end{smallmatrix}\bigr{)} generate the subgroup of index 3 in the full modular group consisting of matrices congruent to \bigl{(}\begin{smallmatrix}1&0\\ 0&1\end{smallmatrix}\bigr{)} or \bigl{(}\begin{smallmatrix}0&1\\ 1&0\end{smallmatrix}\bigr{)} modulo 2 (this is the so-called “theta group,” under which the Jacobi theta function transforms like a modular form of weight 1/2), so equations (8) and (9) can be combined to the following statement:
Corollary**.**
The function satisfies the transformation property
[TABLE]
for every matrix \gamma=\bigl{(}\begin{smallmatrix}a&b\\ c&d\end{smallmatrix}\bigr{)}\in\Gamma_{\vartheta}, where is the homomorphism sending both generators \bigl{(}\begin{smallmatrix}1&2\\ 0&1\end{smallmatrix}\bigr{)} and \bigl{(}\begin{smallmatrix}0&-1\\ 1&0\end{smallmatrix}\bigr{)} of to .
This corollary tells us precisely that the function is a quantum modular form in the sense of [16] (except that there is an absolute value sign here that was not present there). Recall that a “quantum modular form” is by definition a function on that does not extend nicely to but which modulo (the restriction from to of) better-behaved (here, continuous) functions on transforms like a modular form with respect to some subgroup of finite index of (here the theta group ) with some weight (here ) and character (here ). In Section 6 of this paper we will show that the quantum modularity property of is related to an actual modular form (specifically, a Maass Eisenstein series of eigenvalue ) on the group .
The authors would like to thank Steven Johnson (MIT) and Alexander Weisse (MPI) for invaluable help with the numerical aspects of this paper.
2. Properties of the numbers and
In this section we give an integral representation for the numbers that will be used later both to extend the function continuously from to and as the key to the proof in Section 3 of the relation between the numbers and the zeros of the Dirichlet -series . We will also prove the formulas for and as algebraic numbers (finite sums of cotangents).
The first step is to rewrite the definition of given in (7) more compactly in the form
[TABLE]
where is the function defined on by
[TABLE]
The function , like itself, is periodic of period 4, and we can use this property to extend it to all of . (Equivalently, we can extend it by and , or by observing that the summation condition in (11) can be replaced by for any integer divisible by 4, which works for all real .) The function is then the step function given by
[TABLE]
as pictured in Figure 4. Note that the sum in (10) converges (conditionally) for by summation by parts, because the function is then an odd periodic function of and hence has average value zero, so that the sums are bounded.
[math]1$$2$$3$$4$$5$$\tfrac{1}{2}$$1
Figure 4. Graph of the function
We use the bounded periodic function to prove the following formula for the :
Proposition 1**.**
The numbers defined by (2) have the integral representation
[TABLE]
Proof. From the definition of we find
[TABLE]
and since the second sum is identically zero for divisible by 4, equation (13) follows by letting tend to infinity in .
Our next result expresses as a finite sum of algebraic cotangent values indexed by lattice points in the rectangle satisfying a certain “selection rule.”
Proposition 2**.**
Let and be positive integers. Then the number defined by (2) is given as the following integral linear combination of cotangents of rational multiples of :
[TABLE]
Corollary**.**
The numbers are algebraic integers for all .
Using this proposition we can calculate any in closed form as an algebraic number. For example, the first matrix of values (cf. equation (3)) is given by
[TABLE]
Proof. Since is even and periodic of period 4, we can use Euler’s formula to rewrite the integral representation (13) as
[TABLE]
But from the definition of and as characteristic functions, it follows that the restriction of to is the characteristic function of the union of all intervals with as given in the proposition. This proves the proposition because . The corollary then follows because by elementary algebraic number theory the number is an algebraic integer for every rational number having an even denominator.
We next give the corresponding formula for the numbers , which is in fact simpler.
Proposition 3**.**
Let and be positive integers. Then the number defined by (7) is given as the following half-integral linear combination of cotangents of odd multiples of :
[TABLE]
Proof. Since the summand in (10) is an even function of , we can rewrite the sum as a sum over all non-zero integral values of , and then use the periodicity (with period at most ) of the factor together with Euler’s formula (with the sum being interpreted as a Cauchy principal value) to get
[TABLE]
as claimed.
Remark. One can also deduce the formula for in Proposition 2 from (14); this is left as an exercise to the reader. We also observe that the terms on the right-hand side of (14) (assuming, as we can, that and are coprime) are integral multiples of cotangents of odd multiples of , and hence are algebraic integers (in some cyclotomic field), except for the term if and are odd, which is half-integral because then and . Hence is an algebraic integer unless both and are odd, in which case it equals an algebraic integer plus , and the sum is always an algebraic integer.
3. The connection with the generalized Riemann hypothesis
In the first subsection of this section we outline the basic strategy for proving Theorem 1. The easier direction (a)(b) of this theorem is then proved in Subsection 3.2 and the more difficult converse direction in Subsection 3.3, using ideas from two classical papers by A. Beurling. As already stated in the introduction, we will actually prove the following stronger result:
Theorem 3**.**
Let be the -series defined in (1) and be the function defined in (4).
- (a)
If has no zeros with , then tends to infinity as tends to infinity.
- (b)
If for some with , then satisfies the bound
[TABLE]
3.1. The square-wave function and its multiplicative shifts
Set and let be the characteristic function of the interval and the function
[TABLE]
(and if is an odd integer, but we are working in ). Clearly both and belong to , since they are bounded with support in the interval . On the other hand, if we identify with the space of square-integrable functions on that vanish on , then we have bounded operators on defined by
[TABLE]
We set and for any and define subspaces and of by
[TABLE]
where denotes the linear span of a subset of a linear space. Finally, we denote by the distance from to the subspace , viz.
[TABLE]
The key connection is then given by the following proposition.
Proposition 4**.**
The distance is related to the number defined in (4) by
[TABLE]
The proof of this, which is a straightforward computation, will be given in §3.2 below. On the other hand, a quite elementary argument, also given in §3.2, proves the following
Proposition 5**.**
If with then for all .
Together these two results prove statement (b) of Theorem 3. The harder statement (a) follows from the two following results, proved in §3.2 and §3.3 respectively.
Proposition 6**.**
The space is dense in for any set that is dense in .
Proposition 7**.**
If has no zeros in , then is dense in .
The whole argument is summarized by the following diagram, where the flow of the double arrows shows that the six boxed statements are mutually equivalent.
\boxed{\sf{GRH}}$$\boxed{\xi\in\overline{\mathcal{V}_{\mathbb{Q}}}}$$\boxed{R(N)\text{ is unbounded}}$$\boxed{\overline{\mathcal{V}_{\mathbb{R}}}=L^{2}}$$\boxed{\overline{\mathcal{V}_{\mathcal{D}}}=L^{2}\text{ (\mathcal{D} dense)}}$$\boxed{\lim\nolimits_{N\to\infty}R(N)=\infty}Prop. 5Prop. 4Prop. 7Prop. 4Prop. 6trivialtrivial
(The implications labelled “Prop. 4” follow in one direction because each is contained in and in the other because if were bounded for some infinite sequence of integers then the set would be dense in but have a non-zero distance from .)
3.2. Proof of Propositions 4, 5 and 6
Recall from linear algebra that the distance from a vector in a (real) Hilbert space to a finite-dimensional vector space with basis is given by the formula , where is the Gram matrix with -entry (scalar product of the basis elements and ) and the augmented matrix obtained from by adding a row and column with entries and a diagonal entry . We apply this to the vector and the space defined as above as the span of the vectors (). (Here we mean the real rather than the complex span, i.e., we work in the real Hilbert space of real-valued square-integrable functions on .) We also denote by the span of and . We will show in a moment that the vectors and are linearly independent, so . We can then compute the entries of the Gram matrices as
[TABLE]
(the first is trivial and the second a special case of equation(21) below) and
[TABLE]
(combine equations (25) and (5)). Proposition 4 follows.
To find the connection with the -series and its complex zeros, we next consider the “complex monomial functions” on defined by
[TABLE]
Note the key fact that if and only if , and also that for and , we have = , where denotes the Mellin transform
[TABLE]
(Here we are again identifying with the space of functions supported on .) Then obviously
[TABLE]
and the Mellin transform of is almost equally easy to calculate using (12):
[TABLE]
the calculation being valid for all in since the integral then converges absolutely. One consequence of this is that the functions () are all linearly independent, since if some (finite) linear combination of them vanished then the vanishing of its Mellin transform would imply that the product of with a finite Dirichlet series vanished identically (first for , and then for all by analytic continuation). In particular, the space has dimension exactly . In fact, almost the same argument also shows that the functions and are linearly independent, since a linear relation among them would imply that the product of with a finite Dirichlet series was constant, which is obviously impossible. It follows that the extended space spanned by the functions () and has dimension , as claimed above.
The proof of the direction (a)(b) of Theorem 1 now follows easily. If there exists with and then by (21) we have for all . Since is finite-dimensional, the infimum in equation (18) is attained, so that is unbounded (and hence has lim inf equal to 0) if and only if there is a sequence of functions converging in to ; and this would contradict for all since .
This argument actually proves the stronger statement given in Proposition 5 and in part (b) of Theorem 3. Indeed, if is a zero of with , then on the one hand belongs to as already noted, with norm
[TABLE]
and on the other hand for any by virtue of (21). We apply this to the function attaining the minimum distance to . Then applying the Cauchy-Schwarz inequality to the scalar product we get (using (20))
[TABLE]
and (15) now follows immediately from (19).
Finally, we prove the Proposition 6 (“density lemma”). We first note that the norm tends to 0 as tends to , since by the calculation given above (for rational numbers, but true in general by the same argument) we have
[TABLE]
which tends to 0 as because is a continuous function. It follows immediately that if is dense in then the vector for any lies in the closure of , and since is spanned by such vectors, this shows that as claimed.
3.3. Proof of Proposition 7
It remains to show that implies that .
Let denote the Hardy space of the right half-plane , that is, the space of all holomorphic functions on that are square-integrable on the lines for all and with these norms bounded in , and then equipped with the Hilbert space norm . We quote some results from the standard analysis of these spaces, citing as we go the relevant sections in the books by Hoffman [11] and Garnett [8] for the details.
The Paley-Wiener theorem ([11] pp. 131–132) gives the fact that the Mellin transform is (up to scalar multiple) an isometry between the spaces and . Applying this to the function implies that its Mellin transform lies in the space , a fact which is also easy to see directly from the definition of . Next, we note that if we assume that holds, so that has no zeros in , then the factorization theorem for functions in the space ([11] pp. 132–133) applied to the function gives the implication
[TABLE]
Indeed, the factorization theorem represents any element of as the product of a “Blaschke product,” a “singular function,” and an “outer function.” Here we can omit the definitions of all three of these since the Blaschke product is defined as a product over the zeros of in and hence is equal to 1 if has no such zeros, and the singular function factor is also constant because both continues across the line and has slow rate of convergence to zero as on the positive reals (cf. 3.14 in [1] and Chapter II, Theorem 6.3, of [8]). This leaves only the outer function factor of , and here we can apply a corollary of a theorem of Beurling on shift-invariant subspaces in ([4], [11] pp. 99–101), which tells us that a function is outer if and only if the space
[TABLE]
is dense in . (Notice that is contained in because the definition of the norm implies that for , so .)
It follows that GRH implies that is dense in . We can now apply the inverse Mellin transform from to to obtain the isometrically equivalent picture in . An easy calculation shows that is the Mellin transform of for any , where
[TABLE]
and hence that the space is the Mellin transform of the space
[TABLE]
It is not actually obvious that preserves the space , but this follows from the Paley-Wiener theorem together with the fact just shown that multiplication by preserves . A more direct argument is that the operator is the adjoint of the averaging operator
[TABLE]
as is easily checked, and since is a bounded operator on (cf. [10], Example 5.4, p. 23), its adjoint is also bounded on . But also preserves the condition of having support in (0,1), so it is a bounded operator on .
As a consequence the implication (22) translates on the side to the statement
[TABLE]
Proposition 7 is then obtained by applying to the following lemma, in which for any denotes the span of the multiplicative translates ().
Lemma. For we have .
Proof. We first note that it suffices just to show that for all , because then for all by induction and the remark that . Since for all subspaces , we need to show that
[TABLE]
But since is the adjoint of and is the adjoint of , this statement just says that any function that is orthogonal to all with is also orthogonal to , and this is obvious because .
4. Analytic properties of and
In this section we study the analytic properties of and , discussing in particular the question of their extendability from to (which we answer completely only in the case of ) and describing the asymptotic properties in neighborhoods of rational points needed to establish Theorem 2 and its corollary (quantum modularity). We also discuss the relation of these two functions to the Dirichlet series . It should be mentioned in passing that much of the material in this section is similar in spirit to the papers [2] and [3] of Bettin and Conrey, with replaced by the trivial character and by the Riemann zeta function.
4.1. Sum and integral representations of and
Proposition 8 below gives a number of formulas for these two functions. However, the word “function” has a slighly different meaning in the two cases. We have already defined them both as functions on . Part (i) of the proposition says that extends continuously to , and parts (iv) and (v) give further properties of that function. But in the case of we do not know to what extent it can be defined as a function on (this question is discussed in Subsection 4.2, though not in great detail since this is not our main interest). However, it can certainly be defined as a distribution on , in fact even as the derivative of a continuous function, since by integrating (10) we get the absolutely convergent sum representation
[TABLE]
for its integral, where is the periodic continuous function on defined by for and . Parts (ii) and (iii) of the proposition are then to be interepreted in the sense of distributions, one giving the Fourier expansion of the even periodic distribution and the other its relation, already mentioned in the introduction, with the derivative of .
Proposition 8**.**
i The function extends continuously from to via the formula
[TABLE]
ii The distribution has the Fourier expansion
[TABLE]
where denotes the number of positive divisors of .
iii The function and the distribution are related by the equation
[TABLE]
iv The function is also given by the sum
[TABLE]
where denotes the modified cosine integral
[TABLE]
v The Mellin transform of is given by
[TABLE]
where is the analytic continuation of the -series defined in (1).
Proof. Formula (25) for is precisely the statement of Proposition 1 after an obvious change of variables, and the convergence and continuity of the integral for all are easy consequences of the facts that is supported on and is locally constant and bounded. To prove (ii), we insert the (standard) Fourier expansion
[TABLE]
of the even periodic function into the sum (10), combine the double sum into a single one, and use Leibniz’s formula . For (iii), we insert (11) into (25) to get
[TABLE]
from which equation (27) follows immediately by differentiating and using (10). (The easy justification of these steps in the sense of distributions by integrating against smooth test functions is left to the reader.) Finally, to prove (iv) we combine (27) and (6) to get
[TABLE]
and then insert the Fourier development (26) and integrate term-by-term. This gives (28) up to an additive term that can be eliminated by noting that is bounded. Note that the sum in (28) is uniformly and absolutely convergent since a simple integration-by-parts argument shows that as . Finally, for (v) we note first that the Mellin transform of is given according to equation (21) by for , and then use (25) to get
[TABLE]
for , as claimed. (The Mellin transform exists in this strip because as an easy consequence of (25).) In particular, extends meromorphically to all of and is invariant under . We also note that by the well-known functional equation of , equation (30) can be written in the alternative form
[TABLE]
We can then use this to give a second derivation of (28), not making use of the distribution , by first applying the Mellin inversion formula to write as an integral over a vertical line with and then shifting the path of integration to the right, picking up the term in (28) from the residue at and permitting us to use the convergent Dirichlet series representation for . (We omit the details of this calculation.)
4.2. The function on the real line
In this subsection we give two one-parameter generalizations of and discuss possible ways to define this function at irrational arguments.
We first generalize the definitions of and and the assertions of Proposition 8 to families depending on a complex parameter . Specifically, we can generalize (10) to
[TABLE]
which now converges absolutely. The same calculation as for then gives the Fourier expansion
[TABLE]
where is the Dirichlet -function defined in (1) and is the sum-of-divisors function . Similarly, we define for all with by
[TABLE]
where the equality of the two expressions is proved by the same calculation as for the proof of (25). Then and are related by
[TABLE]
by the same calculations as in the special case and one has an expansion like (28) but with replaced by . As long as all of the sums and integrals involved are absolutely convergent and all steps are justified.
A different possible way to regularize for is to define
[TABLE]
Using the identity \sum\limits_{n>0}\dfrac{d(n)}{n}\,x^{n}=\sum\limits_{k>0}\dfrac{1}{k}\,\log\Bigl{(}\dfrac{1}{1-x^{k}}\Bigr{)} and standard trigonometric identities, we find after a short calculation that
[TABLE]
Both this series and the original one defining converge exponentially fast for any positive , giving us another possible approach to the analytic properties of the limiting function .
Summarizing this discussion, we have at least five potential definitions of for :
Definition 1: Define by the series (10), if this sum converges.
Definition 2: Define by the Fourier series (26), if this series converges.
Definition 3: Define as the limit for of the series (33), if this limit exists.
Definition 4: Define as the limit in (36), if this limit exists.
Definition 5: Define as the limit of as tends to , if this limit exists.
We can then ask—but have not been able to answer—the question whether any or all of these definitions converge for irrational values of or, if they do, whether they give the same value. The last definition is in the sense the strongest one, since any definition of on or a subset of that agrees with the original definition on must coincide with the value in Definition 5 at any argument at which this function is continuous. In any case, we pose the explicit question:
Question: Are the five definitions given above convergent and equal to one another for all irrational values of , or for all belonging to some explicit set of measure 1 ?
It seems reasonable to expect the answer to the second question to be affirmative with the set of measure 1 being the complement of some set of irrational numbers having extremely good rational approximations, like the well-known “Brjuno numbers” in dynamics.
4.3. Asymptotic behavior of and near rational points
The results of Section 4.1 prove the continuity of , which is part of the statement of Theorem 2 in the Introduction. We now discuss the remaining statements there, concerning the behavior of and near rational points.
We begin by looking numerically at the asymptotic properties of near . Computing the values of C\bigl{(}1\pm\frac{1}{n}\bigr{)} for and using a numerical interpolation technique that is explained elsewhere (see e.g. [9]), we find empirically an expansion of the form
[TABLE]
with the first few coefficients having numerical values given by
[TABLE]
These numerics become clearer if we work with instead, where we find the simpler expansion
[TABLE]
with no odd powers of and with coefficients given numerically by
[TABLE]
which then gives the expansion (37) with , , and more generally for . Moreover, on calculating the next few values of numerically to high precision we are able to guess the closed formulas
[TABLE]
for the coefficients in (38), where is Euler’s constant and the th Bernoulli number. In fact, it is not difficult to prove (38) (and hence also (37)) with these values of by applying a twisted Euler-Maclaurin formula (giving the asymptotics as of sums over an interval of for a smooth function ) to the closed formula
[TABLE]
which follows easily from (14), though we will not carry this out here.
The “” terms in equations (37) and (38) already show that is not differentiable at and is not continuous at , but from these formulas one might imagine that the functions and are to both the right and the left of this point. However, this is not the case, because both equations (37) and (38) are valid only when is an integer and change in other cases. For instance, if tends to infinity in rather than , then has an expansion of the same form as (38) and with the same constants , but with replaced by , replaced by , etc.) This statement, which again can be proved using an appropriate twisted version of the Euler-Maclaurin formula, is a typical phenomenon of quantum modular forms.
If we look at the asymptotics of and as tends to any rational number with odd numerator and denominator, then we find a similar behavior, which is not surprising since any such is -equivalent to 1 and both and have transformation properties, modulo functions with better smoothness properties, under the action of . More precisely, if we write with and odd and coprime and complete \bigl{(}\begin{smallmatrix}a\\ c\end{smallmatrix}\bigr{)} to a matrix \bigl{(}\begin{smallmatrix}a&b\\ c&d\end{smallmatrix}\bigr{)}\in\text{SL}(2,\mathbb{Z}), then we find asymptotic expansions
[TABLE]
as with , and similar expansions with other coefficients and if with for some fixed rational number .
Similar statements hold for tending to rational numbers with an even numerator or denominator, but now without the log term, the simplest case being
[TABLE]
where is the number of “up-down” permutations of (permutations where has sign for all ), which is also equal to the coefficient of in . (Note that for even, so that the coefficients here are closely related to those in (39).) The difference between the two cases is due to the fact that the action of on has two orbits (cusps), and comes from the relation that we will see in the final section between the quantum modularity properties of and and a specific Maass modular form on that is exponentially small at the cusp but has logarithmic growth at the cusp 1 .
5. Generalization to other odd Dirichlet characters
In this section we discuss the changes that are needed when the character defined in (1) is replaced by an arbitrary odd primitive Dirichlet character. Suppose that is the discriminant of an imaginary quadratic field (or equivalently, that is either square-free and congruent to 1 mod 4 or else equal to 4 times a square-free number not congruent to 1 mod 4), and let \chi=\chi_{D}=\bigl{(}\frac{D}{\cdot}\bigr{)} be the associated Dirichlet character, the case studied up to now corresponding to and . The Dirichlet -series is defined as for and by analytic continuation otherwise, and is equal to the quotient of the Dedekind zeta function of by the Riemann zeta function. Its value at is well-known to be non-zero and given by
[TABLE]
where is or if or and otherwise is the class number of .
We now define by the same formula (11) as before, with . It is still periodic (with period ) and even and hence bounded, but now has average value rather than 1/2 and also no longer takes on only the values 0 and 1, as one sees in the two following pictures of the graphs of this function for and .
[math]1$$2$$3$$4$$5$$1
[math]1$$2$$3$$4$$5$$6$$7$$8$$1$$2
Figure 5. Graphs of the functions for and
We define and for by the same formulas (10) and (8) as before, but we define and by (7) and (5) with a factor instead of in order to get integral linear combinations of cotangents. The Fourier expansions (31) and (26) are then replaced by
[TABLE]
and the formula corresponding to (14) now reads
[TABLE]
by the same calculations as before. Finally we also define the matrices and exactly as we did in the special case (except for replacing the lower right-hand entry in in (3) by ), and then from the scalar product calculations
[TABLE]
which are proved exactly as before, we deduce the same connection as in Theorems 1 and 3 between the unboundedness of the function and the Riemann hypothesis for .
The only real difference with the case is in the argument for quantum modularity. The function on is still even and periodic of period (and also anti-periodic up to a constant with period if is even), it again has discontinuities at infinitely many rational points by an argument similar to the one given in 4.3, and by the analog of Proposition 8 the function again has a continuous extension to and is therefore much better behaved analytically than , as illustrated by the following graphs of these functions for , which look qualitatively much like their counterparts in Figures 2 and 3.
Figure 6. Graph of the function for
Figure 7. Graph of the function for
The difference is that this longer suffices to prove the quantum modularity of because the subgroup of generated by the matrices (or if is even) and now no longer has finite index. Instead, we need a statement like the corollary to Theorem 2 with replaced by the congruence subgroup of (or by if is even, which is indeed if ). This statement is given in the following theorem.
Theorem 4**.**
Let and be as above. Then the function
[TABLE]
extends continuously to for all matrices \gamma=\bigl{(}\begin{smallmatrix}a&b\\ c&d\end{smallmatrix}\bigr{)}\in\Gamma_{(D)}, where is the homomorphism mapping to and to .
This theorem is illustrated for and a typical element of in the following figure.
Figure 8. Graph of for , \gamma=\bigl{(}\begin{smallmatrix}4&-3\\ 3&-2\end{smallmatrix}\bigr{)}
We will deduce Theorem 4 as a consequence of the following proposition, which is a generalization to arbitrary elements of equation (32) for and .
Proposition 9**.**
Let with . Then
[TABLE]
To see that this implies the continuity, we note that we can rewrite the right-hand side by interchanging the summation and integration in the form (generalizing (25))
[TABLE]
for . The right-hand side of this formula is continuous because is piecewise continuous and bounded and has support in , while the second factor of the integrand is piecewise continuous and bounded by a constant times , as one sees easily by partial summation. Another argument is that (44) is equivalent to the equality of distributions
[TABLE]
(generalizing (27)) and we already know that the distribution is locally the derivative of a continuous function.
For the proof of (44) we will restrict to the case when the matrix belongs to , which is sufficient for the proof of Theorem 4 because is a cocycle and we already know the continuity of () for the special case representing the non-trivial coset of in . To avoid distracting case distinctions, we consider only the case and . The argument is the same in principle in all other cases, but we are actually working with ordered tuples of points on the 1-manifold , on which the group acts in an orientation-preserving way, and it is notationally simpler to fix the positions of the points occurring with respect to the point at infinity, so that we can work on instead. We may also assume (by replacing by if necessary, but then remembering that may be congruent to rather than modulo ) that . Finally, to be able to work with absolutely convergent sums and to reorder the terms freely, we will replace by the function () introduced in Section 4.2 and by the corresponding cocycle
[TABLE]
and only set at the end. The equation we have to prove then becomes
[TABLE]
We denote the last term (without the minus sign) by . Since the variable in the integral is always positive (because we are considering the case ), we can replace by its definition (11) and interchange the summation and integration to getf
[TABLE]
In the second sum we replace the vector by the vector . This does not change the product because is congruent to plus or minus the identity modulo the period of , changes the expression in the denominator to , and changes the inequalities to (which imply ). Hence
[TABLE]
completing the proof of equation (48) and hence of the proposition and theorem.
6. Modular forms are everywhere
As one can already see from the examples in the original article [16] where this notion was introduced, quantum modular forms are sometimes related to actual modular forms. These modular forms may be either holomorphic or Maass forms, with the quantum modular form in the latter case being related to the “periods” of Maass forms in the sense developed in [13] and [7]. The quantum modular form that we have been studying in this paper turns out to be of this latter type, with the associated Maass form being an Eisenstein series with eigenvalue 1/4 for the hyperbolic Laplace operator. In this final section we explain how this works, first for the case studied in the first four sections of this paper. In that case the relevant modular group is generated by a translation and an inversion, so that for both the period theory of the Maass form and the quantum modularity of one needs only the functional equation of the associated -series. Then at the end we indicate how the quantum modularity of for general follows from the full theory of periods of Maass forms.
We start with the case , so that is the character given by (1). The relevant modular form here is the Maass Eisenstein series
[TABLE]
where denotes the upper half-plane and the usual -Bessel function of order 0. This is an eigenfunction with eigenvalue 1/4 with respect to the hyperbolic Laplace operator \Delta=-y^{2}\,\bigl{(}\frac{\partial^{2}}{\partial x^{2}}\,+\,\frac{\partial^{2}}{\partial y^{2}}\bigr{)} and is a modular function with character for the group defined in Section 1, meaning that for all or, more explicitly, that
[TABLE]
To see this, we observe that is proportional to E\bigl{(}\frac{z+1}{4},\frac{1}{2}\bigr{)}-E\bigl{(}\frac{z-1}{4},\frac{1}{2}\bigr{)}, where is the usual non-holomorphic Eisenstein series of weight 0 and eigenvalue for the Laplace operator with respect to the full modular group . (This follows easily from the well-known Fourier expansion of as a linear combination of the three functions , , and .) The transformation equations (50) then follow from the -invariance of , the first one trivially since and the second by using the invariance of under \bigl{(}\begin{smallmatrix}1&0\\ \mp 4&1\end{smallmatrix}\bigr{)} to get
[TABLE]
We now associate to the periodic holomorphic function on (where ) having the same Fourier coefficients as , i.e.,
[TABLE]
Proposition 10**.**
The period function defined by
[TABLE]
extends holomorphically from to .
Proof. We follow the proof of the corresponding result given in Chapter 1 of [13] for Maass cusp forms on the full modular group . (See Theorem on p. 202 of [13], which also gives a converse statement characterizing cusp forms in terms of holomorphic functions on satisfying a certain functional equation.) That proof required only the functional equation of the -series associated to the Maass cusp form, which worked because the group is generated by the translation and the inversion , and can be applied here because has a similar structure. Our situation is also a little different because our function is an Eisenstein series rather than a cusp form, but since the Fourier expansion (49) of at infinity has no constant term this has no effect on the proof.
The argument in [13] was first to write the -series of the Maass form as the Mellin transform of the restriction of (or of its normal derivative in the case of an odd cusp form) to the imaginary axis, multiplied by a suitable gamma factor, and to deduce from this relationship and the -invariance of a functional equation for the -series. One then observed that the Mellin transforms of the restrictions to the positive or negative imaginary axis of both the associated periodic holomorphic function and the associated period function were also equal, up to different gamma factors, to the same -series, and the functional equation of this -series combined with Mellin inversion then led to a formula for that applied in all rather than just on . Here the first step, which would require the normal derivative since the restriction of the function (49) to vanishes, can be skipped since the -series is simply the square of the Dirichlet -series (1) and hence has a known functional equation. The Mellin transforms of the restrictions to and to the positive or negative imaginary axis are then given by
[TABLE]
[TABLE]
where in the last line we have used the functional equation of and standard identities for gamma functions. By the Mellin inversion formula we deduce that
[TABLE]
for any , and analytic continuation from to then gives the formula
[TABLE]
for all . But the factor in square brackets is bounded by a power of times for on the vertical line , so the integral on the right-hand side of this equation is absolutely convergent for all with , i.e., for all .
We now show how to connect the functions and to the functions and , respectively, and to deduce the continuity of – and hence the quantum modularity of – from Proposition 10 for the period function . The first step is easy, since from the Fourier expansions (51) and (26) and standard integral formulae we find the relation
[TABLE]
between the periodic function and the periodic distribution . (Here and in what follows the symbol denotes equality up to easily computed scalar factors whose values are irrelevant for the argument.) Replacing by in the integral, we obtain
[TABLE]
where we have used (27) to get the first equality and integration by parts for the second. But now replacing by and using the functional equation we find
[TABLE]
with the same proportionality constant as in (53), and adding these two equations gives the relation
[TABLE]
between the period function of the Maass wave form and the function . This establishes the desired connection between the continuity of , which expresses the quantum modularity of , and Proposition 10, which expresses the modularity of : in one direction, if we know that is continuous (and bounded by ), then (55) immediately gives the analytic continuation of from to ; and conversely, the inversion formula for the Stieltjes transform given in [14] lets us invert (55) to get
[TABLE]
where is any contour with endpoints at which encloses the origin, so that the continuity of is a direct consequence of the holomorphy of in .
We observe that the entire discussion given here applies in an essentially unchanged form to the more general functions and discussed in §4.2, with the Maass form (49) replaced by the form E\bigl{(}\frac{z+1}{4},\frac{s}{2}\bigr{)}-E\bigl{(}\frac{z-1}{4},\frac{s}{2}\bigr{)} with spectral parameter instead of .
Finally, we consider the case when the character in (1) is replaced by an arbitrary primitive odd Dirichlet character . Then all of the above calculations still go through: the function is defined by equation (49) with replaced by and by , which is again a Maass form (with spectral parameter and character on the group ) because it is proportional to \sum_{r\!\!\pmod{D}}\chi(r)E\bigl{(}\frac{z+r}{|D|},\frac{1}{2}\bigr{)}; the associated periodic function and period function are defined by (51) (with the new and with replaced by ) and (52) (with no change at all); and Proposition 10 remains true with the same proof. The difference, however, is that this proposition is no longer equivalent to the modularity of , but only to its invariance (up to sign) under the transformations and (or if is even), which in general generate a subgroup of of infinite order, as already discussed in Section 5. To get the full modularity (of this or any other potential Maass form on ), we need to generalize Proposition 10 to the statement that for any matrix \gamma=\bigl{(}\begin{smallmatrix}a&b\\ c&d\end{smallmatrix}\bigr{)}\in\Gamma_{(D)} the function
[TABLE]
extends holomorphically from to if or if . This statement can be proved in several ways and can be linked by a discussion similar to the one above to the continuity property of stated in Theorem 4. But the simplest approach is to relate the functions directly to the invariant distribution associated to in the sense developed in [12], [13] and [7]. Specifically, this theory says that an eigenfunction of with spectral parameter and Fourier expansion is invariant (possibly with character) under the action of a Fuchsian group if and only if the associated distribution on is invariant (with the same character) with respect to the group action for \gamma=\bigl{(}\begin{smallmatrix}a&b\\ c&d\end{smallmatrix}\bigr{)}\in\Gamma. (Here the word “distribution” must be interpreted correctly, namely, as a functional on the space of test functions that are smooth on and for which is smooth near .) In our case the distribution associated to is given by and is related to the distribution (41) by . But that means that if we differentiate the definition (43) of to get
[TABLE]
then the first two terms on the right cancel and we recover equation (46).
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] M. Balazard, E. Saias, The Nyman-Beurling equivalent form for the Riemann hypothesis. Expo. Math. 18 (2000), 131–138.
- 2[2] S. Bettin, J.B. Conrey, A reciprocity formula for a cotangent sum. Int. Math. Res. Not. IMRN 24 (2013) 5709–5726.
- 3[3] S. Bettin, J.B. Conrey, Period functions and cotangent sums. Algebra Number Theory 7 (2013), 215–242.
- 4[4] A. Beurling, On two problems concerning linear transformations in Hilbert space. Acta Math. 81 (1949), 239–255.
- 5[5] A. Beurling, A closure problem related to the Riemann zeta-function. Proc. PNAS 41 (1955), 312–314.
- 6[6] R. Bruggeman, J. Lewis and D. Zagier, Function theory related to the group P S L ( 2 , ℝ ) 𝑃 𝑆 𝐿 2 ℝ PSL(2,\mathbb{R}) . In From Fourier Analysis and Number Theory to Radon Transforms and Geometry. In memory of Leon Ehrenpreis , H.M. Farkas, R.C. Gunning, M.I. Knopp, and B.A. Taylor (eds.), Springer, Developments in Mathematics 28 (2013), 107–201.
- 7[7] R. Bruggeman, J. Lewis and D. Zagier, Period functions for Maass wave forms and cohomology . Mem. Amer. Math. Soc. 237 (2015), xii+132 pages.
- 8[8] J.B. Garnett, Bounded Analytic Functions , Springer 2007.
