Irrationality of values of L-functions of Dirichlet characters
St\'ephane Fischler (LMO)

TL;DR
This paper extends recent irrationality results for zeta values to L-functions of Dirichlet characters and Hurwitz zeta functions, using advanced approximation techniques and improving bounds on linearly independent values.
Contribution
It generalizes irrationality results to L-functions of Dirichlet characters and Hurwitz zeta functions, employing a novel approach based on Padé approximation and a generalized Shidlovsky's lemma.
Findings
Established lower bounds on the number of irrational L-values.
Proved the existence of many linearly independent L-values for fixed parity.
Improved previous bounds on linearly independent values without dependence on the character.
Abstract
In a recent paper with Sprang and Zudilin, the following result was proved: if is large enough in terms of , then at least values of the Riemann zeta function at odd integers between and are irrational. This improves on the Ball-Rivoal theorem, that provides only such irrational values -- but with a stronger property: they are linearly independent over the rationals.In the present paper we generalize this recent result to both -functions of Dirichlet characters and Hurwitz zeta function. The strategy is different and less elementary: the construction is related to a Pad\'e approximation problem, and a generalization of Shidlovsky's lemma is used to apply Siegel's linear independence criterion. We also improve the analogue of the Ball-Rivoal theorem in this setting: we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Irrationality of values of -functions of Dirichlet characters
Stéphane Fischler
Abstract
In a recent paper with Sprang and Zudilin, the following result was proved: if is large enough in terms of , then at least values of the Riemann zeta function at odd integers between and are irrational. This improves on the Ball-Rivoal theorem, that provides only such irrational values – but with a stronger property: they are linearly independent over the rationals.
In the present paper we generalize this recent result to both -functions of Dirichlet characters and Hurwitz zeta function. The strategy is different and less elementary: the construction is related to a Padé approximation problem, and a generalization of Shidlovsky’s lemma is used to apply Siegel’s linear independence criterion.
We also improve the analogue of the Ball-Rivoal theorem in this setting: we obtain linearly independent values with of a fixed parity, when is a Dirichlet character. The new point here is that the constant does not depend on .
MSC 2010: 11J72 (Primary); 11M06, 11M35, 33C20 (Secondary).
The purpose of this paper is to prove results of irrationality, or linear independence, of values of the Hurwitz function or -functions of Dirichlet characters. Both are generalizations of the Riemann function, so we begin with a quick survey of the main results in this setting.
When is even, is a non-zero rational number so that is transcendental. Apéry has proved [1] that is irrational, but there is no odd for which is known to be irrational. The next breakthrough is due to Ball-Rivoal [2, 20]:
[TABLE]
Here and throughout this introduction, denotes any sequence that tends to 0 as . In this paper we mention only asymptotic results (namely, as ) eventhough most results can be made explicit, and often refined, for small values of . At last we mention the following recent result [11]:
[TABLE]
for odd, .
The natural setting to generalize these results to values of the Hurwitz function or -functions of Dirichlet characters is the following. Let , and be such that for any . We assume that is not identically zero. Let , and be sufficiently large (in terms of and ). For consider the complex numbers
[TABLE]
If is a Dirichlet character mod then these are exactly the values of the associated -function.
The restriction on the parity of in (0.2) is needed in some cases to get rid of powers of . Indeed, if is a Dirichlet character then is either even (i.e., ) or odd (i.e., ), according to whether is equal to 1 or . If has the same parity as then is a non-zero algebraic number (see for instance [18, Chapter VII, §2]) so that the numbers for with this parity are linearly independent over . Moreover, for any periodic map which is either even or odd (and not identically zero), we also have when and have the same parity (see [12]). In these situations, we prove new results on the numbers (0.2) only when and have opposite parities.
An interesting case where (in general) is neither odd nor even is the following. Given we define by if , and otherwise. Then
[TABLE]
where is the Hurwitz function. Therefore the general setting (0.2) encompasses both values of the Hurwitz function and values of -functions of Dirichlet characters.
As far as we know, Apéry’s theorem has never been generalized in this direction; the first natural conjecture in this respect is probably that Catalan’s constant is irrational, where is the non-principal character mod 4. The Ball-Rivoal theorem has been generalized to the -function of this character by Rivoal and Zudilin [21]: they have proved (0.3) below with instead of , eventhough . In the general setting of (0.2), Nishimoto has generalized the Ball-Rivoal theorem as follows [19]:
[TABLE]
In the special case where (which includes the Hurwitz function but not -functions of non-principal Dirichlet characters), this lower bound appears already in Nash’ thesis [17]. The constant in Eq. (0.3) has been refined to in [8], provided is a Dirichlet character and is a multiple of 4. When is the non-principal character mod 4, this gives as a special case the lower bound of Rivoal and Zudilin [21].
Our first result is that one may replace the constant in Eq. (0.3) with , so that the lower bound is uniform in and is the same as for the Riemann function.
Theorem 1**.**
Let , and be such that for any . Assume that is not identically zero. Let , , and be sufficiently large (in terms of and ). Then
[TABLE]
Of course the same result holds without the restriction , but it is weaker and even trivial in some cases where is even or odd (as noticed above).
In another direction, we generalize the recent result (0.1) to this setting.
Theorem 2**.**
Let , and be such that for any . Assume that is not identically zero. Let be a finite-dimensional -vector space contained in , , , and be sufficiently large (in terms of , , and ). Then among the numbers with and , at least
[TABLE]
do not belong to .
Taking we obtain at least irrational values among the numbers . The dependence in is much better than in the lower bound of Theorem 1; however we obtain only numbers outside , and not -linearly independent numbers.
Before explaining the strategy used in the proofs of Theorems 1 and 2, we would like to state the two main special cases of Theorem 2 explicitly.
Corollary 1**.**
Let be a Dirichlet character; put is is odd, and if is even. Let be a finite-dimensional -vector space contained in . Let , and be sufficiently large (in terms of , , and ). Then among the numbers with and , at least do not belong to .
Corollary 2**.**
Let be a positive rational number, and . Let be a finite-dimensional -vector space contained in . Let , and be sufficiently large (in terms of , , and ). Then among the numbers
[TABLE]
with and , at least do not belong to .
Corollary 2 is new even for , i.e. for the Riemann function: it is a refinement of (0.1). We would like to emphasize the fact that the proof of [11] does not give this result for : a different approach is used here, proving linear independence and not only irrationality.
The proof of Theorems 1 and 2 is based on the strategy of [8]: we apply Siegel’s linear independence criterion using a general version of Shidlovsky’s lemma (namely Theorem 3, stated in §1.3 and proved in [8] following the approach of Bertrand-Beukers [4] and Bertrand [3]). This makes it necessary to relate the construction to a Padé approximation problem with essentially as many equations as the number of unknowns. In the present paper we adapt this strategy so as to include Sprang’s arithmetic lemma [23, Lemma 1.4] and the elimination trick of [24, 23, 11]. The proofs of Theorems 1 and 2 are essentially the same, except for the choice of parameters. It is also possible to prove other results of the same flavour (see Theorem 4 at the end of §3.2, which implies both Theorem 2 and – up to a multiplicative constant – Theorem 1).
Our construction contains as a special case the one used in [11] to prove (0.1). We prove this in §3.3; as a byproduct, we relate the construction of [11] to a Padé approximation problem with essentially as many equations as the number of unknowns.
The structure of this paper is as follows. We gather in Proposition 1 the output of the Diophantine construction (see §1.1), and prove it in §1. Then we deduce Theorems 1 and 2 from Proposition 1 in §3 using Siegel’s linear independence criterion (stated in §2).
1 Diophantine construction
In this section we gather the Diophantine part of the proof, namely the construction of linearly independent linear forms. We prove Proposition 1 stated in §1.1, from which we shall deduce in §3 the results stated in the introduction. The linear forms are constructed in §1.2 using series of hypergeometric type. We relate them in §1.4 to a Padé approximation problem, and then apply a general version of Shidlovsky’s lemma (stated in §1.3). At last, arithmetic and asymptotic properties are dealt with in §1.5.
1.1 Statement of the result
Let , , be positive integers such that . Let , and be such that for any . Assume that is not identically zero. Let ; put
[TABLE]
Let also
[TABLE]
Proposition 1**.**
There exists a constant , which depends only on and , with the following property. For any integer multiple of there exist integers , with and , such that:
For any sufficiently large, the subspace of spanned by the vectors , , is non-zero and does not depend on . 2.
For any and any we have as . 3.
For any we have, as :
[TABLE]
From now on, the symbols will be intended as . Since , these symbols can be made uniform with respect to .
The integers depend also implicitly on , , , , and . Their values for do not appear in the linear combinations of part , but they could be of interest in other settings. Another feature of this construction is that for , the integers depend only on , , , but not on or . Probably this could lead to variants of our results in the style of [14] or [7].
Remark 1**.**
In [8] a similar construction is made, where the matrix has rank for sufficiently large so that the subspace of part is equal to . In the present setting we make a different construction to incorporate Sprang’s arithmetic lemma (see §1.2 below), and the matrix we obtain has rank less than for some values of the parameters (see Remark 3 in §1.4): the subspace in Proposition 1 is not always equal to .
1.2 Construction of the linear forms
In this section we define the numbers of Proposition 1 (see Eq. (1.17)) and express the linear form of Eq. (1.2) in a more convenient way (see Lemma 1). We postpone until §1.5 the proof that .
As in §1.1 we let , , be positive integers such that . For any integer multiple of we let
[TABLE]
where is the Pochhammer symbol. Note that each factor of the denominator appears also in the numerator, so that the poles of this rational function only have order . This rational function is similar to that of [8], but central factors have been inserted in the numerator to apply Sprang’s arithmetic lemma (see Remark 2 below).
In this section we follow the proof of [8], except for Eq. (1.19) which is specific to the function we consider here. We let
[TABLE]
for with ; then both series are convergent since the degree of satisfies
[TABLE]
We let and for any we consider the (inverse) discrete Fourier transform of , defined by
[TABLE]
We also let
[TABLE]
The linear forms of Proposition 1 are given by the following lemma. The rational numbers will be constructed explicitly in the proof (see Eq. (1.17)), and we shall prove in §1.5 that they are integers.
Lemma 1**.**
For any there exist rational numbers , …, such that
[TABLE]
where is the -th derivative of a function .
Let us prove Lemma 1. The partial fraction expansion of reads
[TABLE]
with rational coefficients ; we consider
[TABLE]
Let for any , and define inductively by
[TABLE]
where for any . We let also111There is a misprint in the formula that gives in [8].
[TABLE]
[TABLE]
and define , for any by the recurrence relations
[TABLE]
[TABLE]
Then for any we have (as in [2, 10])
[TABLE]
[TABLE]
Since for any , Eq. (1.7) yields . This property is very important to us since we shall evaluate at -th roots of unity. To evaluate in the same way the rational functions for , we write
[TABLE]
with . Then Eqns. (1.12) and (1.13) yield
[TABLE]
[TABLE]
We may now define the coefficients for any by:
[TABLE]
where ; recall that with . Since , and are polynomials with rational coefficients, the numbers , …, are rational. We shall prove in Lemma 3 that they are integers, thanks to the factor . We also point out that is not defined for ; actually for the values of we are interested in (see (1.18) below).
To conclude the proof of Lemma 1, we shall evaluate Eqns. (1.15) and (1.16) at roots of unity. At the point 1 this is possible since, as in [8, §4.3],
[TABLE]
Now let , and be such that . Then Eqns. (1.12) to (1.16) hold, upon agreeing that the sums start at if ; this remark will be used below when is a -th root of unity.
Let be the right hand side of Eq. (1.6). Using (1.18) the definition (1.17) of yields
[TABLE]
Now Eq. (1.5) yields
[TABLE]
Therefore we have, since :
[TABLE]
Then Eqns. (1.15) and (1.16) yield, since , , and depend only on and is a -th root of unity:
[TABLE]
This concludes the proof of Lemma 1.
Remark 2**.**
The only difference here with the construction of [8] is that the rational function has been modified to incorporate Sprang’s arithmetic lemma [23, Lemma 1.4]. In our setting this choice of leads to the following additional property, that will be used in §1.4:
[TABLE]
To prove this property we notice that
[TABLE]
for any which is not a multiple of , the coefficient of is .
1.3 A general version of Shidlosvky’s lemma
Let be a positive integer, and . We fix222We shall check in §1.4 that the notation introduced in the present section is consistent with the one used earlier in this paper. and such that for any . Then with any solution of the differential system is associated a remainder defined by
[TABLE]
Let be a finite subset of , which may contain singularities of the differential system . For each , let be a family of solutions of such that the functions , , are -linearly independent and belong to the Nilsson class at (i.e., have a local expression at as linear combination of holomorphic functions, with coefficients involving complex powers of and integer powers of ). We agree that if , and define rational functions for and by
[TABLE]
These rational functions play an important role because they are used to differentiate the remainders (see [22, Chapter 3, §4]):
[TABLE]
The following result is proved in [8, Theorem 1.2].
Theorem 3**.**
There exists a positive constant , which depends only on and , with the following property. Assume that, for some :
The differential system has a basis of local solutions at with coordinates in for some positive real number .
We have
[TABLE]
for some with .
All rational functions , with and , are holomorphic at .
Then the matrix has rank at least .
In the special case where , , is analytic at [math], and is not a singularity of the differential system , this result was used by Shidlovsky to prove the Siegel-Shidlovsky theorem on values of -functions (see [22, Chapter 3, Lemma 10]). The functional part of Shidlovsky’s lemma has been generalized by Bertand-Beukers [4] to the case where , for any , and all functions are obtained by analytic continuation from a single one, analytic at all . Then Bertrand has allowed [3, Théorème 2] an arbitrary number of solutions at each , assuming that and the functions , , are analytic at . The proof [8] of Theorem 3 follows the approach of Bertand-Beukers and Bertrand, based on differential Galois theory.
An important feature of Theorem 3 is that may be a singularity of the differential system , and/or an element of . Both happen in the present paper, where (see §1.4 where Theorem 3 is applied to prove Lemma 2). If then so that Theorem 3 yields a matrix of maximal rank . On the other hand, if then the linearly independent linear combinations of the rows of the matrix corresponding to , , may vanish at : the lower bound is best possible. In the setting of §1.4 we have and so that Theorem 3 yields . Now (1.18) in the proof of Lemma 1 shows that for any (since because and are independent from whereas tends to with ). Therefore the matrix has rank equal to . Removing the first row, which is identically zero, yields a matrix of rank equal to the number of rows.
1.4 Padé approximation and application of Shidlovsky’s lemma
In this section we prove part of Proposition 1 for the numbers constructed in §1.2.
Lemma 2**.**
Let be defined by Eq. (1.17). Then there exists a positive constant (which depends only on and ) such that for any sufficiently large, the subspace of spanned by the vectors , , is non-zero and does not depend on .
The proof of Lemma 2 falls into 3 steps. To begin with, we construct a Padé approximation problem related to our construction, with essentially as many equations as the number of unknowns; notice that this problem is not the same as in [8], since the function used in the construction is different. Then we apply a general version of Shidlovsky’s lemma, namely Theorem 3 stated in §1.3. This provides a matrix with linearly independent rows. At last, we relate the numbers to by constructing a matrix such that . The point is that does not depend on (whereas and do). The subspace spanned by the columns of is the same as the one spanned by the columns of : it does not depend on .
Step 1: Construction of the Padé approximation problem.
We recall from §1.2 that
[TABLE]
[TABLE]
[TABLE]
Since for any , we have for any . Therefore letting
[TABLE]
for any , we have
[TABLE]
[TABLE]
Moreover, recall that ; Lemma 3 of [10] shows that
[TABLE]
Using again the fact that , we obtain for any :
[TABLE]
The new point here, with respect to [8], is that Eq. (1.19) in Remark 2 shows that does not depend on . Therefore Eq. (1.23) can be written as
[TABLE]
We have obtained a Padé approximation problem with unknowns, namely the coefficients of , for , and for . Eqns. (1.24), (1.25) and (1.26) amount to
[TABLE]
linear equations, where is the difference between the number of unknowns and the number of equations. If then : this is exactly the Padé approximation problem of [10, Theorem 1], which has a unique solution up to proportionality. However if then : we have solved a linear system with (slightly) more equations than the number of unknowns.
Step 2: Application of Shidlovsky’s lemma.
Let us introduce some notation to fit into the context of §1.3, and check the assumptions of Theorem 3. Let , and be the matrix of which the coefficients are given by:
[TABLE]
and all other coefficients are zero. We consider the following solutions of the differential system , with :
[TABLE]
[TABLE]
[TABLE]
where the coefficient in (resp. in ) is in position .
We let , , for , and , so that we have a solution for each , .
We also let (which is equal to for any ), and for any . Then we have polynomials , …, of degree at most , and with the notation of §1.3 the remainders associated with the local solutions , , , are exactly the functions that appear in the Padé approximation problem of Step 1: , , and for any .
Since is not the zero polynomial, we have for any ; the functions , …, (resp. , …, ) are -linearly independent (see [8, Lemma 2]).
Eqns. (1.24), (1.25) and (1.26) yield , and for any , so that
[TABLE]
here , and we recall that . As above, is exactly the difference between the number of unknowns and the number of equations in the Padé approximation problem of Step 1.
The definition (1.20) of is consistent with the one given (for ) by Eq. (1.7) in §1.2. We have , so that for sufficiently large where is the constant given by Theorem 3. Therefore (1.18) shows that and are holomorphic at for any . Eqns. (1.7), (1.10) and (1.11) imply that they are holomorphic at all other roots of unity. Now Eqns. (1.20), (1.10) and (1.11) yield
[TABLE]
for any , by induction on . Therefore all , with and , are holomorphic at 1.
We have checked all assumptions of Theorem 3 for sufficiently large: the matrix has rank at least . Now (1.18) implies for any , so that we may remove the first row: the matrix has rank , equal to its number of rows.
Step 3: Expression of in terms of and conclusion.
We shall now compute a matrix independent from such that ; recall that the coefficients and the matrix depend on .
To begin with, Eq. (1.14) with yields
[TABLE]
since . Therefore we have
[TABLE]
and the same relation holds with and . Using Eq. (1.27) we deduce that
[TABLE]
and also for since , and
[TABLE]
Therefore the definition (1.17) of can be translated as
[TABLE]
for any and any , where the coefficients are defined for and by
[TABLE]
Let us choose now the constant of Lemma 2; the same constant appears in Proposition 1. We take ; this constant depends only on and . We consider the matrices and . Then Eq. (1.29) means that
[TABLE]
Both and have coefficients in ; recall that the coefficients of are rational numbers, and we shall prove in §1.5 that they are integers. Let denote the subspace of spanned by the columns of . Now assume that is sufficiently large; then we have proved in Step 2 that the rows of are linearly independent. Therefore Eq. (1.31) shows that is equal to the subspace spanned by columns of the matrix . Since does not depend on , neither does : this concludes the proof of Lemma 2.
Remark 3**.**
Let us prove that in Lemma 2, the subspace is not always equal to (i.e., that the matrix may have rank less than its number of rows, namely ). Consider the case where and are even (so that ). Then the definition (1.30) of the matrix in Step 3 above yields for any , so that Eq. (1.29) implies for any : the matrix has a zero row. This phenomenon does not occur in [8]; it comes from the new property (1.19) obtained in Remark 2. Indeed a direct proof that can be obtained as follows, using Eqns. (1.17), (1.28), and (1.27) but not the matrix :
[TABLE]
1.5 Arithmetic and Asymptotic Properties
In this section we conclude the proof of Proposition 1 stated in §1.1, by proving parts and and the fact that the are integers. Recall that
[TABLE]
Lemma 3**.**
We have for any and any , and as :
[TABLE]
In this lemma and throughout this section, we denote by any sequence that tends to 0 as ; it usually depends also on , , , and (but the dependence in is not significant since is bounded by , which depends only on , , , ). We also recall that is the least common multiple of 1, 2, …, .
We shall prove two lemmas now; the deduction of Lemma 3 from these lemmas (using Lemma 1 proved in §1.2) is exactly the same as the proof of Proposition 1 in [8, §4.5]. Recall from §1.2 that
[TABLE]
Lemma 4**.**
For any and any we have
[TABLE]
[TABLE]
where is a sequence that tends to 0 as and may depend also on , , and .
**Proof **of Lemma 4: We follow the approach of [5] by letting
[TABLE]
Then the partial fraction expansion of
[TABLE]
can be obtained by multiplying those of , and using repeatedly the formulas and
[TABLE]
with . The denominator of comes both from this formula (and this contribution divides ) and from the denominators of the coefficients in the partial fraction expansions of , , (which belong to , so that accounts for this contribution). This concludes the proof of (1.32).
On the other hand, bounding from above the coefficients of the partial fraction expansions of , , yields
[TABLE]
where is a constant depending only on , , which can be made explicit (see [5] for details). Simplifying the products and using the bound valid when , one obtains
[TABLE]
This concludes the proof of Lemma 4.
The proof of the following lemma is inspired by that of [23, Lemma 1.4]. Recall that and are defined in Eqns. (1.8) and (1.9), and that
[TABLE]
Lemma 5**.**
The polynomials and have integer coefficients.
**Proof **of Lemma 5: Recall from Eq. (1.8) that
[TABLE]
Assume that does not have integer coefficients. Then there exists such that
[TABLE]
Let , which is an integer thanks to Lemma 4. Then we have
[TABLE]
If divides then is a positive integer less than or equal to , so that it divides : this contradicts the assumption . Therefore does not divide , so that .
Since there exists such that
[TABLE]
Now so that
[TABLE]
This rational number has negative -adic valuation for some prime number . Therefore there exist and , such that
[TABLE]
This implies
[TABLE]
so that . This is a contradiction since . This concludes the proof that ; the same proof works for .
2 Siegel’s linear independence criterion
The following criterion is based on Siegel’s ideas (see for instance [6, p. 81–82 and 215–216], [16, §3] or [15, Proposition 4.1]).
Proposition 2**.**
Let be complex numbers, not all zero. Let , and be a sequence of real numbers with limit . Let be an infinite subset of , , and for any let be a matrix with integer coefficients such that as with :
[TABLE]
[TABLE]
Assume also that the subspace of spanned by the columns of is non-zero and independent from (provided is large enough). Then we have
[TABLE]
The usual version of this criterion (see for instance [9, Theorem 4]) is the same statement, but the assumption on is replaced by the assumption that is invertible. The latter is stronger, since it is equivalent to asking for any . Indeed if then has rank : for each we may extract linearly independent columns of , and obtain an invertible matrix to which [9, Theorem 4] applies. The point is that we shall apply Proposition 2 to the matrices constructed in Proposition 1, and the subspace is not always equal to (see Remark 3 in §1.4).
Let us prove Proposition 2 now. Denote by the image of , assumed to be independent from (provided is large enough). Let . Permuting , …, if necessary, we may assume that a system of linear equations of is given by
[TABLE]
We point out that the coefficients are rational numbers because the matrices have integer coefficients. Since for any and any sufficiently large, Eq. (2.1) yields
[TABLE]
upon letting for . Moreover for any sufficiently large, we have and Eq. (2.1) shows that the last rows of are linear combinations of the first rows. Therefore the first rows are linearly independent: the matrix has rank . Accordingly for each there exist pairwise distinct integers , …, between 1 and such that the matrix is invertible. Using Eq. (2.2) we may apply the usual version of Siegel’s criterion (namely [9, Theorem 4]) to this matrix and deduce that
[TABLE]
Since for any , this concludes the proof of Proposition 2.
Remark 4**.**
The idea of applying the usual version of Siegel’s criterion to numbers defined as linear combinations of , …, appears also in [9] (see Proposition 2 in §6 and Eq. (9.1)). However the situation is different in that paper: the rows of the matrix (see Step 2 in §1.4 above) are linearly dependent, which is not the case here.
3 Deduction of Theorems 1 and 2 from Proposition 1
In this section we prove Theorems 1 and 2 stated in the introduction, and also a result that nearly contains both of them (namely Theorem 4 stated at the end of §3.2). At last, we show in §3.3 that the linear forms constructed in [11] are a special case of those studied here.
3.1 Proof of Theorem 1
Let , , , , and be as in the statement of Theorem 1; put . We consider the complex numbers given by:
[TABLE]
We apply Proposition 1 to each integer multiple of , and let for and . Then we apply Siegel’s linear independence criterion (namely Proposition 2 stated and proved in §2) with , and (so that ), where and are defined in §1.1; we take for the set of integer multiples of . Therefore we obtain
[TABLE]
Taking very large and equal to the integer part of concludes the proof of Theorem 1 since
[TABLE]
here the shift of in the lower bound comes from , …, that appear in Eq. (3.1).
3.2 Proof of Theorem 2
Let , , , and be as in the statement of Theorem 2. Let and be sufficiently large with respect to , , and . We denote by the product of all primes less than or equal to (such a product has asymptotically the largest possible number of divisors with respect to its size, see [13, Chapter XVIII, §1]). Then we have
[TABLE]
by the prime number theorem, i.e., . We take for the integer part of . At last, we let .
For any divisor of and any , let if divides , and otherwise. Since is -periodic we have for any .
We shall choose below an integer for each divisor of ; let
[TABLE]
We shall apply Proposition 1 to the -periodic function and obtain linear forms in the numbers
[TABLE]
by letting . Therefore we have
[TABLE]
Notice that has divisors, with
[TABLE]
Assume that the number of values (0.2) which do not belong to is less than . Let be integers such that if and , , then for some .
The homogeneous linear system
[TABLE]
has unknowns , where ranges through the set of divisors of , and equations. Therefore it has a non-zero integer solution .
At this point, the integers are chosen in [11] such that , using an elementary zero estimate (namely, a generalized Vandermonde determinant is non-zero). Here we do not need to make any such assumption: we just assume that for at least one . Indeed a (much more complicated) zero estimate is used in the present proof, namely Theorem 3.
Proposition 1 applies to the -periodic function defined above. Using also Siegel’s linear independence criterion as in §3.1 we obtain
[TABLE]
with
[TABLE]
as (recall that is the integer part of ).
On the other hand, the numbers that appear in the left hand side of (3.5) have the following properties:
, …, belong to .
For with , is zero if , and belongs to otherwise, as Eqns. (3.2) and (3.4) show.
Therefore we have
[TABLE]
Combining Eqns. (3.5) and (3.6) yields a contradiction provided is large enough. This concludes the proof of Theorem 2.
Since , the same proof (with replaced with to take Eq. (3.3) into account) provides the following refinement of Theorem 2.
Theorem 4**.**
Let , and be such that for any . Assume that is not identically zero. Let , , and be sufficiently large (in terms of and ). Let be a finite-dimensional -vector space contained in with . Then among the numbers with and , at least
[TABLE]
do not belong to .
Choosing , this refinement implies that the numbers with and are not all contained in such a subspace : they span a -vector space of dimension at least . Except for the multiplicative constant ( instead of ), Theorem 1 follows as a corollary of Theorem 4.
3.3 Connection to the proof of [11]
In this section we show that the linear forms used in [11] to prove (0.1) are a special case of those studied in the present paper (namely in the proof of Theorem 2 with , , and ). Accordingly they are related to the Padé approximation problem stated in §1.4, in which the number of equations is essentially equal to the number of unknowns.
We keep the notation of the proof of Theorem 2 in §3.2, with and for any . In particular is the product of all primes less than or equal to . For any divisor of we have if divides , and otherwise. The function satisfies
[TABLE]
Now let be an integer multiple of , and let be such that . Then the rational function satisfies the symmetry property of well-poised hypergeometric series:
[TABLE]
This is the key ingredient (since the Ball-Rivoal theorem) to get rid of even zeta values, when . In our approach where Nesterenko’s linear independence criterion is replaced with Siegel’s combined with Shidlovsky’s lemma, this property cannot be used in the same way because it is destroyed when considering for . Using both and in constructing the linear forms (see §1.2) makes it possible to overcome this difficulty (as in [8]). With this trick does not modify the linear forms we are interested in, since for any we have using Eqns. (1.3) and (3.8) and the fact that divides :
[TABLE]
We are now in position to express differently the linear forms constructed in part of Proposition 1 from the map given by Eq. (3.7), in the special case where , is a multiple of , , , and . Denote by this linear form. Then we have using Lemma 1 and Eqns. (3.9) and (3.7):
[TABLE]
In the last expression the sum on should have begun at , but this makes no difference since for any . Now let ; then we have
[TABLE]
Up to the normalizing factor these are exactly the linear forms used in [11] to prove (0.1). Indeed the following notation is used in [11] for and :
[TABLE]
[TABLE]
Now, up to the normalizing factor is equal to the rational function so that is equal to using Eq. (3.10).
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] R. Apéry – “Irrationalité de ζ ( 2 ) 𝜁 2 \zeta(2) et ζ ( 3 ) 𝜁 3 \zeta(3) ”, in Journées Arithmétiques (Luminy, 1978) , Astérisque, no. 61, 1979, p. 11–13.
- 2[2] K. Ball & T. Rivoal – “Irrationalité d’une infinité de valeurs de la fonction zêta aux entiers impairs”, Invent. Math. 146 (2001), no. 1, p. 193–207.
- 3[3] D. Bertrand – “Le théorème de Siegel-Shidlovsky revisité”, in Number theory, Analysis and Geometry: in memory of Serge Lang ( D. Goldfeld et al. , eds.), Springer, 2012, p. 51–67.
- 4[4] D. Bertrand & F. Beukers – “Équations différentielles linéaires et majorations de multiplicités”, Ann. Sci. École Norm. Sup. (4) 18 (1985), no. 1, p. 181–192.
- 5[5] P. Colmez – “Arithmétique de la fonction zêta”, in Journées mathématiques X-UPS 2002 , éditions de l’école Polytechnique, 2003, p. 37–164.
- 6[6] N. Fel’dman & Y. Nesterenko – Number theory IV, transcendental numbers , Encyclopaedia of Mathematical Sciences, no. 44, Springer, 1998, A.N. Parshin and I.R. Shafarevich, eds.
- 7[7] S. Fischler – “Distribution of irrational zeta values”, Bull. Soc. Math. France 145 (2017), no. 3, p. 381–409.
- 8[8] — , “Shidlovsky’s multiplicity estimate and irrationality of zeta values”, J. Austral. Math. Soc. 105 (2018), no. 2, p. 145–172.
