
TL;DR
This paper presents an analytic proof of the Borwein Conjecture, resolving a long-standing mathematical question using advanced mathematical techniques.
Contribution
It offers the first rigorous analytic proof of the Borwein Conjecture, advancing understanding in the field of mathematical analysis.
Findings
Proof confirms the validity of the Borwein Conjecture
Analytic methods effectively resolve the conjecture
Contributes to mathematical theory and proof techniques
Abstract
We provide a proof of the Borwein Conjecture using analytic methods.
| Sum | |||||
|---|---|---|---|---|---|
| 0.544 | 0.197 | 0.237 | 0.004 | 0.982 | |
| 0.046 | 0.266 | 0.008 | 0.864 | ||
| 0.046 | 0.266 | 0.008 | 0.864 |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
An analytic proof of the Borwein Conjecture
Chen Wang
University of Vienna
Abstract.
We provide a proof of the Borwein Conjecture using analytic methods.
This work is supported by the Austrian Science Fund (FWF) grant SFB F50 (F5009-N15).
1. Introduction
In 1990, Peter Borwein observed that the coefficients of the polynomials
[TABLE]
have a repeating sign pattern of . A more formalized version appears in an 1995 paper by Andrews [1]. Here, and in the sequel, we use the standard notation for -shifted factorials,
[TABLE]
Conjecture 1.1** (P. Borwein).**
Let the polynomials , and be defined by the relationship
[TABLE]
Then these polynomials have non-negative coefficients.
This statement is known as the Borwein Conjecture.
There have been many attempts to prove the Borwein Conjecture. Moreover, we find several variations and generalizations in the literature, see [1, 5, 6, 13, 14, 15], sometimes also conjecturally, sometimes with full or partial proofs. However, none of the proved variations and generalizations cover the original conjecture, Conjecture 1.1. It is fair to say that so far essentially two methods have been tried: bijective methods—such as in [6, 10], and basic hypergeometric methods—such as in [1, 5, 14]. Surprisingly though, it seems that nobody made an asymptotic attack on the conjecture. This may have to do with the fact that the “canonical” formulas for , and , namely (14.3)–(14.5), are entirely unsuitable for asymptotic approximation, see the corresponding remarks in Section 14. Nonetheless, it turns out that there are formulas for , and that are amenable to asymptotics, which appear already in Andrews’ paper [1], where the original conjecture appears for the first time in print.
Theorem 1.2** (Andrews, [1, Theorem 4.1]).**
Let , and be defined as in (1.1). Then we have the expansions
[TABLE]
As a matter of fact, after discussing these formulas briefly, Andrews says in [1] that “it might be possible to prove that has positive coefficients by establishing sufficiently tight bounds on the coefficients that arise term-by-term in (4.5)”, where Andrews’ (4.5) is our (1.2).
In the present paper, we follow Andrews’ advice. Our main discovery is that, in the sums (1.2)–(1.4), the first term, i.e, the term for , dominates all other terms. This makes these expressions superior to all other known expressions for the purpose of asymptotic estimations. We use analytic methods to bound the coefficients of , and away from [math] by expressing the coefficients as certain contour integrals and estimating these integrals. Section 2 contains the basic setting of our proof: it is explained how to break the contour integrals into a positive-valued main part and four error terms, thus reducing the Borwein Conjecture to the problem of obtaining sufficiently good upper bounds on the error terms.
After establishing some basic facts and fixing some parameters in Sections 3–7, we derive upper bounds for each of the error terms in Sections 8–11, which leads to a proof of the Borwein Conjecture for all in Section 12. Some auxiliary results of technical nature are stated and proved separately in an appendix. The cases where are directly verified by a computer calculation, see Section 13. We conclude our paper with Section 14, in which we recall in more detail the earlier mentioned variations and generalizations, and where we also comment on possible further implications of our analytic approach.
2. An outline of the proof
In this section, we provide a brief outline of our proof of the Borwein Conjecture.
First, we claim that non-negativity of the coefficients of already implies the complete Borwein Conjecture. Indeed, we have
[TABLE]
which proves the non-negativity of the coefficients of given the non-negativity of the coefficients of . On the other hand, the elementary recursive formula [1, Eq. (3.3)]
[TABLE]
allows us to get the non-negativity of the coefficients of inductively from the non-negativity of the coefficients of (and ). Therefore, from now on, we will concentrate on .
In Section 3, we start by writing (see (3.1))
[TABLE]
where is the -th summand in the expansion (1.3). We then decompose into the sum of two simpler polynomials, namely and , see (3.2), (3.3), and (3.4), so that
[TABLE]
The background of this decomposition is that the polynomials and are simpler to handle asymptotically. By summing over all , we define
[TABLE]
so that
[TABLE]
In particular, this decomposition shows that, to prove the non-negativity of the coefficients of , it suffices to prove the non-negativity of the coefficients of and separately. Some elementary properties about and are collected in Lemma 3.2. In particular, it turns out that is a palindromic polynomial, that is, , while is not. The latter is the reason that, in the subsequent discussion, we also need the reciprocal polynomial of , that is, .
The contents of Section 4 is a proof of non-negativity of the coefficients of in and for . It relies on results of Andrews in [1] and on a positivity result of Berkovich and Garvan from [4]. Thus, what remains to show, is non-negativity of the coefficients of in for , and an analogous result for and for .
For notational simplicity, we will use the notations and throughout this paper to refer to multiple families of polynomials. For example, a proposition that is true for for means the proposition is true for all three families of polynomials , and . We will also use the standard notation to represent the coefficient of in the polynomial .
Using Cauchy’s integral formula, the coefficient can be represented as the integral
[TABLE]
where is any contour about [math] with winding number 1. We will choose as a circle centred at [math] with radius for some , so that the integral becomes
[TABLE]
The exact choice of is related to the saddle point of . We will elaborate on this in Section 5. The appropriate choice for is a value smaller than but close to , see Lemma 5.1.
We use the expansions to write the integral (2.3) as
[TABLE]
Figure 1 illustrates the typical behaviour of on the circle . In particular, we can observe the following general features in the graph:
- •
the terms with smaller have a central peak at ;
- •
the central peak of for small looks like a translated-down version of the central peak for . Since Figure 1 is on a logarithmic scale, this suggests that the magnitude could be controlled by a constant factor times in a neighbourhood of ;
- •
for these terms, the parts outside the small neighbourhood of are very small compared to the peak value;
- •
when becomes larger, the central peak disappears. However, the graph suggests that all of (represented by the red curve in the graph) is at a lower location in the graph, indicating that could be controlled by a relatively small constant if is large.
Therefore, based on these heuristics, we choose two cut-offs and (to be determined in (7.1) and (7.2)), and distinguish the following parts of the integrands , for :
- •
The term primary peak refers to the part where and .
- •
The term secondary peaks refers to the parts where and .
- •
The term tails refers to the parts where and .
- •
Finally, the term remainders refers to the parts where .
Naturally, the integral (2.4) can be divided into four sub-integrals corresponding to the four parts above.
For all , we make the following observations concerning the four sub-integrals:
- •
The primary peak can be approximated by a Gaußian integral. More specifically, if we define
[TABLE]
then we have
[TABLE]
for large .
- •
The secondary peaks will be bounded above by a constant times the primary peak. We make the argument
[TABLE]
- •
The tails will be estimated relative to its corresponding (primary or secondary) peak. More specifically, for , we will construct families of polynomials with non-negative coefficients (see the paragraph before (6.5)), acting as uniform upper bounds for over the circle , satisfying the relations
[TABLE]
for all and all .
With the help of , the tail integrals can be bounded above by
[TABLE]
- •
The remainder will be directly controlled by the upper bounds . Namely, by (2.8), we have
[TABLE]
Our next step is to estimate the relative error in the approximation (2.6), and to bound the other parts of the integral relative to the (presumably) dominating part . Based on (2.6) and the inequalities (2.7), (2.9) and (2.10), we make the following definitions of the relative errors:
[TABLE]
It should be noted that only the first of these, , depends on , the parameter which keeps track of the monomial of which we are taking the coefficient in .
These definitions, along with the integral representation (2.4) and the inequalities (2.7), (2.9) and (2.10), imply that
[TABLE]
Therefore, once we have sufficiently good bounds on all these error terms so that their sum is smaller than 1, we can conclude that is indeed positive.
The primary peak error is estimated in Section 8, the secondary peaks are bounded in Section 9, Section 10 is devoted to bounding the remainders , and finally Section 11 treats the tails . All these estimations are valid for and respectively , and their combination shows that the Borwein Conjecture holds for , see Theorem 12.1 in Section 12. The cases where are disposed of by a (lengthy) computer calculation, the principles of which are explained in Section 13.
3. Decomposing
As we already explained in the introduction, the starting point of our proof of the Borwein Conjecture is Theorem 1.2, which provides certain expansions of the polynomials , and . Based on the expansion (1.3), we define the family of polynomials to be the summands in that expansion, so that
[TABLE]
The factor in turns out to be inconvenient, since our strategy is to bound quotients of successive terms. Therefore, we decompose it as
[TABLE]
This decomposition naturally extends to the family of polynomials via the following definitions:
[TABLE]
so that
[TABLE]
By summing over all , we define
[TABLE]
so that
[TABLE]
As we already indicated in the previous section, our estimations of the error terms for are only valid for , that is, only for “half of the coefficients”, see Section 5, and in particular Lemma 5.1 to which we shall constantly refer. While this is fine for — since is palindromic, proving bounds for the first half of the coefficients automatically means to also have proved analogous bounds for “the other half” — this is a problem for which is not palindromic. Here, we need to consider the reciprocal polynomial of , that is, , and also prove estimations for as defined in (2.11)–(2.14). It is a routine calculation from (3.5) that with
[TABLE]
we have
[TABLE]
Remark 3.1*.*
It is not hard to see that the functions , and , as defined above, are actually polynomials for all with . (For a proof of this fact, see the factorizations (6.1)–(6.4) and the related discussions in Section 6.) However, in the special case and , (3.2), (3.3) and (3.7) fail to give polynomials. Thus, we restrict the domain of the definitions (3.2), (3.3) and(3.7) to , and make alternative definitions in the “borderline case”:
[TABLE]
It is straightforward to see that, with these alternate definitions, and with the sums (3.5) and (3.8), we still have (3.6).
We collect some basic facts about these polynomials.
Lemma 3.2**.**
For , the polynomials and have the following properties:
- •
* is a palindromic polynomial, while and are reciprocal of each other. Therefore, it suffices to consider the coefficients for .*
- •
. Furthermore, we have for all .
- •
The terms in the expansions have a nice product form:
[TABLE]
- •
The expression (3.12) implies the following formula for as defined in (2.5):
[TABLE]
4. The first coefficients
In this section, we settle the non-negativity of the first coefficients of for by considering the limiting case.
To this end, we define
[TABLE]
for all . The following lemma is a direct consequence of (3.2), (3.3) and (3.7).
Lemma 4.1**.**
For all and all , we have
[TABLE]
This lemma says in particular that, for all , the non-negativity of implies the non-negativity of for . The series , with have indeed non-negative coefficients as we are going to show now.
Andrews proved in [1] that and have non-negative coefficients. Moreover, certain Rogers–Ramanujan type identities for modulus 9, first discovered by Bailey [3, p. 224], give product formulas for these two series.
Lemma 4.2** ([1, (4.3)–(4.4)]).**
The power series and have the closed form expressions
[TABLE]
where we use the short notation
[TABLE]
We proceed to deduce non-negativity results for the power series and from these forms. By taking the limit in equations (1.3) and (1.4), and in (3.2), (3.3) and (3.7), we see that
[TABLE]
An immediate conclusion is that also has non-negative coefficients. In order to prove analogous results for and , it suffices to show that has non-negative coefficients. To prove this claim, we write
[TABLE]
The non-negativity of the last factor follows from the following partition inequality, first proved by Berkovich and Garvan in [4], by taking , and .
Theorem 4.3** (Berkovich & Garvan, [4, Theorem 5.3]).**
Let and . Then the -series
[TABLE]
has non-negative coefficients if and only if and .
Thus we have proved that has non-negative coefficients for . Combined with Lemma 4.1, we have the following result concerning the first coefficients of .
Theorem 4.4**.**
For all , , and all , we have
[TABLE]
5. Locating the saddle point
The results of the last section show that it suffices to consider for . The purpose of this section is to describe our choice of the radius in (2.3), under the above restriction on .
Keeping in line with standard practice in analytic combinatorics (cf. [8]), our choice of the radius will be a saddle point of the function . It turns out that there is a unique saddle point on the positive real axis, and we have very tight bounds on the position of this point under the condition . These results will be proved in the following lemma. They are vital in our estimations of the error terms in Sections 8–11.
Lemma 5.1**.**
For all , all integers , and , the saddle point equation
[TABLE]
has a unique solution . Moreover, if , then we have where
[TABLE]
and is the maximum value of the function on .
Proof.
The equation (5.1) can be transformed into
[TABLE]
Let us write for the left-hand side. From the definition of the polynomials in (3.12), we have
[TABLE]
These functions attain the special values
[TABLE]
Moreover, all are increasing functions in since we have
[TABLE]
The existence and uniqueness of solution follows immediately.
It remains to prove the bounds on . Since is increasing, it suffices to show that and . The latter is true due to the second equation in (5.3). In order to see the former inequality, we argue as follows:
[TABLE]
6. The auxiliary polynomials
As mentioned in Section 2, we will construct families of polynomials satisfying (2.8). These polynomials are upper bounds for with respect to . On the way, we also show that , and are polynomials in , as claimed in Remark 3.1.
To this end, we first note that the inequality trivially holds if is a polynomial with non-negative coefficients. Therefore, we proceed to factor out such parts from the polynomials , and bound the cofactor from above by the triangle inequality. Due to the relationship , we will only explicitly write the factorization results for .
Using the definitions (3.2) and (3.3), we arrive at the factorizations
[TABLE]
Here, is the -binomial coefficient, defined by , which is known to be a polynomial in with non-negative coefficients.
We claim that the first factors in (6.1) and (6.2) are polynomials with non-negative coefficients if . For , this is because the -binomials and the polynomials
[TABLE]
have non-negative coefficients. In the case of , the factor is the -analogue of the Fuß–Catalan numbers (see, for example, Stump [12]), and it is also a polynomial with non-negative coefficients.
On the other hand, if , then the first factors in (6.1) and (6.2) will no longer be polynomials. In these cases, we make the alternate factorizations
[TABLE]
In each of the equalities (6.1)–(6.4), the first factor is a polynomial in with non-negative coefficients, and the second factor is a product of factors of the form since
[TABLE]
with the single exception of the factor in (6.2). In particular, all of the second factors in (6.1)–(6.4) are polynomials in , with some negative coefficients though.
We note the trivial fact that , as well as the slightly non-trivial fact that
[TABLE]
as long as .
Based on these facts, we define the polynomials to be the result of replacing by in the first parts of (6.1)–(6.4), replacing every factor in the second parts of (6.1)–(6.4) by a corresponding factor , and replacing in (6.2) by .
The immediate consequence of this definition are expressions for the quotients between successive ’s. We have
[TABLE]
for , as well as
[TABLE]
for . Moreover, we trivially have
[TABLE]
These relations will be used in the estimations of the tails and the remainders in Sections 10 and 11.
7. The cut-off values
In order to get a good balance among the error terms , two cut-offs — for the argument , and for the summation index — will be chosen as
[TABLE]
where is the value of the saddle point given by the unique solution to (5.1).
Remark 7.1*.*
One consequence of the choice (7.1) is that, whenever with and , we know that
[TABLE]
for all with . This means that the complex number belongs to the region
[TABLE]
Having done all the preparatory work, we now dive into the estimations for the error terms in the next few sections.
8. Bounding the primary peak error
We begin this section by introducing a general bound on the relative errors for the approximation of a function by a Gaußian.
Lemma 8.1**.**
Suppose that and satisfy for some . Let . Suppose further that . Then we have
[TABLE]
Proof.
Let . Taylor’s theorem implies that
[TABLE]
We split the integral as follows:
[TABLE]
Therefore we have
[TABLE]
The last integral is then bounded using Lemma A.2 by taking and . ∎
Lemma 8.2**.**
Suppose that is chosen as the saddle point as described in Lemma 5.1. Then, for all , we have
[TABLE]
where .
Proof.
Note that the choice of as the saddle point of ensures that the Taylor expansion of at has a vanishing linear term. Thus we can use Lemma 8.1 to bound the relative error . We define
[TABLE]
Lemma 8.1 immediately allows us to conclude
[TABLE]
provided that .
The subsequent arguments in this part exploit some inequalities for the quantities , and to verify the conditions of Lemma 8.1.
We start by establishing simpler bounds on them. For the sake of simplicity, we write and for and in the subsequent arguments.
The definition of implies that
[TABLE]
where .
Therefore, an upper bound for can be directly inferred from (A.3):
[TABLE]
On the other hand, (3.13) and the elementary inequality lead to the following bounds for :
[TABLE]
as well as
[TABLE]
where we use the inequality , as well as
[TABLE]
and
[TABLE]
which is a consequence of Lemma 5.1.
Having established the bounds above, we can establish some relationships among , , and .
The inequalities (8.2), (8.4) and (A.13) imply that
[TABLE]
We also infer from (8.2), (8.4) and (A.14) that
[TABLE]
where we used the numerical inequality .
Finally, to bound the complementary error function in (8.1), which is equivalent to bound from below, we invoke (8.4) and (A.11) to see that
[TABLE]
and therefore
[TABLE]
9. Bounding the secondary peaks
The error terms related to the secondary peaks concern the quotients . To bound these quotients from above, we look at the quotients of two consecutive polynomials.
[TABLE]
In order to bound these quotients from above, we introduce a general result that can be used to control the rational function in certain domains of the complex plane.
Lemma 9.1**.**
Suppose , and . Then for all such that , we have
[TABLE]
Proof.
We make use of the infinite products
[TABLE]
and
[TABLE]
We claim that under the assumptions of this lemma, we have
[TABLE]
from which the lemma follows after taking the product over all .
In order to prove this inequality, we write and , so that and . Now the absolute value can be written as
[TABLE]
and the inequality can be proved by the manipulation
[TABLE]
Lemma 9.2**.**
Suppose that and are as defined as in (5.2) and (7.1), respectively. Then, for all , all such that , and , we have
[TABLE]
and
[TABLE]
Proof.
We write so that and . Note that the conditions on imply the inequality
[TABLE]
We claim that the inequality
[TABLE]
holds for all and all . This can be proved by observing that
[TABLE]
if , and
[TABLE]
if . Therefore, (9.3) and (9.4) imply that satisfies the condition in Lemma 9.1 with . Lemma 9.1 now says that, for any where , we have
[TABLE]
We use (9.5) to bound various parts on the right-hand sides of (9.1) and (9.2). We have
[TABLE]
as well as
[TABLE]
for or .
It remains to bound the factor
[TABLE]
where or . Here we make use of (A.1) and (A.2) (recall that belongs to the region (7.3)) to conclude that
[TABLE]
These bounds allow us to obtain upper bounds for the first factor in the expression (2.12) of the error term .
Lemma 9.3**.**
Suppose , and that , and are as defined as in (5.2), (7.2) and (7.1), respectively. Then, for all , we have
[TABLE]
Proof.
We first make use of Lemma 9.2, and we notice that the condition and the choice of imply that . Therefore, the terms involving in Lemma 9.2 can be bounded above by
[TABLE]
This implies
[TABLE]
and
[TABLE]
for all with . The relationship (3.7) implies a similar inequality for , namely
[TABLE]
The bounds stated in the lemma can be obtained by noticing that
[TABLE]
and by estimating
[TABLE]
It remains to deal with the second factor in (2.12), namely
[TABLE]
The argument below is parallel to the one in Section 8. The main difference is that the integrand is an even function in , so the error term in the Taylor expansion is of order four instead of order three. We first present an analogue of Lemma 8.1.
Lemma 9.4**.**
Suppose that and that is an even function that satisfies for some . Let . Suppose further that . Then we have
[TABLE]
Proof.
Let . Taylor’s theorem implies that
[TABLE]
Similar to the proof of Lemma 8.1, we argue that
[TABLE]
The last integral is then bounded using Lemma A.3 by taking and . ∎
Lemma 9.5**.**
Suppose that , and is defined as in (7.1). Then we have
[TABLE]
where .
Proof.
Note that the integrand is an even function in , so we can use Lemma 9.4 to bound the integral. We define
[TABLE]
Lemma 8.1 immediately allows us to conclude
[TABLE]
provided that the condition is satisfied.
The subsequent arguments in this part exploit some inequalities for the quantities , and to verify the conditions of Lemma 9.4.
We start by establishing simpler bounds on them. For the sake of simplicity, we write and for and in the subsequent arguments.
The definition of implies that
[TABLE]
where .
Therefore, an upper bound for can be directly inferred from (A.4):
[TABLE]
On the other hand, we recall the upper and lower bounds on from (8.3) and (8.4). We establish some relationships among , , and .
The inequalities (9.7), (8.4) and (A.15) imply that
[TABLE]
Moreover, from (9.7), (8.4) and (A.16), we also infer that
[TABLE]
By combining Lemmas 9.3 and 9.5, we arrive at our bound for the error term .
Lemma 9.6**.**
For all and all with , we have
[TABLE]
10. Bounding the remainders
The reason we estimate the remainder parts before the tail is that certain results in this section, namely upper bounds for the ratios , will also be used in bounding the tails from above.
Lemma 10.1**.**
Suppose that , and . For all , we have
[TABLE]
On the other hand, for all , we have
[TABLE]
Proof.
We claim that the first factors in (6.5) and (6.7) do not exceed 1, and the first factors in (6.6) and (6.8) do not exceed 4. The claims about (6.5) and (6.7) are proved by using the inequality for , and the claims about (6.6) and (6.8) are proved by observing that
[TABLE]
as well as the inequality , valid for and .
The second factors in (6.5) and (6.7) can be estimated using the inequality , valid for all and . (This can be considered as a limiting form of Lemma 9.1 when .) In order to deal with the factor , we use the fact that the function is decreasing in if . This concludes the proof of the first part of the lemma.
We cannot directly use the same method for the second factors in (6.6) and (6.8) because, in each case, one exponent in the numerator, namely , would be larger than both exponents in the denominator. Instead, we argue that if and , then we have
[TABLE]
Insertion of specific values of from (6.6) and (6.8) into the above inequality concludes the proof. ∎
Lemma 10.2**.**
Suppose that , and that , and are as defined as in (5.2), (7.2) and (7.1), respectively. Then, for all , we have
[TABLE]
Proof.
Lemma 10.1 implies the following inequalities for and :
[TABLE]
We observe that the factor does not exceed for all , and it does not exceed for all . Therefore, the right-hand sides of (10.2) and (10.4) (where ) can be bounded above by
[TABLE]
Taking into account the inequality , we calculate
[TABLE]
Applying analogous arguments, and by using the inequality , we get
[TABLE]
Finally, using the fact that
[TABLE]
for , we conclude that
[TABLE]
and
[TABLE]
We finish the proof by using the condition in the above bounds, and by recalling (6.9) to draw a similar conclusion about . ∎
11. Bounding the tails
In order to bound the error term , we need bounds on as well as on . The results of previous section, along with (6.9), imply the inequalities
[TABLE]
We now turn our attention to the quotient .
The definitions of implies that
[TABLE]
The next lemma provides bounds on the factors on the right-hand side.
Lemma 11.1**.**
For all and , we have
[TABLE]
Proof.
It is straightforward to calculate
[TABLE]
where
We claim that
[TABLE]
The first inequality is proved by the algebraic manipulation
[TABLE]
while the second inequality can be obtained by taking in the inequality , which holds for all . ∎
An immediate consequence of Lemma 11.1 is the inequality
[TABLE]
In the lemma below, we provide an inequality concerning the last sum above.
Lemma 11.2**.**
Let such that , and . Then we have
[TABLE]
where
[TABLE]
Proof.
This sum has a closed form,
[TABLE]
We use the Cauchy–Schwarz inequality, observe that
[TABLE]
and
[TABLE]
to arrive at
[TABLE]
Comparing (11.6) with the claims of this lemma, we see that it suffices to prove that
[TABLE]
By routine manipulation, the above inequality is equivalent to
[TABLE]
Here, Lemma A.8 implies the inequality
[TABLE]
Comparing (11.7) and (11.8), we see that it remains to show that
[TABLE]
This is an immediate consequence of Lemma A.5 and the inequality
[TABLE]
Equation (11.10) can be directly verified for . If , we write , so that the inequality is equivalent to
[TABLE]
This follows finally from the fact that is decreasing on . ∎
Now we have all the tools in order to prove an upper bound for the quotient .
Proposition 11.3**.**
For all , , and , we have
[TABLE]
where , and
[TABLE]
Proof.
The condition ensures that . Therefore, Lemmas 11.1 and 11.2 imply that
[TABLE]
where we write and for simplicity of notation.
Now Lemma 11.2 allows us to do further estimation:
[TABLE]
where
[TABLE]
After substituting , we first note
[TABLE]
valid for . Then we use the fact that is decreasing with respect to if to estimate the factor in terms of :
[TABLE]
By exploiting the inequality
[TABLE]
for all and , and by taking
[TABLE]
we arrive at the expected result:
[TABLE]
where the last step uses the inequality , valid for and . ∎
In order to convert the above lemma into an upper bound for , we first note that is increasing with respect to . We estimate the integral in the definition of by making the substitution as in Proposition 11.3 and by splitting the integral at , as shown below:
[TABLE]
Suppose for now that and . By looking at various factors in the definition of , we observe that
[TABLE]
These observations enable us to conclude
[TABLE]
for all and . We define
[TABLE]
and obtain that
[TABLE]
At this point, we incorporate the factor in the definition of . We note that, using (8.3) and (A.12), we have
[TABLE]
In view of this upper bound, we prove some related monotonicity results.
Lemma 11.4**.**
Let be defined as in (11.11). For all and all , we have:
- •
The function
[TABLE]
is decreasing with respect to .
- •
If , then the function
[TABLE]
is also decreasing with respect to .
Proof.
By taking logarithmic derivatives with respect to , these claims are equivalent to the inequalities
[TABLE]
and
[TABLE]
In order to prove (11.16) and (11.17), we perform the following calculations:
- •
[TABLE]
- •
Again, Lemma A.6 imply that
[TABLE]
- •
We have
[TABLE]
therefore the right-hand side of (11.16) is at least .
- •
Since , we have
[TABLE]
and thus the right-hand side of (11.17) is at least . ∎
We are now ready to provide explicit upper bounds for .
Lemma 11.5**.**
Suppose that , and that , and are as defined as in (5.2), (7.2) and (7.1), respectively. Then, for all , we have
[TABLE]
Proof.
Making use of Lemmas 11.3 and 11.4 as well as of (11.12) and (11.13), and also noticing that is increasing with respect to , we infer
[TABLE]
Now we substitute and , and observe that and . Moreover, we use numerical integration to calculate
[TABLE]
Therefore, we infer that
[TABLE]
If this inequality is combined with (11.1) and (11.2), the proof is complete. ∎
12. Concluding the Proof
Having finally obtained upper bounds for all the error terms, we combine them to derive the main result of this paper.
Theorem 12.1**.**
The Borwein Conjecture is true for all .
Proof.
When , we can see that Thus, from Lemma 8.2 we infer
[TABLE]
Also, allows us to conclude that
[TABLE]
which results in explicit bounds for in Lemma 9.6,
[TABLE]
Remembering the estimations in Lemmas 10.2 and 11.5, we make a table of the upper bounds we have obtained so far:
From this table we can finally conclude that
[TABLE]
holds for all and , confirming the truth of the Borwein Conjecture in this range. ∎
13. Computer verification for
We have explicitly verified for all , and all and with and by using a computer. The program consists of calculating and checking the coefficients of by repeated polynomial multiplication, using the GMP library [9] for exact large-integer arithmetic. The computation was run at Johannes Kepler University in Linz, on a computer with 32 Intel Xeon processors at 2GHz (of which only 10 are used). The running time was 53 hours, and used up to 150 gigabytes of memory for storing all the coefficients.
14. Discussion
There are two more Borwein Conjectures mentioned in [1]: a “Second Borwein Conjecture” that also relates to modulus 3, and a “Third Borwein Conjecture” that relates to modulus 5.
Conjecture 14.1** (P. Borwein).**
Let the polynomials , and be defined by the relationship
[TABLE]
Then these polynomials have non-negative coefficients.
Conjecture 14.2** (P. Borwein).**
Let the polynomials , , , and be defined by the relationship
[TABLE]
Then these polynomials have non-negative coefficients.
Both these conjectures are still wide open. In particular, no reasonable formulas for the polynomials have been found so far. We remark that the comparison of (1.1) and (14.1) yields the relationship , so non-negativity for the coefficients of follows trivially from this paper.
Recall that for our proof we used the formulas for , and given in Theorem 1.2. As we mentioned, these formulas had apparently not caught much attention so far. It is rather a different type of formula that was found to be much more inspiring, namely (see [1, Theorem 3.1])
[TABLE]
where we used again the standard notation for -binomial coefficients. These are so much more imaginative because of their resemblance with a family of formulas appearing as generating functions for partitions with restricted hook differences in [2]. Andrews et al. had shown that
[TABLE]
is the generating function for certain partitions with restricted hook differences, with being non-negative integers satisfying and . Indeed, the generating function in (14.3) is the “special case” of (14.6) in which , , and . Similar observations hold for and . In other words, the result of Andrews et al. seems to produce a proof of the Borwein Conjecture, except for the small flaw that the choices of and are not integral, and thus not legal.
Bressoud [6] extended the mystery by making the following much more general conjecture.
Conjecture 14.3** (Bressoud [6, Conjecture 6]).**
Suppose that , and are positive rational numbers, and is a positive integer such that and are integers. If (with strict inequalities if ) and , then the polynomial
[TABLE]
has non-negative coefficients.
To this day, Bressoud’s conjecture has only been proved when (corresponding to the result of Andrews et al. [2] mentioned above), and some sporadic cases where the denominator of either or is a power of 2 (see [13, 14]). The connection to partitions with hook difference conditions lets one hope that a similar combinatorial interpretation may exist for the polynomials in the Borwein Conjecture, but to this day no such connection has been found.
Our approach for proving Theorem 12.1 has been analytic. The formulas that we just discussed, in particular the formulas (14.3)–(14.5) for , and , are unsuitable for asymptotic approximation. The reason is that each dominating term in the sums (14.3)–(14.5) has order , whereas the actual order of magnitude of , and is trivially bounded above by . In other words, in the sums (14.3)–(14.5), there is a huge amount of cancellations going on, which are seemingly impossible to control in order to find reasonable asymptotic estimates. As opposed to that, only the first term in the formulas in Theorem 1.2 contributes to the sum, as the other terms are asymptotically negligible, as we have shown.
We also mention the result of Li [11], which proves the positivity of the sum
[TABLE]
for all with , and furthermore establishes the asymptotics of this sum as . This result is in line with our estimation: the central coefficient of can be approximated by
[TABLE]
We are in fact very optimistic that our analytic approach will have further implications. It seems that it is possible to adapt our approach for a proof of Conjectures 14.1 and 14.2. It remains to be seen whether these ideas may also finally lead to a full proof of Bressoud’s Conjecture. Furthermore, we believe that they may also provide a basis for establishing open unimodality and log-concavity questions concerning polynomials given by products/quotients of factors of the form , as found for example in [7].
Acknowledgements
We are indebted to Prof. Manuel Kauers at JKU Linz for optimizing and running the verification program for .
Appendix: Auxiliary inequalities
This appendix contains several auxiliary inequalities used in the course of the proof. As their proofs are tedious, we put them here so as not to disturb the flow of the argument in the main text.
Lemma A.1**.**
For all such that and , we have
[TABLE]
Proof.
Let be the region
[TABLE]
All the rational functions on the left-hand sides of the inequalities are holomorphic on , so the maximum modulus principle means it suffices to prove the inequalities on the boundary
[TABLE]
The proof is done in a uniform way for all four rational functions (denoted by in the subsequent arguments): we choose uniformly distributed points on , and argue that
[TABLE]
holds for all , where denotes the convex hull of . Then we evaluate and use trivial upper bounds for to complete the proof. ∎
Lemma A.2**.**
Suppose that . Then we have
[TABLE]
Proof.
By substituting and , the claimed inequality is equivalent to
[TABLE]
Using a Taylor expansion of , we write the integral as a sum involving the lower incomplete gamma function :
[TABLE]
We denote the summand by
[TABLE]
and attempt to bound the summand from above.
- •
For , can be bounded above by .
- •
For , we first note that . This implies that the maximum value of on occurs at a point where .
By taking the derivative, we see that any such point satisfies
[TABLE]
By substituting this back into the expression for , we infer that
[TABLE]
Another derivative with respect to shows that this supremum occurs when , giving our final bound for :
[TABLE]
where the last step used Stirling’s approximation .
Directly using the upper bound (A.7), we get
[TABLE]
which is worse than what we claimed. Instead, we use (A.7) for the terms with , and conclude that
[TABLE]
As for the leftover terms where , we first give a crude bound for large by noticing that
[TABLE]
This inequality implies that if , then we have
[TABLE]
The interval is treated using the same method as in Lemma A.1, and the resulting upper bound is approximately .
Combining all the above arguments, we obtain
[TABLE]
Lemma A.3**.**
Suppose that . Then we have
[TABLE]
Proof.
By substituting and , the claimed inequality is equivalent to
[TABLE]
Using the Taylor expansion of , we write the integral as a sum involving the lower incomplete gamma function ,
[TABLE]
We denote the summand by
[TABLE]
and attempt to bound the summand from above. We first note that
[TABLE]
This means that the maximum value of on occurs at a point where .
By taking a derivative, we can see any such point satisfies
[TABLE]
substituting it back into the expression of , we are able to infer that
[TABLE]
Another derivative with respect to shows that this supremum occurs when , giving our final bound for :
[TABLE]
where the last step used Stirling’s approximation .
Similar to the proof of Lemma A.2, we use (A.10) on the terms with , and conclude that
[TABLE]
As for the term , we first note that
[TABLE]
therefore if . The interval is treated using the same method as in Lemma A.1, and the resulting upper bound is approximately .
Combining all the above arguments, we conclude that
[TABLE]
The next lemma deals with inequalities between sums of the form .
Lemma A.4**.**
For all and all , we have
[TABLE]
Proof.
For simplicity of notation, we use to denote the sum . The reader should observe that, for fixed , the sum can be evaluated into a rational function in by applying the binomial theorem.
The first inequality is proved by noticing that the coefficient is equal to for , and is negative for . Moreover, the sum of the coefficients is equal to
[TABLE]
So we have
[TABLE]
where is the sum of all positive coefficients in .
In order to prove the other inequalities, we give explicit formulas for the coefficients of the differences between both sides in those inequalities. More explicitly, after some tedious but routine calculations, we arrive at the following results:
- •
For (A.12), we have
[TABLE]
where for , and for .
- •
For (A.13), we have
[TABLE]
where , , for .
- •
For (A.14), we have
[TABLE]
where
[TABLE]
- •
For (A.15), we have
[TABLE]
where for , and for .
- •
For (A.16), we have
[TABLE]
where .
It is easy to see that , , , , and are non-negative. For the remaining ’s, we distinguish two cases:
- •
. Here we substitute with to see that
[TABLE]
and note that implies that every coefficient above is non-negative.
- •
. Similarly, we substitute to write
[TABLE]
In this case some of the coefficients (namely, the coefficients of , and of ) are negative. However, by exploiting the fact that and that
[TABLE]
we can still directly conclude that . ∎
Lemma A.5**.**
Let and . Then we have
[TABLE]
Proof.
Let . The lemma is equivalent to
[TABLE]
When , the difference between the two sides of (A.17) is
[TABLE]
We now proceed to prove that the left-hand side of (A.17) is increasing with respect to .
To this end, we calculate the derivative
[TABLE]
So it suffices to prove that
[TABLE]
or equivalently,
[TABLE]
Taking the logarithm on both sides, and defining and , we arrive at another equivalent form,
[TABLE]
The left-hand side can be written as a triple integral,
[TABLE]
and we conclude the proof by noting that
[TABLE]
The following two inequalities are used in the proof of Theorem 12.1.
Lemma A.6**.**
Suppose that , and . Then we have
[TABLE]
Proof.
Direct calculation reveals that
[TABLE]
Therefore, the lemma follows from the elementary inequality
[TABLE]
Lemma A.7**.**
Suppose that , and with . Then we have
[TABLE]
Proof.
First of all, the condition implies that , as well as .
Noting that the function is increasing with respect to , we conclude that
[TABLE]
Thus it suffices to prove that for . To this end, we prove that is decreasing on by calculating the derivative. We have
[TABLE]
where we exploit the convexity of the function . ∎
The final inequality in the appendix gives a simple rational lower bound for the Chebyshev polynomials of the first kind , defined by .
Lemma A.8**.**
For all and all , we have
[TABLE]
Proof.
If , then both sides are equal to . From now on we assume .
If , then we have
[TABLE]
If , then we write , so that
[TABLE]
The two sides of the inequalities can be rewritten as
[TABLE]
and
[TABLE]
Thus it suffices to prove that
[TABLE]
for all and .
For the last inequality, we make use of the well-known inequalities
[TABLE]
to conclude that
[TABLE]
The last factor is clearly decreasing with respect to when , so we proceed to find an upper bound for . We note that
[TABLE]
and after substituting we see that the derivative is equal to . So is decreasing with respect to . This implies that we always have
[TABLE]
Using this bound, we conclude that
[TABLE]
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] G. E. Andrews , On a conjecture of Peter Borwein , J. Symbolic Comput., 20 (1995), pp. 487–501. Symbolic computation in combinatorics Δ 1 subscript Δ 1 \Delta_{1} (Ithaca, NY, 1993).
- 2[2] G. E. Andrews, R. J. Baxter, D. M. Bressoud, W. H. Burge, P. J. Forrester, and G. Viennot , Partitions with prescribed hook differences , European J. Combin., 8 (1987), pp. 341–350.
- 3[3] W. N. Bailey , Some identities in combinatory analysis , Proc. London Math. Soc. (2), 49 (1947), pp. 421–425.
- 4[4] A. Berkovich and F. G. Garvan , Dissecting the Stanley partition function , J. Combin. Theory Ser. A, 112 (2005), pp. 277–291.
- 5[5] A. Berkovich and S. O. Warnaar , Positivity preserving transformations for q 𝑞 q -binomial coefficients , Trans. Amer. Math. Soc., 357 (2005), pp. 2291–2351.
- 6[6] D. M. Bressoud , The Borwein conjecture and partitions with prescribed hook differences , Electron. J. Combin., 3 (1996), pp. Research Paper 4, approx. 14. The Foata Festschrift.
- 7[7] W. Y. C. Chen, C. J. Wang, and L. X. W. Wang , The limiting distribution of the coefficients of the q 𝑞 q -Catalan numbers , Proc. Amer. Math. Soc., 136 (2008), pp. 3759–3767.
- 8[8] P. Flajolet and R. Sedgewick , Analytic combinatorics , Cambridge University Press, Cambridge, 2009.
