Fermat's polygonal number theorem for repeated generalized polygonal numbers
Soumyarup Banerjee, Manav Batavia, Ben Kane, Muratzhan Kyranbay,, Dayoon Park, Sagnik Saha, Hiu Chun So, Piyush Varyani

TL;DR
This paper extends Fermat's polygonal number theorem to include repeated generalized polygonal numbers, establishing minimal and optimal bounds for representing all positive integers.
Contribution
It generalizes Fermat's theorem to repeated generalized polygonal numbers and determines minimal and optimal bounds for their representations.
Findings
Derived minimal number of generalized m-gonal numbers for all positive integers
Established bounds for representations with repeated generalized m-gonal numbers
Generalized Fermat's theorem to broader classes of polygonal numbers
Abstract
In this paper, we consider sums of generalized polygonal numbers with repeats, generalizing Fermat's polygonal number theorem which was proven by Cauchy. In particular, we obtain the minimal number of generalized -gonal numbers required to represent every positive integer and we furthermore generalize this result to obtain optimal bounds when many of the generalized -gonal numbers are repeated times, where is fixed.
| Interval with | Bounds on | |||
|---|---|---|---|---|
| , odd | ||||
| , even | ||||
| lower bound | |||
|---|---|---|---|
| for | |||
| all | |||
| all |
| Interval of | bound | |
|---|---|---|
| {2m-6,2m-5} | {2m-6,2m-5} | |
| Interval of | bound | |
|---|---|---|
| or | ||
| Interval of | bound | |
|---|---|---|
| or | ||
| or | ||
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Fermat’s polygonal number theorem for repeated generalized polygonal numbers
Soumyarup Banerjee
Department of Mathematics, University of Hong Kong, Pokfulam, Hong Kong
,
Manav Batavia
Department of Mathematics, Indian Institute of Technology Bombay, Powai, Mumbai, Maharashtra 400076, India
,
Ben Kane
Department of Mathematics, University of Hong Kong, Pokfulam, Hong Kong
,
Muratzhan Kyranbay
Department of Mathematics, Hong Kong Baptist University, Kowloon Tong, Kowloon, Hong Kong
,
Dayoon Park
Department of Mathematical Sciences, Seoul National University, Seoul 151-747, Republic of Korea
,
Sagnik Saha
Department of Mathematics, Indian Institute of Science Education and Research, Thiruvananthapuram, Vithura, Kerala 695551, India
,
Hiu Chun So
Department of Mathematics, University of Hong Kong, Pokfulam, Hong Kong
and
Piyush Varyani
Department of Mathematics, Indian Institute of Technology, Roorkee, Roorkee, Uttarakhand 247667, India
Abstract.
In this paper, we consider sums of generalized polygonal numbers with repeats, generalizing Fermat’s polygonal number theorem which was proven by Cauchy. In particular, we obtain the minimal number of generalized -gonal numbers required to represent every positive integer and we furthermore generalize this result to obtain optimal bounds when many of the generalized -gonal numbers are repeated times, where is fixed.
Key words and phrases:
Fermat’s polygonal number theorem, polygonal numbers, Diophantine equations, universal quadratic polynomials
2010 Mathematics Subject Classification:
11E12,11E25,11E08
The research presented here was conducted while the second, fourth, sixth, seventh, and eighth authors were undergraduate researcher assistants at the University of Hong Kong and they thank the university for its hospitality. The internship of the second author was additionally supported by the Hong Kong Indian Chamber of Commerce, who he thanks for their generous support. The research of the third author was supported by grants from the Research Grants Council of the Hong Kong SAR, China (project numbers HKU 17316416, 17301317, and 17303618).
1. Introduction
Fermat famously conjectured in 1638 that every positive integer may be written as the sum of at most -gonal numbers; that is, for (the -th -gonal number, where with ) there exists an such that
[TABLE]
for every ; we call a Diophantine equation which represents every positive integer universal. The case of Fermat’s claim was Lagrange’s celebrated four squares theorem, proven in 1770, Gauss famously proved the case, sometimes known as the Eureka Theorem, in 1796, and Cauchy finally resolved the general case in 1813 [5]. Guy [7] investigated the question of the optimality of Fermat’s polygonal number theorem. That is to say, for which is the sum
[TABLE]
universal? More generally, Guy [7] considered sums of the type (1.1) with more general inputs ( with is known as a generalized -gonal number) and used a simple argument based on the fact that the smallest generalized -gonal number other than [math] and is to show that for , while Cauchy’s theorem implies that the minimal choice satisfies . Comparison of Guy’s and Cauchy’s theorems hence leaves a small gap between the upper and lower bounds. In this paper, we ask where the true answer lies within this gap in the case of generalized -gonal numbers. For and , consider the sum ()
[TABLE]
One may think of this as a weighted sum of generalized polygonal numbers or as a sum of generalized polygonal numbers where the first generalized -gonal number is repeated times, the second is repeated times, and so on. Using this second interpretation, we see by Guy’s work [7] that if is universal, then ; an upper bound for is not clear, however. We consider the specific case when
[TABLE]
where is repeated times and is repeated times. Let denote the minimal for which (1.1) is universal when we more generally allow and similarly for and denote the optimal minimal choice for which the sum of generalized -gonal numbers defined in (1.2) is universal. Our main result is the following.
Theorem 1.1**.**
- (1)
For we have
[TABLE] 2. (2)
For we have
[TABLE] 3. (3)
We have
[TABLE]
Remarks*.*
- (1)
Using Guy’s argument, for sufficiently large (depending on ), if is universal, then one must have since otherwise the integers from to cannot all be represented by the form . Hence Theorem 1.1 (2) is optimal in the aspect. The restriction on is chosen so that we have at least variables which are not repeated. The cases hence require more delicate care and lead to weaker results in terms of the dependence on . Indeed, a more careful case-by-case checking shows that one may take for , for , and for , but we have chosen the weaker restrictions on appearing in Theorem 1.1 (3) in order to present the proof in a more systematic way. These improved lower bounds for form a theoretical limit on the extent to which the method in this paper may be applied; that is to say, reducing the bound on beyond the stated bounds , , , , and for , , , , and , respectively, would require a different method than the one presented in this paper (or at least a serious modification that likely depends on the choice of ) because we would not have enough variables to apply a crucial lemma that applies to the generic case. Motivated by this, the second, fourth, sixth, seventh, and eighth authors [1] have relaxed the conditions to in order to guarantee at least such variables for , thereby extending the method in this paper to compute without any restriction on or . 2. (2)
The second restriction in Theorem 1.1 (2) is somewhat artificial. Namely, if , then we have generalized -gonal numbers preceding the -times repeated generalized -gonal numbers, and the original terms are already universal by Theorem 1.1 (1). 3. (3)
The method used in this paper does not work for the cases in Theorem 1.1 (1). A certain modification of Lemma 2.2 might work for , but the case seems to require a different method because the dimension is too small to use a modification of Lemma 2.2. Together with K.-L. Kong, the first and sixth authors are investigating the usage of modular forms techniques to resolve these remaining cases.
The case in Theorem 1.1 (3) is exceptional both because and because the dependence on for is vastly different than the generic dependence on large in Theorem 1.1 (2). The primary reason for this is the fact that
[TABLE]
Because of this, it turns out that either or is not represented by for . Guy exploited a similar property for in order to obtain the lower bound .
This special behaviour of the integers or brings up an interesting discussion about general forms with arbitrary . Generalizing the diagonal case of the Conway–Schneeberger fifteen theorem, Liu and the third author [10] proved that there exists a unique minimal such that is universal if and only if it represents every . It was shown in [10] that , and this was improved by the fifth author and Kim [11], who showed that there exists an absolute constant such that . It is natural to wonder about the optimal choice of (perhaps only holding for sufficiently large). The case leads to the conclusion that unless , in which case .
Corollary 1.2**.**
If , then we have
[TABLE]
Remark*.*
Using techniques from the arithmetic theory of quadratic forms, the constant has been explicitly computed for some small . In particular, we have by Bosma and the third author [4], by the Conway–Schneeberger fifteen theorem [6, 2], by Ju [8], and by Ju and Oh [9]. In light of the work in [11] and the lower bound in Corollary 1.2, it may be interesting to systematically investigate other choices of in order to obtain an improvement on the lower bound for .
The paper is organized as follows. In Section 2 we give some helpful preliminary information about quadratic forms and quadratic polynomials. In Section 3 we prove Theorem 1.1 (1) and Theorem 1.1 (2), giving the stronger version of Fermat’s polygonal number theorem in the case and its generalization for large . Finally, in Section 4, we consider small choices of , for which a different technique is necessary, and the resulting bound for given in Corollary 1.2.
Acknowledgements
The authors thank Min-Joo Jang and Sudhir Pujahari for helpful conversations and the anonymous referee for a careful reading of the paper.
2. Preliminaries
The sums of polygonal numbers appearing in (1.2) are a special case of a natural class of functions known as quadratic polynomials. In order to define these, recall that a homogeneous polynomial of degree is known as a quadratic form. If whenever , then we call integer-valued, and it is moreover known as integral if the associated Gram matrix (i.e., the matrix for which ) has integer coefficients (warning: in different contexts, authors write , so one needs to be careful about a factor of whenever comparing in the literature). We call such a quadratic form positive-definite if it only attains non-negative values and vanishes if and only if . A totally-positive quadratic polynomial is a function of the form
[TABLE]
where is a positive-definite quadratic form, is a linear function defined over , and is a constant, such that for all and if and only if . We furthermore assume that attains integer values for .
For a totally-positive quadratic polynomial , we set
[TABLE]
Note that if is a quadratic form with associated Gram matrix , then for each matrix satisfying
[TABLE]
and each such that , we have
[TABLE]
We call an automorph of and set to be the number of automorphs of . The matrix is a special case of an isometry between two quadratic forms and ; we say that is isometric to over a ring if there exists such that , where and are the Gram matrices of and , respectively. The set of isometry classes of a given discriminant is finite
The first check for representations of by a quadratic polynomial is to test local conditions. Namely, if is not solvable with for some prime (or, equivalently, modulo for some ), then clearly is not solvable with . An integer is said to be locally represented if it is represented over for all primes . Minkowski began the study of the local-global principle; this asks for which locally-represented integers do global representations (representations over the integers in this setting) exist. Siegel defined a natural weighted average
[TABLE]
where the sum runs over all of the isometry classes of positive-definite quadratic forms which are isometric to over for all (the set of such forms is known as the genus of and is known as the class number of ). Siegel [12, 13] and Weil [15] then computed so-called local densities (roughly speaking, these “count” the number of representations over and vanish precisely when no such representations exist) to give an explicit formula for . We need only the following well-known special form of their results.
Theorem 2.1** (Siegel, Weil).**
We have that if and only if is locally represented. Moreover, if the class number of is one, then if and only if is locally represented.
The following lemma plays a crucial role in the proof of Theorem 1.1.
Lemma 2.2**.**
The sum represents every integer in the set .
Proof.
Consider in the hyperplane . For in this hyperplane, we have
[TABLE]
The quadratic form has class number one and represents every integer locally, and is hence universal by Theorem 2.1 (alternatively, one may simply use the -theorem of Bhargava and Hanke [3] and verify that every integer up to is represented by this quadratic form, and thus the form is universal).
∎
3. The extension of Fermat’s polygonal number theorem for and large
In this section, we prove parts (1) and (2) of Theorem 1.1, giving the generalization of Fermat’s polygonal number theorem answering Guy’s question and covering the generic case for .
3.1. The case
We next make use of Lemma 2.2 in order to prove Theorem 1.1 (1).
Proof of Theorem 1.1 (1).
The case was proven by Gauss, the case was proven by Lagrange, and Guy [7] uses Legendre’s classification of the integers which are sums of three squares to resolve the case. Guy also points out that the set of generalized hexagonal numbers is precisely the set of triangular numbers and hence the case follows from the case. The case is proven by Sun in [14, Theorem 1.1].
Now assume that . Since we know that by Guy’s work in [7], it suffices to prove that for every integer can be written as the sum of generalized -gonal numbers. Let be given and write it as
[TABLE]
with . By Lemma 2.2, every multiple of may be written as the sum of generalized -gonal numbers. Hence if may be written as a sum of generalized -gonal numbers, then we may choose for which and , yielding the claim. This is possible for and (because ).
It remains to consider the cases . We thus write with . For we may write
[TABLE]
from which we conclude that may be written as the sum of generalized -gonal numbers (again using Lemma 2.2). If (i.e., which is automatically true for ), then we see that is represented as long as . On the other hand, if , then we note that
[TABLE]
and write
[TABLE]
Using Lemma 2.2, may hence be written as the sum of generalized -gonal numbers as long as .
It remains to show that may be represented in the finitely many cases (resp. ) when (resp. ), with . First suppose that . For we write
[TABLE]
For (in particular, since and , this holds for ) we see that may be represented by using choices of and choices of , for which we need (using that and )
[TABLE]
variables. The result follows except for the case , , and , for which one may check by hand that may be written as the sum of 6 generalized -gonal numbers.
In the remaining cases, we have and . There are hence only a finite number of which need to be checked, and this may be done by hand.∎
Remark*.*
After reducing the proof to a check of finitely-many cases, we simply check the remaining cases by hand for and . One may instead drop the restriction (leaving arbitrary as a variable) and use (the following list is complete for because the sequence is increasing for )
[TABLE]
to systematically write (thinking of this as a polynomial in ) as a linear combination of the polynomials occurring in (3.1) for each choice of .
3.2. Inequalities for large
For and , we write in the form
[TABLE]
where and . In order to obtain an upper bound, we need the following extension of Lemma 2.2.
Lemma 3.1**.**
Suppose that . For and , the integer is represented by the sum of at most generalized -gonal numbers unless and .
Proof.
Using Lemma 2.2, we may represent with the first variables. If , then we may represent by taking for the remaining generalized -gonal numbers.
Now suppose that . We note that
[TABLE]
Hence in particular we have
[TABLE]
We may therefore rewrite (3.2) as
[TABLE]
and for we conclude that may be represented with the first variables.
It remains to show that is represented for the cases and . For all of these cases other than the exceptional cases
[TABLE]
we may use (3.1) (thinking of as a polynomial in ) to find a representation
[TABLE]
in variables. We encode the representations in a graph in the following manner. Write the numbers in rows and columns, where the position corresponds to . If we have a representation
[TABLE]
in variables and , then in the location of the graph we write to indicate that we have a representation of in variables where we take . One may then reconstruct the representation of by recursively working backwards through the graph; for example, if we have , then we obtain the representation by looking at and continuing recursively until we have . To summarize, one traverses backwards through the graph as follows:
[TABLE]
This yields the following graph encoding the representations (we add the unnecessary entries in order to include the representations of some integers in the th column)
[TABLE]
∎
For the exceptional cases and , we use the following lemma.
Lemma 3.2**.**
If , , and satisfies and , then
[TABLE]
may be represented by
[TABLE]
In particular, we may take .
Proof.
For some we have
[TABLE]
If , then
[TABLE]
is the sum of generalized -gonal numbers. Hence if the system of equations
[TABLE]
holds, then we are done. If , then since we may take . For the inequality implies that , and hence we may take in that case. Finally, if , then , so we may take in this case.
∎
We are now ready to obtain an upper bound for for large .
Proposition 3.3**.**
If , then we have .
Proof.
The claim is equivalent to proving that is universal for .
Since , we may represent with the -times repeated variables all having (i.e., ), and Lemma 3.1 implies that may be represented by the initial variables unless and .
We finally deal with the cases with and . If , then Lemma 3.2 implies that is represented.
It remains to resolve the case for and . In other words, we need to check the representations of the 10 integers . We write
[TABLE]
for some . This gives
[TABLE]
If , then we are done because . If , then we may write (note that because otherwise , which contradicts the assumption that )
[TABLE]
Noting that , we are done as long as
[TABLE]
with the last inequality coming from the fact that must be represented by generalized -gonal numbers. As in the proof of Lemma 3.2 this holds for some .
We finally deal with the case . Since in this case, we have and hence . In this case, we rewrite
[TABLE]
Writing with , we are done as long as
[TABLE]
with the last inequality coming from the fact that we must write as the sum of at most generalized -gonal numbers. Setting if and otherwise, we claim that satisfies the above system of inequalities. Since , the second inequality automatically holds. The first inequality
[TABLE]
holds because , with if and only if . The third inequality becomes
[TABLE]
Note that since , we have unless . In the exceptional case and , we have
[TABLE]
Otherwise, we have , , and , so we find that
[TABLE]
and the claim follows. ∎
We next use Guy’s argument to obtain a lower bound for .
Proposition 3.4**.**
We have .
Proof.
Following Guy [7], if is universal, then it must necessarily represent . We write
[TABLE]
By (3.1), we have , , and for , so any representation of may only contain . Thus (3.4) yields the inequality
[TABLE]
from which we conclude that
[TABLE]
This yields the claim. ∎
We are now ready to Prove Theorem 1.1 (2).
Proof of Theorem 1.1 (2).
The upper bound in Proposition 3.3 and the lower bound in Proposition 3.4 match unless . In the remaining case, we write and note that the claim is equivalent to proving that is universal for . Recall the presentation (3.2) of . By Lemma 3.1, if , then may be represented by the initial generalized -gonal numbers and we only require variables to represent , unless and . For and , we use Lemma 3.2 to see that is represented with unless , while for we use the splitting (3.3) (with ) with to obtain a representation.
For , rewrite
[TABLE]
Again using Lemma 3.1, we see that is represented by generalized -gonal numbers unless ( and ) or . We use (3.3) in the case . In the case of , we then rewrite
[TABLE]
In this case, Lemma 3.1 implies that is represented unless , while Lemma 3.2 with yields the claim for .
∎
4. Small choices of
In this section, we consider cases for small .
Proof of Theorem 1.1 (3).
We first assume that and . Note that since , any representation of by must have , or in other words , yielding the lower bound .
It remains to show that the form with is indeed universal. We present in the form
[TABLE]
with and (so precisely attains every residue modulo once). By Lemma 2.2, we may represent as a sum of the type . If , then we conclude that may be represented by .
We next consider . Choosing such that , we may rewrite (4.1) as
[TABLE]
Setting , the inequalities for imply that
[TABLE]
Since is an integer, we conclude that . For we are done by Lemma 2.2, and for we use Lemma 3.1 (choosing ) to conclude that may be written as the sum of at most (because we have ) generalized -gonal numbers unless .
We next consider the cases . In this case we write
[TABLE]
and, since , Lemma 3.1 (choosing ) implies that may be written as the sum of at most (because we have ) generalized -gonal numbers unless .
Note that . It remains to show that for each and and there exists a representation
[TABLE]
For each of the form (4.2), we claim that we may choose (with ) so that and
[TABLE]
Note that if we may choose in this way, then since
[TABLE]
is even and less than , we may write as a sum of at most twos, giving a representation of with variables. It remains to choose the first of the s appropriately.
We collect the choices of the set such that , and the corresponding bounds on in Table 4.1. The bounds on are proven, for example, for and by writing (using )
[TABLE]
Recalling that , we have (in general, it suffices to show that ), and we see that (4.3) holds.
We next consider the case. In this case, first note that Theorem 1.1 (1) implies that represents every element of . Taking , we get a representation of every positive integer.
We note that any representation of must have and hence for the congruence implies that so that
[TABLE]
Dividing by and using Guy’s argument [7] again, this implies that , or in other words .
For , we take and similarly note that any representation of
[TABLE]
must have , again implying that .
For , we take and similarly note that, since , any representation of
[TABLE]
must have , , and , from which we conclude that .
We note that any representation of must satisfy
[TABLE]
From this we can conclude that represents every positive integer up to except integers in
[TABLE]
By Lemma 2.2, with , and is represented by .
On the other hand, for each we write
[TABLE]
with and and we see that except for , in which case . Thus for every , Lemma 3.1 implies that is represented as times the sum of at most generalized -gonal numbers for . Using (3.1) we may check the smaller choices of directly. For the remaining case , we write
[TABLE]
Thus, using Lemma 3.1 to represent , for every we may represent as long as . There remain finitely many choices of for each and we check these as in the case. Now suppose that . We first obtain lower bounds for by using Guy’s argument [7] for the exceptional choices of in Table 4.2.
We define sets by
[TABLE]
The sets are precisely the integers less than which are represented by .
For each , we choose and with minimal such that
[TABLE]
By Lemma 2.2, we obtain a representation of with variables, taking for the last variables. If with as given in the statement of the theorem, then may be represented. We check in Tables 4.3, 4.4, and 4.5 that . For those , we rewrite
[TABLE]
Having chosen large enough so that , we see that for we have
[TABLE]
We then choose . If , then (4.4) gives a representation of with variables. On the other hand, if , then Lemma 2.2 may be employed to represent and we obtain a representation of in (4.4) with variables.
∎
We are now ready to conclude the corollary.
Proof of Corollary 1.2.
The proof of Theorem 1.1 (3) immediately implies Corollary 1.2 because either or is not represented by for , but one can see that every smaller integer is represented by . ∎
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] M. Batavia, M. Kyranbay, S. Saha, H. C. So, and P. Varyani, Fermat’s polygonal number theorem with repeats , in preparation.
- 2[2] M. Bhargava, On the Conway–Schneeberger Fifteen Theorem , Contemp. Math. 272 (2000), 27–38.
- 3[3] M. Bhargava and J. Hanke, Universal quadratic forms and the 290 290 290 -theorem , Invent. Math., to appear.
- 4[4] W. Bosma and B. Kane, The triangular theorem of eight and representations by quadratic polynomials , Proc. Amer. Math. Soc. 141 (2013), 1473–1486.
- 5[5] A.-L. Cauchy, Démonstration du théorèm général de Fermat sur les nombres polygones , Mém. Sci. Math. Phys. Inst. France 14 (1813–1815), 177–220; Oeuvres complètes VI (1905), 320–353.
- 6[6] J. H. Conway, Universal quadratic forms and the fifteen theorem , Contemp. Math. 272 (2000), 23–26.
- 7[7] R. K. Guy, Every number is expressible as the sum of how many polygonal numbers? , Amer. Math. Monthly 101 (1994), 169–172.
- 8[8] J. Ju, Universal sums of generalized pentagonal numbers , Ramanujan J. 51 (2020), 479–494.
