Prime number conjectures from the Shapiro class structure
Hartosh Singh Bal, Gaurav Bhatnagar

TL;DR
This paper explores the height function related to Euler's totient iterates, extending Shapiro's work, and presents conjectures linking it to prime counting functions and prime numbers.
Contribution
It provides a formula for the height function, extends results on odd heights, and conjectures new relationships between height and prime number functions.
Findings
Extended results on the largest odd numbers at a height
Formulated conjectures relating height to prime counting functions
Presented computational evidence supporting these conjectures
Abstract
The height of , introduced by Pillai in 1929, is the smallest positive integer such that the th iterate of Euler's totient function at is . H. N. Shapiro (1943) studied the structure of the set of all numbers at a height. We state a formula for the height function due to Shapiro and use it to list steps to generate numbers at any height. This turns out to be a useful way to think of this construct. In particular, we extend some results of Shapiro regarding the largest odd numbers at a height. We present some theoretical and computational evidence to show that and its relatives are closely related to the important functions of number theory, namely and the th prime . We conjecture formulas for and in terms of the height function.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnalytic Number Theory Research · Advanced Mathematical Identities · Advanced Mathematical Theories
**PRIME NUMBER CONJECTURES FROM
THE SHAPIRO CLASS STRUCTURE
Hartosh Singh Bal
**The Caravan, Jhandewalan Extn., New Delhi, India
**Gaurav Bhatnagar111Current address: School of Physical Sciences, Jawaharlal Nehru University, Delhi, India
2010 Mathematics Subject Classification: Primary 11A25; Secondary 11A41, 11N37
Keywords and phrases: Euler’s totient function, Iterated totient function, Prime numbers, Multiplicative Functions, Chebyshev’s theorem, Shapiro Classes
**Fakultät für Mathematik, Universität Wien, Vienna, Austria.
Abstract
The height of , introduced by Pillai in 1929, is the smallest positive integer such that the th iterate of Euler’s totient function at is . H. N. Shapiro (1943) studied the structure of the set of all numbers at a height. We state a formula for the height function due to Shapiro and use it to list steps to generate numbers at any height. This turns out to be a useful way to think of this construct. In particular, we extend some results of Shapiro regarding the largest odd numbers at a height. We present some theoretical and computational evidence to show that and its relatives are closely related to the important functions of number theory, namely and the th prime . We conjecture formulas for and in terms of the height function.
1 Introduction
The principal object of our investigation is a number theoretic function that we call the height function. It is defined as follows. Let , and
[TABLE]
for . Here denotes Euler’s totient function, the number of positive integers less than which are co-prime to . The first few values of are: . Let be the set of numbers at height , that is,
[TABLE]
We call the Shapiro classes in honor of Harold N. Shapiro [9] who studied these classes first. In this paper, we examine the Shapiro class structure, and show that the height function and its relatives are very closely related to the important functions of number theory, namely , the number of primes less than or equal to , and the th prime number.
Shapiro arrived at these classes by considering iterates of Euler’s totient function. For , we denote by
[TABLE]
the th iterate of . Then the height function can also be defined as follows. Let . For , let be the smallest number such that . The height function (with this definition) was studied first by Pillai [7]. The Shapiro classes (by other names) have been studied earlier by Shapiro [9], Erdős, Granville, Pomerance and Spiro [3], and others [2, 4, 5, 6]. We have mildly modified Shapiro’s formulation.
In Section 2 we find an alternative, inductive, approach to generate Shapiro classes. To do so, we require a formula for due to Shapiro. In Section 3, we apply our ideas to extend a theorem of Shapiro about the largest odd numbers at a given height. This rests upon a property that is obvious from our construction, but not observed earlier, regarding the largest possible prime number at a height. This, and results of Erdős et.al. [3], led us to look for number theoretic information from this structure.
Since is not an increasing function, we consider the sum of heights function, defined as: , and
[TABLE]
The structure consisting of Shapiro classes allows us to obtain number theoretic information quite easily. It appears that elementary techniques, such as those found in the textbooks of Apostol [1] and Shapiro [10], can be modified to express classical theorems in terms of functions related to the height function. We illustrate this idea in Section 4, by proving Chebyshev-type theorems, that is, inequalities for and in terms of and .
The function appears to be important to the theory of prime numbers. Consider, for example, the values of for . These are
[TABLE]
This pattern persists; consider Figure 1, a plot of and on the same set of axes. There is remarkable agreement of these graphs (up to ). From numerical computations (see Tables 1 and 5), it appears that is a better approximation to than , at-least until .
This ties up with a result/conjecture of Erdős, Granville, Pomerance and Spiro [3] and motivates the experimental work presented in Section 5.
2 Listing Shapiro classes
We begin our study of the height function and Shapiro classes. The objective of this section to build some useful intuition, by writing a set of rules to generate Shapiro classes inductively. Towards this end, we first prove a formula due to Shapiro for calculating the height of a function. As a corollary, we show the additive nature of the height function. We illustrate the usefulness of these rules by obtaining several elementary properties of the Shapiro classes.
It is instructive to compute the first few classes. Table 2 gives the first four Shapiro classes. We begin with in and in . Now since , we find that . So . Similarly, we see that has height , has height , and has height .
By computing the values of for a few more values, a few rules for finding members of the Shapiro class become evident. For example, consider the following rules for finding members of from . We have
- •
If is an odd number, then the height of is the same as .
- •
If is an even number, then .
- •
If is an odd number, then .
- •
If is a prime number, then . Thus is an even number at one height lower than .
We can use these rules to generate the members of . If we multiply all the even numbers of by , we obtain , , , , . Now multiplying all the odd numbers by , we find that , , are in . Next, consider , , , , . Of these, , , and , are primes, and are thus at height . Finally, the odd numbers already obtained are: . Multiplying them by , we find that and are also in . The reader may verify that we have obtained all the numbers at height .
The rules work to generate all the elements of from , but they are not comprehensive. The complete set of rules will appear shortly, as an application of a formula for .
Theorem 1** (Shapiro [9]).**
Let with prime factorization , where , , , , are distinct odd primes, and , . Then,
[TABLE]
Before giving a proof of this theorem, we obtain a corollary which indicates the additive nature of the height function.
Corollary 1** (Shapiro [9]).**
Let and be natural numbers with prime factorizations and where . Then we have
[TABLE]
Remark**.**
Shapiro [9] considered Corollary 1 as his fundamental theorem. The formula (3) follows immediately. The proof presented below is closely related to Shapiro’s proof of Corollary 1.
Proof.
Note that if both and are strictly positive, then
[TABLE]
as required. Similarly, the second part of the formula follows by considering the cases where only one of and is [math], and where both and are [math]. ∎
Proof of Theorem 1.
We prove this formula by induction on the number of primes in the prime factorization of . Remarkably, we will require the computation in the corollary to prove the theorem.
First we prove the formula for numbers of the form , by induction. Clearly, the formula holds for . Suppose it holds for . Then
[TABLE]
by the induction hypothesis. This proves the formula for powers of .
Next, we consider numbers of the form . Consider first the case . Again we will prove this formula by induction, this time on . It is easy to verify the formula for . Next suppose the formula is true for . For , we have
[TABLE]
as required.
Next we consider the case . In this case, we can obtain the formula from the previous case. Since , We have
[TABLE]
which is the required formula.
Next, as the induction hypothesis, we assume that the formula is valid for numbers of the form
[TABLE]
where are the first odd primes. We will show the formula for
[TABLE]
We first consider the case . Again we will prove this by induction on . Let
[TABLE]
We will first prove the formula for . Then,
[TABLE]
so
[TABLE]
The last equality follows by the argument in Corollary 1 applied to even numbers that have only , , , in their prime factorization. (By the induction hypothesis, we have assumed that (3) holds for such numbers, and so this argument can be used in our proof.) Thus we find that
[TABLE]
as required.
Next, we suppose the formula is true for . We will prove it for
[TABLE]
Again, let , , , be as before. Then we have
[TABLE]
which proves the formula for .
Finally, we prove the formula for and . Let
[TABLE]
Now
[TABLE]
where the last step is obtained as in the case. From here we obtain the formula by applying the induction hypothesis. ∎
We return to our set of rules to generate the numbers at a given height from the previous ones. Corollary 1 tells us what happens if we multiply a number by . Recall that the height of is . So if is any number, then . Thus to obtain numbers at height we have to multiply numbers at height by and the other prime at height , namely . In general, to obtain all the numbers at a height, we have to consider all the primes at lower heights. We are now in a position to create a comprehensive list of rules to generate .
Let denote the set of primes at height . That is, for , define
[TABLE]
For example:
[TABLE]
Theorem 2**.**
Suppose , , are known. The steps for obtaining , that is, the elements at height are as follows
Multiply each even element of by . 2. 2.
*Multiply each odd element of by . * 3. 3.
Multiply each element of by elements of , where 4. 4.
If is a prime where is an even number in , then . 5. 5.
Multiply each odd number obtained already in by .
Remark**.**
Shapiro considered a slightly different function , where and , for . So whereas . Theorem 2 is the primary reason why we have deviated from Shapiro’s formulation.
Proof.
The proof is a formalization of our earlier discussion. Consider the following statements.
If is even, then . 2. 2.
if , then . 3. 3.
if and then .
All these statements follow from Corollary 1.
Step 1 generates all even numbers of the form where , and is odd, i.e., numbers divisible by . The steps 2 and 3 will generate all composite, odd, numbers in .
Since , and for all odd primes is even, thus all prime numbers are obtained by adding to even numbers in . This explains Step 4.
It remains to generate numbers of the form where is odd. Since when is odd, we must multiply each odd number obtained by steps 1–4 by . This will generate all the elements of . ∎
Remark**.**
In Step 3, we need to multiply only those elements of that are odd and not divisible by .
Now it is easy to generate some more sets . The elements of the first few sets are given in Table 3. The prime numbers are given in bold.
Theorems 1 and 2 are very useful to think about the . We illustrate this in the following observations. These have been found previously by other authors.
Observations
The number comes at height . 2. 2.
The smallest even number at height is . To see this, consider the following argument.
An even number in can arise in two ways. If it is obtained by multiplying an element of by , it is bigger than or equal to by induction. The other possibility is that it is of the form , where is an odd number at height .
If is a prime number, then it is obtained by adding to an even number at height , so it is bigger than by induction. This implies .
Suppose is an odd, composite number with prime factorization
[TABLE]
with . Then by Theorem 1, implies that
[TABLE]
But by induction, as above, we must have for . Thus
[TABLE]
Again, this implies that . 3. 3.
Any odd number in is bigger than . This follows from the above argument. 4. 4.
If , then . This is what the two items above amount to; this was implicit in Pillai [7], and stated by Shapiro [9]. 5. 5.
(Shapiro [9]) The numbers at height that are less than are all odd. 6. 6.
(Shapiro [9]) The largest odd number at height is . This follows from induction and Theorem 2. 7. 7.
(Pillai [7]) The largest even number at height is . 8. 8.
(Pillai [7]) Since any number at height is between and , we have the inequalities:
[TABLE]
All the observations above were noted by previous authors. One can ask whether Theorem 2 gives any new information. Indeed, there is one very important observation missed by previous authors.
The largest prime at a level is less than or equal to . This is obvious from Step 4 of Theorem 2 and Pillai’s observation that the largest even number at height is .
In the next section, we show how this observation can be used to obtain information about the numbers appearing at the end of each class, thus extending some of Shapiro’s results.
3 On the Shapiro Class structure
The objective of this section is to illustrate the application of Theorems 1 and 2, by extending some results of Shapiro [9]. Our theorem in this section is a characterization of the last few numbers at each height.
As noted above, the largest prime at a level is less than or equal to . This upper bound is met for many . The smallest such examples are obtained when . The first few examples, corresponding to these values of , are . Primes of this kind (cf. OEIS [11, A003306]) play an important role in our theorem. Let denote the set of primes of this form, that is,
[TABLE]
To prove our main result, we require a useful proposition.
Proposition 1**.**
Let . Let be an odd, composite number not divisible by at height . Then,
[TABLE]
Before proving the proposition, we prove a special case, where is of the form or .
Lemma 1**.**
Let . Let and be (possibly the same) primes. Suppose , and . Then
[TABLE]
Proof.
Let and . Since and are not , we must have . Since and , we must have
[TABLE]
Now from Corollary 1, we see that . Now consider
[TABLE]
since implies that . This completes the proof of the lemma. ∎
Proof of Proposition 1.
We use induction on . For , the statement is vacuously true. Let . Let , and . Then , otherwise has to be divisible by . Further since . Thus by the induction hypothesis, we must have
[TABLE]
Of course, since is a prime, Now by using the same argument as in Lemma 1, we see that
[TABLE]
as required. ∎
Next, we determine all the numbers in the set
[TABLE]
These numbers are the largest elements at height .
Theorem 3**.**
Let . The set comprises all the numbers of the form , where
[TABLE]
where , with and .
Proof.
We let denote the set
[TABLE]
We want to show that .
First observe from (3) that if , then . Further, if , then
[TABLE]
This follows from
[TABLE]
Thus .
To show the converse, we apply an inductive argument using Theorem 2.
For , So let .
Observe that has only even numbers. This is because all odd numbers at height are less than or equal to , and .
Even numbers are obtained from Step 1 or Step 5 in Theorem 2. However, since there are no numbers in that are bigger than , none of the numbers in are obtained from Step 1. Thus all the numbers in are of the form , where is an odd number at height , and
[TABLE]
By Proposition 1, all odd composite numbers not divisible by are less than . Thus there are only two possibilities for .
is a prime of the form with , i.e., as required. 2. 2.
is divisible by .
In case is divisible by , it is obtained from Step 2, and there is an such that , with . So . By the induction hypothesis, , for some with . So
[TABLE]
and . This completes the proof of . ∎
To state our next result, we require some notation. Let be the elements of listed in increasing order, where , and . For example, the first few pairs are , , , and .
Corollary 2**.**
Let . Let , , be as above, with the largest such that . At height , the largest numbers, in decreasing order, are:
[TABLE]
Proof.
Let . Note that from (6), it follows that if , then . Thus , , are in decreasing order. ∎
This immediately implies a similar result for the largest odd numbers at a height.
Corollary 3**.**
Let . Let , , be as above, with the largest such that . At height , the largest odd numbers, in decreasing order, are:
[TABLE]
Remark**.**
Corollary 3 extends Theorems 10, 11 and their corollary from Shapiro [9, §5]. Shapiro considers only two primes in , namely and .
To summarize our work so far, we have found in Theorem 2 an alternative way of thinking about the Shapiro classes. We saw above how a rather obvious observation, about the largest possible prime in a class, can be used to obtain more information about the numbers that appear at a height.
At this point, we would like to venture a comment of a philosophical nature, motivated by another innocuous observation about primes in Shapiro classes.
Theorem 1 suggests that is a “measure of complexity” of a number. The prime numbers can be considered the “atoms” of numbers. A number is built from by successive multiplication by prime numbers, so the number of prime powers dividing a number says something about how complicated a number is. However, this construction does not distinguish between two primes. On the other hand, the Shapiro class structure naturally distinguishes between the primes. On looking at Table 3, we see that primes don’t come in order. For example, appears at height and at height . Thus, the height function gives a “measure of complexity” to each prime, and indeed, to each number. That is why we expect this construct will say something about prime numbers.
4 Chebyshev-type theorems
In this section, we explore one strategy to discover what Shapiro classes imply for prime numbers. The strategy is to study elementary methods explained in Shapiro [10, Chapter 9] and Apostol [1, Chapter 4], and express classical results using and . The objective of this preliminary investigation is to arrive at suitable functions that are related to the prime number functions and .
We derive results analogous to Chebyshev’s theorem, which states that there are constants such that, for ,
[TABLE]
According to Apostol [1, Theorem 4.6, (14) and (18)], we can take and , when is an even number.
We will use this result to provide an alternate formulation of Chebyshev’s theorem in terms of . In addition, we find inequalities for by modifying the proof of Apostol [1, Theorem 4.7] appropriately.
We begin with two preliminary lemmas.
Lemma 2**.**
For , we have
[TABLE]
Proof.
We use the following refinement of Stirling’s formula due to Robbins [8, (1) and (2)]:
[TABLE]
From (5) it is immediate that
[TABLE]
Thus, using (8), we obtain, for ,
[TABLE]
as required. ∎
We require one more lemma.
Lemma 3**.**
Let be such that . Let . Then
[TABLE]
Proof.
Since , we have
[TABLE]
where we have used Lemma 2. In this manner, we obtain:
[TABLE]
where . This proves the first inequality.
Since is such that , then in view of Observation 4 in §2, we must have
[TABLE]
The second inequality follows immediately from this. ∎
Theorem 4** (A Chebyshev-type Theorem).**
For , there are constants and such that
[TABLE]
Proof.
Let , and be such that . The inequalities (7) imply that there are constants and such that:
[TABLE]
Now using (9), we obtain
[TABLE]
This completes the proof of the theorem. ∎
Remarks**.**
By following the proof of (7) in Shapiro [10, Chapter 9] or Apostol [1, Chapter 4], we can obtain (7) in the special case when is a power of ; and from there, for all values of . 2. 2.
We can obtain values for and in the statement of Theorem 4 by taking and (or perhaps even better values, closer to ). However, the purpose here is to find a suitable function that can be related to . The function is evidently .
Theorem 5**.**
For , there are constants , and , such that
[TABLE]
Proof.
The proof is analogous to the proof of the inequalities
[TABLE]
given by Apostol [1, Theorem 4.7].
Let , so . Let be such that , and such that . Clearly, (since ).
We begin with the first inequality. The inequalities (10) imply that there is a constant such that
[TABLE]
or
[TABLE]
But by (9), so we obtain
[TABLE]
where .
For the second inequality, (10) implies that for some ,
[TABLE]
Next, using , we obtain
[TABLE]
so
[TABLE]
which yields,
[TABLE]
Taking logs, we find that
[TABLE]
Finally, we put all the above together, to find that
[TABLE]
for some constants and . This completes the proof. ∎
To summarize, we obtained two Chebyshev-type theorems, one for and the other for . Of course, the first such theorem came up in response to Gauss’ conjecture which said the constants and in (7) are both . The question arises: how good are these functions in approximating and ? These questions are considered in the next section.
5 (Conjectural) formulas for prime numbers
In this section we note some conjectural formulas that are motivated by Theorems 4 and 5 and present some computational evidence. We note here a particular constant that appears in our study:
[TABLE]
where is the Euler-Mascheroni constant.
A formula for
In view of Theorem 4, the first question we investigate is: If is approximately a constant multiple of , then what should that constant be? However, initial experiments on Sage [12] with various numerical guesses did not match the data as became large. However, on graphing the difference of with , the error term appears to be of the same type as itself. This leads to the following conjecture:
Conjecture 1**.**
Let be defined as
[TABLE]
Then , for a constant , where is (approximately)
[TABLE]
Notes
Figure 2 shows the graphs of and on the same set of axes for . The agreement is quite striking. 2. 2.
Figure 3 shows two curves (top) and (bottom) for to . It appears that is a better estimate than for “small” values of . 3. 3.
Table 4 compares the performance of with that of for some values of .
A formula for the prime
Given Theorem 5, one can ask how well is approximated by a constant times . It turns out that even with the constant equal to , the approximation is quite good. Indeed, it appears that
[TABLE]
Here the values of we have computed are until .
Notes
See Figure 1 mentioned in the introduction for a graph of and , for . 2. 2.
Table 1 indicates that is a better approximation to than . 3. 3.
Table 5 contains the values of and at some large random values of , and the relative error of the approximation. It appears that the relative error is increasing.
From the above, it seems that the approximation is slightly off. In view of Conjecture 1, we expect the following.
Conjecture 2**.**
Let denote the th prime. Then
[TABLE]
*for a constant , where . *
Remark**.**
Erdos et.al. [3] conjectured that there is a constant such that . These authors showed that a certain form of the Elliot–Halberstam conjecture implies their conjecture. Conjecture 2 implies that . Note that .
Professor Pomerance pointed out that (13) is inconsistent with Conjecture 1, and commented that it would be interesting to perform numerical computations to conjecture the value of . From Table 5, it appears that the number is bigger than . But we were only able to compute up to . We expect the limit to be larger.
Below, we briefly outline the steps to obtain the value of that follows from Conjecture 1. This motivates our statement of Conjecture 2.
Define now as . Take when . Then (12) can be written as
[TABLE]
On inverting this, we obtain the approximation
[TABLE]
Note that for any fixed this is a finite sum. Upon replacing by , we see that from Conjecture 1, we have
[TABLE]
Let denote the approximation to obtained from (14), that is,
[TABLE]
The computations (with ) in Table 6 indicate that is quite an accurate way to estimate the value of . This further supports Conjecture 1. Note that we may replace by to estimate .
Multiplying both sides of (14) by , and taking (formal) limits, we obtain
[TABLE]
which suggests that .
From the above we see that Conjecture 2 is consistent with Conjecture 1.
Prime gaps
We end this section with a few remarks about the prime gap. Let denote the prime gap, that is, Given that approximates , it is natural to ask whether is approximated by . On the other hand, the prime gap is notorious for its irregularity, and one cannot expect much in this regard. Nevertheless, it seems that on average, does quite a good job of approximating . Indeed, Table 7 gives a few values of the following function:
[TABLE]
In view of Conjecture 2, we expect the limit to be .
Acknowledgements
We thank Carl Pomerance (Dartmouth) and Manjil P. Saikia (Vienna) for useful comments. All numerical computations and graphics have been generated using Sage [12]. The research of the second author was partially supported by the Austrian Science Fund (FWF), grant F50-N15, in the framework of the Special Research Program “Algorithmic and Enumerative Combinatorics”.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] T. M. Apostol, Introduction to Analytic Number Theory , Springer-Verlag, New York-Heidelberg, 1976.
- 2[2] P. A. Catlin, Concerning the iterated Φ Φ \Phi function, Amer. Math. Monthly 77 (1970), 60–61.
- 3[3] P. Erdős, A. Granville, C. Pomerance, and C. Spiro, On the normal behavior of the iterates of some arithmetic functions, in Analytic Number Theory (Allerton Park, IL, 1989) , volume 85 of Progr. Math. , pages 165–204, Birkhäuser Boston, Boston, MA, 1990.
- 4[4] W. H. Mills, Iteration of the ϕ italic-ϕ \phi function, Amer. Math. Monthly 50 (1943), 547–549.
- 5[5] T. D. Noe, Primes in classes of the iterated totient function, J. Integer Seq. , Article 08.1.2 11(1) (2008), 10 pp.
- 6[6] J. C. Parnami, On iterates of Euler’s ϕ italic-ϕ \phi -function, Amer. Math. Monthly 74 (1967), 967–968.
- 7[7] S. S. Pillai, On a function connected with ϕ ( n ) italic-ϕ 𝑛 \phi(n) , Bull. Amer. Math. Soc. 35(6) (1929), 837–841.
- 8[8] H. Robbins, A remark on Stirling’s formula, Amer. Math. Monthly 62 (1955) 26–29.
