On the distribution of Salem numbers
Friedrich G\"otze, Anna Gusakova

TL;DR
This paper develops asymptotic formulas for counting Salem numbers of fixed degree within certain bounds, providing explicit constants and functions, advancing understanding of their distribution in algebraic number theory.
Contribution
It introduces explicit asymptotic formulas for counting Salem numbers of fixed degree, including constants and density functions, and extends results to Salem numbers with bounded absolute value.
Findings
Derived asymptotic count formulas for Salem numbers of fixed degree
Provided explicit expressions for constants and density functions
Extended results to Salem numbers with bounded absolute value
Abstract
In this paper we study the problem of counting Salem numbers of fixed degree. Given a set of disjoint intervals , let denote the set of ordered -tuples of conjugate algebraic integers, such that is a Salem numbers of degree satisfying for some positive real number and . We derive the following asymptotic approximation \[ \# Sal_{m,k}(Q,I_1,\ldots,I_{k})=\omega_m\,Q^{m+1}\,\int\limits_{I_1}\ldots\int\limits_{I_{k}}\rho_{m,k}(\boldsymbol\theta)\rm d\boldsymbol\theta+O\left(Q^{m}\right),\quad Q\rightarrow\infty, \] providing explicit expressions for the constant and the function . Moreover we derive a similar asymptotic formula for the set of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
\label
key
On the distribution of Salem numbers
Friedrich Götze
Friedrich Götze: Faculty of Mathematics, Bielefeld University, P. O. Box 10 01 31, 33501 Bielefeld, Germany
and
Anna Gusakova
Anna Gusakova: Faculty of Mathematics, Bielefeld University, P. O. Box 10 01 31, 33501 Bielefeld, Germany
Abstract.
In this paper we study the problem of counting Salem numbers of fixed degree. Given a set of disjoint intervals , let denote the set of ordered -tuples of conjugate algebraic integers, such that is a Salem numbers of degree satisfying for some positive real number and . We derive the following asymptotic approximation
[TABLE]
providing explicit expressions for the constant and the function . Moreover we derive a similar asymptotic formula for the set of all Salem numbers of fixed degree and absolute value bounded by as .
Key words and phrases:
Salem number, counting formula, correlation function, Jacobi random matrix ensemble.
2010 Mathematics Subject Classification:
Primary, 11R06; secondary, 11P21, 60B20
The research of the first author has been supported by project SFB 1283 at Bielefeld University (Germany). The research of the second author has been supported by project IRTG 2235 at Bielefeld University (Germany).
1. Introduction
Problems concerning the distribution of algebraic numbers have a long history [26, 19, 20, 23, 22, 21]. Recall that an algebraic number is a complex number such that there exists an irreducible polynomial over with integer co-prime coefficients and positive leading coefficient such that . This polynomial is called minimal polynomial of the algebraic number and other roots of are called Galois conjugates of . Moreover if the leading coefficient of the minimal polynomial equals , is called algebraic integer.
For the distribution of algebraic numbers we consider sets of algebraic numbers with fixed degree . Usually this set will be countable and dense in . Hence we shall restrict the counting problem to finite subsets of depending on a real parameter and ask how the cardinality of this set changes as . As one of the many choices consider for example the set of algebraic numbers in with absolute multiplicative Weil height bounded by [22, 23], or the set of algebraic numbers in with naïve height bounded by and lying in some fixed set [25, 24].
In this paper we will consider similar questions for a special subset of algebraic integers, namely the so-called Salem numbers. Let us start with some definitions. A Salem number is a real algebraic integer such that all its Galois conjugates have absolute value less or equal to and at least one of them has absolute value equal to . Let denote a Galois conjugate of the Salem number lying on the complex unit circle , e.g. . Since and its complex conjugate are Galois conjugates we conclude that the minimal polynomial of a Salem number is self-reciprocal. Recall that a polynomial of degree is called self-reciprocal if
[TABLE]
Moreover the polynomial is of even degree , otherwise which contradicts to the irreducibility of . Thus, all Galois conjugates of Salem number (except for ) have absolute value and lie on the unit circle in the complex plane. We will denote them by . We shall use these two properties as a description of the set of Salem numbers.
Denote by the set of all Salem numbers of degree . Our aim is to describe the distribution of Salem numbers by considering some finite subsets of with given properties and investigating how the cardinality of those sets depend on the chosen parameters.
It should be mentioned that Salem numbers play an important role in many areas of mathematics, such as number theory, algebra and dynamical systems. For more details we refer to the papers [11, 12, 13, 14]. In particular the smallest Salem number is closely related to Lehmer’s conjecture [10].
1.1. Counting Salem numbers
Given some real and integer let us introduce the following finite subset
[TABLE]
which consists of all Salem numbers of degree lying in the interval . In this subsection we introduce an asymptotic formula for the cardinality of the set as .
Theorem 1.1**.**
For any integer we have
[TABLE]
where
[TABLE]
It should be mentioned that the set of Salem numbers is a subset of a more general class of algebraic integers, namely Perron numbers. A Perron number is a real algebraic integer such that all its Galois conjugates have absolute value less then . The problem of counting Perron numbers has been studied by F. Calegari and Z. Huang [2]. Denote by the set of all Perron numbers of degree and for some real define the following finite subset
[TABLE]
Then for any natural the following asymptotic approximation holds
[TABLE]
where
[TABLE]
The proof methods used in [2] do not apply to the case of Salem numbers, but we use some modification of those arguments to prove Theorem 1.1.
1.2. Salem numbers with given distribution of their Galois conjugates
In this subsection we consider a slightly more general problem.
Given some real , integer and disjoint intervals , denote by the set of ordered -tuples of conjugate algebraic integers, such that and for . As in the previous subsection we try to determine the cardinality of this set. Theorem 1.2 below provides an asymptotic formula for as tends to infinity.
Before we introduce our main result let us consider some additional notations and definitions. Let be a skew-symmetric matrix, which means that . The Pfaffian of is defined by
[TABLE]
where is the symmetric group of the dimension and is the signature of .Recall the following useful formula connecting Pfaffian and determinant of the matrix
[TABLE]
Furthermore, let us introduce the family of classical orthogonal polynomials, called Jacobi polynomials, via
[TABLE]
These polynomials are orthogonal to each other with respect to weight function on the interval . For more details we refer reader to [16] and Appendix B.
Theorem 1.2**.**
For any integer and any disjoint intervals , we have
[TABLE]
where is defined by (1). Moreover, the function can be written in the following form
[TABLE]
where
[TABLE]
and for we write
[TABLE]
where
[TABLE]
Remark 1**.**
According to the definition of the Pfaffian and the kernel it is easy to see that is a polynomial in and , . Moreover, it can be written as
[TABLE]
where is a polynomial.
Remark 2**.**
It should be also noted that the function defined by (4) - (10) coincides with the Kernel function of some random matrix ensemble, namely the Jacobi -ensemble with (see Appendix A and B).
In general, the formula for seems quite complicated, but it may be simplified for and .
Corollary 1.3**.**
For any integers we have
[TABLE]
and
[TABLE]
Corollary 1.3 immediately follows from the proof of Theorem 1.2 and equation (2).
Example**.**
As an example the following density functions may be written explicitly as
[TABLE]
with corresponding plots given in Figure 1.
The structure of the paper is the following. In Section 2 we present the proof of Theorem 1.1 and Theorem 1.2. Section 3 is devoted to the proof of auxiliary lemmas. In Appendix A and Appendix B we collected some facts about the distribution of the eigenvalues of random matrix ensembles needed for the proof of Theorem 1.2.
2. Proof of Theorem 1.1 and Theorem 1.2
In this section we present the proofs of Theorem 1.1 and Theorem 1.2 simultaneously since they differ in some details only.
Denote by the set of self-reciprocal monic polynomials of degree having roots lying on the unit circle and two positive real roots , bounded by . Moreover let denote the subclass of irreducible polynomials and let denote the subclass of reducible polynomials . By definition is the set of minimal polynomials of Salem numbers and thus
[TABLE]
Given a polynomial denote by a counting measure for the roots of lying on the unit circle
[TABLE]
where is the unit point mass in . Then, for any set of disjoint intervals and a polynomial the following quantity
[TABLE]
is the number of ordered -tuples such that
[TABLE]
Then, obviously,
[TABLE]
and our problem reduces to counting integral irreducible polynomials with a prescribed root distribution.
Our approach in this case will be to consider the set of polynomials with real instead of integer coefficients. Thus, identifying a polynomial of degree with the vector of its coefficients as a point in we transform the algebraic problem to the geometric problem of counting a number of integer points (points with integer coefficients) inside specific sets in .
Following this idea denote by a set of points such that the roots of polynomial
[TABLE]
have the following form
[TABLE]
for some . Moreover, denote by the set of points such that the polynomial satisfies the additional condition
[TABLE]
In order to simplify notation in the rest of the proof we will write instead of and will assume .
Denote by the volume of the set given by
[TABLE]
where denotes the indicator function of a set .
According to our convention every polynomial from represents an integer point in . Hence, the first step is to count the integer points in the set . We will show that for any integer and this number is asymptotically equal to the volume of as .
Approximating the number of integer points in a large set by the volume of this set is a classical approach. One of the earliest references we are aware of in this direction are due to Lipschitz [8] and Davenport [3]. In order to get a good estimate one needs to impose some regularity conditions on the boundary of . In accordance with [7, Definition 2.2], we say that the boundary of a set is of Lipschitz class if there exist maps satisfying a Lipschitz condition
[TABLE]
such that is covered by the images of the maps .
Lemma 2.1**.**
For any integer and the boundary of is of Lipschitz class for some fixed and independent of . Moreover
[TABLE]
where depends on , and only.
For the proof see Subsection 3.1.
Note that Lemma 2.1 allows us to estimate the number of all integer polynomials in satisfying the same conditions to the polynomials with real coefficients forming the set . But for our purpose we need to count the number of irreducible polynomials only. Thus, our next step is to show that the number of reducible polynomials in is relatively small and can be estimated by . The following lemma gives the asymptotic behavior of as .
Lemma 2.2**.**
For some depending on only we have
[TABLE]
The proof of this lemma is given in Subsection 3.2.
Finally, using Lemma 2.1 and Lemma 2.2 we get
[TABLE]
and from (11) and (14) with we conclude
[TABLE]
The last step of the proof is devoted to evaluation of and . Let us introduce the following representation of the points in terms of the roots (15) of a polynomial .
Lemma 2.3**.**
Given a polynomial
[TABLE]
with roots (15) we have
[TABLE]
where , and , .
The proof of this lemma is given in Subsection 3.3.
Let us consider the following simplex
[TABLE]
In Lemma 2.3 we defined a bijective map via (19) and (20) as
[TABLE]
Using this mapping we can rewrite (16) as follows
[TABLE]
where
[TABLE]
is the Jacobian of the map .
Lemma 2.4**.**
Given the map defined by (19) and (20) we have
[TABLE]
The evaluation of the Jacobian is postponed to Subsection 3.4.
Since is invariant with respect to permutations of , we may write
[TABLE]
Now using Lemma 2.4 together with the change of variables
[TABLE]
[TABLE]
where
[TABLE]
and denotes the -th elementary symmetric polynomial.
It is easy to see that the integrals in (24) are Selberg’s type integrals. In particular, taking for we obtain
[TABLE]
which is a special case of Selberg’s integral formula [15]
[TABLE]
for and . Thus, we conclude
[TABLE]
Substituting this into (23) leads to
[TABLE]
which together with (17) finishes the proof of Theorem 1.1.
The evaluation of
[TABLE]
is a bit more involved. Since it does not seem possible to derive compact representations for every separately, we shall look for representation of the whole sum instead.
First of all, using (16) and the definition of the sets we conclude
[TABLE]
Applying Lemma 2.3 and using the representation of via (12) and (13) we get
[TABLE]
and since our integrand is invariant with respect to permutations of we write
[TABLE]
From the identity
[TABLE]
and the fact that intervals are disjoint it follows
[TABLE]
Using again the invariance with respect to permutations of we conclude
[TABLE]
Applying Lemma 2.4 we finally obtain
[TABLE]
where
[TABLE]
with
[TABLE]
and
[TABLE]
Lemma 2.5**.**
For any integer and we have
[TABLE]
where is defined by (4) - (10).
The proof is given in Subsection 3.5.
Combining Lemma 2.5 with equations (26), (27) and (18) we finish the proof of Theorem 1.2.
3. Proofs of Lemmas
3.1. Proof of Lemma 2.1
Recall that denotes the set of points such that the roots of may be written as (15). Furthermore, recall that denotes the subset of points such that there are exactly tuples such that
[TABLE]
and , are the roots of . It is clear that for fixed and all sets are disjoint, only finite number, say , of them are non-empty and, moreover,
[TABLE]
Thus, the boundary of the set can be covered by and a set
[TABLE]
First of all recall that according to Lemma 2.3 there exists a bijective map defined by (19) and (20), where the simplex is defined by (21). This map defines a homeomorphism of manifolds with boundary, which means that , where
[TABLE]
and
[TABLE]
Moreover, according to the definition for any at least one of the equalities
[TABLE]
hold.
Let us construct maps as follows
[TABLE]
and for
[TABLE]
It is easy to see that all maps are Lipschitz continuous since they are continuously differentiable in a compact set. Moreover, from the arguments above it follows that
[TABLE]
Due to the definition of mapping we have for some constant depending on only and, thus, for the Lipschitz constant of the maps we conclude
[TABLE]
for some constant depending on only.
Finally we use a lattice point counting result by Widmer [1].
Theorem 3.1**.**
Let be a lattice in with successive minima . Let be a bounded set with boundary of Lipschitz class . Then is measurable, and, moreover,
[TABLE]
For the expression in the maximum is to be understood as 1.
Taking , , , and , and applying the theorem above we get
[TABLE]
which finishes the proof.
3.2. Proof of Lemma 2.2
Consider a reducible polynomial and assume that it can be written as a product of polynomials , such that , and . By definition of the polynomial is monic and has roots described by (15). Hence the polynomials and are monic as well, and all roots of one of them (say, ) are lying on the unit circle and the one (say, ) belongs to the set . Moreover, by Kronecker’s theorem [9] we conclude that the polynomial has to be a product of cyclotomic polynomials.
From the arguments above it follows that does not exceed the number of pairs of monic polynomials with integer coefficients such that , is a product of cyclotomic polynomials, and . It is easy to see that the number of polynomials does not exceed some constant , and from Lemma 2.1 and (25) it immediately follows that
[TABLE]
Hence, we conclude
[TABLE]
where the constant depends on only.
3.3. Proof of Lemma 2.3
Let us consider a polynomial with roots (15). Then
[TABLE]
All possible summands in the final expression consist of products of one of the summands of , or from every parentheses.
Consider the summand . The power appears in the following combinations only
[TABLE]
where and . Hence it follows that is the sum of the following terms
[TABLE]
which leads to the formula (20).
In the same manner consider the summand and note that the power appears in the following combinations only
[TABLE]
where and . Thus, is the sum of the following terms
[TABLE]
which leads to the formula (19).
3.4. Proof of Lemma 2.4
By definition the Jacobian is the absolute value of the determinant of the following matrix
[TABLE]
Hence,
[TABLE]
where
[TABLE]
We would like to show that
[TABLE]
We shall prove this formula by induction. First off all notice that the first and the second rows already have the required form, hence we may take them as base of the induction. Assume all rows up to -th row are of the form in (30). Consider the -th row, which has the form , where
[TABLE]
with the constants depending on , , and only. Multiplying the -th row by and subtracting it from the -th row for all we have
[TABLE]
and, since these transformations do not change the determinant of the matrix (30) holds for the -th row as well.
On the other hand completing the sums in the rows and applying successive row operations leaving the determinant invariant we finally arrive at
[TABLE]
The last determinant is the well-known Vandermonde determinant and, thus, we have
[TABLE]
Substituting this into (29) and taking into account that yields Lemma 2.4.
3.5. Proof of Lemma 2.5
Consider the expression
[TABLE]
Using change of variables , and the definition (28) of the function we write
[TABLE]
where
[TABLE]
Define the following function
[TABLE]
According to [17, Chapter 19] this function corresponds to the joint probability density function of the eigenvalues of the Jacobi random matrix ensemble with and weight function , (see Appendix A for the more details).
Then
[TABLE]
defines a -point correlation function (see (32)), which in this case can be written in the following form
[TABLE]
with the Kernel function is defined by (4) - (10) (see Apendix B).
Finally we write
[TABLE]
and combining this with (31) we finish the proof.
Acknowledgments
The authors are grateful to Arturas Dubickas for the suggestion of the problem and to Zakhar Kabluchko and Gernot Akemann for helpful discussions which improved the result.
Appendix A Random Matrices: General Theory
In this appendix we collect some basic facts about random matrices and the distribution of their eigenvalues. In view of the rather extensive literature on this subject we shall restrict ourselves to a very short review of results we need.
One way of defining a random matrix ensemble is to specify the joint probability density functions for its eigenvalues in the following form
[TABLE]
where is an in general complex parameter and the so called weight function can be chosen to suit the needs. The most well-studied cases are (real ensembles), (complex ensembles) and (quaternion ensembles).
One of the main objectives of Random Matrix Theory is to investigate the distribution of eigenvalues of different random matrix ensembles and particularly their limiting distribution when . For this purpose one needs to calculate -point correlation functions of eigenvalues [17, eq. (5.7.1)] defined by
[TABLE]
where the normalization constant is given by
[TABLE]
The eigenvalues of random matrix ensemble form a point process (see [18] for a definition and theory of point processes) and for two special types of point processes, namely the Determinantal point processes and the Pfaffian point processes the functions (32) will have a compact representation.
- (1)
In case of Determinantal point processes all -point correlation functions have the form
[TABLE] 2. (2)
In case of Pfaffian point processes all -point correlation functions have the form
[TABLE]
where is some special function satisfying the conditions described by [17, eq. (5.1.21), (5.1.22)]. The function is called a Kernel function.
It is known from random matrix theory that the case corresponds to Determinantal point processes and the case , correspond to Pfaffian point processes. Formulas for the kernel function may be derived using systems of orthogonal () and skew-orthogonal (, ) polynomials with the particular choice depending on the weight function . We skip the details referring the reader to [17, Chapter 5].
Appendix B Jacobi Ensemble with
This appendix is devoted to the particular random matrix ensemble used in the proof of Theorem 1.2, namely the Jacobi -ensemble [17, Chapter 19] with .
Consider the random matrix ensemble with the following joint probability density function for eigenvalues
[TABLE]
with the weight function
[TABLE]
This is a special case of more general class of weight functions
[TABLE]
which gives an ensemble corresponding to the Jacobi orthogonal polynomials defined by (3). The orthogonality property means
[TABLE]
where
[TABLE]
The Jacobi polynomials have been known for a long time and their properties are well studied [16].
As it was mentioned in Appendix A for one needs to construct the set of skew-orthogonal polynomials corresponding to the weight function (33). Skew orthogonality means that
[TABLE]
Thus, following the procedure described in [17, Chapter 19.2] and using the identities for Jacobi polynomials
[TABLE]
we get for
[TABLE]
and for
[TABLE]
Define the function
[TABLE]
Thus for we have
[TABLE]
and for we get
[TABLE]
Finally the Kernel function for the Jacobi ensemble with is defined via equations (19.2.22) - (19.2.28) in [17].
Note that in deriving (5) - (7) we have combined the equations (19.2.23) - (19.2.26) with (19.2.27) and (19.2.28). Moreover using the notation and equations (34) - (39) we arrive at (8) - (10). Furthermore, note that the representation of in (19.2.22) is given in quaternion form. In order to get a representation for Pfaffian point processes we have to use [17, Theorem 5.1.2].
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] M. Widmer Counting primitive points of bounded height. Trans. Amer. Math. Soc. , 362:4793–4829, 2010.
- 2[2] F. Calegari and Z. Huang Counting Perron numbers by absolute value. J. London Math. Soc. , (96):181–200, 2017.
- 3[3] H. Davenport. On a principle of Lipschitz. J. Lond. Math. Soc. , 26(3):179–183, 1951. Corrigendum: “On a principle of Lipschitz”, J. Lond. Math. Soc. 39 (1964), 580.
- 4[4] D. Masser and J. D. Vaaler. Counting algebraic numbers with large height I. In Diophantine Approximation , volume 16 of Dev. Math. , pages 237–243. Springer Wien New York, Vienna, 2008.
- 5[5] G. Kuba. On the distribution of reducible polynomials. Math. Slovaca. , 59(3):349–356, 2009.
- 6[6] V.V. Prasolov. Polynomials. vol. 11 of Algorithms and Computation in Mathematics, Springer, Berlin, 2004.
- 7[7] M. Widmer. Lipschitz class, narrow class, and counting lattice points. Proc. Amer. Math. Soc. , 140(2):677–689, 2012.
- 8[8] R. Lipschitz. Über die asymptotischen Gesetze von gewissen Gattungen zahlentheoretischer Funktionen. Monatsber. der Berliner Akademie , 1865:174–185, 1865.
