Counting fundamental solutions to the Pell equation with prescribed size
Ping Xi

TL;DR
This paper investigates the number of fundamental solutions to Pell equations with size constraints, providing new lower bounds by applying advanced exponential sum techniques, thus advancing understanding of Pell solutions' distribution.
Contribution
It introduces a novel application of the $q$-analogue of van der Corput method to algebraic exponential sums, improving bounds on Pell solutions compared to previous work.
Findings
Established new lower bounds for the count of Pell solutions
Applied the $q$-analogue of van der Corput method to algebraic sums
Enhanced previous results by Fouvry on Pell equation solutions
Abstract
The cardinality of the set of for which the fundamental solution of the Pell equation is less than with is studied and certain lower bounds are obtained, improving previous results of Fouvry by introducing the -analogue of van der Corput method to algebraic exponential sums with smooth moduli.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Counting fundamental solutions to the Pell equation with prescribed size
Ping Xi
School of Mathematics and Statistics, Xi’an Jiaotong University, Xi’an 710049, P. R. China
Abstract.
The cardinality of the set of for which the fundamental solution of the Pell equation is less than with is studied and certain lower bounds are obtained, improving previous results of Fouvry by introducing the -analogue of van der Corput method to algebraic exponential sums with smooth moduli.
Key words and phrases:
Pell equation, exponential sums, -analogue of van der Corput method
2010 Mathematics Subject Classification:
11D09, 11N37, 11L07, 11T23
1. Introduction and main results
Let be a non-square positive integer. The Pell equation is usually referred to
[TABLE]
to which the solution can be written in the usual form
[TABLE]
The classical Dirichlet Units Theorem asserts that the set of solutions to (1.1) is non-trivial and has the form
[TABLE]
where is called the fundamental solution of (1.1) and is given by
[TABLE]
Writing , we have , from which we deduce that and finally
[TABLE]
We are interested in counting the integers for which or is less than a fixed power of .
For and , define
[TABLE]
In his pioneer work, Hooley [Ho84] proved the following theorem.
Theorem A** (Hooley).**
Let satisfy . As , one has
[TABLE]
uniformly for
In the same paper, Hooley [Ho84, P. 110] also made the following conjecture.
Conjecture 1.1** (Hooley).**
For any given , we have
[TABLE]
where
[TABLE]
Fouvry [Fo16] made a first significant step towards Hooley’s conjecture in the case In fact, he proved the following theorem.
Theorem B** (Fouvry).**
As , one has
[TABLE]
and
[TABLE]
uniformly for
Moreover, Fouvry [Fo16] considered a conjectural estimation for short exponential sums.
Conjecture 1.2**.**
There exists an absolute , such that for any integer one has the inequality
[TABLE]
uniformly for any integers satisfying and , for any real number satisfying and
Assuming Conjecture 1.2, Fouvry [Fo16] derived the following stronger lower bounds.
Theorem C** (Fouvry).**
Assume that Conjecture 1.2 is true for some . As , one has
[TABLE]
and
[TABLE]
uniformly for
The first inequality coincides with Hooley’s Conjecture 1.1 when is slightly larger than On the other hand, Bourgain [Bou15] considered Conjecture 1.2 itself. In particular, he succeeded in saving a power of in the trivial bound for the sum involved. This allows him to improve the lower bound (1.2) by replacing the term with the term , where is some positive constant.
Bourgain’s improvement is of interest only if is quite close to and one should pay much more attention if every parameter is to be made effective. The aim of this paper is to give another improvement to Theorem B towards Conjecture 1.1.
Theorem 1.1**.**
Let . For any fixed we have
[TABLE]
and
[TABLE]
uniformly in , where is the Dickman function given by and
[TABLE]
Fix . One may check that is always positive for and monotonically increasing in . In particular, for we take such that and
[TABLE]
Moreover, for Hence we may conclude the following consequence.
Corollary 1.1**.**
Let . Uniformly for we have
[TABLE]
and
[TABLE]
The framework of the proof is based on [Fo16]. To make the paper clear, we will present the proof as complete as possible, but also with omitting some details that are not quite essential to understand the underlying ideas.
The key point of proving Theorem 1.1 is a variant of Conjecture 1.2 that can be proved unconditionally. More precisely, if one allows the moduli to be smooth numbers (integers free of large prime factors), it is possible to prove the existence of in Conjecture 1.2 as long as is not too small. The details can be referred to Theorem 3.1 and Section 4. We will adopt the -analogue of van der Corput method, which can be at least dated back to Heath-Brown [HB78] on the proof of Weyl-type subconvex bounds for Dirichlet -functions to well-factorable moduli. Instead of the -process in [HB78], we apply the -process by introducing a completion in the initial step. It is expected that one can do better on the exponential sums in Conjecture 1.2 if better factorizations of the moduli are imposed; see [WX16] for instance in the case of squarefree moduli. However, the improvements to Theorem 1.1 would be rather slight, since the density of smooth numbers decays rapidly when the size of their prime factors decreases.
As an extension to Theorem 1.1, one may consider
[TABLE]
for Conjecture 1.1 would yield asymptotics for while are of different prescribed sizes. A weaker statement would assert that, for any there exists a positive constant , such that
[TABLE]
for all large . This weaker statement was made unconditionally by Fouvry and Jouve [FJ12] whenever . It is expected that the arguments in this paper can enlarge the admissible range of .
Notation and convention
As usual, , denotes the Euler function and counts the number of distinct prime factors of . The variable is reserved for prime numbers. Denote by and the squarefree and squarefull parts of , respectively; namely,
[TABLE]
For a real number denote by its integral part and . From time to time, we use to denote the greatest common divisor of , and also to denote a tuple given by two coordinates; these will not cause confusions as one will see later. The symbol in summation reminds us to sum over primitive elements such that poles of the summand are avoided. For a function , its Fourier transform is defined as
[TABLE]
We use to denote a very small positive number, which might be different at each occurrence; we also write The convention means
Acknowledgements
I am very grateful to the referee for the valuable comments and suggestions. The work is supported in part by NSFC (No.11601413) and NSBRP (No. 2017JQ1016) of Shaanxi Province.
2. Fundamental transformations: after Hooley and Fouvry
We first make some fundamental transformations following the arguments of Hooley and Fouvry. For some conclusions, we omit the proof and the detailed arguments can be found in [Ho84] and [Fo16].
2.1. An initial transformation
First, we write
[TABLE]
where
[TABLE]
Here, is a function in , implicitly defined by the equation
[TABLE]
We have the following asymptotic characterization for The proof can be found in [Fo16, Lemma 2.1] and the subsequent remark.
Lemma 2.1**.**
Let The function is of -class and satisfies the inequalities
[TABLE]
and
[TABLE]
as
2.2. A first dissection of
We truncate the -sum in (2.1) at , and the contributions from and are respectively denoted by and Therefore,
[TABLE]
Accordingly, we may define and by introducing the extra restriction to the equation
As stated by Fouvry [Fo16, Formula (4.5)], we have
Lemma 2.2**.**
Let As we have
[TABLE]
Our task thus reduces to proving a lower bound for Put We then have
[TABLE]
Put . Thus is multiplicative and satisfies
[TABLE]
2.3. Analysis of
For , write , where is an odd integer. The choice of is unique for each . The Chinese remainder theorem implies
[TABLE]
In this way, one can establish a bijection between and Starting from (2.9), we decompose by
[TABLE]
where
[TABLE]
The task will be evaluating for all and This would require the following description of that allows us to create one more variable. This is Lemma 4.1 in [Fo16].
Lemma 2.3**.**
Let be a positive odd integer. Then there is a bijection between the set of coprime decompositions of
[TABLE]
and the set of roots of congruence
[TABLE]
Such a bijection can be defined by where is uniquely determined by the congruences and . In an equivalent manner, we have the congruence
[TABLE]
Here and .
With the help of Lemma 2.3, we may rewrite as
[TABLE]
where and correspond to the restrictions and , respectively. Since the treatments of and are similar, it suffices to study as presented in the next section.
We close this section by the following trivial equality:
[TABLE]
which is a consequence of the equivalence
[TABLE]
This allows us to transfer between and .
3. Lower bound for
In order to conclude the lower bound for , we now start the study of . Recall that
[TABLE]
We would like to drop the multiplicative constraints and sum over separately. To do so, we may introduce the following inequality
[TABLE]
where
- •
the summation is over all satisfying
[TABLE]
and being powers of ,
- •
the summation is over all satisfying
- •
we have defined
[TABLE]
Of course the condition can be dropped when The parameter is supposed to be fixed and the congruence conditions modulo are harmless. So to shorten the notations, we write Finally we shall not precise the dependence on of some -symbols, since we shall work with a finite number of values of . The case is typical and really reflects the difficulties of the method.
3.1. Reduction to exponential sums: after Fouvry
The congruence condition implies that , i.e., for some Since , then there is no such if is too large, for instance when
[TABLE]
Hence we can suppose
[TABLE]
otherwise
Since is odd, we deduce from (2.13) the equivalence
[TABLE]
with
[TABLE]
where and It follows that
[TABLE]
with The three terms on the RHS have completely different structures: the first one is constant, the second one changes very slowly when and vary, the third one oscillates a lot when varies with fixed.
For each fixed , we rewrite the sum as
[TABLE]
where and . As in [Fo16], we smooth the -sum via the following lemma.
Lemma 3.1**.**
For every there exists a smooth function which has the two properties
[TABLE]
and
[TABLE]
Let be a smooth function given as in Lemma 3.1. Hence
[TABLE]
By Poisson summation, the -sum becomes
[TABLE]
From integration by parts, we have for any . Note that
[TABLE]
The above sum over can be truncated to with , and the remaining contribution is at most Therefore,
[TABLE]
where and are used to denote contributions from and , respectively.
First,
[TABLE]
which is . It is also desirable to show that
[TABLE]
for some . By standard tools from analysis (see [Fo16] for details), it suffices to prove that
[TABLE]
where , . After transforming the -sum to a complete sum , where
[TABLE]
Fouvry [Fo16] evaluated in terms of classical Gauss sums and Jacobi symbols. He then arrived at a bilinear form involving Jacobi symbols, for which a celebrated estimate due to Heath-Brown [HB95] was applied. Amongst some other delicate arguments, Fouvry was able to prove (3.4) under the conditions
[TABLE]
in which case he obtained the lower bound
[TABLE]
To obtain a better lower bound for and thus for , it is natural to expect that (3.4) and (3.7) can hold in larger ranges of . However, it seems rather difficult when is quite close to since the -sum is too short in the sense of the Pólya-Vinogradov barrier. In fact, Bourgain [Bou15] managed to control the LHS in (3.5), but with a saving of a small power of rather than that of . This allows him to improve upon Fouvry when is rather close to in Theorem B.
In our subsequent argument, we will specialize with special structures in the original sum (3.3) before Poisson summation. More precisely, we will consider those consisting of only small prime factors, so that has good factorizations, which enable us to control the exponential sums in (3.5) even though is quite close to .
3.2. Lower bound of : smooth approach
A positive integer is said to be -smooth (or friable) if all prime factors of do not exceed . Let be a fixed number. If is -smooth, the inclusion-exclusion principle yields the existence of the divisor such that for any
We now restrict these in the RHS of (3.3) to -smooth numbers and put
[TABLE]
where with given as in Lemma 3.1. Following the similar arguments of Fouvry, we may derive that
[TABLE]
where
[TABLE]
and we expect to show that
[TABLE]
for some , for which it suffices to prove, for all , that
[TABLE]
We will prove
Theorem 3.1**.**
There exists some such that holds, provided that
[TABLE]
Put and . In addition to the restrictions , (3.6) requires , and we require in the particular case . In the following figure, the blue area shows what we can gain more than the previous approach (We are gaining relatively more as becomes closer to ).
The proof of Theorem 3.1 will be given in the next section. To see the advantage of our approach, one may consider the particular case , and our first restriction will reduce to ; however, the stronger restriction in (3.6) is required.
Therefore, we may obtain the lower bound
[TABLE]
subject to the restrictions in (3.11).
3.3. A weakened form of Theorem 1.1
Up to now, we have two lower bounds for , i.e., (3.7) and (3.12), subject to the restrictions in (3.6) and (3.11), respectively. In what follows, we will take into account all such admissible tuples , for which we appeal to (3.7) if (3.6) is satisfied, and appeal to (3.12) if (3.6) is not satisfied but (3.11) is valid. To this end, we define two sets of tuples
[TABLE]
and
[TABLE]
where is a sufficiently small positive number.
First, we may derive a lower bound for by inserting the inequality (3.7) or (3.12) to (3.1). A similar lower bound also holds for by symmetry. Therefore, we have
[TABLE]
where are also restricted to be powers of 2. Recall that
[TABLE]
We then obtain the lower bound
[TABLE]
where
[TABLE]
and
[TABLE]
Taking very large, very small, and letting tend to zero, we conclude from Lemma A.1 that
[TABLE]
uniformly for where
[TABLE]
Note that is what we have gained more than Fouvry [Fo16]. From Lemma A.3, we arrive at
[TABLE]
with
[TABLE]
One may check that
[TABLE]
as given in Theorem 1.1. Combining this asymptotic evaluation for with (3.13), we may conclude a lower bound for , from which and (2.8), (2.7), we get
[TABLE]
uniformly for
3.4. Concluding Theorem 1.1
To pass from a lower bound of to that of , it is natural to invoke the identity (2.14) and Theorem A. In fact, one can do a bit better following the arguments of Fouvry [Fo16] and show that the above lower bound (3.14) also hold for This will depend on a more elaborate study of the contribution from non-fundamental solutions. In other words, we would like to show that the non-fundamental solutions create negligible contributions to and
The following lemma is borrowed directly from [Fo16, Lemma 9.1].
Lemma 3.2**.**
Uniformly for and , one has
[TABLE]
To deal with the contribution of the non-fundamental solutions to , we also follow Fouvry. The above arguments which lead to (3.14) are essentially counting the number of of 5-tuples of integers satisfying
[TABLE]
[TABLE]
[TABLE]
as well as one of the following restrictions:
- •
;
- •
;
- •
;
- •
.
By introducing the extra constraint , we may also define . In fact, the above arguments yield
[TABLE]
which are true for every positive and for every More precisely, we have proved for every , and that
[TABLE]
with
Following the approach of Fouvry [Fo16], we can state without proof that
Lemma 3.3**.**
For every and every , one has
[TABLE]
uniformly for and
We are now ready to complete the proof of Theorem 1.1. In view of (3.15), we may write
[TABLE]
which are true for every and . From Lemmas 3.2 and 3.3, we obtain
[TABLE]
By (2.8), (3.16), and by choosing sufficiently large, sufficiently small, and letting tend to zero, we find the lower bound (3.14) holds definitely for This establishes (1.4).
The lower bounds for in Theorem 1.1 can be deduced from (1.2) by adding the contribution of the non-fundamental solutions, as it is shown by (2.14).
4. Estimate for triple exponential sums
We now prove Theorem 3.1. For the economy of the presentation, we only focus on the case and define
[TABLE]
We would like to show that
[TABLE]
for some while fall into the ranges in (3.11).
By Poisson summation, the -sum in (4.1) becomes
[TABLE]
From the Chinese remainder theorem, the sum over can be rewritten as
[TABLE]
where is an analogue of Kloosterman sums:
[TABLE]
Hence we may conclude that
[TABLE]
Note that and if is odd, we have . According to and , we split by
[TABLE]
where
[TABLE]
and
[TABLE]
Following the approach of Fouvry, one may express in terms of Jacobi symbols (see [Fo16, Lemma 6.2]) and then appeal to the bilinear estimate of Heath-Brown [HB95], getting
[TABLE]
which produces the second restriction in (3.6). Moreover, Fouvry proved that
[TABLE]
which produces the first restriction in (3.6).
Our task will be proving a stronger estimate for by virtue of the special structure of . More precisely, we shall prove that
[TABLE]
where will be chosen at our demand. This would at least require the following inequality as proved by Fouvry [Fo16]. In fact, Fouvry only considered those that are perfect squares, and his argument also applies to more general .
Lemma 4.1**.**
Let be an odd positive integer. Then we have
[TABLE]
As an extension to , we define another exponential sum
[TABLE]
where in summation reminds us to sum over primitive elements, i.e., We will need the following inequality.
Lemma 4.2**.**
For each odd , we have
[TABLE]
where
[TABLE]
The proof of Lemma 4.2 will be given in the last appendix.
We now start to prove Theorem 3.1. Due to the decay of , we may truncate the -sum in by so that
[TABLE]
Our project will be controlling the cancellations while summing over with . More precisely, we would like to estimate
[TABLE]
The contributions from negative and positive can be treated similarly, it thus suffices to consider
[TABLE]
which can be rewritten as
[TABLE]
Clearly, . By partial summation, it suffices to consider
[TABLE]
For each , we define by
[TABLE]
It follows that and . By virtue of the -analogue of van der Corput method, we will prove
Lemma 4.3**.**
With the above notation, we have
[TABLE]
Proof.
Lemma 4.3 is a trivial consequence of Lemma 4.1 if We now assume Denote by the characteristic function of the interval Thus,
[TABLE]
for any From the Chinese remainder theorem, we may write
[TABLE]
where we have used the fact that
For , we sum over , getting
[TABLE]
From Lemma 4.1 we find
[TABLE]
In view of the support of , the sum over is in fact restricted to By Cauchy inequality, we derive that
[TABLE]
Squaring out and switching summations, we get
[TABLE]
where is an interval, depending on , of length at most . For , we appeal to Lemma 4.1 to estimate the -sum trivially. For , reasonable cancellations in the -sum are expected. In fact, by completion, we have
[TABLE]
where
[TABLE]
and
[TABLE]
On one hand,
[TABLE]
Opening each by definition, the orthogonality of additive characters gives
[TABLE]
For , we have
[TABLE]
For , we would like to appeal to Lemma 4.2. To do so, we first derive from Lemma 4.2 that (we have since is a perfect square)
[TABLE]
This yields
[TABLE]
from which and (4.6) we obtain
[TABLE]
Note that Thus,
[TABLE]
We then conclude that
[TABLE]
which gives Lemma 4.3 immediately. ∎
In view of Lemma 4.3 and the discussions before it, we may derive from (4.5) that
[TABLE]
where . In view of the choice , Lemma A.5 yields
[TABLE]
from which and (4.3) we conclude that
[TABLE]
Choosing , we then have . Recalling the choice of , we then arrive at the expected estimate (4.2), provided that (3.11) holds.
Appendix A Mean values of arithmetic functions
A.1. Some basic asymptotics
The first part of the appendix is devoted to state several basic asymptotics.
Recall that denotes the number of solutions to the congruence equation As a multiplicative function, satisfies the evaluation (2.10). We are now ready to state the following averages.
Lemma A.1**.**
As , we have
[TABLE]
[TABLE]
[TABLE]
and
[TABLE]
Proof.
The first one can be derived from the evaluations of as given in (2.10). The other three asymptotics can be found in Fouvry [Fo16, Lemma 8.1, Lemma 8.2]. ∎
A.2. Smooth numbers
Denote by the set of -smooth numbers not exceeding . Write We now introduce the Dickman function by
[TABLE]
In the first several intervals, we have
[TABLE]
The following lemma is classical and shows is the density function of smooth numbers.
Lemma A.2**.**
Uniformly for , we have
[TABLE]
Proof.
See [Te95, P. 367, Theorem 6]. ∎
Lemma A.3**.**
Let be fixed. As , we have
[TABLE]
and
[TABLE]
Proof.
One can refer to [TW03], for instance, for some general theorems on the mean values of multiplicative functions over smooth numbers. In particular, one has
[TABLE]
The lemma then follows from the partial summation. ∎
Lemma A.4**.**
For all , we have
[TABLE]
Proof.
Note that is multiplicative. For we consider the Dirichet series
[TABLE]
By Euler product formula, we have
[TABLE]
where is holomorphic for and
[TABLE]
The lemma then follows from a routine application of Perron’s formula. ∎
The following inequality is a consequence of Rankin’s method, which is a stronger version of [Fo16, Lemma 7.2].
Lemma A.5**.**
For any , one has
[TABLE]
Proof.
Denote by the sum in question. First, we have
[TABLE]
Note that
[TABLE]
giving
[TABLE]
from which and Lemma A.4 it follows that
[TABLE]
By Rankin’s method, the last sum over is, for any ,
[TABLE]
by re-defining . We now get
[TABLE]
where the last step follows from Lemma A.4 together with partial summation and taking therein. ∎
Appendix B Estimate for
While invoking the ideas of -analogue of van der Corput method, we have transformed the original algebraic exponential sum to a new sum as given by (4.4). This appendix will be devoted to present an estimation for that suits well in our applications to Theorem 1.1. In fact, the job can be done for the general complete exponential sum
[TABLE]
Here with and being coprime in . The values of such that are excluded from summation. We define the degree of by
[TABLE]
If
[TABLE]
we then adopt the convention that
[TABLE]
for all
There are many known estimates for complete exponential sums in the literature. In [WX16, Theorem B.1], we obtained the following estimate for .
Theorem B.1**.**
Let For we have
[TABLE]
where and are given as in Lemma 4.2.
In fact, the case of being a prime is essentially due to Weil [We48], and one can refer to [Bom66] for a complete proof that suits quite well in our situation. Thanks to the Chinese remainder theorem, the evaluation of can be reduced to the case of prime power moduli , which can usually be treated following an elementary device; see [IK04, Section 12.3] for details. This is in fact the line of the proof in [WX16].
The upper bound in Theorem B.1 is complicated at first glance and it even gives a worse bound than the trivial one if is a constant function mod . However, it provides an essentially optimal bound if we have an extra average over , since does not oscillate too much on average.
Given we take
[TABLE]
with
[TABLE]
Moreover, we have
[TABLE]
Thus, for odd , we have
[TABLE]
[TABLE]
and
[TABLE]
Note that
From Theorem B.1, we may conclude the following estimate as given in Lemma 4.2.
Theorem B.2**.**
For each odd , we have
[TABLE]
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[Bom 66] E. Bombieri, On exponential sums in finite fields, Amer. J. Math. 88 (1966), 71–105.
- 2[Bou 15] J. Bourgain, A remark on solutions of the Pell equation, Int. Math. Res. Not. 2015, 2841–2855.
- 3[Es 61] T. Estermann, On Kloosterman’s sum, Mathematika 8 (1961), 83–86.
- 4[Fo 16] É. Fouvry, On the size of the fundamental solution of the Pell equation, J. Reine Angew. Math. 717 (2016), 1–33.
- 5[FJ 12] É. Fouvry & F. Jouve, Fundamental solutions to Pell equation with prescribed size, Proc. Steklov Inst. Math. 276 (2012), 40–50.
- 6[HB 78] D. R. Heath-Brown, Hybrid bounds for Dirichlet L 𝐿 L -functions, Invent. Math. 47 (1978), 149–170.
- 7[HB 95] D. R. Heath-Brown, A mean value estimate for real character sums, Acta Arith. 72 (1995), 235–275.
- 8[Ho 84] C. Hooley, On the Pellian equation and the class number of indefinite binary quadratic forms, J. Reine Angew. Math. 353 (1984), 98–131.
