Free rational points on smooth hypersurfaces
Tim Browning, Will Sawin

TL;DR
This paper uses the Hardy-Littlewood circle method to count rational points of bounded height on smooth hypersurfaces, addressing a question posed by Peyre and focusing on low-degree cases.
Contribution
It introduces a novel application of the circle method to count 'sufficiently free' rational points on smooth hypersurfaces over the rationals.
Findings
Successfully counts rational points on certain hypersurfaces
Provides new bounds for the number of rational points
Addresses Peyre's question in specific cases
Abstract
Motivated by a recent question of Peyre, we apply the Hardy-Littlewood circle method to count "sufficiently free" rational points of bounded height on arbitrary smooth projective hypersurfaces of low degree that are defined over the rational numbers.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Free rational points on smooth hypersurfaces
Tim Browning
IST Austria
Am Campus 1
3400 Klosterneuburg
Austria
and
Will Sawin
Columbia University
Department of Mathematics
2990 Broadway
New York
NY 10027
USA
Abstract.
Motivated by a recent question of Peyre, we apply the Hardy–Littlewood circle method to count “sufficiently free” rational points of bounded height on arbitrary smooth projective hypersurfaces of low degree that are defined over the rationals.
2010 Mathematics Subject Classification:
11P55 (11D45, 14G05)
Contents
- 1 Introduction
- 2 The geometry of numbers and the shape of lattices
- 3 Free rational points on hypersurfaces
- 4 Identification of the major arcs
- 5 Treatment of the minor arcs
1. Introduction
Let be a smooth hypersurface of degree , defined over the field of rational numbers. For , let , where is the usual exponential height function on . Thanks to the Hardy–Littlewood circle method and work of Birch [2], it follows that there exists a constant such that
[TABLE]
as , provided that . Here and is the Tamagawa measure on the space of adeles of . The asymptotic formula (1.1) provided one of the earliest pieces of evidence for the conjecture of Manin [7], and its refinement by Peyre [10], about the distribution of rational points on Fano varieties.
The purpose of this paper is to address a very recent question of Peyre [11] about the distribution of “sufficiently free” rational points of bounded height on . Peyre associates a measure of “freeness” to any and advocates the idea of only counting those rational points which satisfy , where is a function of decreasing to zero sufficiently slowly.111A similar question was asked by Ellenberg and Venkatesh in a 2015 private communication with the first author. (See [11, Def. 6.11] for a precise statement for arbitrary Fano varieties over arbitrary number fields.) Peyre’s function is defined in (3.5) using Arakelov geometry and the theory of slopes associated to the tangent bundle . Let
[TABLE]
In the setting of smooth hypersurfaces of low degree, Peyre predicts that for a suitable range of , should have the same asymptotic behaviour as the usual counting function , as . The following result confirms this for a range of that is independent of .
Theorem 1.1**.**
Let and let . Then there exists a constant such that for any
[TABLE]
there exists a further constant such that
[TABLE]
where is the expected leading constant.
Note that in our theorem the parameter is a constant, while in Peyre’s notion of freeness one takes tending to zero. Thus our result is stronger than necessary for Peyre’s formulation. We shall show in §3 that it suffices to work with a simpler freeness function that is defined in (3.4) in terms of the largest successive minimum of a certain associated lattice. Once this is achieved, the proof of Theorem 1.1 is guided by our investigation [4] of the analogous situation for smooth hypersurfaces over global fields of positive characteristic. We shall find that the role of the Riemann–Roch theorem in [4, §3] is replaced by the Poisson summation formula. After this the argument runs in close parallel to [4], apart from in one essential difference associated to primes of bad reduction for .
An interesting feature of our method is that it relies on counting integer solutions to the system of equations , where is the defining polynomial of . This is equivalent to counting integer points on the tangent bundle of the affine cone over . This suggests that it may be possible to bound the number of rational points of small freeness on a Fano variety by using asymptotics for the number of rational points on together with asymptotics for the number of integral points on the tangent bundle of .
Acknowledgements*.*
The authors are very grateful to the anonymous referee for numerous pertinent remarks. While working on this paper the first author was supported by EPRSC grant EP/P026710/1. The research was partially conducted during the period the second author served as a Clay Research Fellow, and partially conducted during the period he was supported by Dr. Max Rössler, the Walter Haefner Foundation and the ETH Zurich Foundation.
2. The geometry of numbers and the shape of lattices
Most of the facts that we record in this section are taken from the book by Cassels [5]. Recall that a lattice is a discrete additive subgroup of . Equivalently
[TABLE]
for a set of linearly independent vectors . The rank of is then and the determinant is , where is the matrix formed from the column vectors . For each let be the least such that contains at least linearly independent vectors of Euclidean length bounded by . The are the successive minimima of and they satisfy Furthermore, it follows from Minkowski’s second convex body theorem [5, §VIII.3.2] that
[TABLE]
where the implied constant depends only on . The dual lattice is defined to be
[TABLE]
This lattice has basis matrix and so and . Appealing to work of Banaszczyk [1, Thm. 2.1], it follows that
[TABLE]
for .
The following result is well-known and will prove instrumental in our work. A proof is given as a special case of work by Heath-Brown [8, Lemma 1].
Lemma 2.1**.**
For any vector the set is a lattice of dimension and determinant , where is the Euclidean norm on .
Given a lattice of rank it will be important to detect when the lattice is unusually skew, in the sense that the largest successive minimum is excessively large. To be precise, we seek a useful majorant for the indicator function
[TABLE]
This is achieved in the following simple result.
Lemma 2.2**.**
Let be a lattice of rank and let be the Gaussian function Then
[TABLE]
Proof.
Note that for all and . It follows from Poisson summation that
[TABLE]
since . Thus
[TABLE]
for any lattice . Moreover, according to (2.2), we have if . This means that there exists a non-zero vector such that . But then
[TABLE]
This implies that
[TABLE]
if , which thereby completes the proof of the lemma. ∎
3. Free rational points on hypersurfaces
Suppose that is a non-singular form of degree that defines the hypersurface . Any rational point has a representative vector such that and . The measure of freeness of that we shall use in our paper is phrased in terms of the “well-shapedness” of the associated lattice
[TABLE]
It follows from Lemma 2.1 that is a lattice of rank and determinant
[TABLE]
where is the Euclidean norm. Let be the absolute value of the discriminant of the non-singular polynomial . From the definition of the discriminant as the resultant of the forms , it follows that there exists and algebraic identities
[TABLE]
for , where each has integer coefficients. In particular
[TABLE]
Next we claim that
[TABLE]
for appropriate implied constants that depend only on . Since has degree and so its partial derivatives have degree , the upper bound is clear. To see the lower bound we note that for all , since is non-singular. Thus is nowhere vanishing on the unit sphere and so attains some minimum value , say, there. Thus we have in general because is homogeneous of degree . This establishes (3.3).
As we shall see shortly, Peyre defines a freeness function relative to the smallest slope on the tangent bundle . The measure of freeness that we shall work with is related to this, but it is phrased in terms of the relative size of the largest successive minimum of the lattice . To be precise, we set
[TABLE]
Then if and only if .
We gain some feeling for the behaviour of by recalling (2.1). Thus for “typical” one might expect the successive minima to have the same order of magnitude, for . If this were true it would follow from (2.1) that
[TABLE]
since by (3.3). Such satisfy , as . The following example shows a familiar situation in which the freeness function is unusually small.
Example*.*
Consider the case and of a smooth cubic surface . Let be a -line and define the associated rank lattice
[TABLE]
We claim that, for any , we have for all but finitely many . To see this we note that vanishes identically in for all . But then it follows that in which case we have . It now follows from (2.1) and (3.3) that
[TABLE]
This therefore yields and the claim.
We now explain how our freeness function (3.4) relates to that defined by Peyre [11, Déf. 4.11]. To begin with we can extend to a closed subscheme . A rational point gives a section of this scheme. Because this scheme is smooth of dimension , the pullback of its tangent bundle along is a rank free -module; i.e. a free lattice of rank . Fixing a Riemannian metric on gives a metric on this lattice. Peyre defines the freeness of as
[TABLE]
where is the logarithmic anticanonical height of and are the slopes defined by Bost. There are four main differences between Peyre’s definition and ours:
- (1)
Peyre includes a factor of in the numerator and the anticanonical height in the denominator instead of . 2. (2)
Peyre uses the notion of slopes instead of successive minima. The slopes of a lattice differ from minus the logarithms of its successive minima by . 3. (3)
Peyre works in a slightly different lattice, namely the tangent lattice instead of the perpendicular lattice to . These lattices are closely related, but not identical, and this discrepancy means that we only produce an inequality (instead of an identity) between the two notions of freeness. 4. (4)
Peyre defines the freeness to always be non-negative.
The relationship between the two notions of freeness is articulated in the following result.
Lemma 3.1**.**
For any we have
[TABLE]
Proof.
We first explain how to relate the tangent lattice to , and then why this leads to the stated inequality. We have an Euler exact sequence
[TABLE]
on . This induces an exact sequence
[TABLE]
We have and because and are rank one locally free sheaves, so their pullback along are rank one locally free sheaves on , which are all isomorphic to . The map between them is multiplication by , so the tangent lattice of is the quotient lattice . We claim that the induced metric on this is the renormalized metric
[TABLE]
Formally this arises from the twist, but we can see this explicitly since the natural isomorphism between and the tangent space to at depends on the scaling of the vector and not just on its equivalence class in . The Arakelov metric on the tangent bundle of projective space must depend continuously on a point in projective space. To make it do so, we divide by .
Calculating is now relatively easy. Consider the exact sequence
[TABLE]
where the second map represents dot product with . We can realise as the kernel of dotting with in , with no further renormalization necessary. Invoking some basic properties of slopes, we deduce that
[TABLE]
Indeed, the first step uses the fact that, when we divide the metric of a lattice by , we add to each slope of the lattice, which is clear from the definition [11, Déf 4.4] and is a special case of [3, Lemma 4.2]. The second step uses the fact that the minimum slope of a quotient lattice is at least the minimum slope of the original lattice, which is immediate from the definition of the minimum slope as a minimum over quotients of the lattice in [3, p. 195] and the equivalence of Bost’s minimum slope and the last slope in Peyre’s ordering. The last step uses [11, Remarque 4.7(b)].
The inequality
[TABLE]
immediately follows, since . We therefore have
[TABLE]
except if where the middle inequality fails, but this happens for only finitely many and we can handle it by assuming that the constant in the term is sufficiently large. ∎
Returning to (1.2), we can now make sense of the counting function
[TABLE]
for any , where
[TABLE]
The first term is handled by (1.1), since Moreover, in view of Lemma 3.1 and the fact that , we have
[TABLE]
where the presence of the term is needed to account for the low height points. Hence
[TABLE]
on taking into account the action of the units on . We require an upper bound for which is for an appropriate , and which is valid for as wide a range of as possible.
To handle it will be convenient to break the range for into dyadic intervals. Thus
[TABLE]
say. Appealing to Lemma 2.2, we deduce that
[TABLE]
where . In what follows it will be convenient to write when is represented by a vector . In view of (3.2) we have , so that
[TABLE]
We shall use this formula to extend the definition of to all (not necessarily primitive) vectors .
Let us write to denote the inequalities . In order to treat we begin by analysing the term
[TABLE]
It is clear that if and only if , for any . Hence, an application of Möbius inversion yields
[TABLE]
where
[TABLE]
Our plan is to define a set of “major arcs” for the interval whose integral matches the expected main term from (3.7).
4. Identification of the major arcs
Our identification of the major arcs follows the path that was paved in [4, §4]. Henceforth all implied constants will be allowed to depend on . It will be convenient to set
[TABLE]
where is the parameter occurring in (3.9). Since , it follows from Poisson summation that
[TABLE]
for any and any . Let us use to denote the distance to the nearest integer. We observe that
[TABLE]
for any and any . Hence it is not hard to see that
[TABLE]
for any . Led by this we make the following definition.
Definition 4.1** (Major arcs).**
For any we set
[TABLE]
where is a sufficiently large constant that only depends on .
The following result is concerned with the size of the exponential sum when belongs to this set of major arcs.
Lemma 4.2**.**
Let , let with , and let for coprime integers such that and . Then
[TABLE]
if and , with otherwise.
Proof.
Let . Then
[TABLE]
for any , provided that is large enough. Next, we see that
[TABLE]
if . Thus it follows from (4.1) that if and . Alternatively, if we have and then clearly
[TABLE]
Finally, if for some then there exists a non-zero integer such that , whence
[TABLE]
for . This shows that in this case, as required to complete the proof of the lemma. ∎
The following result is concerned with the evaluation of the integral of over the major arcs.
Lemma 4.3**.**
Let and assume that . Then
[TABLE]
where
[TABLE]
Proof.
Let us set throughout the proof. We define the modified major arcs to be the set of for which and . We claim that these modified major arcs are non-overlapping. To see this we suppose that . Then we may assume without loss of generality that . But then it follows that
[TABLE]
Assuming that is sufficiently large, this implies that , which thereby establishes the claim. (In fact it is not hard to check that the major arcs are also disjoint provided that .)
An application of Lemma 4.2 yields
[TABLE]
for any , where
[TABLE]
Since , it is clear that
[TABLE]
where is as in the statement of the lemma.
Next, we observe that
[TABLE]
Moroever,
[TABLE]
vanishes unless , which implies that
[TABLE]
Appealing to (3.3) and using the fact that , the right hand side is at most , if the constant is taken to be sufficiently large in Definition 4.1. Hence we conclude that , which in turn implies that and thus that . Because the integrand is nonnegative, the integral over this restricted interval is at most , so that
[TABLE]
Putting everything together yields the statement of the lemma. ∎
It is now time to return to our expression (3.9) for . First, sticking with the notation and , we deduce from Lemma 4.3 that
[TABLE]
where
[TABLE]
for any . If then . Furthermore, if then (3.1) yields
[TABLE]
whence in fact . Moreover, we have
[TABLE]
Assume now that . Then it follows from (1.1) that
[TABLE]
Thus
[TABLE]
for any , where
[TABLE]
Here the exponent arises from summing the savings over . Reintroducing the sum over , we now see that the overall contribution to from the set of major arcs is
[TABLE]
on taking sufficiently large.
Putting and bringing everything together in (3.7), it now follows that
[TABLE]
We may detect the equation in the way most familiar to practitioners of the Hardy–Littlewood circle method. On doing so, we are led to the following result, which summarises our discussion of the major arcs.
Lemma 4.4**.**
Let . For any let , where is given by Definition 4.1. Then there exists such that
[TABLE]
where if is given by (3.8) then
[TABLE]
Proof.
The only thing that requires comment is the truncation from to . But since the trivial bound yields
[TABLE]
Hence the tail of the -summation makes a satisfactory contribution. ∎
5. Treatment of the minor arcs
We begin with a technical result from the geometry of numbers, which generalises the “shrinking lemma” that is due to Davenport [6, Lemma 12.6], and which one recovers by taking in the following result.
Lemma 5.1**.**
Let be a symmetric matrix with entries in . Let , let and let . Let be the number of such that and . Then
[TABLE]
where the implied constant depends only on .
Proof.
We may assume that , since the left hand side is when . Define the matrix
[TABLE]
so that
[TABLE]
Let denote the successive minima of the lattice corresponding to and let be the successive minima of the dual lattice corresponding to . Then (2.2) implies that for . Since the lattices are equal up to left and right multiplication by a matrix in , we must have
[TABLE]
for all . Taking we deduce that
Since , the quantity is equal to the number of vectors in the lattice corresponding to whose first entries form a vector of Euclidean norm and whose last entries are individually . Thus it is bounded below by the number of vectors with Euclidean norm , and bounded above by the corresponding number with Euclidean norm . On the other hand, is bounded below by the number of vectors in the lattice corresponding to with norm and above by the corresponding number with norm . It therefore follows from Davenport [6, Lemma 12.4] that
[TABLE]
where the implied constants depend only on . Dividing term by term, we see that each contributes at most and each contributes at most . Thus the total contribution is at most , as claimed in the statement of the lemma. ∎
The second technical result required is a simple Diophantine approximation result due to Heath-Brown [9, Lemma 2.3].
Lemma 5.2**.**
Let . Let such that and let , with coprime integers and , such that . Assume that
[TABLE]
Then .
The statement of this result requires the assumption that and are coprime, which isn’t formally stated in [9, Lemma 2.3] but is implicit in the proof.
We now have the tools in place to study our exponential sum on the minor arcs. Let us set
[TABLE]
as previously. We want to study
[TABLE]
for , where is given by (3.8) and
[TABLE]
Let us write . Then we have
[TABLE]
whence
[TABLE]
In this section an important role will be played by the multilinear forms
[TABLE]
for , where are the symmetric coefficients such that
[TABLE]
In what follows we shall write to denote the vector . Since is non-singular, it follows from [2, Lemmas 3.1 and 3.3] that
[TABLE]
for any . Next, for given and , let
[TABLE]
It follows from applications of Lemma 5.1 that
[TABLE]
for any .
Returning to the expression for in (5.2), we start by removing the factor via the observation that depends only on mod . Letting for compactness of notation, we break the sum into residue classes mod , getting
[TABLE]
where
[TABLE]
We may write
[TABLE]
where
[TABLE]
and
[TABLE]
Note that has degree .
We shall estimate via Weyl differencing, as in Birch [2]. Let
[TABLE]
and
[TABLE]
Then, for any , we have
[TABLE]
where
[TABLE]
We shall produce two estimates for . In the first we take , which eliminates the effect of the lower degree term and leads to a family of linear exponential sums that depend on the Diophantine approximation properties of alone. Alternatively, we take . After a further application of Cauchy–Schwarz, one brings the -sum inside, thereby bringing the Diophantine properties of into play.
By Dirichlet’s approximation theorem there exist and such that
[TABLE]
with
[TABLE]
The following is our first bound for and only involves the Diophantine approximation properties of .
Lemma 5.3**.**
Assume that is such that (5.8) holds and put
[TABLE]
Then
[TABLE]
Proof.
Taking in (5.7), we first note that
[TABLE]
for some polynomial that doesn’t depend on . It follows that
[TABLE]
for any and any absolute constant . Exploiting (3.3), it readily follows from multi-dimensional partial summation that
[TABLE]
In the standard way (cf. the proof of [6, Lemma 13.2]) one finds that
[TABLE]
in the notation of (5.4). Applying (5.5) we obtain
[TABLE]
for any . By choosing to satisfy
[TABLE]
for appropriate implied constants depending on , we can make Lemma 5.2 applicable. We then deduce from (5.3) that
[TABLE]
since (5.8) holds. It follows that
[TABLE]
with as in the statement of the lemma. Substituting this into (5.6), we conclude the proof of the lemma by summing trivially over and the finitely many possible values of . ∎
We now turn to our alternative estimate for , which is obtained by exploiting the Diophantine approximation properties of By Dirichlet’s approximation theorem there exist and such that
[TABLE]
with
[TABLE]
We shall prove the following result, which operates under the assumption that and are not too lopsided.
Lemma 5.4**.**
Assume that is such that (5.9) holds and put
[TABLE]
Assume that . Then
[TABLE]
Proof.
This time we begin through an application of Hölder’s inequality in (5.6). Recalling that , we deduce that
[TABLE]
where
[TABLE]
Taking in (5.7), it follows that
[TABLE]
At this point we carry out a further differencing operation to conclude that
[TABLE]
There exists a polynomial that doesn’t depend on and polynomials that don’t depend on such that
[TABLE]
where may depend on . We may now interchange the order of summation and execute the sum over , noting that for . Hence, in the usual way, we conclude that
[TABLE]
where denotes the number of for which and
[TABLE]
for . Note that is linear when viewed as a polynomial in . For fixed , given a single satisfying the inequality, for all other solutions we will have
[TABLE]
where . Thus we may replace by to remove the constant term . Doing so leads to the conclusion that is at most the number of vectors for which and , with
[TABLE]
for . We conclude that
[TABLE]
in the notation of (5.4).
Next, on appealing to (5.5), we deduce that
[TABLE]
for any . Assume that . If we choose so that
[TABLE]
for appropriate implied constants that depend only on , then we can make Lemma 5.2 applicable. In the light of (5.3), this leads to the conclusion that
[TABLE]
since . Thus
[TABLE]
where
[TABLE]
Assuming that (5.9) holds and , it follows that
[TABLE]
whence finally
[TABLE]
We deduce by summing over the finitely many possible values of that
[TABLE]
The lemma follows since . ∎
We now have everything in place to complete the estimation of via Lemma 4.4. For the moment we continue to adopt the notation (5.1) for and . Since in Lemma 4.4 we may assume that in Lemma 5.4. Given , let denote the overall contribution to the integral
[TABLE]
from and such that
[TABLE]
Then it follows that from Lemmas 5.3 and 5.4 that
[TABLE]
By invoking Dirichlet’s approximation theorem twice, as in (5.8) and (5.9), we see that we are only interested in such that
[TABLE]
Furthermore, since belongs to the minor arcs it follows from Definition 4.1 that unless
[TABLE]
Since there are possible dyadic values for that can contribute, we get an estimate for the minor arc integral by taking a maximum of (5.10) over all satisfying these inequalities.
Taking , with
[TABLE]
and then taking , we deduce from (5.10) that
[TABLE]
But and Hence
[TABLE]
Note that the exponent of in the denominator is strictly positive precisely when . Recalling that and are given by (5.1) we insert this argument into Lemma 4.4 to deduce that
[TABLE]
for some , provided that is sufficiently small in terms of and . This completes the proof of Theorem 1.1, on summing over dyadic intervals in (3.6).
We can get an explicit value of the constant as follows. Since in (5.1), we see that
[TABLE]
gives a power saving as soon as Recalling that and multiplying both sides by , this condition becomes
[TABLE]
or
[TABLE]
Thus we may take in Theorem 1.1 by letting converge to [math]. Note that for fixed we have as .
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] W. Banaszczyk, New bounds in some transference theorems in the geometry of numbers. Math. Annalen 296 (1993), 625–635.
- 2[2] B.J. Birch, Forms in many variables. Proc. Roy. Soc. Ser. A 265 (1961/62), 245–263.
- 3[3] J.B. Bost, Algebraic leaves of algebraic foliations over number fields. Publications mathématiques de l’IHÉS 93 (2001), 161–221.
- 4[4] T.D. Browning and W. Sawin, Free rational curves on low degree hypersurfaces and the circle method. Submitted , 2018. ( ar Xiv:1810.06882 )
- 5[5] J.W.S. Cassels, Introduction to the geometry of numbers . Springer-Verlag, 1971.
- 6[6] H. Davenport, Analytic methods for Diophantine equations and Diophantine inequalities . 2nd ed., edited by T.D. Browning, Camb. Univ. Press, 2005.
- 7[7] J. Franke, Y.I. Manin and Y. Tschinkel, Rational points of bounded height on Fano varieties. Invent. Math. 95 (1989), 421–435.
- 8[8] D.R. Heath-Brown, Diophantine approximation with square-free numbers. Math. Zeit. 187 (1984), 335–344.
