Binary quartic forms with bounded invariants and small Galois groups

Cindy Tsang; Stanley Yao Xiao

arXiv:1702.07407·math.NT·November 13, 2019

Binary quartic forms with bounded invariants and small Galois groups

Cindy Tsang, Stanley Yao Xiao

PDF

TL;DR

This paper classifies integral irreducible binary quartic forms with Galois groups as subgroups of the dihedral group of order eight, organizing them into families linked to quadratic forms and counting their equivalence classes.

Contribution

It introduces a classification of such quartic forms based on quadratic form families and provides enumeration of their equivalence classes.

Findings

01

Forms are organized into families indexed by quadratic forms.

02

Enumeration of GL_2(Z)-equivalence classes for fixed quadratic forms.

03

Connection between quartic forms and quadratic form discriminants.

Abstract

In this paper, we consider integral and irreducible binary quartic forms whose Galois group is isomorphic to a subgroup of the dihedral group of order eight. We first show that the set of all such forms is a union of families indexed by integral binary quadratic forms $f (x, y)$ of non-zero discriminant. Then, we shall enumerate the $GL_{2} (Z)$ -equivalence classes of all such forms associated to a fixed $f (x, y)$ .

Equations606

F (x, y) = a_{4} x^{4} + a_{3} x^{3} y + a_{2} x^{2} y^{2} + a_{1} x y^{3} + a_{0} y^{4},

F (x, y) = a_{4} x^{4} + a_{3} x^{3} y + a_{2} x^{2} y^{2} + a_{1} x y^{3} + a_{0} y^{4},

I (F)

I (F)

J (F)

H_{BS} (F) = max {∣ I (F) ∣^{3}, J (F)^{2} /4} .

H_{BS} (F) = max {∣ I (F) ∣^{3}, J (F)^{2} /4} .

N_{Z} (X)

N_{Z} (X)

\displaystyle\hskip 85.35826pt\mbox{quartic forms $F$ such that $H_{\mathrm{\tiny BS}}(F)\leq X$}\},

N_{\mathbb{Z}}(X)=\frac{44\zeta(2)}{135}X^{5/6}+O_{\epsilon}\left(X^{3/4+\epsilon}\right)\mbox{ for any $\epsilon>0$}.

N_{\mathbb{Z}}(X)=\frac{44\zeta(2)}{135}X^{5/6}+O_{\epsilon}\left(X^{3/4+\epsilon}\right)\mbox{ for any $\epsilon>0$}.

S_{4}

S_{4}

A_{4}

D_{4}

C_{4}

V_{4}

Q_{F} (x) = x^{3} - 3 I (F) x + J (F) .

Q_{F} (x) = x^{3} - 3 I (F) x + J (F) .

\operatorname{Gal}(F)\mbox{ is small if and only if ${\mathcal{Q}}_{F}(x)$ is reducible}.

\operatorname{Gal}(F)\mbox{ is small if and only if ${\mathcal{Q}}_{F}(x)$ is reducible}.

ξ_{T} (x, y) = \frac{1}{det ( T ) ^{d e g ξ /2}} ξ (t_{1} x + t_{2} y, t_{3} x + t_{4} y) \mbox f or T = (t_{1} t_{3} t_{2} t_{4}) .

ξ_{T} (x, y) = \frac{1}{det ( T ) ^{d e g ξ /2}} ξ (t_{1} x + t_{2} y, t_{3} x + t_{4} y) \mbox f or T = (t_{1} t_{3} t_{2} t_{4}) .

M_{f} = (β - 2 α 2 γ - β)

M_{f} = (β - 2 α 2 γ - β)

{T \in GL_{2} (R) : T \mbox i s n o t a sc a l a r m u l t i pl eo f I_{2 \times 2} and F_{T} = F}

{T \in GL_{2} (R) : T \mbox i s n o t a sc a l a r m u l t i pl eo f I_{2 \times 2} and F_{T} = F}

V_{R, f}

V_{R, f}

V_{Z, f}

V_{R, f}^{0} = {F \in V_{R, f} : Δ (F) \neq = 0} and V_{Z, f}^{0} = {F \in V_{Z, f} : Δ (F) \neq = 0} .

V_{R, f}^{0} = {F \in V_{R, f} : Δ (F) \neq = 0} and V_{Z, f}^{0} = {F \in V_{Z, f} : Δ (F) \neq = 0} .

L_{f} (F) = ω_{f} (F) and K_{f} (F) = - ω_{f}^{'} (F) ω_{f}^{''} (F) .

L_{f} (F) = ω_{f} (F) and K_{f} (F) = - ω_{f}^{'} (F) ω_{f}^{''} (F) .

H_{f} (F) = max {L_{f} (F)^{2}, ∣ K_{f} (F) ∣} .

H_{f} (F) = max {L_{f} (F)^{2}, ∣ K_{f} (F) ∣} .

x^{3} - I (F) x + J (F) = (x - ω_{f} (F)) (x - ω_{f}^{'} (F)) (x - ω_{f}^{''} (F)),

x^{3} - I (F) x + J (F) = (x - ω_{f} (F)) (x - ω_{f}^{'} (F)) (x - ω_{f}^{''} (F)),

3 I (F) = L_{f} (F)^{2} + K_{f} (F) and J (F) = L_{f} (F) K_{f} (F),

3 I (F) = L_{f} (F)^{2} + K_{f} (F) and J (F) = L_{f} (F) K_{f} (F),

(H_{f} (F) /10)^{3} \leq H_{BS} (F) \leq H_{f} (F)^{3} .

(H_{f} (F) /10)^{3} \leq H_{BS} (F) \leq H_{f} (F)^{3} .

Δ (F) = \frac{4 I ( F ) ^{3} - J ( F ) ^{2}}{27} = (\frac{L _{f} ( F ) ^{2} + 4 K _{f} ( F )}{9}) (\frac{2 L _{f} ( F ) ^{2} - K _{f} ( F )}{9})^{2},

Δ (F) = \frac{4 I ( F ) ^{3} - J ( F ) ^{2}}{27} = (\frac{L _{f} ( F ) ^{2} + 4 K _{f} ( F )}{9}) (\frac{2 L _{f} ( F ) ^{2} - K _{f} ( F )}{9})^{2},

H_{f_{T}} (F_{T}) = H_{f} (F),

H_{f_{T}} (F_{T}) = H_{f} (F),

V_{R, f} ⟶ V_{R, f_{T}}; F \mapsto F_{T},

V_{R, f} ⟶ V_{R, f_{T}}; F \mapsto F_{T},

V_{Z}^{sm, †} = {F \in V_{Z}^{sm} : Gal (F) \neq ≃ V_{4}} .

V_{Z}^{sm, †} = {F \in V_{Z}^{sm} : Gal (F) \neq ≃ V_{4}} .

V_{Z}^{sm}

V_{Z}^{sm}

V_{Z}^{sm, †}

H (F) = H_{f} (F) .

H (F) = H_{f} (F) .

N_{Z}^{†} (X)

N_{Z}^{†} (X)

N_{Z, f}^{†} (X)

N_{Z}^{†} (X) = f \in F \sum N_{Z, f}^{†} (X),

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Binary quartic forms with bounded invariants

and small Galois groups

Cindy (Sin Yi) Tsang

Yau Mathematical Sciences Center

Tsinghua University

Beijing, P. R. China

[email protected]

and

Stanley Yao Xiao

Mathematical Institute

University of Oxford

Andrew Wiles Building

Radcliffe Observatory Quarter

Woodstock Road

Oxford

OX2 6GG

[email protected]

Abstract.

In this paper, we consider integral and irreducible binary quartic forms whose Galois group is isomorphic to a subgroup of the dihedral group of order eight. We first show that the set of all such forms is a union of families indexed by integral binary quadratic forms $f(x,y)$ of non-zero discriminant. Then, we shall enumerate the $\operatorname{GL}_{2}({\mathbb{Z}})$ -equivalence classes of all such forms associated to a fixed $f(x,y)$ .

1 Introduction
2 Characterization of forms with small Galois groups
3 Basic properties of forms in $V_{{\mathbb{R}},f}$ of non-zero discriminant
4 Parametrizing forms in $V_{{\mathbb{R}},f}$ of non-zero discriminant
5 Definition of a bounded semi-algebraic set
6 Error estimates and the main theorem
7 Acknowledgments

1. Introduction

The problem of enumerating $\operatorname{GL}_{2}({\mathbb{Z}})$ -equivalence classes of integral and irreducible binary forms of a fixed degree has a long history. The quadratic and cubic cases were solved in [16, 22] and [12, 13], respectively, where the forms are ordered by the natural height, namely the discriminant $\Delta(-)$ . The quartic case turns out to be much more challenging because while the ring of polynomial invariants for both binary quadratic and cubic forms is generated by $\Delta(-)$ as an algebra, that for binary quartic forms is generated by two independent invariants, usually denoted by $I(-)$ and $J(-)$ . For

[TABLE]

they are given by the explicit formulae

[TABLE]

which are of degrees two and three, respectively. In [4], instead of using the discriminant, Bhargava and Shankar introduced the height function

[TABLE]

For $X>0$ , let us define

[TABLE]

where $[-]$ denotes $\operatorname{GL}_{2}({\mathbb{Z}})$ -equivalence class. In [4], they proved that

[TABLE]

This is the first result ever obtained, and as far as we know, the only known result in the literature, for the quartic case.

1.1. Set-up and notation

In this paper, we shall also be interested in the quartic case, but only the integral and irreducible binary quartic forms $F$ with small Galois group $\operatorname{Gal}(F)$ , which is defined to be the Galois group of the splitting field of $F(x,1)$ over ${\mathbb{Q}}$ . We know that $\operatorname{Gal}(F)$ is isomorphic to one of the following:

[TABLE]

We shall say that $\operatorname{Gal}(F)$ is small if it is isomorphic to $D_{4},C_{4}$ , or $V_{4}$ . Recall that the cubic resolvent of $F$ is defined by

[TABLE]

Then, equivalently, we have the classical characterization that for irreducible $F$

[TABLE]

It turns out that whether $\operatorname{Gal}(F)$ is small or not may also be characterized in terms of binary quadratic forms and the following so-called twisted action of $\operatorname{GL}_{2}({\mathbb{R}})$ .

Given a complex binary form $\xi(x,y)$ , let $\operatorname{GL}_{2}({\mathbb{R}})$ act on it via

[TABLE]

Observe that this is only an action up to sign when $\deg\xi$ is odd, in the sense that for $T_{1},T_{2}\in\operatorname{GL}_{2}({\mathbb{R}})$ , we only have $\xi_{T_{1}T_{2}}=\pm(\xi_{T_{1}})_{T_{2}}$ in general. Now, given a real binary quadratic form $f(x,y)=\alpha x^{2}+\beta xy+\gamma y^{2}$ with $\Delta(f)\neq 0$ , write

[TABLE]

for its associated matrix in $\operatorname{GL}_{2}({\mathbb{R}})$ . Its action on binary quartic forms clearly remain unchanged if we scale $f(x,y)$ by a constant in ${\mathbb{R}}^{\times}$ . In [27], the second-named author proved that for any real binary quartic form $F$ with $\Delta(F)\neq 0$ , elements of

[TABLE]

all arise from binary quadratic forms in this way; see Proposition 2.1. Recall that an integral binary quadratic form is called primitive if its coefficients are coprime. Using this result from [27], in Section 2, we shall first show that:

Theorem 1.1.

Let $F$ be an integral binary quartic form with $\Delta(F)\neq 0$ . Then, the following are equivalent.

(1)

${\mathcal{Q}}_{F}(x)$ * is reducible.* 2. (2)

$F_{T}=F$ * for some $T\in\operatorname{GL}_{2}({\mathbb{Q}})$ which is not a scalar multiple of $I_{2\times 2}$ .* 3. (3)

$F_{M_{f}}=F$ * for an integral and primitive binary quadratic form $f$ with $\Delta(f)\neq 0$ .*

Moreover, in the case that ${\mathcal{Q}}_{F}(x)$ is reducible:

(a)

If $\Delta(F)\neq\square$ , then there is a unique such $f$ up to sign. 2. (b)

If $\Delta(F)=\square$ , then there are exactly three such $f$ up to sign, among which one is definite and two are indefinite.

Given a real binary quadratic form $f(x,y)$ with $\Delta(f)\neq 0$ , let us further make the following definitions. First put

[TABLE]

Clearly $V_{{\mathbb{R}},f}$ is a vector space over ${\mathbb{R}}$ and $V_{{\mathbb{Z}},f}$ a lattice over ${\mathbb{Z}}$ . A straightforward calculation shows that $\dim_{{\mathbb{R}}}V_{{\mathbb{R}},f}$ is three; see (3.1) and (3.2) below. Also, put

[TABLE]

For $F\in V_{{\mathbb{R}},f}^{0}$ , we shall define two new invariants as follows. As we shall see in (2.3), there is a unique root $\omega_{f}(F)$ of ${\mathcal{Q}}_{F}(x)$ corresponding to $f$ . Let $\omega^{\prime}_{f}(F),\omega^{\prime\prime}_{f}(F)$ denote the other two roots of ${\mathcal{Q}}_{F}(x)$ and define

[TABLE]

By Proposition 3.2 below, they have degrees one and two, respectively, in the coefficients of $F$ . Following (1.2), let us define the height of $F$ associated to $f$ by

[TABLE]

This is comparable to the height (1.2) because by comparing coefficients in

[TABLE]

we easily deduce the relations

[TABLE]

which in turn imply that

[TABLE]

Let us note that

[TABLE]

where the first equality is well-known, and the second equality holds by (1.5). Also, our height $H_{f}(-)$ is an invariant in the sense that for any $T\in\operatorname{GL}_{2}({\mathbb{R}})$ , we have

[TABLE]

as shown in Proposition 3.1 below. This implies that the map

[TABLE]

which is a well-defined bijection because $M_{f_{T}}=T^{-1}M_{f}T$ , is height-preserving when restricted to the forms of non-zero discriminant.

Now, let us return to the integral and irreducible binary quartic forms with small Galois group. Write $V_{{\mathbb{Z}}}^{\mathrm{\tiny sm}}$ for the set of all such forms and set

[TABLE]

By Theorem 1.1, we know that

[TABLE]

where $\mathfrak{F}^{*}$ denotes the set of all integral and primitive binary quadratic forms of non-zero discriminant, up to sign. In particular, given $F\in V_{{\mathbb{Z}}}^{\mathrm{\tiny sm},\dagger}$ , there is a unique $f\in\mathfrak{F}^{*}$ such that $F\in V_{{\mathbb{Z}},f}^{0}$ , and we may define the height of $F$ by setting

[TABLE]

For $X>0$ , let us define

[TABLE]

Then, by (1.8) and (1.9), we have

[TABLE]

where $\mathfrak{F}$ denotes a set of representatives of the $\operatorname{GL}_{2}({\mathbb{Z}})$ -equivalence classes on $\mathfrak{F}^{*}$ . In Theorem 1.2, which is our main result, for $f\in\mathfrak{F}^{*}$ , we shall determine the asymptotic formula for $N_{{\mathbb{Z}},f}^{\dagger}(X)$ . In fact, we shall consider the finer counts

[TABLE]

and show that the latter two are negligible compared to $N_{{\mathbb{Z}},f}^{(D_{4})}(X)$ . This means that most of the forms in $V_{{\mathbb{Z}}}^{\mathrm{\tiny sm}}\cap V_{{\mathbb{Z}},f}^{0}$ have Galois group isomorphic to $D_{4}$ . However, all of our error estimates depend upon $f$ . Currently, we do not know how to control them in a uniform way, and so we are unable to obtain an asymptotic formula for $N_{{\mathbb{Z}}}^{\dagger}(X)$ by summing over $f\in\mathfrak{F}$ .

Finally, let us explain, for each $f\in\mathfrak{F}^{*}$ , how counting forms in $V_{\mathbb{Z}}^{\mathrm{\tiny sm}}\cap V_{{\mathbb{Z}},f}^{0}$ may be reduced to counting lattice points. Write $f(x,y)=\alpha x^{2}+\beta xy+\gamma y^{2}$ with $\alpha,\beta,\gamma\in{\mathbb{Z}}$ . By (3.1) and (3.2), the set $V_{{\mathbb{R}},f}$ is a vector space isomorphic to ${\mathbb{R}}^{3}$ via

[TABLE]

Recall that the subset $V_{{\mathbb{Z}},f}$ has the structure of a rank-three ${\mathbb{Z}}$ -lattice, which may be identified with the lattices

[TABLE]

in ${\mathbb{Z}}^{3}$ . Let us mention here that we shall use the isomorphism

[TABLE]

Thus, the problem is reduced to counting points in $\Lambda_{f,1}$ or $\Lambda_{f,2}$ , and then sieving out those which come from reducible forms. In turn, counting lattice points amounts to computing certain volumes by a result of Davenport [11]; see Proposition 5.1.

1.2. Statement of the main theorem

It is clear that we may choose the set $\mathfrak{F}$ of representatives to be such that for all $f\in\mathfrak{F}$ , the $x^{2}$ -coefficient is positive, and

[TABLE]

when $f$ is reducible. Let $\sim$ denote $\operatorname{GL}_{2}({\mathbb{Z}})$ -equivalence. Then, our main result is:

Theorem 1.2.

Let $f(x,y)$ be an integral and primitive binary quadratic form of non-zero discriminant and with positive $x^{2}$ -coefficient. Write $D_{f}=|\Delta(f)|$ , and put

[TABLE]

(a)

Suppose that $f$ is positive definite. Then, we have

[TABLE]

where

[TABLE] 2. (b)

Suppose that $f$ is reducible and that $f$ has the shape (1.11). Then, we have

[TABLE]

where

[TABLE] 3. (c)

Suppose that $f$ is indefinite and irreducible. Define $t_{D_{f}}\in{\mathbb{R}}$ to be such that $e^{t_{D_{f}}}$ is the fundamental unit of the quadratic order ${\mathbb{Z}}[(D_{f}+\sqrt{D_{f}})/2]$ , or equivalently

[TABLE]

where $(u_{D_{f}},v_{D_{f}})\in{\mathbb{N}}^{2}$ is the least solution to $x^{2}-D_{f}y^{2}=\pm 4$ . Then, we have

[TABLE]

where

[TABLE] 4. (d)

In all three cases, for any $\epsilon>0$ , we have

[TABLE]

and also

[TABLE]

Notice that the error terms in Theorem 1.2 depend upon $f$ . Hence, we are unable to obtain an asymptotic formula for $N_{{\mathbb{Z}}}^{\dagger}(X)$ by summing over $f\in\mathfrak{F}$ . However, there are only three $f\in\mathfrak{F}$ that need to be considered if we restrict to the forms in

[TABLE]

This is because by Proposition 2.1 below, such a matrix $T$ must be of the shape $M_{f}$ or $M_{f}/2$ up to sign, where $f\in\mathfrak{F}^{*}$ . From (1.9), we then deduce that

[TABLE]

For $X>0$ , let us put

[TABLE]

Then, by (1.8) and the above discussion, we have

[TABLE]

where we may take

[TABLE]

whose discriminants are $-4,1$ , and $4$ , respectively. It follows that:

Corollary 1.3.

We have

[TABLE]

Proof.

Theorem 1.2 implies that

[TABLE]

Summing these terms up then yields the claim. ∎

Finally, as a consequence of the proof of Theorem 1.2, we also have:

Theorem 1.4.

Let $D=\beta^{2}+4\alpha^{2}$ , where $\alpha,\beta\in{\mathbb{N}}$ are coprime and $D$ is not a square. Then, the negative Pell’s equation $x^{2}-Dy^{2}=-4$ has integer solutions if and only if the integral binary quadratic form $\alpha x^{2}+\beta xy-\alpha y^{2}$ is $\operatorname{GL}_{2}({\mathbb{Z}})$ -equivalent to a form of the shape $ax^{2}+bxy+cy^{2}$ with $a$ dividing $b$ .

We now discuss some potential applications of our Theorem 1.2 and Corollary 1.3.

First, it is natural to ask whether the asymptotic formula (1.3), which was proven using Proposition 5.1, admits a secondary main term. From the arguments in [4], we see that the error term arising from volumes of the lower dimensional projections in Proposition 5.1 is only of order $O(X^{3/4})$ . Thus, possibly $X^{3/4}$ is the order of a second main term, but it is dominated by another error term coming from

[TABLE]

In particular, it was shown in [4, Lemma 2.4] that

[TABLE]

Our Corollary 1.3 removes this obstacle, because

[TABLE]

by (1.6) and Theorem 1.2 (d), whence we have

[TABLE]

This improvement potentially allows one to prove a secondary main term for (1.3) by using similar methods from [5], where it was shown that the counting theorem in [14] for cubic fields has a secondary main term of order $X^{5/6}$ ; this latter fact was proven independently in [23] as well.

Next, integral binary quartic forms are closely related to quartic orders, and maximal irreducible quartic orders may be regarded as quartic fields. More generally, by the construction of Birch-Merriman [7] or Nakagawa [20], any integral binary form $F$ gives rise to a ${\mathbb{Z}}$ -order $Q_{F}$ whose rank is the degree of $F$ , where $\operatorname{GL}_{2}({\mathbb{Z}})$ -equivalence class of $F$ corresponds to isomorphism class of $Q_{F}$ . By [15], it is well-known that all cubic orders come from integral binary cubic forms, which enabled the enumeration of cubic orders having a non-trivial automorphism as well as cubic fields by their discriminant; see [6] and [14], respectively. But this is not true for orders of higher rank. Parametrizations of quartic and quintic orders were given by Bhargava in his seminal work [2] and [3]. In [25], Wood further showed that the quartic orders arising from integral binary quartic forms are exactly those having a monogenic cubic resolvent; see [2] for the definition. This implies that the forms in

[TABLE]

correspond to quartic $D_{4}$ -, $C_{4}$ -, and $V_{4}$ -fields whose ring of integers has a monogenic cubic resolvent. In our upcoming paper [24], we shall enumerate $\operatorname{GL}_{2}({\mathbb{Z}})$ -equivalence classes of forms in $V_{{\mathbb{Z}}}^{\mathrm{\tiny sm},\star}$ with respect to a height corresponding to the conductor of fields, as motivated by [1]. In fact, we shall that show that

[TABLE]

Thus, our counting theorem in [24] may be regarded as a refinement and an extension of Corollary 1.3 above.

Last but not least, binary quartic forms are connected to elliptic curves as well. In particular, any integral binary quartic form $F$ gives rise to an elliptic curve

[TABLE]

defined over ${\mathbb{Q}}$ . In [4], Bhargava and Shankar applied (1.3) as well as a parametrization of 2-Selmer groups due to Birch and Swinnerton-Dyer to show that the average rank of elliptic curves over ${\mathbb{Q}}$ , when ordered by a naive height analogous to (1.2), is at most $3/2$ . This result is remarkable in that it is the first to show, unconditional on the BSD-conjecture and the Grand Riemann Hypothesis, boundedness of the average rank of large families of elliptic curves over ${\mathbb{Q}}$ . Conditional bounds were obtained by Brumer [8], Heath-Brown [17], and Young [26] previously. Now, the relations in (1.5) imply that for $F\in V_{{\mathbb{Z}}}^{\mathrm{\tiny sm}}\cap V_{{\mathbb{Z}},f}^{0}$ with $f\in\mathfrak{F}^{*}$ , we have

[TABLE]

which has a rational $2$ -torsion point. Hence, our Theorem 1.2 potentially allows one to study arithmetic properties of elliptic curves with $2$ -torsion over ${\mathbb{Q}}$ . Let us remark that unlike a large family of elliptic curves over ${\mathbb{Q}}$ , in the sense of [4, Section 3], the family consisting of those curves with a rational $2$ -torsion exhibits a rather peculiar behaviour. Indeed, Klagsbrun and Lemke-Oliver [19] proved that the average size of the 2-Selmer groups in this family is unbounded, and they conjectured an asymptotic growth rate. One might be able to obtain such an asymptotic growth rate using our Theorem 1.2 and a sieve that detects local solubility; this line of inquiry is pursued in an upcoming paper due to D. Kane and Z. Klagsbrun.

2. Characterization of forms with small Galois groups

2.1. Cremona covariants

Let $F$ be a real binary quartic form with $\Delta(F)\neq 0$ . As Cremona defined in [10], we have three quadratic covariants $\mathfrak{C}_{F,\omega}(x,y)$ , each of which is associated to a root $\omega$ of ${\mathcal{Q}}_{F}(x)$ ; see [27, Subsection 4.2] for the explicit definition. They satisfy the syzygy

[TABLE]

where $F_{4}$ is the Hessian covariant of $F$ and is given by

[TABLE]

We shall label the roots $\omega_{1}(F),\omega_{2}(F),\omega_{3}(F)$ of ${\mathcal{Q}}_{F}(x)$ such that

[TABLE]

where $\mathfrak{C}_{F,i}(x,y)$ is defined as in [27, (4.6)]. Then, from (2.1) and the explicit expressions for $\mathfrak{C}_{F,\omega}(x,y)$ given in [27], we have the following observations:

(1)

For $\omega=\omega_{1}(F)$ , the binary quadratic form $\mathfrak{C}_{F,\omega}(x,y)$ has real coefficients. 2. (2)

For $\omega=\omega_{2}(F),\omega_{3}(F)$ , we have:

$\bullet$ If $\Delta(F)>0$ , then $\lambda_{\omega}\cdot\mathfrak{C}_{F,\omega}(x,y)$ has real coefficients for some $\lambda_{\omega}\in\{1,\sqrt{-1}\}$ .

$\bullet$ If $\Delta(F)<0$ , then $\lambda\cdot\mathfrak{C}_{F,\omega}(x,y)$ does not have real coefficients for all $\lambda\in{\mathbb{C}}^{\times}$ .

Also, it is easy to check that

[TABLE]

We shall require the following result by the second-named author in [27].

Proposition 2.1.

Let $F$ be a real binary quartic form with $\Delta(F)\neq 0$ . Then, a set of representatives for the quotient group

[TABLE]

is given by

[TABLE]

Furthermore, the quadratic forms $\mathfrak{C}_{F,\omega_{1}(F)}(x,y),\mathfrak{C}_{F,\omega_{2}(F)}(x,y)$ , and $\mathfrak{C}_{F,\omega_{3}(F)}(x,y)$ , are pairwise non-proportional over ${\mathbb{C}}^{\times}$ .

Proof.

For the first statement, see [27, Proposition 4.6]. As for the second statement, since $\mathfrak{C}_{F,\omega_{i}(F)}(x,y)$ are covariants, replacing $F$ by a $\operatorname{GL}_{2}({\mathbb{R}})$ -translate if necessary, we may assume that $F(x,y)=a_{4}x^{4}+a_{2}x^{2}y^{2}\pm a_{4}y^{4}$ . In this special case, it is not hard to verify the claim using the explicit expressions for $\mathfrak{C}_{F,\omega_{i}(F)}(x,y)$ in [27, (4.6)]. ∎

Let $F$ be a real binary quartic form with $\Delta(F)\neq 0$ . Proposition 2.1 implies that for any real binary quadratic form $f$ with $\Delta(f)\neq 0$ , we have $F\in V_{{\mathbb{R}},f}$ if and only if

[TABLE]

Moreover, this root $\omega$ is unique, and we shall denote it by $\omega_{f}(F)$ . This was required in order to define the $L_{f}$ - and $K_{f}$ -invariants in (1.4).

2.2. Proof of Theorem 1.1

The key is the following lemma.

Lemma 2.2.

Let $F$ be an integral binary quartic form with $\Delta(F)\neq 0$ and let $\omega$ be a root of ${\mathcal{Q}}_{F}(x)$ . Then, the quadratic form $\mathfrak{C}_{F,\omega}(x,y)$ is proportional over ${\mathbb{C}}^{\times}$ to a form with integer coefficients if and only if $\omega\in{\mathbb{Z}}$ .

Proof.

If $\omega\in{\mathbb{Z}}$ , then we easily see from (2.1) that $\lambda\cdot\mathfrak{C}_{F,\omega}(x,y)$ has integer coefficients for some $\lambda\in{\mathbb{C}}^{\times}$ . Conversely, if $\lambda\cdot\mathfrak{C}_{F,\omega}(x,y)$ has integer coefficients for some $\lambda\in{\mathbb{C}}^{\times}$ , then consider the action of an element $\sigma\in\operatorname{Gal}(\overline{{\mathbb{Q}}}/{\mathbb{Q}})$ , where $\overline{{\mathbb{Q}}}$ is an algebraic closure of ${\mathbb{Q}}$ . It is clear from the definition of $\mathfrak{C}_{F,\omega}(x,y)$ that $\lambda\in\overline{{\mathbb{Q}}}$ . From (2.1), we have

[TABLE]

and this last binary quartic form has zero discriminant. This shows that $\omega-\sigma(\omega)=0$ for all $\sigma\in\operatorname{Gal}(\overline{{\mathbb{Q}}}/{\mathbb{Q}})$ . Thus, we have $\omega\in{\mathbb{Q}}$ , and so $\omega\in{\mathbb{Z}}$ since ${\mathcal{Q}}_{F}(x)$ is monic. ∎

The first claim in Theorem 1.1 now follows from Proposition 2.1, Lemma 2.2, and (2.3). Note that $\Delta(F)=27^{2}\Delta({\mathcal{Q}}_{F})$ , which means that ${\mathcal{Q}}_{F}(x)$ has three integer roots if and only if ${\mathcal{Q}}_{F}(x)$ is reducible and $\Delta(F)=\square$ . The second claim then follows from this fact and (2.2).

3. Basic properties of forms in $V_{{\mathbb{R}},f}$ of non-zero discriminant

Throughout this section, let $f(x,y)=\alpha x^{2}+\beta xy+\gamma y^{2}$ be a real binary quadratic form with $\Delta(f)\neq 0$ . It is not hard to check, by a direct calculation, that

[TABLE]

if $\alpha\neq 0$ , and similarly that

[TABLE]

if $\beta,\beta^{2}+4\alpha\gamma\neq 0$ . Below, we shall give some basic properties of $V_{{\mathbb{R}},f}^{0}$ and $V_{{\mathbb{Z}},f}^{0}$ .

3.1. The two new invariants

Recall the definitions of the $L_{f}$ - and $K_{f}$ -invariants given in (1.4). First, we shall show that they are indeed invariants under the twisted action of $\operatorname{GL}_{2}({\mathbb{R}})$ in the following sense.

Proposition 3.1.

For all $F\in V_{{\mathbb{R}},f}^{0}$ and $T\in\operatorname{GL}_{2}({\mathbb{R}})$ , we have

[TABLE]

Proof.

Notice that ${\mathcal{Q}}_{F}(x)={\mathcal{Q}}_{F_{T}}(x)$ . For any root $\omega$ of ${\mathcal{Q}}_{F}(x)$ , because $\mathfrak{C}_{F,\omega}(x,y)$ is a covariant up to sign by (2.1), if $\mathfrak{C}_{F,\omega}(x,y)$ is proportional to $f(x,y)$ , then $\mathfrak{C}_{F_{T},\omega}(x,y)$ is proportional to $f_{T}(x,y)$ . It then follows from the definition that $L_{f_{T}}(F_{T})=L_{f}(F)$ . Since $I(F_{T})=I(F)$ , we also have $K_{f_{T}}(F_{T})=K_{f}(F)$ by the first equality in (1.5). ∎

We shall give explicit formulae for $L_{f}(-)$ and $K_{f}(-)$ in two special cases.

Proposition 3.2.

The following holds.

(a)

Assume that $\alpha\neq 0$ . Then, for all $F\in V_{{\mathbb{R}},f}^{0}$ as in (3.1), we have

[TABLE]

Moreover, we have

[TABLE]

where

[TABLE] 2. (b)

Assume that $\gamma=0$ . Then, for all $F\in V_{{\mathbb{R}},f}^{0}$ as in (3.2), we have

[TABLE]

Moreover, we have

[TABLE]

Proof.

This may be verified by explicit computation. ∎

We shall also need the following observation.

Proposition 3.3.

Assume that $f$ is integral. Then, for all $F\in V_{{\mathbb{Z}},f}^{0}$ , we have

[TABLE]

Moreover, when $f$ is primitive in addition, we have

[TABLE]

Proof.

We have $L_{f}(F)\in{\mathbb{Z}}$ by Lemma 2.2. Since $I(F)\in{\mathbb{Z}}$ , we deduce from the first equality in (1.5) that $K_{f}(F)\in{\mathbb{Z}}$ holds as well. Observe that

[TABLE]

both of which are integers. Since $\Delta(F)\in{\mathbb{Z}}$ , we deduce from (1.7) that at least one of the above expressions is divisible by $3$ . But again by (1.5), we have

[TABLE]

so in fact both expressions are divisible by $3$ . This proves the first claim.

Next, assume that $f$ is primitive in addition. In view of Proposition 3.1, by applying a $\operatorname{GL}_{2}({\mathbb{Z}})$ -action on $f$ if necessary, we may assume that $\alpha\neq 0$ and that $\alpha$ is coprime to $\Delta(f)$ . Using Proposition 3.2 (a), we then compute that

[TABLE]

This expression is an integer by the first claim, and hence must be divisible by $\Delta(f)$ , because $\alpha$ is taken to be coprime to $\Delta(f)$ . This proves the second claim. ∎

3.2. Determinants of the two lattices

In this subsection, assume that $f$ is integral and primitive. Let $\Lambda_{f,1}$ and $\Lambda_{f,2}$ denote the lattices defined in (1.10). Below, we shall compute their determinants in terms of the number $s_{f}$ as in Theorem 1.2.

Proposition 3.4.

We have $\det(\Lambda_{f,1})=s_{f}|\alpha|^{3}$ and $\det(\Lambda_{f,2})=s_{f}|\beta(\beta^{2}+4\alpha\gamma)|/8$ .

Proof.

Observe that the linear transformation defined by the matrix

[TABLE]

has determinant ${\mathcal{B}}$ , and it sends $\Lambda_{f,1}$ to $\Lambda_{f,2}$ . Thus, it suffices to prove the first claim. Recall from (3.1) that $\Lambda_{f,1}$ is the set of tuples $(A,B,C)\in{\mathbb{Z}}^{3}$ satisfying

[TABLE]

If $\beta\gamma=0$ , then it is easy to check that $\det(\Lambda_{f,1})=s_{f}|\alpha|^{3}$ . If $\beta\gamma\neq 0$ , then we shall use the fact that

[TABLE]

and so $\det(\Lambda_{f,1})=s_{f}|\alpha|^{3}$ indeed holds by Lemma 3.5 below. ∎

Lemma 3.5.

Let $p$ be a prime dividing $2\alpha$ and let $p^{k}\|\alpha$ . Then, we have

[TABLE]

Proof.

For brevity, write

[TABLE]

Then, the claim may be restated as

[TABLE]

By definition, the lattice $\Lambda_{f,1}^{(p)}$ is the set $(A,B,C)\in{\mathbb{Z}}_{p}^{3}$ of tuples satisfying

[TABLE]

where

[TABLE]

Observe that we have the relation

[TABLE]

For $\ell=0$ , we deduce from (3.3) that $\Lambda_{f,1}^{(p)}$ is defined solely by

[TABLE]

For $\ell\geq 1$ and $\ell\geq k+2\epsilon_{p}$ , it is easy to see that $\Lambda_{f,1}^{(p)}$ is in fact defined by

[TABLE]

For $\ell\geq 1$ and $\ell\leq k+\epsilon_{p}$ , we shall first show that $\Lambda_{f,1}^{(p)}$ is also defined by

[TABLE]

If (3.4) is satisfied, then from (3.3), it is easy to see that $(A,B,C)\in\Lambda_{f,1}^{(p)}$ . Conversely, if $(A,B,C)\in\Lambda_{f,1}^{(p)}$ , then the assumption $\ell\leq k+\epsilon_{p}$ implies that

[TABLE]

while reducing (3.3) mod $p^{2k+\ell+\epsilon_{p}}$ also yields

[TABLE]

From these three congruence equations, it follows that (3.4) is indeed satisfied. In all cases, we then see that $\det(\Lambda_{f,1}^{(p)})$ is as claimed. ∎

3.3. Forms with abelian Galois groups

In this subsection, assume that $f$ is integral. Consider an irreducible form $F\in V_{{\mathbb{Z}},f}^{0}$ . By Theorem 1.1, we have $\operatorname{Gal}(F)\simeq D_{4}$ , $C_{4}$ , or $V_{4}$ . To distinguish among these three possibilities, note that the cubic resolvent polynomial of $F$ , defined by

[TABLE]

when $F$ has the shape (1.1), is reducible since $\operatorname{Gal}(F)$ is small. Also, it has a unique root $r_{F}\in{\mathbb{Q}}$ precisely when $\Delta(F)\neq\square$ , in which case we define

[TABLE]

Then, we have the well-known criterion

[TABLE]

See [9] for example. We then deduce that:

Proposition 3.6.

Let $F\in V_{{\mathbb{Z}},f}^{0}$ be an irreducible form. Then, we have

[TABLE]

as well as

[TABLE]

Proof.

Observe that by (1.7), we have

[TABLE]

The first claim is then clear. Next, suppose that $\Delta(F)\neq\square$ . By Proposition 3.1, we may assume that $\alpha\neq 0$ . For $F$ in the shape as in (3.1), a direct computation yields

[TABLE]

Using Proposition 3.2 (a), we further compute that

[TABLE]

By (1.7) and the criterion above, it follows that $\theta_{1}(F),\theta_{2}(F)$ are squares if and only if $(L_{f}(F)^{2}+4K_{f}(F))(2L_{f}(F)^{2}-K_{f}(F))/\Delta(f)$ is a square, as desired. ∎

3.4. Reducible forms

In this subsection, assume that $f$ is integral. We shall study the reducible forms in $V_{{\mathbb{Z}},f}^{0}$ . Let us first make a definition and an observation.

Definition 3.7.

Let $F\in V_{{\mathbb{Z}},f}^{0}$ be a reducible form.

(1)

We say that $F$ is of type $1$ if $F=m\cdot pp_{M_{f}}$ for some $m\in{\mathbb{Q}}^{\times}$ and integral binary quadratic form $p$ . 2. (2)

We say that $F$ is of type $2$ if $F=pq$ for some integral binary quadratic forms $p$ and $q$ satisfying $p_{M_{f}}=-p$ and $q_{M_{f}}=-q$ .

Lemma 3.8.

For all reducible forms $F\in V_{{\mathbb{Z}},f}^{0}$ of type $1$ , we have

[TABLE]

Proof.

This may be verified by a direct computation. ∎

Below, we shall show that the two reducibility types in Definition 3.7 are in fact the only possibilities. We shall require two further lemmas.

Lemma 3.9.

Let $\ell(x,y)=\ell_{1}x+\ell_{0}y$ be a non-zero complex binary linear form, and suppose that $\ell_{M_{f}}=\lambda\cdot\ell$ for some $\lambda\in{\mathbb{C}}^{\times}$ . Then, we have $\lambda=\pm\sqrt{-1}$ , with

[TABLE]

in the case that $\alpha\neq 0$ .

Proof.

The hypothesis implies that

[TABLE]

Then, by computing the eigenvalues and eigenspaces of the $2\times 2$ matrix above, we see that the claim holds. ∎

Lemma 3.10.

Let $p(x,y)=p_{2}x^{2}+p_{1}xy+p_{0}y^{2}$ be a non-zero complex binary quadratic form, and suppose that $p_{M_{f}}=\lambda\cdot p$ for some $\lambda\in{\mathbb{C}}^{\times}$ . Then, we have $\lambda=\pm 1$ , with

[TABLE]

in the case that $\alpha\neq 0$ .

Proof.

The hypothesis implies that

[TABLE]

Then, by computing the eigenvalues and eigenspaces of the $3\times 3$ matrix above, it is not hard to check that the claim holds.∎

Proposition 3.11.

Any reducible form $F\in V_{{\mathbb{Z}},f}^{0}$ is either of type $1$ or of type $2$ .

Proof.

Write $F=g^{(1)}g^{(2)}g^{(3)}g^{(4)}$ , where the $g^{(k)}$ are complex binary linear forms, and are pairwise non-proportional because $\Delta(F)\neq 0$ . Since $F$ is reducible, by renumbering if necessary, we may assume that

[TABLE]

have integer coefficients and are irreducible. We have $M_{f}^{2}=\Delta(f)\cdot I_{2\times 2}$ and $F_{M_{f}}=F$ by definition. Hence, up to scaling, the matrix $M_{f}$ acts on the $g^{(k)}$ via a permutation $\sigma$ on four letters of order dividing two. This has two consequences.

By (1.8), without loss of generality, we may assume that $\alpha\neq 0$ . First, the form $F$ cannot have exactly one rational linear factor, for otherwise

[TABLE]

From Lemma 3.9, it would follow that $\Delta(f)$ is a square and that $g^{(k_{0})}$ is proportional to a form with integer coefficients, which is a contradiction. Second, when $F$ has four rational linear factors, by further renumbering if necessary, we may assume that

[TABLE]

Now, in all three of the possible cases for the factorization of $F$ , define

[TABLE]

which are integral binary quadratic forms by definition. We then deduce that

[TABLE]

for some $\lambda\in{\mathbb{Q}}^{\times}$ . In the former case, it is clear that $F$ is of type $1$ . In the latter case, we have $\lambda=-1$ by Lemma 3.10 and the fact that $\Delta(F)\neq 0$ , so $F$ is of type $2$ . ∎

4. Parametrizing forms in $V_{{\mathbb{R}},f}$ of non-zero discriminant

Throughout this section, let $f(x,y)=\alpha x^{2}+\beta xy+\gamma y^{2}$ be a real binary quadratic form with $\Delta(f)\neq 0$ and $\alpha>0$ . We shall give an alternative parametrization of $V_{{\mathbb{R}},f}^{0}$ , different from (3.1) and (3.2), in terms of the regions

[TABLE]

corresponding to the $L_{f}$ - and $K_{f}$ -invariants, as well as a parameter $t\in{\mathbb{R}}$ arising from the orthogonal group of $f$ , defined by

[TABLE]

Note that by (1.7), for any $F\in V_{{\mathbb{R}},f}^{0}$ , we have

[TABLE]

First, we shall show that it suffices to consider $x^{2}+y^{2}$ and $x^{2}-y^{2}$ . It shall be helpful to recall (1.8) as well as the isomorphisms $\Theta_{1}$ and $\Theta_{2}$ defined in Subsection 1.1.

Lemma 4.1.

Define a matrix

[TABLE]

Then, we have a well-defined bijective linear map

[TABLE]

and we have $\det(\Psi_{f})=8\alpha^{3}|\Delta(f)|^{-3/2}$ .

Proof.

The first claim holds by (1.8) and the fact

[TABLE]

Identifying $V_{{\mathbb{R}},x^{2}\pm y^{2}}$ and $V_{{\mathbb{R}},f}$ with ${\mathbb{R}}^{3}$ via $\Theta_{1}$ , we see from (3.1) that

[TABLE]

from which the second claim follows. ∎

In the subsequent subsections, we shall prove the following propositions.

Proposition 4.2.

There exists an explicit bijection

[TABLE]

defined as in (4.4), such that

(a)

we have $L_{x^{2}+y^{2}}(\Phi(L,K,t))=L$ and $K_{x^{2}+y^{2}}(\Phi(L,K,t))=K$ , 2. (b)

the Jacobian matrix of $\Theta_{1}\circ\Phi$ has determinant $-1/18$ .

Proposition 4.3.

There exist explicit injections

[TABLE]

defined as in (4.6), with

[TABLE]

such that

(a)

we have $L_{x^{2}-y^{2}}(\Phi^{(i)}(L,K,t))=L$ and $K_{x^{2}-y^{2}}(\Phi^{(i)}(L,K,t))=K$ , 2. (b)

the Jacobian matrix of $\Theta_{1}\circ\Phi^{(i)}$ has determinant $-1/18$ ,

for all $i=1,2,3,4$ .

In view of (1.11), we shall give another parametrization of $V_{{\mathbb{R}},f}$ when $\gamma=0$ , which does not require reducing to the form $x^{2}-y^{2}$ via Lemma 4.1.

Proposition 4.4.

Suppose that $\gamma=0$ . Then, there exist explicit injections

[TABLE]

defined as in (4.9), with

[TABLE]

such that

(a)

we have $L_{f}(\Phi^{(i)}(L,K,t))=L$ and $K_{f}(\Phi^{(i)}(L,K,t))=K$ , 2. (b)

the Jacobian matrix of $\Theta_{2}\circ\Phi_{f}^{(i)}$ has determinant $-1/18$ ,

for both $i=1,2$ .

For $t\in{\mathbb{R}}$ , we shall use the notation

[TABLE]

which is an element of $O_{x^{2}+y^{2}}({\mathbb{R}})$ and $O_{x^{2}-y^{2}}({\mathbb{R}})$ , respectively.

4.1. Positive definite case

Define

[TABLE]

where

[TABLE]

The image of $\Phi$ lies in $V_{{\mathbb{R}},{x^{2}+y^{2}}}$ by (3.1) and (1.8). Using Propositions 3.1 and 3.2 (a), it is easy to check that Proposition 4.2 (a) holds.

Now, by (3.1), an arbitrary $F\in V_{{\mathbb{R}},x^{2}+y^{2}}^{0}$ has the shape

[TABLE]

Write $L=L_{x^{2}+y^{2}}(F)$ and $K=K_{x^{2}+y^{2}}(F)$ . Note that $(L,K)\in\Omega^{+}$ because $\Delta(F)>0$ by (1.7). For $t\in{\mathbb{R}}$ , a direct computation yields

[TABLE]

where

[TABLE]

It is not hard to show that there exists a unique $t_{0}\in(-\pi/4,\pi/4]$ such that $B(t_{0})=0$ and $2A(t_{0})-C(t_{0})>0$ . Put $(A,C)=(A(t_{0}),C(t_{0}))$ . Then, we have

[TABLE]

by Propositions 3.1 and 3.2 (a). We solve that $F_{T^{+}(t_{0})}=F_{(L,K)}$ , or equivalently

[TABLE]

Since $-t_{0}\in[-\pi/4,\pi/4)$ is uniquely determined by $F$ , this shows that $\Phi$ is a bijection.

Finally, the above calculation also yields

[TABLE]

where

[TABLE]

By a direct computation, we then see that Proposition 4.2 (b) holds.

4.2. Indefinite case

Define

[TABLE]

where

[TABLE]

for $i=1,2$ , and

[TABLE]

for $i=3,4$ . The images of $\Phi^{(1)},\Phi^{(2)},\Phi^{(3)},\Phi^{(4)}$ lie in $V_{{\mathbb{R}},{x^{2}-y^{2}}}$ by (3.1) and (1.8). Using Propositions 3.1 and 3.2 (a), it is easy to check that Proposition 4.3 (a) holds.

Now, by (3.1), an arbitrary $F\in V_{{\mathbb{R}},x^{2}-y^{2}}^{0}$ has the shape

[TABLE]

Write $L=L_{x^{2}-y^{2}}(F)$ and $K=K_{x^{2}-y^{2}}(F)$ . For $t\in{\mathbb{R}}$ , a direct computation yields

[TABLE]

where

[TABLE]

Note that $\frac{d}{dt}A(t)=\frac{1}{2}B(t)$ . It is not hard to check that:

•

If $\Delta(F)>0$ , then there is a unique $t_{0}\in{\mathbb{R}}$ such that $B(t_{0})=0$ .

•

If $\Delta(F)<0$ , then $B(t)\neq 0$ for all $t\in{\mathbb{R}}$ , and there is a unique $t_{0}\in{\mathbb{R}}$ such that $A(t_{0})=0$ .

Put $(A,B,C)=(A(t_{0}),B(t_{0}),C(t_{0}))$ . Then, we have

[TABLE]

by Propositions 3.1 and 3.2 (a). We solve that $F_{T^{-}(t_{0})}=F_{(L,K)}^{(i)}$ , or equivalently

[TABLE]

Since $t_{0}$ is uniquely determined by $F$ , this shows that $\Phi^{(1)},\Phi^{(2)},\Phi^{(3)},\Phi^{(4)}$ are all injections, and that the stated disjoint union holds.

Finally, the above calculation also yields

[TABLE]

where

[TABLE]

for $i=1,2$ , and

[TABLE]

for $i=3,4$ . By a direct computation, we then see that Proposition 4.3 (b) holds.

4.3. Reducible case

Suppose $\gamma=0$ . For $t\in{\mathbb{R}}$ , put

[TABLE]

which is an element of $O_{f}({\mathbb{R}})$ . Define

[TABLE]

where

[TABLE]

The images of $\Phi_{f}^{(1)},\Phi_{f}^{(2)}$ lie in $V_{{\mathbb{R}},f}$ by (3.2) and (1.8). Using Propositions 3.1 and 3.2 (b), it is easy to check that Proposition 4.4 (a) holds.

Now, by (3.2), an arbitrary $F\in V_{{\mathbb{R}},f}^{0}$ has the shape

[TABLE]

Write $L=L_{f}(F)$ and $K=K_{f}(F)$ . For $t\in{\mathbb{R}}$ , a direct computation yields

[TABLE]

where

[TABLE]

Since $\Delta(F)\neq 0$ , we have $(-1)^{i}a_{0}>0$ for a unique $i\in\{1,2\}$ , and there is a unique $t_{0}\in{\mathbb{R}}$ such that $C(t_{0})=(-1)^{i}\beta^{2}$ . Put $(A,B)=(A(t_{0}),B(t_{0}))$ . Then, we have

[TABLE]

by Propositions 3.1 and 3.2 (b). We solve that $F_{T(t_{0})}=F_{f,(L,K)}^{(i)}$ , or equivalently

[TABLE]

Since $t_{0}$ and $i$ are uniquely determined by $F$ , this shows that $\Phi_{f}^{(1)}$ and $\Phi_{f}^{(2)}$ are both injections, and that the stated disjoint union holds.

Finally, the above calculation also yields

[TABLE]

where

[TABLE]

By a direct computation, we then see that Proposition 4.4 (b) holds.

5. Definition of a bounded semi-algebraic set

Throughout this section, let $f(x,y)=\alpha x^{2}+\beta xy+\gamma y^{2}$ be an integral and primitive binary quadratic form with $\Delta(f)\neq 0$ and $\alpha>0$ , in the shape (1.11) whenever $f$ is reducible. As we have already explained in Subsection 1.1, the proof of Theorem 1.2 is reduced to counting points in the lattices in (1.10), which in turn amounts to certain volume computations, by the result below.

Proposition 5.1 (Davenport’s lemma).

Let ${\mathcal{R}}$ be a bounded semi-algebraic multi-set in ${\mathbb{R}}^{n}$ having maximum multiplicity $m$ and which is defined by at most $k$ polynomial inequalities, each having degree at most $\ell$ . Then, the number of integral lattice points (counted with multiplicity) contained in the region ${\mathcal{R}}$ is

[TABLE]

where $\operatorname{Vol}(\overline{{\mathcal{R}}})$ denotes the greatest $d$ -dimensional volume of any projection of ${\mathcal{R}}$ onto a coordinate subspace by equating $n-d$ coordinates to zero, with $1\leq d\leq n-1$ . The implied constant in the second summand depends only on $n,m,k,\ell$ .

Proof.

This is a result of Davenport [11], and the above formulation is due to Bhargava and Shankar in [4, Proposition 2.6]. ∎

For $X>0$ , define

[TABLE]

However, to prove Theorem 1.2, we cannot apply Proposition 5.1 directly to

[TABLE]

as in Subsection 1.1, to count the lattice points in $\Theta_{w(f)}(V_{{\mathbb{Z}},f})\subset\Lambda_{f,w(f)}$ because

(1)

the set $\Theta_{w(f)}(V_{{\mathbb{R}},f}^{0}(X))$ is unbounded when $f$ is indefinite, 2. (2)

distinct forms in $V_{{\mathbb{Z}},f}^{0}(X)$ might be $\operatorname{GL}_{2}({\mathbb{Z}})$ -equivalent.

Recall (4.1) and define

[TABLE]

In the notation of Lemma 4.1 as well as Propositions 4.2, 4.3, and 4.4, we have

[TABLE]

respectively, if $f$ is positive definite, indefinite, and reducible. We shall overcome the two issues above by restricting the values for $t\in{\mathbb{R}}$ .

For brevity, in this section, write

[TABLE]

as in Theorem 1.2 and Lemma 4.1, respectively.

Definition 5.2.

If $f$ is positive definite, define

[TABLE]

If $f$ is reducible, define

[TABLE]

If $f$ is indefinite and irreducible, define

[TABLE]

where $t_{D_{f}}$ is defined as in Theorem 1.2 (c).

The goal of this section to prove the following preliminary results and estimates:

Proposition 5.3.

The set $\Theta_{w(f)}(S_{f}(X))$ is bounded, semi-algebraic, and definable by an absolutely bounded number of polynomial inequalities whose degrees are absolutely bounded.

Proposition 5.4.

The following statements hold.

(a)

A form in $V_{{\mathbb{Z}},f}^{0}(X)$ is $\operatorname{GL}_{2}({\mathbb{Z}})$ -equivalent to at least one form in ${\mathcal{S}}_{f}(X)$ . 2. (b)

A form in $V_{{\mathbb{Z}},f}^{0}(X)$ for which $\Delta(F)\neq\square$ is $\operatorname{GL}_{2}({\mathbb{Z}})$ -equivalent to exactly $r_{f}$ forms in ${\mathcal{S}}_{f}(X)$ , where $r_{f}$ is defined as in Theorem 1.2.

5.1. Alternative description

First, we shall give an alternative description of the set ${\mathcal{S}}_{f}(X)$ in terms of the coefficients of the forms in $V_{{\mathbb{R}},f}^{0}(X)$ .

Lemma 5.5.

If $f$ is positive definite, then ${\mathcal{S}}_{f}(X)=V_{{\mathbb{R}},f}^{0}(X)$ .

Proof.

This is clear from (5.1). ∎

Lemma 5.6.

If $f$ is reducible, then

[TABLE]

where $C_{F}$ denotes the $y^{4}$ -coefficient of $F$ .

Proof.

For $i=1,2$ and for any $F=\Phi_{f}^{(i)}(L,K,t)$ , we have $C_{F}=(-1)^{i}\beta^{2}e^{4t}$ by (4.11), and the claim is then clear from (5.1). ∎

Lemma 5.7.

If $f$ is an indefinite and irreducible, then

[TABLE]

where in the notation of Proposition 3.2 (a), we define

[TABLE]

and for $F$ in the image of $\Psi_{f}\circ\Phi^{(i)}$ , we define

[TABLE]

Proof.

For $i=1,2,3,4$ , consider $F=(\Psi_{f}\circ\Phi^{(i)})(L,K,t)$ . For $k=1,2$ , we have

[TABLE]

by a direct computation using (4.2), (4.7), and (4.8). We then see that

[TABLE]

from which the claim follows. ∎

5.2. Proof of Proposition 5.3

From (4.5), (4.7), (4.8), and (4.11), it is clear that the set ${\mathcal{S}}_{f}(X)$ is bounded. Thus, it remains to show that ${\mathcal{S}}_{f}(X)$ is a semi-algebraic set definable by an absolutely bounded number of polynomial inequalities whose degrees are absolutely bounded.

5.2.1. The case when $f$ is positive definite or reducible

The claim follows immediately from Lemmas 5.5 and 5.6 as well as Proposition 3.2.

5.2.2. The case when $f$ is indefinite and irreducible

The only problem is that $Z_{f}(F)$ is not a polynomial in the $x^{4}$ , $x^{3}y$ , and $x^{2}y^{2}$ -coefficients of $F$ . We shall resolve this issue in Lemma 5.8 below. The claim then follows from Lemma 5.7 and Proposition 3.2.

Lemma 5.8.

For $i=3,4$ , let $F\in(\Psi_{f}\circ\Phi^{(i)})(\Omega^{-}\times{\mathbb{R}})$ . Then, the condition

[TABLE]

is equivalent to an absolutely bounded number of polynomial inequalities in the variables $L_{f}(F),K_{f}(F),E_{f,1}(F),E_{f,2}(F)$ whose degrees are absolutely bounded.

Proof.

For brevity, define

[TABLE]

as well as write

[TABLE]

Note that $L^{2}+4K<0$ by (1.7) because $\Delta(F)<0$ . This implies that $Z<0$ and so the stated condition may be rewritten as

[TABLE]

By rearranging, we may further rewrite the above as

[TABLE]

From here, we shall consider the different possibilities for the signs of $E_{2}$ , $L$ , $Y_{1},Y_{2}$ . For example, when $E_{2}>0$ and $L\geq 0$ , the above is equivalent to $Y_{1}\leq 0$ and

[TABLE]

The other cases are analogous. We then see that the claim holds. ∎

5.3. Integral orthogonal groups

We shall require an explicit description of

[TABLE]

In the notation of Lemma 4.1, observe that

[TABLE]

Moreover, it is well-known that

[TABLE]

where $T^{+}(t)$ and $T^{-}(t)$ are defined as in (4.3), and

[TABLE]

We shall need the following lemma.

Lemma 5.9.

Suppose that $T\in O_{f}({\mathbb{Z}})\setminus\{\pm I_{2\times 2}\}$ has finite order. Then, the form $f$ is $\operatorname{GL}_{2}({\mathbb{Z}})$ -equivalent to a form of the shape

[TABLE]

for some integers $a,b$ , and $c$ .

Proof.

By [21, Chapter IX], for example, a finite cyclic subgroup of $\operatorname{GL}_{2}({\mathbb{Z}})$ not contained in $\{\pm I_{2\times 2}\}$ is conjugate to the subgroup generated by one of the following:

[TABLE]

We then deduce that there exists $P\in\operatorname{GL}_{2}({\mathbb{Z}})$ such that $Q=P^{-1}TP$ is equal to one of the following matrices up to sign:

[TABLE]

Since $f$ is primitive with $\alpha>0$ by assumption and $(f_{P})_{Q}=\pm f_{P}$ , we then check that $f_{P}$ must have one of the stated shapes. ∎

Proposition 5.10.

Suppose that $f$ is positive definite. Then, we have

[TABLE]

if $f$ is not $\operatorname{GL}_{2}({\mathbb{Z}})$ -equivalent to the forms below, and the group $O_{f}({\mathbb{Z}})$ is equal to

[TABLE]

Proof.

Elements in $O_{f}({\mathbb{Z}})$ have finite order by (5.2) and so the first claim follows from Lemma 5.9. Using (5.2), we compute that elements in $O_{f}({\mathbb{R}})$ are of the forms

[TABLE]

where $t\in{\mathbb{R}}$ and $(\phi_{t},\psi_{t})=(\cos t,\sin t)$ . With the help of the proof of Lemma 5.9, it is not hard to check that $O_{f}({\mathbb{Z}})$ is as claimed. ∎

Proposition 5.11.

Suppose that $f$ is reducible. Then, the group $O_{f}({\mathbb{Z}})$ is equal to

[TABLE]

Proof.

Using (5.2), we compute that elements in $O_{f}({\mathbb{R}})$ are of the forms

[TABLE]

where $t\in{\mathbb{R}}$ and $(\phi_{t},\psi_{t})\in\{(\cosh t,\sinh t),(\sinh t,\cosh t)\}$ . For the matrix on the left to have integer entries, necessarily

[TABLE]

Similarly, for the matrix on the right to have integer entries, necessarily

[TABLE]

We then deduce that

[TABLE]

Since $f$ has the shape (1.11) by assumption, we have

[TABLE]

and we see that the claim indeed holds. ∎

Proposition 5.12.

Suppose that $f$ is indefinite and irreducible. Define

[TABLE]

and $(u_{D_{f}},v_{D_{f}})\in{\mathbb{N}}^{2}$ is the least solution to $x^{2}-D_{f}y^{2}=\pm 4$ . Then, we have

[TABLE]

if $f$ is not $\operatorname{GL}_{2}({\mathbb{Z}})$ -equivalence to the forms below, and the group $O_{f}({\mathbb{Z}})$ is equal to

[TABLE]

Proof.

By (5.2), elements in $O_{f}({\mathbb{R}})$ of infinite order are of the shape

[TABLE]

where $t\in{\mathbb{R}}$ and $(\phi_{t},\psi_{t})\in\{(\cosh t,\sinh t),(\sinh t,\cosh t)\}$ . We then see that

[TABLE]

Hence, the first claim follows from Lemma 5.9 and the fact that $ax^{2}+bxy+ay^{2}$ is $\operatorname{GL}_{2}({\mathbb{Z}})$ -equivalent to the form

[TABLE]

Now, again by (5.2), elements in $O_{f}({\mathbb{R}})$ of finite order have the shape

[TABLE]

where $t\in{\mathbb{R}}$ and $(\phi_{t},\psi_{t})\in\{(\cosh t,\sinh t),(\sinh t,\cosh t)\}$ . Notice that the matrix on the left cannot lie in $\operatorname{GL}_{2}({\mathbb{Z}})$ because $D_{f}$ is not square when $f$ is irreducible. Using the description of $O_{x^{2}-y^{2}}({\mathbb{R}})$ , it is then not hard to check that $[O_{f}({\mathbb{Z}}):G_{f}({\mathbb{Z}})]\leq 2$ , from which the second claim follows. ∎

5.4. Proof of Theorem 1.4

Suppose that $f(x,y)=\alpha x^{2}+\beta xy-\alpha y^{2}$ and that $D_{f}$ is not a square. In the notation of Proposition 5.12, we have

[TABLE]

by definition. But Proposition 5.12 also implies that $\det(T_{D_{f}})=-1$ is equivalent to

[TABLE]

The theorem now follows from Lemma 5.9 and (5.4).

5.5. Proof of Proposition 5.4

We shall need the following lemma.

Lemma 5.13.

For all $F\in V_{{\mathbb{Z}},f}^{0}$ with $\Delta(F)\neq\square$ and $T\in\operatorname{GL}_{2}({\mathbb{Z}})\setminus\{\pm I_{2\times 2}\}$ , we have

(a)

$F_{T}\in V_{{\mathbb{Z}},f}^{0}$ * if and only if $T\in O_{f}({\mathbb{Z}})$ ,* 2. (b)

$F_{T}=F$ * if and only if $T=\pm D_{f}^{-1/2}M_{f}$ .*

Proof.

Note that $F_{T}\in V_{{\mathbb{Z}},f_{T}}^{0}$ by (1.8). By Theorem 1.1 (a), we then have $F_{T}\in V_{{\mathbb{Z}},f}^{0}$ if and only if $f_{T}=\pm f$ , whence part (a) holds. By Theorem 1.1 (a) and Proposition 2.1, we have $F_{T}=F$ if and only if $T$ is proportional to $M_{f}$ , from which part (b) follows since $\det(T)=\pm 1$ . ∎

5.5.1. The case when $f$ is positive definite or reducible

Let us first observe that:

Lemma 5.14.

We have $V_{{\mathbb{Z}},f}^{0}(X)\subset{\mathcal{S}}_{f}(X)$ .

Proof.

Let $F\in V_{{\mathbb{Z}},f}^{0}(X)$ be given. If $f$ is positive definite, then clearly $F\in{\mathcal{S}}_{f}(X)$ by Lemma 5.5. If $f$ is reducible, then recall Lemma 5.6, and we have $F\in{\mathcal{S}}_{f}(X)$ since

[TABLE]

by (4.10) and Proposition 3.2 (b), respectively. ∎

Lemma 5.14 implies that part (a) holds. Together with Lemma 5.13 (a, it further implies that for $F\in V_{{\mathbb{Z}},f}^{0}(X)$ with $\Delta(F)\neq\square$ , the number of forms in ${\mathcal{S}}_{f}(X)$ which are $\operatorname{GL}_{2}({\mathbb{Z}})$ -equivalent to $F$ is equal to

[TABLE]

By Lemma 5.13 (b), we in turn have

[TABLE]

which may be verified to be equal to $r_{f}$ using Propositions 5.10 and 5.11.

5.5.2. The case when $f$ is indefinite and irreducible

We shall use the notation from Lemma 4.1, Proposition 5.12, (4.3), and (5.3). Then, by definition, we have

[TABLE]

Now, by (5.1) and (4.6), a form in $V_{{\mathbb{Z}},f}^{0}(X)$ is of the shape

[TABLE]

Observe that $J_{1}$ and $J_{2}$ commute with $T^{-}(t)$ as well as fix the forms in $V_{{\mathbb{R}},x^{2}-y^{2}}$ . For any $n\in{\mathbb{Z}}$ , we then deduce that

[TABLE]

Let $n_{1}\in{\mathbb{Z}}$ be the unique integer such that $0\leq t+n_{1}t_{D_{f}}<t_{D_{f}}$ . The existence of $n_{1}$ then implies part (a).

Next, suppose that $\Delta(F)\neq\square$ , in which case

[TABLE]

by Lemma 5.13 (a). If $O_{f}({\mathbb{Z}})=G_{f}({\mathbb{Z}})$ , then part (b) holds by the uniqueness of $n_{1}$ . If $O_{f}({\mathbb{Z}})\neq G_{f}({\mathbb{Z}})$ , then recall from Proposition 5.12 that

[TABLE]

From (5.2), we see that

[TABLE]

Then, for any $n\in{\mathbb{Z}}$ , it is straightforward to verify that

[TABLE]

There is a unique $n_{2}\in{\mathbb{Z}}$ such that $0\leq-(t+n_{2}t_{D_{f}})+t_{0}<t_{D_{f}}$ . Observe that

[TABLE]

But $T_{D_{f}}^{n_{2}-n_{1}}M$ has finite order, and so it cannot proportional to $M_{f}$ by (5.5), which is a contradiction by Lemma 5.13 (b). Then, we conclude from Proposition 5.12 that part (b) indeed holds.

6. Error estimates and the main theorem

Throughout this section, let $f(x,y)=\alpha x^{2}+\beta xy+\gamma y^{2}$ be an integral and primitive binary quadratic form with $\Delta(f)\neq 0$ and $\alpha>0$ , in the shape (1.11) whenever $f$ is reducible. Let $D_{f},r_{f}$ and $s_{f}$ be as in Theorem 1.2.

In Subsections 6.1 and 6.2, respectively, we shall first prove:

Proposition 6.1.

For any $\epsilon>0$ , we have

[TABLE]

and

[TABLE]

Further, the number

[TABLE]

is equal to zero if $-\Delta(f)\neq\square$ . and is bounded by $O_{f}(X)$ otherwise.

Propositions 6.1, 3.6, and 5.4 then imply part d) of Theorem 1.2.

The reader should compare the last claim above with [28, Theorem 1.4].

Proposition 6.2.

We have

[TABLE]

Now, from Propositions 5.4, 6.1, and 6.2, we also easily see that

[TABLE]

Let ${\mathcal{L}}_{f,w(f)}$ be a linear transformation on ${\mathbb{R}}^{3}$ which takes $\Lambda_{f,w(f)}$ to ${\mathbb{Z}}^{3}$ , and define

[TABLE]

as before. Observe that then

[TABLE]

By Proposition 5.3, we may apply Proposition 5.1 to obtain

[TABLE]

where by Proposition 3.4, we know that

[TABLE]

Hence, it remains to compute the above volumes, which we shall do in Subsection 6.3.

6.1. Proof of Proposition 6.1

Recall the notation from Proposition 3.2. By definition and Proposition 3.3, we then have a well-defined map

[TABLE]

Using Proposition 3.2, it is easy to verify that $\iota$ is in fact injective. We shall also need the following result due to Heath-Brown [18].

Lemma 6.3.

Let $\xi(x_{1},x_{2},x_{3})$ be a ternary quadratic form such that its corresponding matrix $M_{\xi}$ has non-zero determinant. For $B_{1},B_{2},B_{3}>0$ , let $N_{\xi}(B_{1},B_{2},B_{3})$ denote the number of tuples $(x_{1},x_{2},x_{3})\in{\mathbb{Z}}^{3}$ such that

[TABLE]

Then, we have

[TABLE]

where $\det_{0}(M_{\xi})$ denotes the greatest common divisor of the $2\times 2$ minors of $M_{\xi}$ , and $d_{3}(|\det(M_{\xi})|)$ is the number of ways to write $|\det(M_{\xi})|$ as a product of three positive integers.

Proof.

See [18, Corollary 2].∎

In what follows, consider $F\in{\mathcal{S}}_{f}(X)\cap V_{{\mathbb{Z}},f}^{0}$ , and for brevity, write

[TABLE]

Since $\iota$ is injective, it is enough to estimate the number of choices for $(L,L_{1},L_{2})$ . To that end, let us put ${\mathcal{D}}_{f}=\Delta(f)$ . Recall from Propositions 3.2 and 3.3 that

[TABLE]

which is non-zero by (1.7). By the definition of our height, we also have

[TABLE]

The latter estimate holds by

[TABLE]

as well as the fact that $L_{1}$ and $L_{2}$ are linear in the coefficients of $F$ . Finally, we shall write $d(-)$ for the divisor function.

Proof of Proposition 6.1: first claim.

Suppose that $L^{2}+4K=\square$ . Then, we have

[TABLE]

If $f$ is reducible, then ${\mathcal{D}}_{f}=\square$ and so clearly there are

[TABLE]

choices for the pair $(L_{1},L_{2})$ . If $f$ is irreducible, then note that

[TABLE]

and applying Lemma 6.3 to the ternary quadratic form $\xi$ with matrix

[TABLE]

we deduce from (6.3) that there are

[TABLE]

choices for the pair $(L_{1},L_{2})$ . In both cases, we see that there are

[TABLE]

choices for $(L,L_{1},L_{2})$ in total, whence the claim. ∎

Proof of Proposition 6.1: second claim.

Suppose that $(L^{2}+4K)(2L^{2}-K)/{\mathcal{D}}_{f}=\square$ . By Proposition 3.3, we may write

[TABLE]

From the hypothesis, we then easily see that

[TABLE]

as well as that $m$ divides $L$ . In particular, a simple calculation yields

[TABLE]

Now, suppose also that $L\neq 0$ , in which case $m=O_{f}(X^{1/2})$ by (6.3). Note also that

[TABLE]

Applying Lemma 6.3 to the ternary quadratic form $\xi_{m}$ with matrix

[TABLE]

we then see from (6.3) that there are

[TABLE]

choices for $(x,u,v)$ when $m$ is fixed. It follows that we have

[TABLE]

choices for $(m,x,u,v)$ and hence for $(L,K)$ .

Next, regard $(L,K)$ as being fixed, and recall that

[TABLE]

We claim that there are $O_{f}(d(T))$ choices for $(L_{1},L_{2})$ . If $f$ is positive definite or if $f$ is reducible, then this is clear. If $f$ is indefinite and irreducible, then by Definition 5.2 as well as Propositions 3.1 and 4.3, we have

[TABLE]

Since ${\mathcal{D}}_{f}>0$ , we must have $L^{2}+4K>0$ by the hypothesis, and so in fact $i\in\{1,2\}$ . From the proof of Lemma 5.7, we know that

[TABLE]

which implies that

[TABLE]

Since $t=O_{f}(1)$ , we then deduce that indeed there are $O_{f}(d(T))$ choices for $(L_{1},L_{2})$ . Using the bound $d(T)=O_{\epsilon}(T^{\epsilon})=O_{f,\epsilon}(X^{\epsilon})$ , we conclude that there are

[TABLE]

choices for $(L,L_{1},L_{2})$ in total, whence the claim. ∎

Proof of Proposition 6.1: third claim.

Suppose that $L=0$ and that $F$ is in the shape as in (3.1). Using Proposition 3.2, we then deduce that

[TABLE]

Hence

[TABLE]

from which it follows that the above expression is a square if and only if $-{\mathcal{D}}_{f}$ is a square. This also follows immediately from the observation that the above product is equal to $-4K^{2}/{\mathcal{D}}_{f}$ in this case.

We now suppose that $-\Delta(f)=\square$ , so in particular $f$ is positive definite. $F$ is then determined by $(A,B)\in{\mathbb{Z}}^{2}$ , and that $|K|\leq X$ implies

[TABLE]

Hence there are $O_{f}(X)$ choices for $(A,B)$ . It follows that the claim holds. ∎

6.2. Proof of Proposition 6.2

By Lemma 3.8 and Proposition 6.1, we have

[TABLE]

whence it is enough to consider the reducible forms in ${\mathcal{S}}_{f}(X)\cap V_{{\mathbb{Z}},f}^{0}$ of type 2; recall Definition 3.7. By definition, such a form has the shape

[TABLE]

where $p_{2},p_{1},p_{0},q_{2},q_{1},q_{0}\in{\mathbb{Z}}$ , and we have

[TABLE]

by Lemma 3.10. We have the condition

[TABLE]

since the above numbers are all integers. Using Proposition 3.2 (a), we compute that

[TABLE]

Now, by the definition of our height, we clearly have

[TABLE]

Observe also that

[TABLE]

by (4.7), (4.8), (4.2), and the bound $0\leq t<t_{D_{f}}$ . We then deduce that

[TABLE]

where we define

[TABLE]

It is clear that this set is bounded and semi-algebraic. Hence, we may apply Proposition 5.1 to estimate the number of integral points it contains.

6.2.1. The case when $f$ is irreducible

Let us define

[TABLE]

Applying Proposition 5.1, we then obtain

[TABLE]

For any $(u_{2},u_{1},v_{2},v_{1})\in{\mathcal{R}}_{f}^{\prime\prime}(X)$ , from (6.5) and (6.6), we deduce that

[TABLE]

as well as that

[TABLE]

This, together with (6.7), implies that in fact

[TABLE]

We then compute that

[TABLE]

The claim now follows from (6.4) and (6.8).

6.2.2. The case when $f$ is reducible

Let us define

[TABLE]

Since $D_{f}=\square$ in this case, we see that

[TABLE]

Now, applying Proposition 5.1, we have

[TABLE]

For any $(z_{1},z_{2},z_{3},z_{4})\in{\mathcal{R}}_{f}^{\prime\prime}(X)$ , the conditions (6.5) and (6.6) imply that

[TABLE]

which is analogous to (6.9). We then compute that

[TABLE]

The claim now follows from (6.4) and (6.8).

6.3. Proof of Theorem 1.2

We have already proven part (d). To prove parts (a) through (c), it remains to compute the volumes in (6.2).

6.3.1. The case when $f$ is positive definite

We have

[TABLE]

by Lemma 4.1 and Proposition 4.2 (b), as well as

[TABLE]

Observe also that

[TABLE]

because $\Theta_{1}({\mathcal{S}}_{f}(X))$ lies in the cube centered at the origin of side length $O_{f}(X^{1/2})$ by (4.5) and (4.2). We then deduce part (a) from (6.1) and (6.2).

6.3.2. The case when $f$ is reducible

We have

[TABLE]

by Proposition 4.4, as well as

[TABLE]

We then deduce part (b) from Lemma 6.4 below as well as (6.1) and (6.2).

Lemma 6.4.

We have $\operatorname{Vol}(\overline{\Theta_{2}({\mathcal{S}}_{f}(X))})=O_{f}(X^{3/2})$ .

Proof.

By Definition 5.2, an element in $\Theta_{2}({\mathcal{S}}_{f}(X))$ takes the form

[TABLE]

Let us recall that

[TABLE]

Then, from (4.11), we see that $1$ -dimensional projections of $\Theta_{2}({\mathcal{S}}_{f}(X))$ have lengths of order $O_{f}(X)$ . As for the $2$ -dimensional projections, note that (5.1) and (6.10) yield

[TABLE]

as well as the estimates

[TABLE]

Hence, the projections of $\Theta_{2}({\mathcal{S}}_{f}(X))$ onto the $BC$ -plane and $AC$ -plane, respectively, have areas bounded by

[TABLE]

Similarly, from (5.1) and (6.10), we deduce that

[TABLE]

as well as the estimate

[TABLE]

Note that $|L|\leq X^{1/2}$ also implies that

[TABLE]

Hence, the projection of $\Theta_{2}({\mathcal{S}}_{f}(X))$ onto the $AB$ -plane has area bounded by

[TABLE]

It follows that all of the $2$ -dimensional projections of $\Theta_{2}({\mathcal{S}}_{f}(X))$ have areas of order $O_{f}(X^{3/2})$ , and this proves the lemma.∎

6.3.3. The case when $f$ is indefinite and irreducible

We have

[TABLE]

by Lemma 4.1 and Proposition 4.3, as well as

[TABLE]

Observe also that

[TABLE]

because $\Theta_{1}({\mathcal{S}}_{f}(X))$ lies in the cube centered at the origin of side length $O_{f}(X^{1/2})$ by (4.7), (4.8), (4.2), and the bound on $t$ . We then deduce part (c) from (6.1) and (6.2).

7. Acknowledgments

The first-named author was partially supported by the China Postdoctoral Science Foundation Special Financial Grant (grant number: 2017T100060). We would like to thank the referee for many useful suggestions which helped improve the exposition of the paper significantly.

Bibliography28

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] A. Altug, A. Shankar, I. Varma, and K. Wilson, The number of quartic D 4 subscript 𝐷 4 D_{4} -fields ordered by conductor , ar Xiv:1704.01729 v 1 [math.NT].
2[2] M. Bhargava, Higher composition laws III. The parametrization of quartic rings , Ann. of Math. 159 (2004), no. 3, 1329-1360.
3[3] M. Bhargava, Higher composition laws IV: The parametrization of quintic rings , Ann. of Math. 167 (2008), no.1, 53-94.
4[4] M. Bhargava and A. Shankar, Binary quartic forms having bounded invariants, and the boundedness of the average rank of elliptic curves , Ann. of Math. 181 (2015), no. 1, 191-242.
5[5] M. Bhargava, A. Shankar, and J. Tsimerman, On the Davenport-Heilbronn theorems and second order terms , Invent. Math. 193 (2013), no. 2, 193-439 .
6[6] M . Bhargava and A. Shnidman, On the number of cubic orders of bounded discriminant having automorphism group C 3 subscript 𝐶 3 C_{3} , and related problems , Algebra and Number Theory 8 (2014), no. 1, 53-88.
7[7] B. J. Birch and J. R. Merriman, Finiteness theorems for binary forms with given discriminant , Proc. London Math. Soc. 24 (1972), no. 3, 385-394.
8[8] A. Brumer, The average rank of elliptic curves. I , Invent. Math. 109 (1992), no. 1, 445-472.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Binary quartic forms with bounded invariants

Abstract.

Contents

1. Introduction

1.1. Set-up and notation

Theorem 1.1**.**

1.2. Statement of the main theorem

Theorem 1.2**.**

Corollary 1.3**.**

Proof.

Theorem 1.4**.**

2. Characterization of forms with small Galois groups

2.1. Cremona covariants

Proposition 2.1**.**

Proof.

2.2. Proof of Theorem 1.1

Lemma 2.2**.**

Proof.

3. Basic properties of forms in VR,fV_{{\mathbb{R}},f}VR,f​ of non-zero discriminant

3.1. The two new invariants

Proposition 3.1**.**

Proof.

Proposition 3.2**.**

Proof.

Proposition 3.3**.**

Proof.

3.2. Determinants of the two lattices

Proposition 3.4**.**

Proof.

Lemma 3.5**.**

Proof.

3.3. Forms with abelian Galois groups

Proposition 3.6**.**

Proof.

3.4. Reducible forms

Definition 3.7**.**

Lemma 3.8**.**

Proof.

Lemma 3.9**.**

Proof.

Lemma 3.10**.**

Proof.

Proposition 3.11**.**

Proof.

4. Parametrizing forms in VR,fV_{{\mathbb{R}},f}VR,f​ of non-zero discriminant

Lemma 4.1**.**

Proof.

Proposition 4.2**.**

Proposition 4.3**.**

Proposition 4.4**.**

4.1. Positive definite case

4.2. Indefinite case

4.3. Reducible case

5. Definition of a bounded semi-algebraic set

Proposition 5.1** (Davenport’s lemma).**

Proof.

Definition 5.2**.**

Proposition 5.3**.**

Proposition 5.4**.**

5.1. Alternative description

Lemma 5.5**.**

Proof.

Lemma 5.6**.**

Proof.

Lemma 5.7**.**

Proof.

5.2. Proof of Proposition 5.3

5.2.1. The case when fff is positive definite or reducible

5.2.2. The case when fff is indefinite and irreducible

Lemma 5.8**.**

Proof.

5.3. Integral orthogonal groups

Lemma 5.9**.**

Proof.

Theorem 1.1.

Theorem 1.2.

Corollary 1.3.

Theorem 1.4.

Proposition 2.1.

Lemma 2.2.

3. Basic properties of forms in $V_{{\mathbb{R}},f}$ of non-zero discriminant

Proposition 3.1.

Proposition 3.2.

Proposition 3.3.

Proposition 3.4.

Lemma 3.5.

Proposition 3.6.

Definition 3.7.

Lemma 3.8.

Lemma 3.9.

Lemma 3.10.

Proposition 3.11.

4. Parametrizing forms in $V_{{\mathbb{R}},f}$ of non-zero discriminant

Lemma 4.1.

Proposition 4.2.

Proposition 4.3.

Proposition 4.4.

Proposition 5.1 (Davenport’s lemma).

Definition 5.2.

Proposition 5.3.

Proposition 5.4.

Lemma 5.5.

Lemma 5.6.

Lemma 5.7.

5.2.1. The case when $f$ is positive definite or reducible

5.2.2. The case when $f$ is indefinite and irreducible

Lemma 5.8.

Lemma 5.9.

Proposition 5.10.

Proposition 5.11.

Proposition 5.12.

Lemma 5.13.

5.5.1. The case when $f$ is positive definite or reducible

Lemma 5.14.

5.5.2. The case when $f$ is indefinite and irreducible

Proposition 6.1.

Proposition 6.2.

Lemma 6.3.

6.2.1. The case when $f$ is irreducible

6.2.2. The case when $f$ is reducible

6.3.1. The case when $f$ is positive definite

6.3.2. The case when $f$ is reducible

Lemma 6.4.

6.3.3. The case when $f$ is indefinite and irreducible