Areas of triangles and SL_2 actions in finite rings

Alex McDonald

arXiv:1906.04674·math.CO·June 12, 2019

Areas of triangles and SL_2 actions in finite rings

Alex McDonald

PDF

Open Access

TL;DR

This paper extends the concept of triangle area to points over finite rings and demonstrates that large point sets in these rings produce many distinct area-based configurations, invariant under SL_2 actions.

Contribution

It introduces a generalized area formula over finite rings and proves that large subsets generate a positive proportion of all possible triangle configurations.

Findings

01

Large subsets in finite rings produce many distinct triangle types.

02

The area formula is invariant under SL_2(R) actions.

03

Results apply to both finite fields and modular rings.

Abstract

In Euclidean space, one can use the dot product to give a formula for the area of a triangle in terms of the coordinates of each vertex. Since this formula involves only addition, subtraction, and multiplication, it can be used as a definition of area in $R^{2}$ , where $R$ is an arbitrary ring. The result is a quantity associated with triples of points which is still invariant under the action of $SL_{2} (R)$ . One can then look at a configuration of points in $R^{2}$ in terms of the triangles determined by pairs of points and the origin, considering two such configurations to be of the same type if corresponding pairs of points determine the same areas. In this paper we consider the cases $R = F_{q}$ and $R = Z / p^{ℓ} Z$ , and prove that sufficiently large subsets of $R^{2}$ must produce a positive proportion of all such types of configurations.

Equations101

Δ (E) = {(x_{1} - y_{1})^{2} + \dots + (x_{d} - y_{d})^{2} : x, y \in E} .

Δ (E) = {(x_{1} - y_{1})^{2} + \dots + (x_{d} - y_{d})^{2} : x, y \in E} .

Π (E) = {x_{1} y_{1} + \dots + x_{d} y_{d} : x, y \in E},

Π (E) = {x_{1} y_{1} + \dots + x_{d} y_{d} : x, y \in E},

s (x, y) = 1 - \frac{( x \cdot y ) ^{2}}{∥ x ∥∥ y ∥},

s (x, y) = 1 - \frac{( x \cdot y ) ^{2}}{∥ x ∥∥ y ∥},

g = (y^{i} y^{j}) (x^{i} x^{j})^{- 1} .

g = (y^{i} y^{j}) (x^{i} x^{j})^{- 1} .

(x_{1}^{i} x_{2}^{i} x_{1}^{j} x_{2}^{j}) (a b) = x^{n}

(x_{1}^{i} x_{2}^{i} x_{1}^{j} x_{2}^{j}) (a b) = x^{n}

g x^{n} = g (a x^{i} + b x^{j}) = a g x^{i} + b g x^{j} = a y^{i} + b y^{j} = y^{n} .

g x^{n} = g (a x^{i} + b x^{j}) = a g x^{i} + b g x^{j} = a y^{i} + b y^{j} = y^{n} .

ψ (x_{1}^{1}, x_{2}^{1}, \dots, x_{1}^{k + 1}, x_{2}^{k + 1}) = (p^{ℓ - 1} x_{1}^{1}, x_{2}^{1}, \dots, p^{ℓ - 1} x_{1}^{k + 1}, x_{2}^{k + 1}) .

ψ (x_{1}^{1}, x_{2}^{1}, \dots, x_{1}^{k + 1}, x_{2}^{k + 1}) = (p^{ℓ - 1} x_{1}^{1}, x_{2}^{1}, \dots, p^{ℓ - 1} x_{1}^{k + 1}, x_{2}^{k + 1}) .

(p^{ℓ - 1} x_{1}^{i}, x_{2}^{i}) \cdot (p^{ℓ - 1} x_{1}^{j}, x_{2}^{j}) = p^{ℓ - 1} x^{i} \cdot x^{j ⊥} = 0.

(p^{ℓ - 1} x_{1}^{i}, x_{2}^{i}) \cdot (p^{ℓ - 1} x_{1}^{j}, x_{2}^{j}) = p^{ℓ - 1} x^{i} \cdot x^{j ⊥} = 0.

∣ E ∣^{2 (k + 1)} ≲ C_{k + 1} (E) g \in SL_{2} (R) \sum f (g)^{k + 1} .

∣ E ∣^{2 (k + 1)} ≲ C_{k + 1} (E) g \in SL_{2} (R) \sum f (g)^{k + 1} .

∣ G ∣^{2} \leq ∣ G / \sim ∣ \cdot ∣ {(x, y) \in G \times G : x \sim y} ∣.

∣ G ∣^{2} \leq ∣ G / \sim ∣ \cdot ∣ {(x, y) \in G \times G : x \sim y} ∣.

∣ {(x, y) \in G \times G : x \sim y} ∣ = x, y \in G \sum g y = g x \sum 1.

∣ {(x, y) \in G \times G : x \sim y} ∣ = x, y \in G \sum g y = g x \sum 1.

x, y \in E^{k + 1} \sum g y = g x \sum 1

x, y \in E^{k + 1} \sum g y = g x \sum 1

=

=

=

A = \frac{1}{∣ S ∣} x \in S \sum F (x)

A = \frac{1}{∣ S ∣} x \in S \sum F (x)

M = x \in S sup F (x)

M = x \in S sup F (x)

x \in S \sum F (x)^{2} = A^{2} ∣ S ∣ + R .

x \in S \sum F (x)^{2} = A^{2} ∣ S ∣ + R .

x \in S \sum F (x)^{k + 1} \leq c_{k} (M^{k - 1} R + A^{k + 1} ∣ S ∣) .

x \in S \sum F (x)^{k + 1} \leq c_{k} (M^{k - 1} R + A^{k + 1} ∣ S ∣) .

x \in S \sum (F (x) - A)^{2}

x \in S \sum (F (x) - A)^{2}

=

=

=

x \in S \sum F (x)^{k + 1} = x \in S \sum (F (x) - A)^{k} F (x) + j = 0 \sum k - 1 (j k) (- 1)^{k - j + 1} A^{k - j} x \in S \sum F (x)^{j + 1} .

x \in S \sum F (x)^{k + 1} = x \in S \sum (F (x) - A)^{k} F (x) + j = 0 \sum k - 1 (j k) (- 1)^{k - j + 1} A^{k - j} x \in S \sum F (x)^{j + 1} .

x \in S \sum (F (x) - A)^{k} F (x) \leq M^{k - 1} x \in S \sum (F (x) - A)^{2} = M^{k - 1} R .

x \in S \sum (F (x) - A)^{k} F (x) \leq M^{k - 1} x \in S \sum (F (x) - A)^{2} = M^{k - 1} R .

j = 0 \sum k - 1 (j k) (- 1)^{k - j + 1} A^{k - j} x \in S \sum F (x)^{j + 1}

j = 0 \sum k - 1 (j k) (- 1)^{k - j + 1} A^{k - j} x \in S \sum F (x)^{j + 1}

\leq

\leq

\leq

φ (x, y) = \frac{∣ G ∣}{∣ X ∣}

φ (x, y) = \frac{∣ G ∣}{∣ X ∣}

g \in G \sum h (g x_{0}) = \frac{∣ G ∣}{∣ X ∣} x \in X \sum h (x) .

g \in G \sum h (g x_{0}) = \frac{∣ G ∣}{∣ X ∣} x \in X \sum h (x) .

x, y \in X \sum φ (x, y) = g \in G \sum x, y \in X g x = y \sum 1.

x, y \in X \sum φ (x, y) = g \in G \sum x, y \in X g x = y \sum 1.

g \sum f (g)^{2} = \frac{∣ E ∣ ^{4}}{q} + O (q^{2} ∣ E ∣^{2}) .

g \sum f (g)^{2} = \frac{∣ E ∣ ^{4}}{q} + O (q^{2} ∣ E ∣^{2}) .

g \sum f (g)^{2} = x^{1}, x^{2}, y^{1}, y^{2} \sum E (x^{1}) E (x^{2}) E (y^{1}) E (y^{2}) g g x = y \sum 1 .

g \sum f (g)^{2} = x^{1}, x^{2}, y^{1}, y^{2} \sum E (x^{1}) E (x^{2}) E (y^{1}) E (y^{2}) g g x = y \sum 1 .

g \sum f (g)^{2} = O (∣ E ∣^{2} q^{2}) + t \sum x^{1}, x^{2}, y^{1}, y^{2} x^{1} \cdot x^{2 ⊥} = t y^{1} \cdot y^{2 ⊥} = t \sum E (x^{1}) E (x^{2}) E (y^{1}) E (y^{2}) = O (∣ E ∣^{2} q^{2}) + t \sum ν (t)^{2} .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFinite Group Theory Research · Limits and Structures in Graph Theory · Analytic Number Theory Research

Full text

Areas of triangles and $\text{SL}_{2}$ actions in finite rings

Alex McDonald

Abstract

In Euclidean space, one can use the dot product to give a formula for the area of a triangle in terms of the coordinates of each vertex. Since this formula involves only addition, subtraction, and multiplication, it can be used as a definition of area in $R^{2}$ , where $R$ is an arbitrary ring. The result is a quantity associated with triples of points which is still invariant under the action of $\text{SL}_{2}(R)$ . One can then look at a configuration of points in $R^{2}$ in terms of the triangles determined by pairs of points and the origin, considering two such configurations to be of the same type if corresponding pairs of points determine the same areas. In this paper we consider the cases $R=\mathbb{F}_{q}$ and $R=\mathbb{Z}/p^{\ell}\mathbb{Z}$ , and prove that sufficiently large subsets of $R^{2}$ must produce a positive proportion of all such types of configurations.

1 Introduction

There are several interesting combinatorial problems asking whether a sufficiently large subset of a vector space over a finite field must generate many different objects of some type. The most well known example is the Erdos-Falconer problem, which asks whether such a set must contain all possible distances, or at least a positive proportion of distances. More precisely, given $E\subset\mathbb{F}_{q}^{d}$ we define the distance set

[TABLE]

Obviously, $\Delta(E)\subset\mathbb{F}_{q}$ . The Erdos-Falconer problem asks for an exponent $s$ such that $\Delta(E)=\mathbb{F}_{q}$ , or more generally $|\Delta(E)|\gtrsim q$ , whenever $|E|\gtrsim q^{s}$ (Throughout, the notation $X\lesssim Y$ means there is a constant $C$ such that $X\leq CY$ , $X\approx Y$ means $X\lesssim Y$ and $Y\lesssim X$ , and $O(X)$ denotes a quantity that is $\lesssim X$ ). In [9], Iosevich and Rudnev proved that $\Delta(E)=\mathbb{F}_{q}$ if $|E|\gtrsim q^{\frac{d+1}{2}}$ In [8] it is proved by Hart, Iosevich, Koh, and Rudnev that the exponent $\frac{d+1}{2}$ cannot be improved in odd dimensions, althought it has been improved to $4/3$ in the $d=2$ case (first in [3] in the case $q\equiv 3\text{ (mod 4)}$ by Chapman, Erdogan, Hart, Iosevich, and Koh, then in general in [2] by Bennett, Hart, Iosevich, Pakianathan, and Rudnev). Several interesting variants of the distance problem have been studied as well. A result of Pham, Phuong, Sang, Valculescu, and Vinh studies the problem when distances between pairs of points are replaced with distances between points and lines in $\mathbb{F}_{q}^{2}$ ; they prove that if sets $P$ and $L$ of points and lines, respectively, satisfy $|P||L|\gtrsim q^{8/3}$ , then they determine a positive proportion of all distances [12]. Birklbauer, Iosevich, and Pham proved an analogous result about distances determined by points and hyperplanes in $\mathbb{F}_{q}^{d}$ [1].

We can replace distances with dot products and ask the analogous question. Let

[TABLE]

and again ask for an exponent $s$ such that $|E|\gtrsim q^{s}$ implies $\Pi(E)$ contains all distances (or at least a positive proportion of distances). Hart and Iosevich prove in [6] that the exponent $s=\frac{d+1}{2}$ works for this question as well. The proof is quite similar to the proof of the same exponent in the Erdos-Falconer problem; in each case, the authors consider a function which counts, for each $t\in\mathbb{F}_{q}$ , the number of representations of $t$ as, respectively, a distance and a dot product determined by the set $E$ . These representation functions are then studied using techniques from Fourier analysis.

Another interesting variant of this problem was studied in [10], where Lund, Pham, and Vinh defined the angle between two vectors in analogue with the usual geometric interpretation of the dot product. Namely, given vectors $x$ and $y$ , they consider the quantity

[TABLE]

where $\|x\|=x_{1}^{2}+\cdots x_{d}^{2}$ is the finite field distance defined above. Note that since we cannot always take square roots in finite fields, the finite field distance corresponds to the square of the Euclidean distance; therefore, $s(x,y)$ above is the correct finite field analogue of $\sin^{2}\theta$ , where $\theta$ is the angle between the vectors $x$ and $y$ . This creates a variant of the dot product problem, since one can obtain different dot products from the same angle by varying length. The authors go on to prove that the exponent $\frac{d+2}{2}$ guarantees a positive proportion of angles.

It is of interest to generalize these types of results to point configurations. By a $(k+1)$ -point configuration in $\mathbb{F}_{q}^{d}$ , we simply mean an element of $(\mathbb{F}_{q}^{d})^{k+1}$ . Throughout, we will use superscripts to denote different vectors in a given configuration, and subscripts to denote the coordinates of each vector. For example, a $(k+1)$ point configuration $x$ is made up of vectors $x^{1},...,x^{k+1}$ , each of which has coordinates $x_{1}^{i},x_{2}^{i}$ . Given a set $E\subset\mathbb{F}_{q}^{d}$ , we can consider $(k+1)$ -point configurations in $E$ (i.e., elements of $E^{k+1}$ ) and ask whether $E$ must contain a positive proportion of all configurations, up to some notion of equivalence. For example, we may view $(k+1)$ -point configurations as simplices, and our notion of equivalence is geometric congruence; any two simplices are congruent if there is a translation and a rotation that maps one onto the other. Since a $2$ -simplex is simply a pair of points, and two such simplices are congruent if and only if the distance is the same, congruence classes simply correspond to distance. Hence, the Erdos-Falconer distance problem may be viewed as simply the $k=1$ case of the simplex congruence problem. In [7], Hart and Iosevich prove that $E$ contains the vertices of a congruent copy of every non-degenerate simplex (non-degenerate here means the points are in general position) whenever $|E|\gtrsim q^{\frac{kd}{k+1}+\frac{k}{2}}$ . However, in order for this result to be non-trivial the exponent must be $<d$ , and that only happens when $\binom{k+1}{2}<d$ . So, the result is limited to fairly small configurations. This result is improved in [2] by Bennett, Hart, Iosevich, Pakianathan, and Rudnev, who prove that for any $k\leq d$ a set $E\subset\mathbb{F}_{q}^{d}$ determines a positive proportion of all congruence classes of $(k+1)$ -point configurations provided $|E|\gtrsim q^{d-\frac{d-1}{k+1}}$ . This exponent is clearly non-trivial for all $k$ . In [11], I extended this result to the case $k\geq d$ .

In this paper, we consider a different notion of equivalence. We will consider the problem over both finite fields and rings of integers modulo powers of primes, so I will define the equivalence relation in an arbitrary ring.

Definition 1.

Let $R$ be a ring, and let $E\subset R^{2}$ . We define an equivalence relation $\sim$ on $E^{k+1}$ by $(x^{1},...,x^{k+1})\sim(y^{1},...,y^{k+1})$ (or more breifly $x\sim y$ ) if and only if for each pair $i,j$ we have $x^{i}\cdot x^{j\perp}=y^{i}\cdot y^{j\perp}$ . Define $\mathcal{C}_{k+1}(E)$ to be the set of equivalence classes of $E$ under this relation.

In the Euclidean setting, $\frac{1}{2}|x\cdot y^{\perp}|$ is the area of the triangle with vertices $0,x,y$ . So, we may view each pair of points in a $(k+1)$ -point configuration as determining a triangle with the origin, and we consider two such configurations to be equivalent if the triangles they determine all have the same areas. As we will prove in section $2$ , this equivalence relation is closely related to the action of $\text{SL}_{2}(R)$ on tuples of points; except for some degenerate cases, two configurations are equivalent if and only if there is a unique $g$ mapping one to the other. This allows us to analyze the problem in terms of this action; in section 2, we define a counting function $f(g)$ and reduce matters to estimating the sum $\sum_{g}f(g)^{k+1}$ . In section 3, we show how to turn an estimate for $\sum_{g}f(g)^{2}$ into an estimate for $\sum_{g}f(g)^{k+1}$ . Since we already understand the $k=1$ case (it is essentially the same as the dot product problem discussed above), this reduction allows us to obtain a non-trivial result. Our first theorem is as follows.

Theorem 1.

Let $q$ be a power of an odd prime, and let $E\subset\mathbb{F}_{q}^{2}$ satisfy $|E|\gtrsim q^{s}$ , where $s={2-\frac{1}{k+1}}$ . Then $\mathcal{C}_{k+1}(E)\gtrsim\mathcal{C}_{k+1}(\mathbb{F}_{q}^{2})$ .

In addition to proving this theorem, we will consider the case where the finite field $\mathbb{F}_{q}$ is replaced by the ring $\mathbb{Z}/p^{\ell}\mathbb{Z}$ . The structure of the proof is largely the same; the dot product problem over such rings is studied in [4], giving us the $k=1$ case, and the machinery which lifts that case to arbitrary $k$ works the same way. However, many details in the proofs are considerably more complicated. The theorem is as follows.

Theorem 2.

Let $p$ be an odd prime, let $\ell\geq 1$ , and let $E\subset(\mathbb{Z}/p^{\ell}\mathbb{Z})^{2}$ satisfy $|E|\gtrsim\ell^{\frac{2}{k+1}}p^{\ell s}$ , where $s=2-\frac{1}{\ell(k+1)}$ . Then $\mathcal{C}_{k+1}(E)\gtrsim\mathcal{C}_{k+1}((\mathbb{Z}/p^{\ell}\mathbb{Z})^{2}).$

We first note that, as we would expect, Theorem 2 coincides with Theorem 1 in the case $\ell=1$ . We also note that, for fixed $p$ and $k$ , the exponent in Theorem 2 is always less than 2, but it tends to $2$ as $\ell\to\infty$ . This does not happen in the finite field case, where the exponent depends on $k$ but not on the size of the field.

Finally, we want to state the extent to which these results are sharp. There are examples which show that the exponent must tend to $2$ as $k\to\infty$ in the finite field case, and as either $k\to\infty$ or $\ell\to\infty$ in the $\mathbb{Z}/p^{\ell}\mathbb{Z}$ case.

Theorem 3 (Sharpness).

We have the following:

i

For any $s<2-\frac{2}{k+1}$ , there exists $E\subset\mathbb{F}_{q}^{2}$ such that $|E|\approx q^{s}$ and $\mathcal{C}_{k+1}(E)=o(\mathcal{C}_{k+1}(\mathbb{F}_{q}^{2}))$ . 2. ii

For any $s<2-\min\left(\frac{2}{k+1},\frac{1}{\ell}\right)$ , there exists $E\subset(\mathbb{Z}/p^{\ell}\mathbb{Z})^{2}$ such that $|E|\approx p^{\ell s}$ and $\mathcal{C}_{k+1}(E)=o(\mathcal{C}_{k+1}((\mathbb{Z}/p^{\ell}\mathbb{Z})^{2}))$ .

2 Characterization of the equivalence relation in terms of the $\text{SL}_{2}(R)$ action

Our main tool in reducing the problem of $(k+1)$ -point configurations to the $k=1$ case is the fact that we can express the equivalence relation in terms of the action of the special linear group; with some exceptions, tuples $x$ and $y$ are equivalent if and only if there exists a unique $g\in\text{SL}_{2}$ such that for each $i$ , we have $y^{i}=gx^{i}$ . In order to use this, we need to bound the number of exceptions to this rule. This is easy in the finite field case, and a little more tricky in the $\mathbb{Z}/p^{\ell}\mathbb{Z}$ case. The goal of this section is to describe and and bound the number of exceptional configurations in each case. We begin with a definition.

Definition 2.

Let $R$ be a ring. A configuration $x=(x^{1},...,x^{k+1})\in(R^{2})^{k+1}$ is called good if there exist two indices $i,j$ such that $x^{i}\cdot x^{j\perp}$ is a unit. A configuration is bad if it is not good.

As we will see, the good configurations are precisely those for which equivalence is determined by the action of $\text{SL}_{2}(R)$ . To prove this, we will need the following theorems about determinants of matrices over rings, which can be found in [5], section 11.4.

Theorem 4.

Let $R$ be a ring, let $A_{1},...,A_{n}$ be the columns of an $n\times n$ matrix $A$ with entries in $R$ . Fix an index $i$ , and let $A^{\prime}$ be the matrix obtained from $A$ by replacing column $A_{i}$ by $c_{1}A_{1}+\cdots+c_{n}A_{n}$ , for some $c_{1},...,c_{n}\in R$ . Then $\det(A^{\prime})=c_{i}\det(A)$ .

Theorem 5.

Let $R$ be a ring, and let $A$ be an $n\times n$ matrix with entries in $R$ . The matrix $A$ is invertible if and only if $\det(A)$ is a unit in $R$ .

Theorem 6.

Let $R$ be a ring, and let $A$ and $B$ be $n\times n$ matrices with entries in $R$ . Then $\det(AB)=\det(A)\det(B)$ .

We are now ready to prove that equivalence of good configurations is given by the action of the special linear group.

Lemma 1.

Let $R$ be a ring, and let $x,y$ be good configurations such that $x^{i}\cdot x^{j\perp}=y^{i}\cdot y^{j\perp}$ for every pair of indices $i,j$ . Then there exists a unique $g\in\text{SL}_{2}(R)$ such that $y^{i}=gx^{i}$ for each $i$ .

Proof.

Because $x$ and $y$ are good, there exist indices $i$ and $j$ such that $x^{i}\cdot x^{j\perp}$ is a unit; equivalently, the determinant of the $2\times 2$ matrix with columns $x^{i}$ and $x^{j}$ is a unit. Denote this matrix by $(x^{i}\ x^{j})$ . By theorem 5, this matrix is invertible. Let

[TABLE]

Since $g(x^{i}\ x^{j})=(gx^{i}\ gx^{j})$ , it follows that $y^{i}=gx^{i}$ and $y^{j}=gx^{j}$ . Also note that by Theorem 6, we have $\det(g)=1$ . Let $n$ be any other index. We want to write $x^{n}=ax^{i}+bx^{j}$ ; this amounts to solving the matrix equation

[TABLE]

Since we have already established the matrix $(x^{i}\ x^{j})$ is invertible, we can solve for $a$ and $b$ . Similarly, let $y^{n}=a^{\prime}y^{i}+b^{\prime}y^{j}$ . By Theorem 4, we have $\det(x^{i}\ x^{n})=b\det(x^{i}\ x^{j})$ and $\det(y^{i}\ y^{n})=b^{\prime}\det(y^{i}\ y^{j})$ . It follows that $b=b^{\prime}$ , and an analogous argument yields $a=a^{\prime}$ . Therefore,

[TABLE]

So, we have established existance. To prove uniqueness, note that $g$ must satisfy $g(x^{i}\ x^{j})=(y^{i}\ y^{j})$ , and since $(x^{i}\ x^{j})$ is invertible we can solve for $g$ . ∎

Now that we know that good tuples allow us to use the machinery we need, we must prove that the bad tuples are negligible.

Lemma 2.

Let $R$ be a ring and let $E\subset R^{2}$ . We have the following:

i

If $R=\mathbb{F}_{q}$ , then $E^{k+1}$ contains $\lesssim q^{k}|E|$ bad tuples. In particular, if $|E|\gtrsim q^{1+\varepsilon}$ for any constant $\varepsilon>0$ , the number of bad tuples in $E^{k+1}$ is $o(|E|^{k+1})$ . 2. ii

If $R=\mathbb{Z}/p^{\ell}\mathbb{Z}$ , the number of bad tuples in $R^{k+1}$ is $\lesssim p^{(2\ell-1)(k+1)+1}$ . In particular, if $|E|\gtrsim p^{2\ell-1+\frac{1}{k+1}+\varepsilon}$ for any constant $\varepsilon>0$ , then the number of bad tuples in $E^{k+1}$ is $o(|E|^{k+1})$ .

Proof.

We first prove the first claim. Since the only non-unit of $\mathbb{F}_{q}$ is 0, a bad tuple must consist of $k+1$ points which all lie on a line through the origin. Therefore, we may choose $x^{1}$ to be anything in $E$ , after which the next $k$ points must be chosen from the $q$ points on the line through the origin and $x^{1}$ .

To prove the second claim, first observe that the number of tuples where at least one coordinate is a non-unit is $p^{2(\ell-1)(k+1)}$ , which is less then the claimed bound. So, it suffices to bound the set of bad tuples where all coordinates are units. Let $B$ be this set. Define

[TABLE]

If $x\in B$ , then $x^{i}\cdot x^{j\perp}$ is a non-unit, meaning it is divisible by $p$ , and

[TABLE]

Therefore, $\psi$ maps bad tuples $x$ to tuples $y$ with $y^{i}\cdot y^{j\perp}=0$ , or $y_{1}^{i}y_{2}^{j}-y_{1}^{j}y_{2}^{i}=0$ . Rearranging, using the fact that the second coordinate of each $y^{i}$ is a unit, we conclude that $\frac{y_{1}^{i}}{y_{2}^{i}}$ is a constant independent of $i$ which is divisible by $p^{\ell-1}$ . In other words, each $y^{i}$ is on a common line through the origin and a point $(n,1)$ where $p^{\ell-1}|n$ . There are $p$ such lines, and once we fix a line there are $p^{\ell(k+1)}$ choices of tuples $y$ . Therefore, $|\psi(B)|\leq p\cdot p^{\ell(k+1)}$ . Finally, we observe that the map $\psi$ is $p^{(\ell-1)(k+1)}$ -to-1. This gives us the claimed bound on $|B|$ . ∎

Lemma 3.

Let $R$ be either $\mathbb{F}_{q}$ or $\mathbb{Z}/p^{\ell}\mathbb{Z}$ . Let $E\subset R^{2}$ , and let $G\subset E^{k+1}$ be the set of good tuples. Suppose $|E|\gtrsim q^{1+\varepsilon}$ if $R=\mathbb{F}_{q}$ and $|E|\gtrsim p^{2\ell-1+\frac{1}{k+1}+\varepsilon}$ if $R=\mathbb{Z}/p^{\ell}\mathbb{Z}$ . For $g\in\text{SL}_{2}(R)$ , define $f(g)=\sum_{x}E(x)E(gx)$ . Then

[TABLE]

Proof.

By Cauchy-Schwarz, we have

[TABLE]

By assumption and Lemma 2, $|E|^{k+1}\approx|G|$ , and therefore the left hand side above is $\approx|E|^{2(k+1)}$ . Since $G\subset E^{k+1}$ the right hand side above is $\leq\mathcal{C}_{k+1}(E)|\{(x,y)\in G\times G:x\sim y\}|$ . It remains to prove $|\{(x,y)\in G\times G:x\sim y\}|\leq\sum_{g\in\text{SL}_{2}(R)}f(g)^{k+1}$ . By lemma 1,

[TABLE]

By extending the sum over $G$ to one over all of $E^{k+1}$ , we bound the above sum by

[TABLE]

∎

3 Lifting $L^{2}$ estimates to $L^{k+1}$ estimates

In both the case $R=\mathbb{F}_{q}$ and $R=\mathbb{Z}/p^{\ell}\mathbb{Z}$ , results are known for pairs of points, which is essentially the $k=1$ case. The finite field version was studied in [6], and the ring of integers modulo $p^{\ell}$ was studied in [4]. In section 2, we defined a function $f$ on $\text{SL}_{2}(R)$ and related the number of equivalence classes determined by a set to the sum $\sum_{g}f(g)^{k+1}$ . Since results are known for the $k=1$ case, we have information about the sum $\sum_{g}f(g)^{2}$ . We wish to turn that into a bound for $\sum_{g}f(g)^{k+1}$ . This is achieved with the following lemma.

Lemma 4.

Let $S$ be a finite set, and let $F:S\to\mathbb{R}_{\geq 0}$ . Let

[TABLE]

denote the average value of $F$ , and

[TABLE]

denote the maximum. Finally, suppose

[TABLE]

Then there exist constants $c_{k}$ , depending only on $k$ , such that

[TABLE]

Proof.

We proceed by induction. For the base case, let $c_{1}=1$ and observe that the claimed bound is the one we assumed for $\sum_{x}F(x)^{2}$ . Now, let $\{c_{k}\}$ be any sequence such that $k\binom{k}{j}c_{j}\leq c_{k}$ holds for all $j<k$ ; for example, $c_{k}=2^{k^{2}}$ works. Now, suppose the claimed bound holds for all $1\leq j<k$ , and also observe that the bound is trivial for $j=0$ . By direct computation, we have

[TABLE]

We also have

[TABLE]

To bound the first term, we simply use the trivial bound. Since $F(x)\leq M$ for all $x$ , $A\leq M$ , and $F(x),A\geq 0$ , we conclude $|F(x)-A|\leq M$ for each $x$ . Therefore,

[TABLE]

To bound the second term, we use the inductive hypothesis and the triangle inequality. We have

[TABLE]

Since $A\leq M$ , it follows that $A^{k-j}M^{j-1}R\leq M^{k-1}R$ for any $j<k$ , so the claimed bound holds. ∎

4 Some lemmas about the action of $\text{SL}_{2}(R)$

Lemma 5.

Let $G$ be a finite group acting transitively on a finite set $X$ . Define $\varphi:X\times X\to\mathbb{N}$ by $\varphi(x,y)=|\{g\in G:gx=y\}|$ . We have

[TABLE]

for every pair $x,y$ . If $h:X\to\mathbb{C}$ and $x_{0}\in X$ , then

[TABLE]

Proof.

The second statement follows from the first by a simple change of variables. To prove the first, we have

[TABLE]

On the right, for any fixed $g$ , one can choose any $x$ and there is a unique corresponding $y$ , so the inner sum is $|X|$ and the right hand side is therefore $|G||X|$ . On the other hand, $\varphi$ is constant. To prove this, let $x,y,z,w\in X$ and let $h_{1},h_{2}\in G$ such that $h_{1}x=z$ and $h_{2}w=y$ . This means for any $g$ with $gz=w$ , we have $(g_{2}gh_{1})x=y$ , so $\varphi(z,w)\leq\varphi(x,y)$ . By symmetry, equality holds. If $c$ is the constant value of $\varphi(x,y)$ , the left hand side above must be $c|X|^{2}$ , and therefore $c=\frac{|G|}{|X|}$ as claimed. ∎

Lemma 6.

We have $|\text{SL}_{2}(\mathbb{F}_{q})|=q^{3}-q$ and $|\text{SL}_{2}(\mathbb{Z}/p^{\ell}\mathbb{Z})|=p^{3\ell}-p^{3\ell-2}$ .

Proof.

We are counting solutions to the equation $ad-bc=1$ where $a,b,c,d\in\mathbb{F}_{q}$ . We consider two cases. If $a$ is zero, then $d$ can be anything, and we must have $bc=1$ . This means $b$ can be anything non-zero, and $c$ is determined. So, there are $q^{2}-q$ solutions with $a=0$ . With $a\neq 0$ , $b$ and $c$ can be anything, and $d$ is determined, giving $q^{3}-q^{2}$ solutions in this case. So, there are $(q^{3}-q^{2})+(q^{2}-q)$ total solutions.

Next, we want to count solutions to $ad-bc=1$ with $a,b,c,d\in\mathbb{Z}/p^{\ell}\mathbb{Z}$ . The arguments are essentially the same as in the proof of the finite field case, but slightly more complicated because there are non-zero elements which are still not units. We again consider separately two cases according to whether $a$ is a unit or not. If $a$ is a unit, then $b,c$ can be anything and then $d$ is determined, so there are $(p^{\ell}-p^{\ell-1})p^{2\ell}$ such solutions. If $a$ is not a unit, then $b$ and $c$ must be units, as otherwise $1$ would be divisible by $p$ . So there are $p^{\ell-1}$ choices for $a$ , $p^{\ell}$ choices for $d$ , $p^{\ell}-p^{\ell-1}$ for $b$ , and $c$ is determined. Putting this together, we get the claimed number of solutions. ∎

5 Proof of Theorem 1

We are now ready to prove theorem 1.

Proof.

First observe that good tuples are equivalent to $\approx q^{3}$ distinct tuples, so there are $\approx q^{2k-1}$ equivalence classes of good tuples. Since the only non-unit in the finite field case is 0, the bad tuples are all in the same equivalence class. So, our goal is to prove $\mathcal{C}_{k+1}(E)\gtrsim q^{2k-1}$ . We first must prove the estimate

[TABLE]

We expand the sum on the left hand side and change variables to obtain

[TABLE]

We first observe we may ignore the pairs $x,y$ which are on a line through the origin. This is because if $x^{2}=tx^{1}$ and $y^{2}=sy^{1}$ , there will exist $g$ with $gx=y$ if and only if $t=s$ , in which case there are $\approx q$ choices for $g$ . So, we have $|E|$ choices for $x^{1}$ and $y^{1}$ , $q$ choices for $t$ , and $\approx q$ choices for $g$ giving an error of $O(q^{2}|E|^{2})$ , as claimed. For all other pairs $x,y$ , the inner sum in $g$ is 1 if $x\sim y$ and 0 otherwise. Therefore, if $\nu(t)=|\{(x,y)\in E\times E:x\cdot y^{\perp}=t\}|$ , we have

[TABLE]

The proof of theorem 1.4 in [6] shows that $\nu(t)=\frac{|E|^{2}}{q}+O(|E|q^{1/2})$ , so this gives

[TABLE]

which proves the equation above. We now apply lemma 4 with $F=f$ . Lemmas 6 and 5 imply

[TABLE]

and

[TABLE]

Putting this together gives

[TABLE]

and therefore

[TABLE]

with $R=O(q^{2}|E|^{2})$ . Finally, we observe that $f$ has maximum $M\leq|E|$ . Therefore, lemma 4 gives

[TABLE]

Together with lemma 3, this gives

[TABLE]

If the second term on the right is bigger, we get the result for free. If the first term is bigger, we get

[TABLE]

This will be $\gtrsim q^{2k-1}$ when $|E|\gtrsim q^{2-\frac{1}{k+1}}$ , as claimed.

∎

6 Size of $\mathcal{C}_{k+1}((\mathbb{Z}/p^{\ell}\mathbb{Z})^{2})$

Since $|\text{SL}_{2}(R)|\approx|R|^{3}$ , we expect each tuple in $(R^{2})^{k+1}$ to be equivalent to $\approx|R|^{3}$ other tuples, and therefore we expect the number of congruence classes to be $|R|^{2k-1}$ . In the finite field case, this was proved as the first step of the proof of Theorem 1, but the proof in the $R=\mathbb{Z}/p^{\ell}\mathbb{Z}$ is more complicated so we will prove it here, separately from the proof of Theorem 2 in the next section.

Theorem 7.

We have $\mathcal{C}_{k+1}((\mathbb{Z}/p^{\ell}\mathbb{Z})^{2})\approx(p^{\ell})^{2k-1}$ . More precisely, the good $(k+1)$ -point configurations of $(\mathbb{Z}/p^{\ell}\mathbb{Z})^{2}$ determine $\approx(p^{\ell})^{2k-1}$ classes, and the bad configurations determine $o((p^{\ell})^{2k-1})$ classes.

Proof.

We first establish that there are $\approx p^{\ell(2k-1)}$ classes of good tuples. This is easy; if $x$ is a good tuple, we have seen the map $g\mapsto gx$ is injective, so each class has size $\approx p^{3\ell}$ and there are $p^{2\ell(k+1)}$ tuples, meaning there are $p^{2\ell(k+1)-3\ell}$ classes.

It remains to bound the number of bad classes. We first establish the $k=1,2$ cases. When $k=1$ , we want to prove there are $o(p^{\ell})$ equivalence classes. This is clear, because in the $k=1$ case we are looking at pairs $(x^{1},x^{2})$ whose class is determined by the scalar $x^{1}\cdot x^{2\perp}$ . The classes therefore correspond to the underlying set of scalars in $\mathbb{Z}/p^{\ell}\mathbb{Z}$ , and the bad classes correspond to non-units. In the $k=2$ case, we are looking at triples $(x^{1},x^{2},x^{3})$ whose class is determined by the three scalars $(x^{1}\cdot x^{2\perp},x^{2}\cdot x^{3\perp},x^{3}\cdot x^{1\perp})$ . So, the space of equivalence classes can be identified with $((\mathbb{Z}/p^{\ell}\mathbb{Z})^{2})^{3}$ , and the bad classes correspond to triples of non-units.

For $k\geq 3$ , we use the following theorem, which is really just a more specific version of Theorem 5, also found in [5], chapter 11.

Theorem (5’).

For any $2\times 2$ matrix $A$ , there exists a $2\times 2$ matrix $B$ with $AB=BA=(\det(A))I_{2}$ , where $I_{2}$ is the $2\times 2$ identity matrix.

We also make a more specific version of the definition of good and bad tuples. Namely, let $x$ be a $(k+1)$ point configuration in $\mathbb{Z}/p^{\ell}\mathbb{Z}$ , and let $m\leq\ell$ be minimal with respect to the property that $p^{m}$ divides $x^{i}\cdot x^{j\perp}$ for every pair of indices $(i,j)$ . We say that $x$ is $m$ -bad. Observe that according to our previous definition, good tuples are [math]-bad and bad tuples are $m$ -bad for some $m>0$ . Also observe that $m$ -badness is preserved by equivalence, so we may define $m$ -bad equivalence classes analogously. An easy variant of the argument in Lemma 2 shows that the number of $m$ -bad tuples is $\lesssim p^{(2\ell-m)(k+1)+m}$ ; note that this bound can be rewritten as $p^{\ell(2k-1)+3\ell-km}$ . We claim that every $m$ -bad equivalence class has at least $p^{3\ell-2m}$ elements. It follows from the claim that there are $\lesssim p^{\ell(2k-1)+(2-k)m}$ $m$ -bad classes, and since we may assume $k\geq 3$ the theorem follows from here. To prove the claim, note that the equivalence class containing $x$ also contains $gx$ for any $g\in\text{SL}_{2}(\mathbb{Z}/p^{\ell}\mathbb{Z})$ , so for a lower bound on the size of a class we need to determine the size of the image of the map $g\mapsto gx$ . First note that we may assume without loss of generality that each coordinate of $x^{1}$ is a unit. This is because given $x$ we can shift any factor of $p$ from $x^{1}$ onto each other vector $x^{i}$ and obtain another representative of the same equivalence class. Next, observe that if $x$ is $m$ -bad and $gx=hx$ , then by Theorem 5’ we have $p^{m}g=p^{m}h$ . It follows that $h=g+p^{\ell-m}A$ for some matrix $A$ with entries between [math] and $p^{m}$ . Using the fact that

[TABLE]

where $\mathcal{B}$ is bilinear, we conclude that if $h=g+p^{\ell-m}A$ and $\det(g)=\det(h)=1$ , we must have

[TABLE]

Let $m^{\prime}$ be the minimal power of $p$ which divides all entries of $A$ . Since the entries of $g$ cannot all be divisible by $p$ , it follows that $\ell-m+m^{\prime}$ is the maximal power of $p$ which divides the second term above. Since $2(\ell-m+m^{\prime})$ divides the first term, it follows that both terms must be 0 for the equation to hold. In particular, we must have $\mathcal{B}(g,A)=0$ . Since at least one entry of $g$ must be a unit, we can solve for one entry of $A$ in terms of the others. Now observe that in order to have $gx=hx$ , we must have $p^{\ell-m}Ax=0$ . In particular, $p^{\ell-m}Ax^{1}=0$ . Since each coordinate of $x^{1}$ is a unit, we may solve for another entry of the matrix $A$ . This means there are at most $p^{2m}$ choices for $A$ , and hence the map $g\mapsto gx$ is at most $p^{2m}$ -to-one. It follows that $m$ -bad classes have at least $p^{3\ell-2m}$ elements, as claimed.

∎

7 Proof of Theorem 2

Proof.

In keeping with the rest of this paper, the proof of the $\mathbb{Z}/p^{\ell}\mathbb{Z}$ case is essentially the same as the finite field case, but more complicated casework is required to deal with non-units. By our work in the previous section, our goal is to show $\mathcal{C}_{k+1}(E)\gtrsim(p^{\ell})^{2k-1}$ . Following the line of reasoning in the proof of Theorem 1, we want to establish the estimate

[TABLE]

We have, after a change of variables,

[TABLE]

We first want to throw away terms where $x^{1},y^{1}$ have non-units in their first coordinates. Note that there are $\approx p^{4\ell-2}$ such pairs. For each, there are $|E|$ many choices for $x^{2}$ . We claim that there are $\leq p^{\ell}$ choices of $g$ which map $x^{1}$ to $y^{1}$ under this constraint. It follows from this claim that those terms contribute $\lesssim p^{5\ell-2}|E|$ to ( $*$ ), which is less then the claimed error term. To prove the claim, observe that we are counting solutions to the system of equations

[TABLE]

in $a,b,c,d$ . Since $x_{1}^{1}$ is a unit, we can solve the first two equations for $a$ and $c$ , respectively. Plugging these solutions into the third equation yields

[TABLE]

Since $y_{1}^{1}$ is a unit, for every $b$ there is a unique $d$ satisfying the equation. This proves the claim. Now, we want to remove all remaining terms from ( $*$ ) corresponding to $x^{1},x^{2}$ where $x^{1}\cdot x^{2\perp}$ is not a unit. To bound this contribution, we observe that for any such pair, we can write $x^{2}=tx^{1}+k$ , where $0<t<p$ and $k$ is a vector where both entries are non-units. Therefore, there are $\leq|E|^{2}$ choices for $(x^{1},y^{1})$ , there are $\leq p^{2\ell-1}$ choices for $x^{2}$ , and there are $\leq p^{\ell}$ choices for $g$ as before. This gives the bound $|E|^{2}p^{3\ell-1}$ , smaller than the claimed error term. This means, up to the error term, ( $*$ ) can be written as

[TABLE]

where $\nu(t)=|\{(x,y)\in E\times E:x\cdot y^{\perp}=t\}|$ . This function was studied in [4]; in that paper, it is proved that $\nu(t)=\frac{|E|^{2}}{q}+O(\ell|E|(p^{\ell})^{\frac{1}{2}(2-\frac{1}{\ell})})$ , leading to the claimed estimate for $\sum_{g}f(g)^{2}$ , using the same reasoning as in the proof of Theorem 1. Applying Lemma 4 and Lemma 3 with $A\approx\frac{|E|^{2}}{p^{2\ell}},|S|\approx p^{3\ell},M\leq|E|,R=O(\ell^{2}|E|^{2}(p^{\ell})^{3-\frac{1}{\ell}})$ gives

[TABLE]

If the second term on the right is bigger, we get the result for free. If the first term is bigger, we have

[TABLE]

If $|E|\gtrsim\ell^{\frac{2}{k+1}}p^{\ell s}$ , then this is $\gtrsim p^{\ell s(k+1)-3\ell+1}$ , which is $\gtrsim p^{\ell(2k-1)}$ when $s\geq 2-\frac{1}{\ell(k+1)}$ .

∎

8 Proof of sharpness

Proof.

We first consider the finite field case. Let $1\leq s<2-\frac{2}{k+1}$ , and let $E$ be a union of $q^{s-1}$ circles of distinct radii. Since each circle has size $\approx q$ , this is a set of size $\approx q^{s}$ . Observe that for any $x\in E^{k+1}$ and any $g$ in the orthogonal group $O_{2}(\mathbb{F}_{q})$ , we have $gx\in E^{k+1}$ . Therefore, every configuration of points in $E$ is equivalent to at least $|O_{2}(\mathbb{F}_{q})|\approx q$ other configurations. This means that

[TABLE]

where in the last step we use the assumed bound on $s$ .

Now, consider the $\mathbb{Z}/p^{\ell}\mathbb{Z}$ case. Let $1\leq s<2-\min\left(\frac{2}{k+1},\frac{1}{\ell}\right)$ . We consider two different examples, according to which of $\frac{2}{k+1}$ or $\frac{1}{\ell}$ is smaller. In the first case, the example that works for finite fields also works here; circles still have size $\approx p^{\ell}$ , so nothing is changed. In the second, let

[TABLE]

Clearly $|E|=p^{2\ell-1}=(p^{\ell})^{2-\frac{1}{\ell}}$ , but it is also easy to check that $x\cdot y^{\perp}$ is never a unit for any $x,y\in E$ . Therefore, every configuration of points in $E$ is bad, and we have shown that this is $o(\mathcal{C}_{k+1}(p^{\ell(2k-1)}))$ .

∎

Bibliography12

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] P. Birklbauer, A. Iosevich, T. Pham, Distances from points to planes , Acta Arith. 186 (2018), no. 3, 219–224.
2[2] M. Bennett, D. Hart, A. Iosevich, J. Pakianathan, M. Rudnev, Group actions and geometric combinatorics in 𝔽 q d superscript subscript 𝔽 𝑞 𝑑 \mathbb{F}_{q}^{d} , Forum Math,. 29(1):91-110, 2017
3[3] Jeremy Chapman, M. Barak Erdogan, Derrick Hart, Alex Iosevich, Doowon Koh, Pinned distance cets, k 𝑘 k -simplices, Wolff’s exponent in finite fields and sum product estimates , Mathematische Zeitschrift, Math. Z. 271 (2012), no. 1-2, 63-93
4[4] David Covert, Alex Iosevich, and Jonathan Pakianathan, Geometric configurations in the ring of integers modulo p ℓ superscript 𝑝 ℓ p^{\ell} , Indiana Univ. Math. J. 61 (2012), no. 5, 1949–1969.
5[5] David S. Dummitt and Richard M. Foote, Abstract Algebra, third edition , John Wiley and Sons, Inc., 2004
6[6] D. Hart and A. Iosevich, Sums and products in finite fields: an integral geometric viewpoint , Radon transforms, geometry, and wavelets, 129–135, Contemp. Math., 464, Amer. Math. Soc., Providence, RI, 2008.
7[7] D. Hart and A. Iosevich, Ubiquity of simplices in subsets of vector spaces over finite fields , Anal. Math. 34 (2008), no. 1, 29-38
8[8] Derrick Hart, Alex Iosevich, Doowon Koh, Misha Rudnev, Averages over hyperplanes, sum product theory in vector spaces over finite fields, and the Erdos-Falconer distance conjecture , Transactions of the AMS, 363 (2011) 3255-3275

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

Areas of triangles and SL2\text{SL}_{2}SL2​ actions in finite rings

Abstract

1 Introduction

Definition 1**.**

Theorem 1**.**

Theorem 2**.**

Theorem 3** (Sharpness).**

2 Characterization of the equivalence relation in terms of the SL2(R)\text{SL}_{2}(R)SL2​(R) action

Definition 2**.**

Theorem 4**.**

Theorem 5**.**

Theorem 6**.**

Lemma 1**.**

Proof.

Lemma 2**.**

Proof.

Lemma 3**.**

Proof.

3 Lifting L2L^{2}L2 estimates to Lk+1L^{k+1}Lk+1 estimates

Lemma 4**.**

Proof.

4 Some lemmas about the action of SL2(R)\text{SL}_{2}(R)SL2​(R)

Lemma 5**.**

Proof.

Lemma 6**.**

Proof.

5 Proof of Theorem 1

Proof.

6 Size of Ck+1((Z/pℓZ)2)\mathcal{C}_{k+1}((\mathbb{Z}/p^{\ell}\mathbb{Z})^{2})Ck+1​((Z/pℓZ)2)

Theorem 7**.**

Proof.

Theorem** (5’).**

7 Proof of Theorem 2

Proof.

8 Proof of sharpness

Proof.

Areas of triangles and $\text{SL}_{2}$ actions in finite rings

Definition 1.

Theorem 1.

Theorem 2.

Theorem 3 (Sharpness).

2 Characterization of the equivalence relation in terms of the $\text{SL}_{2}(R)$ action

Definition 2.

Theorem 4.

Theorem 5.

Theorem 6.

Lemma 1.

Lemma 2.

Lemma 3.

3 Lifting $L^{2}$ estimates to $L^{k+1}$ estimates

Lemma 4.

4 Some lemmas about the action of $\text{SL}_{2}(R)$

Lemma 5.

Lemma 6.

6 Size of $\mathcal{C}_{k+1}((\mathbb{Z}/p^{\ell}\mathbb{Z})^{2})$

Theorem 7.

Theorem (5’).