Areas of triangles and SL_2 actions in finite rings
Alex McDonald

TL;DR
This paper extends the concept of triangle area to points over finite rings and demonstrates that large point sets in these rings produce many distinct area-based configurations, invariant under SL_2 actions.
Contribution
It introduces a generalized area formula over finite rings and proves that large subsets generate a positive proportion of all possible triangle configurations.
Findings
Large subsets in finite rings produce many distinct triangle types.
The area formula is invariant under SL_2(R) actions.
Results apply to both finite fields and modular rings.
Abstract
In Euclidean space, one can use the dot product to give a formula for the area of a triangle in terms of the coordinates of each vertex. Since this formula involves only addition, subtraction, and multiplication, it can be used as a definition of area in , where is an arbitrary ring. The result is a quantity associated with triples of points which is still invariant under the action of . One can then look at a configuration of points in in terms of the triangles determined by pairs of points and the origin, considering two such configurations to be of the same type if corresponding pairs of points determine the same areas. In this paper we consider the cases and , and prove that sufficiently large subsets of must produce a positive proportion of all such types of configurations.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFinite Group Theory Research ยท Limits and Structures in Graph Theory ยท Analytic Number Theory Research
Areas of triangles and actions in finite rings
Alex McDonald
Abstract
In Euclidean space, one can use the dot product to give a formula for the area of a triangle in terms of the coordinates of each vertex. Since this formula involves only addition, subtraction, and multiplication, it can be used as a definition of area in , where is an arbitrary ring. The result is a quantity associated with triples of points which is still invariant under the action of . One can then look at a configuration of points in in terms of the triangles determined by pairs of points and the origin, considering two such configurations to be of the same type if corresponding pairs of points determine the same areas. In this paper we consider the cases and , and prove that sufficiently large subsets of must produce a positive proportion of all such types of configurations.
1 Introduction
There are several interesting combinatorial problems asking whether a sufficiently large subset of a vector space over a finite field must generate many different objects of some type. The most well known example is the Erdos-Falconer problem, which asks whether such a set must contain all possible distances, or at least a positive proportion of distances. More precisely, given we define the distance set
[TABLE]
Obviously, . The Erdos-Falconer problem asks for an exponent such that , or more generally , whenever (Throughout, the notation means there is a constant such that , means and , and denotes a quantity that is ). In [9], Iosevich and Rudnev proved that if In [8] it is proved by Hart, Iosevich, Koh, and Rudnev that the exponent cannot be improved in odd dimensions, althought it has been improved to in the case (first in [3] in the case by Chapman, Erdogan, Hart, Iosevich, and Koh, then in general in [2] by Bennett, Hart, Iosevich, Pakianathan, and Rudnev). Several interesting variants of the distance problem have been studied as well. A result of Pham, Phuong, Sang, Valculescu, and Vinh studies the problem when distances between pairs of points are replaced with distances between points and lines in ; they prove that if sets and of points and lines, respectively, satisfy , then they determine a positive proportion of all distances [12]. Birklbauer, Iosevich, and Pham proved an analogous result about distances determined by points and hyperplanes in [1].
We can replace distances with dot products and ask the analogous question. Let
[TABLE]
and again ask for an exponent such that implies contains all distances (or at least a positive proportion of distances). Hart and Iosevich prove in [6] that the exponent works for this question as well. The proof is quite similar to the proof of the same exponent in the Erdos-Falconer problem; in each case, the authors consider a function which counts, for each , the number of representations of as, respectively, a distance and a dot product determined by the set . These representation functions are then studied using techniques from Fourier analysis.
Another interesting variant of this problem was studied in [10], where Lund, Pham, and Vinh defined the angle between two vectors in analogue with the usual geometric interpretation of the dot product. Namely, given vectors and , they consider the quantity
[TABLE]
where is the finite field distance defined above. Note that since we cannot always take square roots in finite fields, the finite field distance corresponds to the square of the Euclidean distance; therefore, above is the correct finite field analogue of , where is the angle between the vectors and . This creates a variant of the dot product problem, since one can obtain different dot products from the same angle by varying length. The authors go on to prove that the exponent guarantees a positive proportion of angles.
It is of interest to generalize these types of results to point configurations. By a -point configuration in , we simply mean an element of . Throughout, we will use superscripts to denote different vectors in a given configuration, and subscripts to denote the coordinates of each vector. For example, a point configuration is made up of vectors , each of which has coordinates . Given a set , we can consider -point configurations in (i.e., elements of ) and ask whether must contain a positive proportion of all configurations, up to some notion of equivalence. For example, we may view -point configurations as simplices, and our notion of equivalence is geometric congruence; any two simplices are congruent if there is a translation and a rotation that maps one onto the other. Since a -simplex is simply a pair of points, and two such simplices are congruent if and only if the distance is the same, congruence classes simply correspond to distance. Hence, the Erdos-Falconer distance problem may be viewed as simply the case of the simplex congruence problem. In [7], Hart and Iosevich prove that contains the vertices of a congruent copy of every non-degenerate simplex (non-degenerate here means the points are in general position) whenever . However, in order for this result to be non-trivial the exponent must be , and that only happens when . So, the result is limited to fairly small configurations. This result is improved in [2] by Bennett, Hart, Iosevich, Pakianathan, and Rudnev, who prove that for any a set determines a positive proportion of all congruence classes of -point configurations provided . This exponent is clearly non-trivial for all . In [11], I extended this result to the case .
In this paper, we consider a different notion of equivalence. We will consider the problem over both finite fields and rings of integers modulo powers of primes, so I will define the equivalence relation in an arbitrary ring.
Definition 1**.**
Let be a ring, and let . We define an equivalence relation on by (or more breifly ) if and only if for each pair we have . Define to be the set of equivalence classes of under this relation.
In the Euclidean setting, is the area of the triangle with vertices . So, we may view each pair of points in a -point configuration as determining a triangle with the origin, and we consider two such configurations to be equivalent if the triangles they determine all have the same areas. As we will prove in section , this equivalence relation is closely related to the action of on tuples of points; except for some degenerate cases, two configurations are equivalent if and only if there is a unique mapping one to the other. This allows us to analyze the problem in terms of this action; in section 2, we define a counting function and reduce matters to estimating the sum . In section 3, we show how to turn an estimate for into an estimate for . Since we already understand the case (it is essentially the same as the dot product problem discussed above), this reduction allows us to obtain a non-trivial result. Our first theorem is as follows.
Theorem 1**.**
Let be a power of an odd prime, and let satisfy , where . Then .
In addition to proving this theorem, we will consider the case where the finite field is replaced by the ring . The structure of the proof is largely the same; the dot product problem over such rings is studied in [4], giving us the case, and the machinery which lifts that case to arbitrary works the same way. However, many details in the proofs are considerably more complicated. The theorem is as follows.
Theorem 2**.**
Let be an odd prime, let , and let satisfy , where . Then
We first note that, as we would expect, Theorem 2 coincides with Theorem 1 in the case . We also note that, for fixed and , the exponent in Theorem 2 is always less than 2, but it tends to as . This does not happen in the finite field case, where the exponent depends on but not on the size of the field.
Finally, we want to state the extent to which these results are sharp. There are examples which show that the exponent must tend to as in the finite field case, and as either or in the case.
Theorem 3** (Sharpness).**
We have the following:
- i
For any , there exists such that and . 2. ii
For any , there exists such that and .
2 Characterization of the equivalence relation in terms of the action
Our main tool in reducing the problem of -point configurations to the case is the fact that we can express the equivalence relation in terms of the action of the special linear group; with some exceptions, tuples and are equivalent if and only if there exists a unique such that for each , we have . In order to use this, we need to bound the number of exceptions to this rule. This is easy in the finite field case, and a little more tricky in the case. The goal of this section is to describe and and bound the number of exceptional configurations in each case. We begin with a definition.
Definition 2**.**
Let be a ring. A configuration is called good if there exist two indices such that is a unit. A configuration is bad if it is not good.
As we will see, the good configurations are precisely those for which equivalence is determined by the action of . To prove this, we will need the following theorems about determinants of matrices over rings, which can be found in [5], section 11.4.
Theorem 4**.**
Let be a ring, let be the columns of an matrix with entries in . Fix an index , and let be the matrix obtained from by replacing column by , for some . Then .
Theorem 5**.**
Let be a ring, and let be an matrix with entries in . The matrix is invertible if and only if is a unit in .
Theorem 6**.**
Let be a ring, and let and be matrices with entries in . Then .
We are now ready to prove that equivalence of good configurations is given by the action of the special linear group.
Lemma 1**.**
Let be a ring, and let be good configurations such that for every pair of indices . Then there exists a unique such that for each .
Proof.
Because and are good, there exist indices and such that is a unit; equivalently, the determinant of the matrix with columns and is a unit. Denote this matrix by . By theorem 5, this matrix is invertible. Let
[TABLE]
Since , it follows that and . Also note that by Theorem 6, we have . Let be any other index. We want to write ; this amounts to solving the matrix equation
[TABLE]
Since we have already established the matrix is invertible, we can solve for and . Similarly, let . By Theorem 4, we have and . It follows that , and an analogous argument yields . Therefore,
[TABLE]
So, we have established existance. To prove uniqueness, note that must satisfy , and since is invertible we can solve for . โ
Now that we know that good tuples allow us to use the machinery we need, we must prove that the bad tuples are negligible.
Lemma 2**.**
Let be a ring and let . We have the following:
- i
If , then contains bad tuples. In particular, if for any constant , the number of bad tuples in is . 2. ii
If , the number of bad tuples in is . In particular, if for any constant , then the number of bad tuples in is .
Proof.
We first prove the first claim. Since the only non-unit of is 0, a bad tuple must consist of points which all lie on a line through the origin. Therefore, we may choose to be anything in , after which the next points must be chosen from the points on the line through the origin and .
To prove the second claim, first observe that the number of tuples where at least one coordinate is a non-unit is , which is less then the claimed bound. So, it suffices to bound the set of bad tuples where all coordinates are units. Let be this set. Define
[TABLE]
If , then is a non-unit, meaning it is divisible by , and
[TABLE]
Therefore, maps bad tuples to tuples with , or . Rearranging, using the fact that the second coordinate of each is a unit, we conclude that is a constant independent of which is divisible by . In other words, each is on a common line through the origin and a point where . There are such lines, and once we fix a line there are choices of tuples . Therefore, . Finally, we observe that the map is -to-1. This gives us the claimed bound on . โ
Lemma 3**.**
Let be either or . Let , and let be the set of good tuples. Suppose if and if . For , define . Then
[TABLE]
Proof.
By Cauchy-Schwarz, we have
[TABLE]
By assumption and Lemma 2, , and therefore the left hand side above is . Since the right hand side above is . It remains to prove . By lemma 1,
[TABLE]
By extending the sum over to one over all of , we bound the above sum by
[TABLE]
โ
3 Lifting estimates to estimates
In both the case and , results are known for pairs of points, which is essentially the case. The finite field version was studied in [6], and the ring of integers modulo was studied in [4]. In section 2, we defined a function on and related the number of equivalence classes determined by a set to the sum . Since results are known for the case, we have information about the sum . We wish to turn that into a bound for . This is achieved with the following lemma.
Lemma 4**.**
Let be a finite set, and let . Let
[TABLE]
denote the average value of , and
[TABLE]
denote the maximum. Finally, suppose
[TABLE]
Then there exist constants , depending only on , such that
[TABLE]
Proof.
We proceed by induction. For the base case, let and observe that the claimed bound is the one we assumed for . Now, let be any sequence such that holds for all ; for example, works. Now, suppose the claimed bound holds for all , and also observe that the bound is trivial for . By direct computation, we have
[TABLE]
We also have
[TABLE]
To bound the first term, we simply use the trivial bound. Since for all , , and , we conclude for each . Therefore,
[TABLE]
To bound the second term, we use the inductive hypothesis and the triangle inequality. We have
[TABLE]
Since , it follows that for any , so the claimed bound holds. โ
4 Some lemmas about the action of
Lemma 5**.**
Let be a finite group acting transitively on a finite set . Define by . We have
[TABLE]
for every pair . If and , then
[TABLE]
Proof.
The second statement follows from the first by a simple change of variables. To prove the first, we have
[TABLE]
On the right, for any fixed , one can choose any and there is a unique corresponding , so the inner sum is and the right hand side is therefore . On the other hand, is constant. To prove this, let and let such that and . This means for any with , we have , so . By symmetry, equality holds. If is the constant value of , the left hand side above must be , and therefore as claimed. โ
Lemma 6**.**
We have and .
Proof.
We are counting solutions to the equation where . We consider two cases. If is zero, then can be anything, and we must have . This means can be anything non-zero, and is determined. So, there are solutions with . With , and can be anything, and is determined, giving solutions in this case. So, there are total solutions.
Next, we want to count solutions to with . The arguments are essentially the same as in the proof of the finite field case, but slightly more complicated because there are non-zero elements which are still not units. We again consider separately two cases according to whether is a unit or not. If is a unit, then can be anything and then is determined, so there are such solutions. If is not a unit, then and must be units, as otherwise would be divisible by . So there are choices for , choices for , for , and is determined. Putting this together, we get the claimed number of solutions. โ
5 Proof of Theorem 1
We are now ready to prove theorem 1.
Proof.
First observe that good tuples are equivalent to distinct tuples, so there are equivalence classes of good tuples. Since the only non-unit in the finite field case is 0, the bad tuples are all in the same equivalence class. So, our goal is to prove . We first must prove the estimate
[TABLE]
We expand the sum on the left hand side and change variables to obtain
[TABLE]
We first observe we may ignore the pairs which are on a line through the origin. This is because if and , there will exist with if and only if , in which case there are choices for . So, we have choices for and , choices for , and choices for giving an error of , as claimed. For all other pairs , the inner sum in is 1 if and 0 otherwise. Therefore, if , we have
[TABLE]
The proof of theorem 1.4 in [6] shows that , so this gives
[TABLE]
which proves the equation above. We now apply lemma 4 with . Lemmas 6 and 5 imply
[TABLE]
and
[TABLE]
Putting this together gives
[TABLE]
and therefore
[TABLE]
with . Finally, we observe that has maximum . Therefore, lemma 4 gives
[TABLE]
Together with lemma 3, this gives
[TABLE]
If the second term on the right is bigger, we get the result for free. If the first term is bigger, we get
[TABLE]
This will be when , as claimed.
โ
6 Size of
Since , we expect each tuple in to be equivalent to other tuples, and therefore we expect the number of congruence classes to be . In the finite field case, this was proved as the first step of the proof of Theorem 1, but the proof in the is more complicated so we will prove it here, separately from the proof of Theorem 2 in the next section.
Theorem 7**.**
We have . More precisely, the good -point configurations of determine classes, and the bad configurations determine classes.
Proof.
We first establish that there are classes of good tuples. This is easy; if is a good tuple, we have seen the map is injective, so each class has size and there are tuples, meaning there are classes.
It remains to bound the number of bad classes. We first establish the cases. When , we want to prove there are equivalence classes. This is clear, because in the case we are looking at pairs whose class is determined by the scalar . The classes therefore correspond to the underlying set of scalars in , and the bad classes correspond to non-units. In the case, we are looking at triples whose class is determined by the three scalars . So, the space of equivalence classes can be identified with , and the bad classes correspond to triples of non-units.
For , we use the following theorem, which is really just a more specific version of Theorem 5, also found in [5], chapter 11.
Theorem** (5โ).**
For any matrix , there exists a matrix with , where is the identity matrix.
We also make a more specific version of the definition of good and bad tuples. Namely, let be a point configuration in , and let be minimal with respect to the property that divides for every pair of indices . We say that is -bad. Observe that according to our previous definition, good tuples are [math]-bad and bad tuples are -bad for some . Also observe that -badness is preserved by equivalence, so we may define -bad equivalence classes analogously. An easy variant of the argument in Lemma 2 shows that the number of -bad tuples is ; note that this bound can be rewritten as . We claim that every -bad equivalence class has at least elements. It follows from the claim that there are -bad classes, and since we may assume the theorem follows from here. To prove the claim, note that the equivalence class containing also contains for any , so for a lower bound on the size of a class we need to determine the size of the image of the map . First note that we may assume without loss of generality that each coordinate of is a unit. This is because given we can shift any factor of from onto each other vector and obtain another representative of the same equivalence class. Next, observe that if is -bad and , then by Theorem 5โ we have . It follows that for some matrix with entries between [math] and . Using the fact that
[TABLE]
where is bilinear, we conclude that if and , we must have
[TABLE]
Let be the minimal power of which divides all entries of . Since the entries of cannot all be divisible by , it follows that is the maximal power of which divides the second term above. Since divides the first term, it follows that both terms must be 0 for the equation to hold. In particular, we must have . Since at least one entry of must be a unit, we can solve for one entry of in terms of the others. Now observe that in order to have , we must have . In particular, . Since each coordinate of is a unit, we may solve for another entry of the matrix . This means there are at most choices for , and hence the map is at most -to-one. It follows that -bad classes have at least elements, as claimed.
โ
7 Proof of Theorem 2
Proof.
In keeping with the rest of this paper, the proof of the case is essentially the same as the finite field case, but more complicated casework is required to deal with non-units. By our work in the previous section, our goal is to show . Following the line of reasoning in the proof of Theorem 1, we want to establish the estimate
[TABLE]
We have, after a change of variables,
[TABLE]
We first want to throw away terms where have non-units in their first coordinates. Note that there are such pairs. For each, there are many choices for . We claim that there are choices of which map to under this constraint. It follows from this claim that those terms contribute to (), which is less then the claimed error term. To prove the claim, observe that we are counting solutions to the system of equations
[TABLE]
in . Since is a unit, we can solve the first two equations for and , respectively. Plugging these solutions into the third equation yields
[TABLE]
Since is a unit, for every there is a unique satisfying the equation. This proves the claim. Now, we want to remove all remaining terms from () corresponding to where is not a unit. To bound this contribution, we observe that for any such pair, we can write , where and is a vector where both entries are non-units. Therefore, there are choices for , there are choices for , and there are choices for as before. This gives the bound , smaller than the claimed error term. This means, up to the error term, () can be written as
[TABLE]
where . This function was studied in [4]; in that paper, it is proved that , leading to the claimed estimate for , using the same reasoning as in the proof of Theorem 1. Applying Lemma 4 and Lemma 3 with gives
[TABLE]
If the second term on the right is bigger, we get the result for free. If the first term is bigger, we have
[TABLE]
If , then this is , which is when .
โ
8 Proof of sharpness
Proof.
We first consider the finite field case. Let , and let be a union of circles of distinct radii. Since each circle has size , this is a set of size . Observe that for any and any in the orthogonal group , we have . Therefore, every configuration of points in is equivalent to at least other configurations. This means that
[TABLE]
where in the last step we use the assumed bound on .
Now, consider the case. Let . We consider two different examples, according to which of or is smaller. In the first case, the example that works for finite fields also works here; circles still have size , so nothing is changed. In the second, let
[TABLE]
Clearly , but it is also easy to check that is never a unit for any . Therefore, every configuration of points in is bad, and we have shown that this is .
โ
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] P. Birklbauer, A. Iosevich, T. Pham, Distances from points to planes , Acta Arith. 186 (2018), no. 3, 219โ224.
- 2[2] M. Bennett, D. Hart, A. Iosevich, J. Pakianathan, M. Rudnev, Group actions and geometric combinatorics in ๐ฝ q d superscript subscript ๐ฝ ๐ ๐ \mathbb{F}_{q}^{d} , Forum Math,. 29(1):91-110, 2017
- 3[3] Jeremy Chapman, M. Barak Erdogan, Derrick Hart, Alex Iosevich, Doowon Koh, Pinned distance cets, k ๐ k -simplices, Wolffโs exponent in finite fields and sum product estimates , Mathematische Zeitschrift, Math. Z. 271 (2012), no. 1-2, 63-93
- 4[4] David Covert, Alex Iosevich, and Jonathan Pakianathan, Geometric configurations in the ring of integers modulo p โ superscript ๐ โ p^{\ell} , Indiana Univ. Math. J. 61 (2012), no. 5, 1949โ1969.
- 5[5] David S. Dummitt and Richard M. Foote, Abstract Algebra, third edition , John Wiley and Sons, Inc., 2004
- 6[6] D. Hart and A. Iosevich, Sums and products in finite fields: an integral geometric viewpoint , Radon transforms, geometry, and wavelets, 129โ135, Contemp. Math., 464, Amer. Math. Soc., Providence, RI, 2008.
- 7[7] D. Hart and A. Iosevich, Ubiquity of simplices in subsets of vector spaces over finite fields , Anal. Math. 34 (2008), no. 1, 29-38
- 8[8] Derrick Hart, Alex Iosevich, Doowon Koh, Misha Rudnev, Averages over hyperplanes, sum product theory in vector spaces over finite fields, and the Erdos-Falconer distance conjecture , Transactions of the AMS, 363 (2011) 3255-3275
