This paper establishes a polynomial bound relating the partition rank to the analytic rank of multilinear forms over finite fields, confirming a conjecture and advancing understanding of their structural relationship.
Contribution
The paper proves a polynomial upper bound of the partition rank in terms of the analytic rank, improving previous exponential bounds and confirming a conjecture by Kazhdan and Ziegler.
Findings
01
Partition rank is polynomially bounded by analytic rank.
02
Confirmed a conjecture of Kazhdan and Ziegler.
03
Independent proof by Janzer corroborates results.
Abstract
Let G1,…,Gk be vector spaces over a finite field F=Fq with a non-trivial additive character χ. The analytic rank of a multilinear form α:G1×⋯×Gk→F is defined as arank(α)=−logqEx1∈G1,…,xk∈Gkχ(α(x1,…,xk)). The partition rank prank(α) of α is the smallest number of maps of partition rank 1 that add up to α, where a map is of partition rank 1 if it can be written as a product of two multilinear forms, depending on different coordinates. It is easy to see that arank(α)≤O(prank(α)) and it has been known that prank(α) can be bounded from above in terms of arank(α). In this paper, we improve the…
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Full text
**Polynomial bound for partition rank in terms of analytic rank
**
Luka Milićević†
00footnotetext: † Mathematical Institute of the Serbian Academy of Sciences and Arts, Belgrade, Serbia.
Let G1,…,Gk be vector spaces over a finite field F=Fq with a non-trivial additive character χ. The analytic rank of a multilinear form α:G1×⋯×Gk→F is defined as \operatorname{arank}(\alpha)=-\log_{q}\mathop{\mathbb{E}}_{x_{1}\in G_{1},\dots,x_{k}\in G_{k}}\chi\big{(}\alpha(x_{1},\dots,x_{k})\big{)}. The partition rank prank(α) of α is the smallest number of maps of partition rank 1 that add up to α, where a map is of partition rank 1 if it can be written as a product of two multilinear forms, depending on different coordinates. It is easy to see that \operatorname{arank}(\alpha)\leq O\Big{(}\operatorname{prank}(\alpha)\Big{)} and it has been known that prank(α) can be bounded from above in terms of arank(α). In this paper, we improve the latter bound to polynomial, i.e. we show that there are quantities C,D depending on k only such that prank(α)≤C(arank(α)D+1). As a consequence, we prove a conjecture of Kazhdan and Ziegler.
aaaThe same result was obtained independently and simultaneously by Janzer.
**§1 Introduction
**
Throughout the paper G1,…,Gk are vector spaces over a finite field F and χ is a non-trivial additive character of F. In order to measure how much the distribution of values of a multilinear form α:G1×⋯×Gk→F deviates from the uniform distribution Gowers and Wolf [3] introduced the notion of analytic rank.
Definition 1** (Analytic rank).**
Analytic rank of α is defined as
[TABLE]
Note that this definition does not depend on the choice of χ and that for the case of two variables coincides with the usual algebraic rank of the corresponding matrix. Another way to generalize the notion of algebraic rank to multilinear forms is the partition rank, introduced by Naslund [12], which is defined as follows.
Definition 2** (Partition rank).**
Let α:G1×⋯×Gk→F be a multilinear form. We say that α is of partition rank 1 if there is a set ∅=I⊂[k] and multlinear forms β:∏i∈IGi→F and γ:∏i∈[k]∖IGi→F such that
[TABLE]
In general, partition rankprank(α) of α is the smallest integer r such that α is a sum of r multilinear forms of partition rank 1. We also set prank(0)=0.
It is easy to see that \operatorname{arank}(\alpha)\leq O\Big{(}\operatorname{prank}(\alpha)\Big{)} and Lovett [11] proved arank(α)≤prank(α). On the other hand, it turns out that we can bound partition rank in terms of analytic rank. This was first proved by Bhowmick and Lovett [1], and they showed that prank(α)≤f(arank(α),k,∣F∣), where f has an Ackerman-type dependence on its parameters. This bound was recently improved significantly by Janzer [7] to tower of exponentials of height depending on k. In this paper we prove
Theorem 3**.**
For given k, there are constants C,D such that prank(α)≤C(arank(α)D+1).
We refer to this result as the strong inverse theorem for multilinear forms of low analytic rank or simply as strong inverse theorem.111We emphasize the word strong to make distiction from a weak version of the inverse theorem which we also prove in this paper. The constants C,D can be taken to be C=2k2O(k2),D=22O(k2). Note also that, since every non-zero form α:G1×⋯×Gk→F takes at least \Big{(}\frac{|\mathbb{F}|-1}{|\mathbb{F}|}\Big{)}^{k}|G_{1}|\cdots|G_{k}| non-zero values, there is a constant a0, depending on k and F, such that for all non-zero forms α, arank(α)≥a0. Hence, we may rewrite the bound in Theorem 3 as
[TABLE]
but the new constant C′ would need to depend on F as well.
Theorem 3 was also proved by Janzer [8] independently and simultaneously, using a different argument.
Before the study of ranks of multilinear forms, an important topic of study has been the distribution of multivariate polynomials over finite fields. In this direction, we have the following result proved by Green and Tao [4] (with the restriction on the degree of the polynomial to be bounded by the size of the field), and Kaufman and Lovett [9] (without the restriction on the degree), and was applied by Bhowmick and Lovett [1] to coding theory and effective algebraic geometry.
Theorem 4** (Green and Tao [4]; Kaufman and Lovett [9]).**
Let F be a field of prime order. Suppose that f:Fn→F is a polynomial of degree d (d<∣F∣ in the result of Green and Tao). If \Big{|}\mathop{\mathbb{E}}_{x_{1},\dots,x_{n}\in\mathbb{F}}\chi\Big{(}f(x_{1},\dots,x_{n})\Big{)}\Big{|}\geq c, then there is r≤Op,d,c−1(1), polynomials f1,…,fr of degree d−1, and a map g:Fr→F such that
[TABLE]
As observed by Green and Tao in [4] and by Janzer in [7], it is easy to deduce this result from the strong inverse theorem, at least when d<charF. We include a very short sketch to confirm that we may take polynomial bounds in Theorem 4 as well. This proves a conjecture of Kazhdan and Zielger (Conjecture 3.5 in [10]).
Consider a symmetric multilinear form α:(Fn)d→F such that f(x)=α(x,x,…,x). Since d<charF, d! is invertible in F, and such a form exists. Notice that for all x,y1,…,yd−1∈Fn
[TABLE]
for some maps g,h1,…,hd−1. Observe also that
[TABLE]
Applying this d−1 times in total, we get
[TABLE]
Hence, by Cauchy-Schwarz inequality for the box norm (Lemma 15), we get
[TABLE]
Apply the strong inverse theorem to find r\leq C\Big{(}2^{2d-2}\log_{|\mathbb{F}|}(|\mathbb{F}|c^{-1})\Big{)}^{D}, sets ∅=Ii⊊[d] and multilinear forms βi:∏j∈IiGj→F,γi:∏j∈[d]∖IiGj→F for i∈[r] such that
[TABLE]
Then f(x)=α(x,…,x)=∑i∈[r]βi(x,…,x)γi(x,…,x), as desired.∎
The proof of the strong inverse theorem given here is entirely self-contained and in particular does not use other results of additive combinatorics such as Freiman’s theorem, and, as it is clear from the bounds we obtain here, we do not apply a regularity lemma. In fact, it is probable that the results of this paper may replace the use of regularity lemmas in similarly algebraic settings. In the next subsection, we list main results of this paper, discuss the proofs and the ideas in more detail.
**1.1. Main results and overview of the argument
**
Notation. In the rest of the paper, we use the following convention to save writing in situations where we have many indices appearing in predictable patterns. Instead of whole sequence x1,…,xm, we write x[m], and we write xI for I⊂[m] to be the subsequence with indices in I. This applies to products as well: G[k] stands for ∏i∈[k]Gi and GI=∏i∈IGi. For example, instead of writing α:∏i∈IGi→F and α(xi:i∈I), we write α:GI→F and α(xI).
Given F-vector spaces G1,…,Gk,H, a map α:G[k]→H is said to be multilinear if it is linear in each coordinate, that is, whenever x[i−1]∈G[i−1],x[i+1,k]∈G[i+1,k] and y,z∈Gi, then
[TABLE]
Similarly, α is multiaffine if it is affine in each coordinate, i.e. whenever x[i−1]∈G[i−1],x[i+1,k]∈G[i+1,k] and y,z,w∈Gi, then
[TABLE]
Also, we refer to the zero set of a multiaffine map α:G[k]→H, where H is a vector space over F, as variety, and the codimension of a variety is dimH. Another convention we adopt is that we write Ex, without specifying the set from which x is taken, when this causes no confusion. Frequently we shall consider ‘slices’ of sets S⊂G[k], by which we mean sets SxI={y[k]∖I∈G[k]∖I:(xI,y[k]∖I)∈S}, for I⊂[k],xI∈GI. Occasionally, we might have a single element z∈Gi instead of xI, and in this case we write Si:z for the resulting slice, since the direction i is not clear from the notation z, unlike in the case of xI. Finally, for each vector space Gi, fix a dot product ⋅. We need this for the characterization of linear forms on Gi – each linear form ϕ:Gi→F takes form ϕ(x)=x⋅u for an element u∈Gi.
Results and outline. Our first main result is the weak version of the (strong) inverse theorem.
Theorem 5** (Weak inverse theorem for maps of low analytic rank - Weak(k)).**
For given k, there are constants C=Ckweak,D=Dkweak>0 with the following property. Suppose that α:G[k]→F is a multilinear form such that \mathop{\mathbb{E}}_{x_{[k]}}\chi\Big{(}\alpha(x_{[k]})\Big{)}\geq c, for some c>0. Then, there is r≤Clog∣F∣D(∣F∣c−1) and there are multilinear maps βi:GIi→F, i∈[r], where ∅=Ii⊂[k−1] such that
[TABLE]
Note that there is a multilinear map A:G[k−1]→Gk such that for each x[k]∈G[k], α(x[k])=A(x[k−1])⋅xk. Then \Big{|}\Big{\{}x_{[k-1]}\in G_{[k-1]}\colon A(x_{[k-1]})=0\Big{\}}\Big{|}=\Big{(}\mathop{\mathbb{E}}_{x_{[k]}}\chi(\alpha(x_{[k]}))\Big{)}|G_{[k-1]}|. Thus, another way to phrase the weak inverse theorem is to say that every dense variety contains a low-codimensional variety. On the other hand, it is very easy to see that low-codimensional varieties are necessarily dense, see Lemma 11.
Next, we have the strong inverse theorem.
Theorem 6** (Strong inverse theorem for maps of low analytic rank - Strong(k)).**
For given k, there are constants C=Ckstrong,D=Dkstrong>0 with the following property. Suppose that α:G[k]→F is a multilinear form such that Ex[k]χ(α(x[k]))≥c, for some c>0. Then, there is r≤Clog∣F∣D(∣F∣c−1) and there are multilinear maps βi:GIi→F and γi:G[k]∖Ii→F, i∈[r], where ∅=Ii⊂[k−1] such that
[TABLE]
We remark that the bounds on the constants claimed in the introduction, namely Ckstrong=2k2O(k2) and Dkstrong=22O(k2), follow from inequalities (9) and (4) which appear later in the paper. In fact, bounds of the same form hold in all results stated in this subsection.
Let S⊂G[k] and let α:G[k]→H be a multiaffine map. A layer of α is any set of the form {x[k]∈G[k]:α(x[k])=λ}, for λ∈H. We say that layers of α *internally ϵ-approximate *S, if there are layers L1,…,Lm of α such that S⊃Li and \Big{|}S\setminus\Big{(}\cup_{i\in[m]}L_{i}\Big{)}\Big{|}\leq\epsilon|G_{[k]}|. Similarly, we say that layers of α *externally ϵ-approximate *S, if there are layers L1,…,Lm of α such that S⊂∪i∈[m]Li and \Big{|}\Big{(}\cup_{i\in[m]}L_{i}\Big{)}\setminus S\Big{|}\leq\epsilon|G_{[k]}|.
The next two results say that we may approximate internally and externally certain sets by low-codimensional varieties. In the first case, the sets we have in mind are dense varieties, and in the second case these are the sets of dense columns of a variety.
Theorem 7** (Simultaneous inner approximation of varieties - Inner(k)).**
For given k, there are constants C=Ckinner,D=Dkinner>0 with the following property. Let ϵ>0 and let B1,…,Br:G[k]→H be multiaffine maps. For each λ∈Fr, let Zλ={x[k]∈G[k]:∑i∈[r]λiBi(x[k])=0}. Then there is s\leq C\Big{(}r\log_{|\mathbb{F}|}(|\mathbb{F}|\epsilon^{-1})\Big{)}^{D}, a multiaffine map β:G[k]→Fs such that for each λ∈Fr , layers of β internally ϵ-approximate Zλ.
Theorem 8** (Structure of a set of dense columns of a variety - Columns(k)).**
For given k, there are constants C=Ckcolumns,D=Dkcolumns>0 with the following property. Let α:G[k]→Fr be a multiaffine map. Let S⊂Fr and ϵ>0. Define the set of ϵ-dense columns as
[TABLE]
Then, there is s\leq C\Big{(}r\log_{|\mathbb{F}|}(|\mathbb{F}|\epsilon^{-1})\Big{)}^{D}, a multiaffine map β:G[k−1]→Fs such that layers of βϵ-internally and ϵ-externally approximate X.
Finally, we prove a strong approximation result for the convolutions of the indicator function of a low-codimensional variety. We call it an almost L∞ approximation theorem which sounds slightly oxymoronical, but is appropriate since we actually prove that the convolution can be approximated by a finite exponential sum on very structured set (a union of layers of a low-codimensional variety) of density 1−o(1). For this theorem, we need one more piece of notation. For a map f:G[k]→C, we define its convolution in directioni, denoted by Cif:G[k]→C as
[TABLE]
We also misuse notation slightly and for the given set Z we also treat Z as the indicator function in the expression below.
Theorem 9** (Almost L∞ approximation theorem for convolutions of varieties of low codimension - Conv(k)).**
For given l∈[k], there are constants C=Ck,lconv,D=Dk,lconv>0 with the following property. Let α:G[k]→Fr be a multilinear map, Z={x[k]∈G[k]:α(x[k])=0} and let ϵ>0. Then there are s,t\leq C\Big{(}r\log_{|\mathbb{F}|}(|\mathbb{F}|\epsilon^{-1})\Big{)}^{D}, multiaffine forms βi:G[k]→F for i∈[s], multiaffine map γ:G[k]∖{l}→Ft, constants c1,…,cm∈C, multiaffine maps ρ1,…,ρm∈span{β[s]} and layers L1,…,Ln of γ such that
[TABLE]
for all x_{[k]}\in G_{[k]}\setminus\Big{(}(\cup_{i\in[n]}L_{i})\times G_{l}\Big{)}, ∣∪i∈[n]Li∣≤ϵ∣G[k]∖{l}∣ and ∑i∈[m]∣ci∣≤1.
The proof naturally splits into five parts, each showing one of the following implications.
[TABLE]
To complete this inductive scheme, we note that Strong(2) holds, and this is a simple consequence of linear algebra. Indeed, if α:G1×G2→F is a bilinear form such that Ex,yχ(α(x,y))≥c, writing α(x,y)=A(x)⋅y for a linear map A:G1→G2, we see that ∣{A=0}∣≥c∣G1∣. By rank-nullity theorem, A has rank r≤log∣F∣c−1, thus, there are v1,…,vr∈G2, and linear forms β1,…,βr:G1→F such that (∀x∈G1)A(x)=∑i∈[r]viβi(x). Thus α(x,y)=∑i∈[r]βi(x)(vi⋅y), as desired.
In some sense, all the results above can be seen as corollaries of Theorem 6 (or any other theorem listed here), but this would not be an entirely correct viewpoint since the proof has the structure outlined here. Still, the deduction of the strong inverse theorem from the weak one occupies the largest part of the proof. The crucial idea in this part of the proof is the following proposition. To state it, we need to introduce a notion of connectivity for subsets of G[k]. Namely, we consider G[k] as vertices of a graph G with edges between points that differ in exactly one coordinate. We say that a set S⊂G[k] is connected if G[S] is connected. The diameter of S is the largest distance between two vertices in the graph G[S].
Proposition 10** (One-sided regularity lemma).**
Let ρ:G[k]→F,γi:GIi→F, i∈[r] be multilinear maps. Let F={i∈[r]:Ii=[k]}. Suppose that
[TABLE]
for any choice of λ∈FF. Then, the set {x[k]∈G[k]:(∀i∈[r])γi(xIi)=0,ρ(x[k])=0} is connected and of diameter at most (2k+1)(2k−1).
Thus, if the form ρ is sufficiently quasirandom w.r.t. other forms γi, then the set {x[k]∈G[k]:(∀i∈[r])γi(xIi)=0,ρ(x[k])=0} is well-behaved. For our purposes, this means that we may easily remove ρ from the collection of the considered maps. On the other hand, if neither form is sufficiently quasirandom, then we may replace them by forms that depend on fewer coordinates using the weak inverse theorem.
Another idea that plays a very important role in the proof is the dependent random choice, which takes a particularly simple form in the algebraic setting and allows us to externally approximate dense varieties by low-codimensional varieties very efficiently (Lemma 12).
Acknowledgements. I would like to acknowledge the support of the Ministry of Education, Science and Technological Development of the Republic of Serbia, Grant ON174026. I would also like to thank the anonymous referee for a very careful reading.
**§2 Preliminaries
**
From now on, we adopt a non-standard convention and write log, without subscripts, to be a slightly modified version of the logarithm. Namely, for positive real x, we write logx=log∣F∣(∣F∣x)=(log∣F∣x)+1. This has the merit of being greater or equal to 1, when x≥1, which simplifies the calculations. If we write log∣F∣, we still have its usual meaning in mind.
As a warm-up, we show that low-codimensional varieties are necessarily dense. We use this very simple fact in the proofs that follow without explicitly referring to the next lemma.
Lemma 11**.**
Let B be a variety of codimension r in G[k]. Let x[k]∈B. Then there are at least ∣F∣−kr∣G[k]∣ points in B at distance222In the induced graph G[B], where G has the same meaning as in the introduction. at most k from x[k]. In particular, if B is non-empty, then ∣B∣≥∣F∣−kr∣G[k]∣.
Proof.
For i∈[k], we show that there is Yi⊂G[i] of size ∣Yi∣≥∣F∣−ir∣G[i]∣ such that for each y[i]∈Yi, the point (y[i],x[i+1,k]) belongs to B and is at distance at most i from x[k]. Let β:G[k]→Fr be the multiaffine map defining B, thus B={β=λ}, for some λ∈Fr. For i=1, we may take Y1={y1∈G1:β(y1,x[2,k])=λ}×{x[2,k]}. The projection of this set in G1 is a non-empty (since it contains x1) coset of codimension at most r, hence the claim follows. Suppose that the claim holds for some i≤k−1. Similarly as in the previous case, for each y[i]∈Yi look at Z(y[i])={zi+1∈Gi+1:β(y[i],zi+1,x[i+2,k])=λ}, which is again non-empty (it contains xi+1) coset of codimension at most r. Taking Yi+1=∪y[i]∈Yiy[i]×Z(y[i])×{x[i+2,k]} finishes the proof.∎
When A:G[k]→H is a map, we write {A=0}={x[k]∈G[k]:A(x[k])=0}, when there is no danger of confusion. Also, if A1,…,Ar:G[k]→H are maps, and λ∈Fr, we write λ⋅A for the map ∑i∈[r]λiAi, when there is no danger of confusion.
Let A:G[k]→H be a multiaffine map. Then, there is a multiaffine map ϕ:G[k]→Fs such that {A=0}⊂{ϕ=0} and ∣{ϕ=0}∖{A=0}∣≤∣F∣−s∣G[k]∣. If, additionally, A is linear in coordinate c, then so is ϕ.
Proof.
Take h1,…,hs uniformly and independently from H, and set ϕ(x[k])i=A(x[k])⋅hi, i∈[s]. If A is linear in coordinate c, then so is ϕ, as required. Notice that {A=0}⊂{ϕ=0} holds immediately. On the other hand, if x[k]∈G[k] satisfies A(x[k])=0, then
[TABLE]
Thus, E∣{ϕ=0}∖{A=0}∣≤∣F∣−s∣{A=0}∣, and the claim follows.∎
Let A1,…,Ar:G[k]→H be multiaffine maps. Let ϵ>0. Then, there is s≤r+log∣F∣ϵ−1, multiaffine maps ϕ1,…,ϕr:G[k]→Fs such that for each λ∈Fr we have {λ⋅A=0}⊂{λ⋅ϕ=0} and ∣{λ⋅ϕ=0}∖{λ⋅A=0}∣≤ϵ∣G[k]∣.
Proof.
Consider an auxiliary multiaffine map Φ:G[k]×Fr→H, defined by
[TABLE]
Apply Lemma 12 to find s≤log∣F∣(∣F∣rϵ−1)=r+log∣F∣ϵ−1 and a multiaffine map ψ:G[k]×Fr→Fs such that {ψ=0}⊃{Φ=0} and the difference set has density at most ∣F∣−rϵ in G[k]×Fr. Furthermore, since Φ is linear in the last (auxiliary) coordinate, so is ψ. If e1,…,er is the standard basis of Fr, let ϕi be defined by ϕi(x[k])=ψ(x[k],ei). Hence, for each λ∈Fr, we have that
[TABLE]
and
[TABLE]
as desired.∎
When A:G[k]→H is a multiaffine map, we may write A(x[k])=∑I⊂[k]AI(xI), for multilinear maps AI:GI→H (for I=∅, AI is a constant, but not necessarily zero). We call AI the multilinear parts of A. We make use of the following observation of Lovett [11].
Let α:G[k]→F be a multiaffine form, with multilinear parts αI. Then
[TABLE]
Sketch proof.
Write α(x[k])=α′(x[k−1])+A(x[k−1])⋅xk, for multiaffine maps α′ and A. Then
[TABLE]
Apply this observation k−1 more times to end up with α[k] only in the final bound.∎
Recall the definition of Gowers box norm (Definition B.1 in the Appendix B of [5])
[TABLE]
for a map f:G[k]→C, where Conj stands for the conjugation operator. This definition is a generalization of Gowers uniformity norms, which were introduced by Gowers in [2] in additive combinatorics and by Host and Kra in [6] in the context of ergodic theory. For a more detailed discussion of box norms, see [5]. Note that when ϕ:G[k]→F is a multilinear form, we have ∥χ∘ϕ∥□k2k=Ex[k]∈G[k]χ(ϕ(x[k])). The following is the Gowers-Cauchy-Schwarz inequality for the box norm.
Proposition 15**.**
Let fI:G[k]→C be a function for each I⊂[k]. Then
[TABLE]
This can be proved by induction on k, Cauchy-Schwarz and Hölder inequalities, for details, see [5]. Note that Proposition 15 also implies a bound resembling that in Lemma 14, but such a bound would be weaker than that in Lemma 14. In fact, in the rest of the paper we only need Lemma 14, but we include the definition of Gowers box norm and Proposition 15, since we apply it in the introduction in the sketch proof of Theorem 4.
**§3 Weak(k)⟹Strong(k)
**
Proof. We begin the deduction of the strong inverse theorem from the weak one by proving the ‘one-sided regularity lemma’.
Proposition 16**.**
Let ρ:G[k]→F,γi:GIi→F, i∈[r] be multilinear maps. Let F={i∈[r]:Ii=[k]}. Suppose that
[TABLE]
for any choice of λ∈FF. Then, the set {x[k]∈G[k]:(∀i∈[r])γi(xIi)=0,ρ(x[k])=0} is connected and of diameter at most (2k+1)(2k−1).
Proof.
Write r=r0+r1 and reorder maps so that k∈Ii if and only if i∈[r0+1,r]. Also, write S0={x[k−1]∈G[k−1]:(∀i∈[r0])γi(xIi)=0} and S1={x[k]∈G[k]:(∀i∈[r0+1,r])γi(xIi)=0,ρ(x[k])=0}. The set we are interested then becomes S=(S0×Gk)∩S1. We first prove that for almost all of pairs (x[k−1]),(y[k−1])∈G[k−1] we have some z∈Gk such that (x[k−1],z),(y[k−1],z)∈S1.
We may find multilinear maps Γi:GIi∖{k}→Gk,i∈[r0+1,r] and R:G[k−1]→Gk such that
[TABLE]
Observe that if x[k−1],y[k−1]∈G[k−1] are such that
[TABLE]
then we may certainly get a z∈Gk such that (x[k−1],z),(y[k−1],z)∈S1. For the sake of completeness, we include a short proof.
Observation 17**.**
Let G be a F-vector space with a dot product ⋅. Let v1,v2,u1,…,um∈G be elements such that v1,v2∈/span{u1,…,um}. Then we have z∈G such that v1⋅z,v2⋅z=0, but ui⋅z=0 for all i∈[m].
Suppose contrary, for any z with ui⋅z=0 for all i∈[m], we have v1⋅z=0 or v2⋅z=0. Suppose that we have z1 and z2 such that ui⋅z1=0 and ui⋅z2=0 for all i∈[m], but v1⋅z1=0,v2⋅z2=0. Then v2⋅z1=0,v1⋅z2=0 and hence (∀i∈[m])ui⋅(z1+z2)=0, v1⋅(z1+z2)=0,v2⋅(z1+z2)=0, which is a contradiction. Hence, w.l.o.g. we have that whenever ui⋅z=0 for all i∈[m], then v1⋅z=0. But then v1∈span{u1,…,um}, which is the final contradiction.∎
We count the number of pairs x[k−1],y[k−1]∈G[k−1] such that
[TABLE]
by counting for each linear combination λ,μ∈F[r0+1,r] how often
[TABLE]
happens (and analogously for R(y[k−1])). The density of such pairs is exactly
where F⊂[r] is the set of all i such that Ii=[k]. From this we deduce that for all but at most 2∣F∣2rη proportion of pairs x[k−1],y[k−1]∈G[k−1] we have some z∈Gk such that (x[k−1],z),(y[k−1],z)∈S1. Moreover, for any given pair, we have at least ∣F∣−2(r+1)∣Gk∣ such z, since for λ1,λ2∈F
[TABLE]
is an at most 2(r+1)-codimensional coset.
Write F′={i∈[r]:Ii=[k−1]}. Fix t∈Gk, and consider analytic rank of the map τt,λ:G[k−1]→F defined by
[TABLE]
for λ∈FF∪F′. If analytic ranks of τt,λ are small for all choices of λ∈FF∪F′, then the induction hypothesis applies, and Sk:t (recall that this is the slice notation Sk:t={z[k−1]∈G[k−1]:(z[k−1],t)∈S}) is connected and of diameter at most (2k−1)(2k−1−1). For any fixed λ∈FF∪F′, we have
[TABLE]
[TABLE]
hence, by averaging, we obtain a set T⊂Gk of density at least 1−21∣F∣−2r−2, such that
[TABLE]
Consequently, by induction hypothesis, for each z∈T, Sk:z is connected and of diameter at most (2k−1)(2k−1−1).
We are now ready to prove that S is connected and of bounded diameter. We do this in two steps, the first one being to show that this holds for a very large subset of S, and then the second is to extend this to whole S.
Let X⊂G[k−1] be the set of all x[k−1]∈G[k−1] such that for proportion at least 1−2∣F∣rη of y[k−1]∈G[k−1], we have at least ∣F∣−2(r+1)∣Gk∣ of z∈Gk such that (x[k−1],z),(y[k−1],z)∈S1. By the previous argument, we get |X|\geq\Big{(}1-2|\mathbb{F}|^{r}\sqrt{\eta}\Big{)}|G_{[k-1]}|. We claim that (X×Gk)∩S is connected and of diameter at most (2k−1)(2k−2)+3. Indeed, let x[k],y[k]∈(X×Gk)∩S. Since ∣S0∣≥∣F∣−(k−1)r∣G[k−1]∣>4∣F∣rη∣G[k−1]∣, by the way we defined X, we may find some u[k−1]∈S0 such that we have at least ∣F∣−2(r+1)∣Gk∣ of z∈Gk such that (x[k−1],z),(u[k−1],z)∈S1, and we also have at least ∣F∣−2(r+1)∣Gk∣ of z′∈Gk such that (y[k−1],z′),(u[k−1],z′)∈S1. In particular, recalling that |T|\geq\Big{(}1-\frac{1}{2}|\mathbb{F}|^{-2r-2}\Big{)}|G_{[k]}|, we have a choice of z,z′∈T with the above properties. But, Sk:z and Sk:z′ are connected and of diameter at most (2k−1)(2k−1−1), which completes the first step.
Finally, take any x[k]∈S. Since x[k] is at distance at most k to at least ∣F∣−k(r+1)∣G[k]∣ of points in S, and ∣S∖(X×Gk)∣≤∣G[k−1]∖X∣∣Gk∣≤2∣F∣rη∣G[k]∣, at least one such point lies in (X×Gk)∩S, and we are done, with the final diameter bound being (2k−1)(2k−2)+3+2k≤(2k+1)(2k−1).∎
Apply Theorem 5 to α to find m0≤ClogDc−1 and multilinear βi:GIi→F, i∈[m0] where ∅=Ii⊂[k−1] and
[TABLE]
Write α(x[k])=A(x[k−1])⋅xk for a multilinear map A:G[k−1]→Gk. Thus, we also have
[TABLE]
For a set Q, the power-set of Q is the collection of all subsets of Q (including ∅ and Q itself) and is denoted by PQ. By induction on the size of up-set333Collection of sets closed under taking supersets. F⊂P[k−1], we prove the following proposition.
Proposition 18**.**
Let F⊂P[k−1] be an up-set. Then there are constants CF,DF with the following property. We may find mF,nF≤CFlogDFc−1, a collection of multilinear maps ρiF:GJiF→F, where i∈[nF], ∅=JiF⊂[k−1], points yJiFF,i∈GJiF, another collection of multilinear maps βiF:GIiF→F, where i∈[mF], ∅=IiF∈P[k−1]∖F, such that the multilinear map AF:G[k−1]→Gk defined as
For F=∅, we take the given maps βi, and hence n∅=0,m∅=m0. Let F∪{S} be a given up-set, where S is a minimal set inside it, thus making F an up-set as well. Assume that the claim holds for F, and that we get multilinear maps βiF:GIiF→F, i∈[mF], and A′=AF, with the property above. If no i∈[mF] satisfies IiF=S, then the same collection works for F∪{S}. Thus, after reordering maps βiF if necessary, assume that IiF=S if and only if i∈[s]. Let {λ1,…,λd}⊂Fs be a maximal independent set such that for each j∈[d]
[TABLE]
Thus, if we extend λ1,…,λd by further μ1,…,μs−d to a basis of Fs, and setting
[TABLE]
for i∈[d], and
[TABLE]
for i∈[s−d], then we have the following properties:
We first deal with ρi. Let F={i∈[s+1,mF]:IiF⊂[k−1]∖S} and Z=\Big{\{}x_{[k-1]\setminus S}\in G_{[k-1]\setminus S}\colon(\forall i\in F)\beta_{i}(x_{I^{\mathcal{F}}_{i}})=0\Big{\}}. The property (ii) and Proposition 16 imply that for each z[k−1]∖S∈G[k−1]∖S, for each i∈[s−d], the set
[TABLE]
is connected.
Next, we pick points ySi∈GS for each i∈[s−d] and define
[TABLE]
We claim that we may choose the points yS1,…,ySs−d∈GS so that
[TABLE]
Indeed, we may certainly satisfy this condition since otherwise get that some ρi=0 whenever the maps ρj for j=i and γj are all zero, and we may discard ρi.
Next, we set
[TABLE]
and we observe the following.
Proposition 19**.**
For all x[k−1]∈W∩(GS×(Z∩V)) we have the equality
Fix z[k−1]∖S∈Z∩V. We show that for all xS∈Wz[k−1]∖S
[TABLE]
holds. We argue by induction on the maximal index i∈[s−d] such that ρi(xS)=0, and if there are no such i, we put i=0. Thus the base case is when all ρi(xS)=0. However, since (xS,z[k−1]∖S)∈W and z[k−1]∖S∈Z, by property (iii), it follows that A′(xS,z[k−1]∖S)=0. The right hand side is also zero, so the identity holds.
Assume now that the claim holds for values of i smaller than some i0≥1, and that we are given xS∈Wz[k−1]∖S such that ρi0(xS)=0, but ρi(xS)=0 for i>i0. Recall that the set
[TABLE]
is connected. Thus, since xS,ySi0∈R, there is a sequence of points xS=xS0,xS1,…,xSl=ySi0 inside this set, such that every two consecutive points differ in exactly one coordinate (we call such points neighbouring). We introduce the following piece of notation. When a and b are two points in G[k] differing only in coordinate i, we write a−b to be the point with coordinates (a−b)j=aj=bj for j=i and (a−b)i=ai−bi. This notation makes calculations much neater, since for a multilinear map ϕ on G[k], we have ϕ(a−b)=ϕ(a)−ϕ(b).
Let τi=ρi0(xSi) for i∈[0,l]. If xS0 and xS1 differ in coordinate c0, multiply each of points xS1,…,xSl at coordinate c0 by τ0τ1−1. The new points still have the property that the consecutive points are either neighbouring or identical, and that they all lie in the set R. Misusing the notation, we keep writing xSi for the modified points. If c1 is the coordinate where xS1 and xS2 differ, multiply all points among xS2,…,xSl by τ1τ2−1 at coordinate c1, and proceed. Hence, we end up with a sequence xS=xS0,xS1,…,xSl in R such that for each s∈S, xsl=σsysi0 for some σs∈F∖{0}, the consecutive points are either neighbouring or identical and (∀i∈[0,l])ρi0(xSi)=τ0. Hence ρj(xSi−xSi+1)=0 for j∈[i0,s−d]. Thus, we get
[TABLE]
as desired.∎
Hence, the multilinear map AF∪{S} defined by
[TABLE]
satisfies
[TABLE]
On the other hand, recalling the property (i), applying Theorem 5 to each γi, allows us to find
[TABLE]
and further multilinear maps βi,j′:GIi,j→F, j∈[mi], ∅=Ii,j⊂S∖{maxS} such that {xS∈GS:(∀j∈[mi])βi,j′(xIi,j)=0}⊂{xS∈GS:γi(xS)=0}. Thus
[TABLE]
as desired. When it comes to bounds, we may take
[TABLE]
and
[TABLE]
This finishes the proof.∎
Applying Proposition 18 with F=P[k−1] implies that
[TABLE]
for all x[k−1]∈G[k−1]. Taking dot product with xk completes the proof.∎
Hence, we may take C^{\bm{strong}}_{k}=\Big{(}3\overline{C}(7k)^{\overline{D}}\Big{)}^{(\overline{D}+1)^{2^{k}}}, and Dkstrong=D(D+1)2k, where C=Ckweak and D=Dkweak. Hence, if t is a quantity such that Ckweak≤2k2t and Dkweak≤22t, then
[TABLE]
∎
**§4 Strong(k)⟹Inner(k−1)
**
Proof. Let C=Ckstrong and D=Dkstrong. The theorem will follow from the following proposition. For a given F⊂P([k−1]), we say that a multiaffine map B:G[k−1]→H is F-supported if it can be written as B(x[k−1])=∑I∈FBI(xI), for some multilinear maps BI:GI→H.
Proposition 20**.**
Let F⊂P([k−1]) be a non-empty down-set.444A collection of set closed under taking subsets. Then, there are constants CF,DF with the following property. Let ϵ>0 and let B1,…,Br:G[k−1]→H be multiaffine maps. For λ∈Fr, let Zλ={x[k]∈G[k−1]:∑i∈[r]λiBi(x[k−1])=0}. Then, there are s,t\leq C_{\mathcal{F}}\Big{(}r\log\epsilon^{-1}\Big{)}^{D_{\mathcal{F}}}, a multiaffine map β:G[k−1]→Fs, and a collection of F-supported multiaffine maps Γ1,…,Γt:G[k−1]→H such that:
(i)
for each λ∈Fr, there are distinct layers L1,…,Lm of β and multiaffine maps A1,…,Am∈span{Γ[t]}, such that Zλ∩Li={x[k−1]∈G[k−1]:Ai(x[k−1])=0}∩Li,
We may take C_{\mathcal{F}}=\Big{(}2\overline{C}(2k)^{\overline{D}}\Big{)}^{(\overline{D}+1)^{2^{k}-|\mathcal{F}|}} and DF=(2D+2)2k−∣F∣.
Proof.
We prove the claim by down-wards induction on ∣F∣. The base case is F=P([k−1]), in which case we take s=1, β=0, t=r and Γi=Bi.
Assume that we have proved the claim for some F. Let C=CF and D=DF. Let β,Γ[t] be as above for the choice F for the down-set. Thus s,t≤ClogDϵ−1rD. Let I0 be a maximal set in F. The set I0 is non-empty, since we are done otherwise. Let F′=F∖{I0}. Write each Γi(x[k−1])=Γi′(x[k−1])+Ψi(xI0), for a F′-supported multiaffine map Γi′ and a multilinear map Ψi:GI0→H. Look at maximal independent set λ1,…,λd∈Ft such that for each i∈[d], \mathop{\mathbb{E}}_{x_{I_{0}},h}\chi\Big{(}\Big{(}\sum_{j\in[t]}\lambda^{i}_{j}\Psi_{j}(x_{I_{0}})\Big{)}\cdot h\Big{)}\geq\nu=\epsilon 2^{-k}|\mathbb{F}|^{-s}. For each i∈[d], we apply Theorem 6 to the multilinear map on GI0×H given by (x_{I_{0}},h)\mapsto\big{(}\sum_{j\in[t]}\lambda^{i}_{j}\Psi_{j}(x_{I_{0}})\big{)}\cdot h, to find ri≤ClogDν−1, multilinear maps γji:GJij→F and Λji:GI0∖Jij→H, for j∈[ri], where ∅=Jij⊂I0, such that
[TABLE]
Define a multiaffine map β′:G[k−1]→Fs×F{(i,j):i∈[d],j∈[ri]} by
[TABLE]
We now show that β′ and Γ[t]′,Λji, i∈[d],j∈[ri] have the desired properties for the down-set F′.
Take arbitrary λ∈Fr and consider Zλ. By induction hypothesis, there are distinct layers L1,…,Lm of β and multiaffine maps A1,…,Am∈span{Γ[t]}, such that Zλ∩Li={x[k−1]∈G[k−1]:Ai(x[k−1])=0}∩Li, and \Big{|}Z_{\lambda}\setminus\Big{(}\cup_{i\in[m]}(Z_{\lambda}\cap L_{i})\Big{)}\Big{|}\leq\frac{2^{k}-|\mathcal{F}|}{2^{k}}\epsilon|G_{[k-1]}|. Here m≤∣F∣s. For each i∈[m], write Ai(x[k−1])=Ai′(x[k−1])+Ci(xI0) so that Ai′∈span{Γ[t]′} and Ci∈span{Ψ[t]}. Thus, either ExI0∈GI0,h∈Hχ(Ci(xI0)⋅h)<ν, or Ci is a linear combination of λ1⋅Ψ,…,λd⋅Ψ.
where τ is the map defined by (xI0,h)↦Ci(xI0)⋅h. We choose to discard these layers. By doing so, we lose at most 2−kϵ of density in total.
In the latter case, we may find μ∈Fd such that Ci(xI0)=∑j1∈[d],j2∈[ri]μj1γj2j1(xJj1j2)Λj2j1(xI0∖Jj1j2). Partition Li into layers M1,…,Ml of β′. On each layer Mj, Ci becomes a linear combination of Λj2j1, and thus Ai becomes a linear combination of Γj1′,Λj2j1, finishing the proof.
For the bounds, observe the codimension of β′ is at most
[TABLE]
and the number of maps Γj1′,Λj2j1 is at most
[TABLE]
as desired.∎
The theorem follows by applying the proposition for F={∅}. Then each Zλ∩Li=Li is a layer of β. The constants may be taken to be
[TABLE]
where C=Ckstrong and D=Dkstrong. Hence, if t is a quantity such that Ckweak≤2k2t and Dkweak≤22t, using (4), then
[TABLE]
∎
**§5 Inner(k−1)⟹Columns(k)
**
Proof. We begin the proof by proving the following lemma.
Lemma 21**.**
Let G be a F-vector space. Let x1,…,xr∈G and let λ1,…,λr∈F. The following are equivalent.
(i)
There is y∈G such that xi⋅y=λi for each i∈[r].
(ii)
(∀μ∈Fr)∑i∈[r]μixi=0⟹μ⋅λ=0.
Proof.
(i)⟹(ii): Suppose that ∑i∈[r]μixi=0. Take dot product with y.
(ii)⟹(i): Take a maximal independent set {xi1,…,xis} among xi. Renaming xi if necessary, we may assume that this set is {x1,…,xs}. For i∈[s+1,r], we have some μi∈Fs such that xi=∑j∈[s]μjixj. By property (ii) we also have λi=∑j∈[s]μjiλj. Since x1,…,xs are independent, there is y such that xi⋅y=λi for each i∈[s]. Property (i) follows since λi for i∈[s+1,r] satisfy the identities above.∎
Write αi(x[k])=αi′(x[k−1])+xk⋅Ai(x[k−1]) for multiaffine maps αi′:G[k−1]→F and Ai:G[k−1]→Gk. For each coset Λ⊂Fr define V_{\Lambda}=\Big{\{}x_{[k-1]}\in G_{[k-1]}\colon(\forall\lambda\in\Lambda)(\exists y\in G_{k})\alpha(x_{[k-1]},y)=\lambda\Big{\}}. Applying the lemma above, we may rewrite this set as
[TABLE]
Notice that for each x[k−1]∈G[k−1], the set {α(x[k−1],y):y∈Gk} is a coset Λ in Fr, and that ∣{y∈Gk:α(x[k−1],y)∈S}∣=∣Λ∣∣Λ∩S∣∣Gk∣. Thus, X is the union of sets of the form V_{\Lambda}\setminus\Big{(}\cup_{\begin{subarray}{c}\Lambda\subsetneq M\subseteq\mathbb{F}^{r}\\
M\text{ coset}\end{subarray}}V_{M}\Big{)}, where Λ are cosets in Fr such that ∣Λ∩S∣≥ϵ∣Λ∣.
Next, for each M≤Fr, define
[TABLE]
Thus
[TABLE]
Let Λ=λ0+Λ0, for a subspace Λ0. Notice that
[TABLE]
Hence
[TABLE]
Apply Theorem 7 to A1,…,Ar to find s\leq C^{\bm{inner}}_{k-1}\Big{(}(4r^{3}+3r^{2})\log\epsilon^{-1}\Big{)}^{D^{\bm{inner}}_{k-1}}, a multiaffine map β:G[k−1]→Fs so that for each λ∈Fr the set {λ⋅A=0} can be internally approximated by layers of β up to error of at most ϵ∣F∣−(4r2+3r) in density. Notice that, if M=⟨μ1,…,μd⟩, where d≤r, then
[TABLE]
Recall that W_{\langle\mu^{i}\rangle}=\Big{\{}x_{[k-1]}\in G_{[k-1]}\colon\sum_{i\in[r]}\mu_{i}A_{i}(x_{[k-1]})=0\Big{\}}, so we may we internally approximate W⟨μi⟩ by a union Di of layers of β with error of density at most ϵ∣F∣−(4r2+3r). Thus WM⊃∩i∈[d]Di and
[TABLE]
Also, apply Proposition 13 to A1,…,Ar to find multiaffine maps γ1,…,γr:G[k−1]→Ft, where t≤(5r3+2r2)logϵ−1 such that each W⟨μ⟩ can be externally approximated by {μ⋅γ=0} with error of density at most ∣F∣−5r2−2rϵ. Thus,
[TABLE]
Finally, define a multiaffine map ϕ:G[k−1]→Fr×Fs×Ft by ϕ=(α′,β,γ). Then, each WM can be approximated both internally and externally using layers of ϕ up to error of density at most ∣F∣−4r2−2rϵ. Hence, from (6), every VΛ may be approximated both internally and externally using layers of ϕ up to error of density at most ∣F∣−2r2−2rϵ. Finally, each V_{\Lambda}\setminus\Big{(}\cup_{\begin{subarray}{c}\Lambda\subsetneq M\subseteq\mathbb{F}^{r}\\
M\text{ coset}\end{subarray}}V_{M}\Big{)} may be approximated both internally and externally using layers of ϕ up to error of density at most ∣F∣−r2−rϵ. Thus, the set
[TABLE]
may be approximated both internally and externally using layers of ϕ up to error of density at most ϵ, as desired.
When it comes to bounds, the codimension of the desired map is r+s+t, and we may take Ckcolumns=20Ck−1inner and Dkcolumns=3Dk−1inner. Hence, if t is a quantity such that Ckweak≤2k2t and Dkweak≤22t, using (5), then
[TABLE]
∎
**§6 Columns(k−1)∧Inner(k−1)⟹Conv(k)
**
Proof. We prove the claim by induction on k, followed by induction on l. Recall that we use Z in the expressions below also to denote the indicator function of the set Z. We include the artificial case l=0. In this case, we have Z(x_{[k]})=|\mathbb{F}|^{-r}\sum_{\lambda\in\mathbb{F}^{r}}\chi\Big{(}\lambda\cdot\alpha(x_{[k]})\Big{)}. Hence, in this case we may take Ck,0conv=1,Dk,0conv=1, ci=∣F∣−r, βi=αi.
Write C=Ck,l−1conv,D=Dk,l−1conv. By induction hypothesis there are s,t≤Clog∣F∣D(100ϵ−2)rD, multiaffine forms βi:G[k]→F for i∈[s], multiaffine map γ:G[k]∖{l−1}→Ft, constants c1,…,cm∈C, multiaffine maps ρ1,…,ρm∈span{β[s]} and layers L1,…,Ln of γ such that
[TABLE]
for all x_{[k]}\in G_{[k]}\setminus\Big{(}(\cup_{i\in[n]}L_{i})\times G_{l-1}\Big{)}, ∣∪i∈[n]Li∣≤100ϵ2∣G[k]∖{l−1}∣ and ∑i∈[m]∣ci∣≤1.
Let E=\Big{(}\cup_{i\in[n]}L_{i}\Big{)}\times G_{l-1}, which therefore has density at most 100ϵ2. Write ρi(x[k])=ρi′(x[k]∖{l})+Γi(x[k]∖{l})⋅xl for multiaffine maps ρi′:G[k]∖{l}→F and Γi:G[k]∖{l}→Gl. Write a≈νb if ∣a−b∣≤ν. Consider x[k]∈G[k] such that ∣Ex[k]∖{l}∣≤10ϵ∣G[k]∖l∣. For such x[k] we have
[TABLE]
[TABLE]
Write βi(x[k])=βi′(x[k]∖{l})+Ψi(x[k]∖{l})⋅xl for multiaffine maps βi′:G[k]∖{l}→F and Ψi:G[k]∖{l}→Gl. Hence Γi∈span{Ψ[s]}. Thus, for each i,j∈[m], there is μij∈Fs such that Γi−Γj=∑v∈[s]μvijΨv.
Apply Proposition 13 to maps Ψ[s] to find u≤(2s3+s)log(100ϵ−1) and multiaffine maps τ1,…,τs:G[k]∖{l}→Fu such that for each λ∈Fs, we have that {∑i∈[s]λiτi=0}⊃{∑i∈[s]λiΨi=0} and ∣{∑i∈[s]λiτi=0}∖{∑i∈[s]λiΨi=0}∣≤∣F∣−2s2100ϵ∣G[k]∖{l}∣.
Therefore,
[TABLE]
holds for all x[k]∈G[k] outside the set
[TABLE]
Since ∣E∣≤100ϵ2∣G[k]∣, we have
[TABLE]
Also, by the way we chose τ, recalling that m≤∣F∣s,
[TABLE]
Next, apply Theorem 7 to Ψ1,…,Ψs, to find a map ϕ:G[k]∖{l}→Fs′, where
[TABLE]
such that for each λ∈Fs, we may approximate {λ⋅Ψ=0} (and hence each {Γi−Γj=0}) internally using layers of ϕ up to error of at most ∣F∣−2s100ϵ in density.
Finally, we need to approximate \Big{\{}x_{[k]}\in E\colon|E_{x_{[k]\setminus\{l\}}}|\geq\frac{\epsilon}{10}|G_{l}|\Big{\}} externally by layers of a multiaffine map of bounded codimension up to error of density at most 100ϵ. We apply Theorem 8 to γ, recalling that E=\Big{(}\cup_{i\in[n]}L_{i}\Big{)}\times G_{l-1}, to find such a map δ:G[k]∖{l−1,l}→Fw, where
[TABLE]
Hence, we obtained the desired approximation outside layers L1′,…,Lq′ of the multiaffine map (τ,ϕ,δ) of codimension at most
[TABLE]
such that ∣∪i∈[q]Li′∣≤ϵ∣G[k]∣. The multiaffine forms \beta^{\prime}_{[s]},\Big{(}x_{[k]}\mapsto\Psi_{[s]}(x_{[k]\setminus\{l\}})\cdot x_{l}\Big{)},\tau_{[s],[u]} span the set of multiaffine forms used in the arguments of χ in the approximation sum, and their number is at most
[TABLE]
Hence, we may take constants
[TABLE]
and
[TABLE]
where C=Ck,l−1conv,D=Dk,l−1conv, which completes the proof. Hence, if t is a quantity such that Ckweak≤2k2t and Dkweak≤22t, using (7), then
[TABLE]
∎
**§7 Conv(k)⟹Weak(k+1)
**
Proof.
Let α:G[k+1]→F be a multilinear form such that \mathop{\mathbb{E}}_{x_{[k+1]}}\chi\Big{(}\alpha(x_{[k+1]})\Big{)}\geq c. Let α(x[k+1])=A(x[k])⋅xk+1 for a multilinear map A:G[k]→Gk+1. Then {A=0} has density at least c in G[k]. Apply Lemma 12 to find a multilinear β:G[k]→Fr, where r≤2klogc−1+(k+3)log2, such that {A=0}⊂{β=0} and ∣{β=0}∖{A=0}∣≤2−k−3c2k∣G[k]∣. Let Z={β=0}. Apply Theorem 9 to β, to find s,t\leq C^{\bm{conv}}_{k,k}\Big{(}2^{k}\log c^{-1}+k+3\Big{)}^{2\cdot D^{\bm{conv}}_{k,k}}, multiaffine forms τi:G[k]→F for i∈[s], multiaffine map γ:G[k−1]→Ft, constants c1,…,cm∈C, multiaffine maps ρ1,…,ρm∈span{τ[s]} and layers L1,…,Ln of γ such that
[TABLE]
for all x_{[k]}\in G_{[k]}\setminus\Big{(}(\cup_{i\in[n]}L_{i})\times G_{k}\Big{)}, ∣∪i∈[n]Li∣≤81c2k∣G[k−1]∣ and ∑i∈[m]∣ci∣≤1.
By applying Cauchy-Schwarz inequality several times, we see that
[TABLE]
By averaging, there is a non-empty layer D of (τ,γ) such that, for each x[k]∈D, Ck⋯C1Z(x[k])≥41c2k.
We now give a definition of an arrangement of points in G[k]. Firstly, we define ∅-arrangement of lengths l[k]∈G[k] to be the singleton sequence whose only element is l[k]. For i∈[k], an [i]-arrangement of lengths l[k]∈G[k] is a sequence of length 2i, being a concatenation (q1,q2) of two [i−1]-arrangements q1 and q2 (for i=1, [0] is taken to be ∅), where q1 has lengths (l[i−1],li+y,l[i+1,k]) and q2 has lengths (l[i−1],y,l[i+1,k]) for some y∈Gi. We note the following.
Lemma 22**.**
(i)
For a set S⊂G[k], and a given x[k]∈G[k], the number of [i]-arrangements of lengths x[k] whose points lie in S is exactly
[TABLE]
(ii)
For x[k],l[k]∈G[k], j≤2i, there are exactly ∣G1∣2i−1−1∣G2∣2i−2−1⋯∣Gi∣20−1[i]-arrangements of lengths l[k] that contain x[k] at jth position, when l[i+1,k]=x[i+1,k], and no such [i]-arrangements otherwise.
(iii)
If all 2i points of an [i]-arrangement of lenghts x[k] lie inside {A=0}, then A(x[k])=0.
As a few times before, we misuse the notation slightly by writing S for the indicator function of the set S.
Proof.
(i): We prove the claim by induction on i. For i=0, the only [0]-arrangement of lengths x[k] is precisely the singleton sequence (x[k]). Thus, the number of such arrangements equals S(x[k]).
Suppose the claim holds for some i and take any x[k]. By definition, each [i+1]-arrangement of lengths x[k] is concatenation of two [i]-arrangements, of lengths (x[i],xi+y,x[i+2,k]) and (x[i],y,x[i+2,k]), for some y∈Gi+1. Using this, and the inductive hypothesis, we see that the number of [i+1] arrangements of lengths x[k] is exactly
[TABLE]
(ii): For i=0, this is clear. Suppose the claim holds for [i]-arrangments. Assume that x[i+2,k]=l[i+2,k], otherwise there are clearly no desired arrangements. Note that from part (i) there are exactly ∣G1∣2i−1∣G2∣2i−2⋯∣Gi∣ of [i]-arrangements of any given lengths. If j≤2i, then we know that the number of [i]-arrangments of lengths (l[i],y,l[i+2,k]) that contain x[k] at jth position is ∣G1∣2i−1−1∣G2∣2i−2−1⋯∣Gi∣20−1, when y=xi+1, and zero otherwise. On the other hand, there are exactly ∣G1∣2i−1∣G2∣2i−2⋯∣Gi∣ of [i]-arrangements of lengths (l[i],xi+1−li+1,l[i+2,k]). The result now follows. The case j>2i can be treated similarly.
(iii): For i=0, this is clear. Suppose the claim holds for [i]-arrangments. Let q be an [i+1]-arrangment of lengths x[k] whose all points lie inside {A=0}. Then, q=(q1,q2) for an [i]-arrangement q1 of lengths (x[i],xi+y,x[i+2,k]) and an [i]-arrangement q2 of lengths (x[i],y,x[i+2,k]). By induction hypothesis, we have A(x[i],xi+y,x[i+2,k])=0 and A(x[i],y,x[i+2,k])=0. Since A is linear in (i+1)th coordinate, A(x[k])=0, as desired.∎
By part (i) of the lemma, for each x[k]∈D, there are at least 41c2k∣G1∣2k−1∣G2∣2k−2⋯∣Gk∣20[k]-arrangements of lengths x[k] with all points in {β=0}. Since ∣{β=0}∖{A=0}∣≤2−(k+3)c2k∣G[k]∣, and, by (ii), each point in G[k] belongs to at most 2k∣G1∣2k−1−1∣G2∣2k−2−1⋯∣Gk∣0[k]-arrangements of lengths x[k] (the factor of 2k comes from the number of positions a point may take in a [k]-arrangement), this implies that for each x[k]∈D, at least one [k]-arrangement of lengths x[k] has all its points inside {A=0}. Using part (iii), we obtain A(x[d])=0. Hence, D⊂{A=0}. To finish the proof we modify D to a variety inside {A=0}, but one which is defined by multilinear maps only.
Lemma 23**.**
Suppose that A:G[k]→H is a multilinear map and that D is a variety of codimension r in G[k] such that D⊂{A=0}. Then, there is R≤22kr, multilinear forms βi:GIi→F, ∅=Ii⊂[k] for i∈[R], such that {x[k]∈G[k]:(∀i∈[R])βi(xIi)=0}⊂{A=0}.
Proof.
By splitting the map that defines D into its multilinear parts, we see that there are mutlilinear forms δi:GIi→F, ∅=Ii⊂[k], scalars λi∈F, i≤r′≤2kr, such that
[TABLE]
By induction on d, we prove that, misusing the notation above, we may replace the maps by new ones so that λi=0, when Ii∩[d]=∅, at the expense of larger bound r′≤2k+dr.
For d=0, the given maps satisfy these properties. Assume now that the claim holds for some d≥0. Take an arbitrary point (v[k]) such that δi(vIi)=λi for all i∈[r′]. We claim that if x[k]∈G[k] satisfies δi(xIi)=λi for d+1∈/Ii, δi(xIi)=0 and δi(xIi∖{d+1},vd+1)=λi for d+1∈Ii, then A(x[k])=0. Indeed, let x[k] be such a point. It suffices to show that A(x[d],xd+1+vd+1,x[d+2,k])=A(x[d],vd+1,x[d+2,k])=0. Since δi(xIi)=λi for d+1∈/Ii, we just need to show that δi(xIi∖{d+1};xd+1+vd+1)=δi(xIi∖{d+1};vd+1)=λi for d+1∈Ii, which we already know to hold. Note also that v[k] satisfies all these equalities, so we get a non-empty variety. Thus, we may replace the given maps with at most 2r′ maps with desired properties.∎
This completes the proof. As far as the bounds are concerned, we may find a variety inside {A=0} with desired properties with codimension bounded by
[TABLE]
Thus, we may take Ck+1weak=22k+1Ck,kconv(2k+k+3)2Dk,kconv and Dk+1weak=2Dk,kconv. Hence, if t is a quantity such that Ckweak≤2k2t and Dkweak≤22t, using (8), then
[TABLE]
∎
Bibliography12
The reference list from the paper itself. Each links out to its DOI / PubMed record.
1[1] A. Bhowmick and S. Lovett, Bias vs structure of polynomials in large fields, and applications in effective algebraic geometry and coding theory , ar Xiv preprint (2015), ar Xiv:1506.02047 .
2[2] W.T. Gowers, A new proof of Szemerédi’s theorem , Geometric and Functional Analysis 11 (2001), no. 3, 465–588.
3[3] W.T. Gowers and J. Wolf, Linear forms and higher-degree uniformitty functions on 𝔽 p n subscript superscript 𝔽 𝑛 𝑝 \mathbb{F}^{n}_{p} , Geometric and Functional Analysis 21 (2011), no. 1, 36–69.
4[4] B. Green and T. Tao. The distribution of polynomials over finite fields, with applications to the Gowers norms , Contributions to Discrete Mathematics 4 (2009), no. 2, 1–36.
5[5] B. Green and T. Tao, Linear equations in primes , Annals of Mathematics 171 (2010), no. 3, 1753–1850.
6[6] B. Host and B. Kra, Nonconventional ergodic averages and nilmanifolds , Annals of Mathematics 161 (2005), no. 1, 397–488.
7[7] O. Janzer, Low analytic rank implies low partition rank for tensors , ar Xiv preprint (2018) ar Xiv:1809.10931 .
8[8] O. Janzer, Polynomial bound for the partition rank vs the analytic rank of tensors , ar Xiv preprint (2019) ar Xiv:1902.11207 .