Polynomial bound for the partition rank vs the analytic rank of tensors

Oliver Janzer

arXiv:1902.11207·math.CO·May 19, 2020

Polynomial bound for the partition rank vs the analytic rank of tensors

Oliver Janzer

PDF

TL;DR

This paper proves a polynomial bound relating the partition rank and the analytic rank of tensors over finite fields, improving previous tower and Ackermann-type bounds, with implications for biased polynomials.

Contribution

It establishes a polynomial bound for the partition rank in terms of the analytic rank, advancing from tower and Ackermann bounds, and independently confirms similar results by Milićević.

Findings

01

Partition rank is polynomially bounded by analytic rank for tensors.

02

Improves previous bounds from tower and Ackermann-type functions.

03

Shows biased polynomials have low rank with polynomial dependence.

Abstract

A tensor defined over a finite field $F$ has low analytic rank if the distribution of its values differs significantly from the uniform distribution. An order $d$ tensor has partition rank 1 if it can be written as a product of two tensors of order less than $d$ , and it has partition rank at most $k$ if it can be written as a sum of $k$ tensors of partition rank 1. In this paper, we prove that if the analytic rank of an order $d$ tensor is at most $r$ , then its partition rank is at most $f (r, d, ∣ F ∣)$ , where, for fixed $d$ and $F$ , $f$ is a polynomial in $r$ . This is an improvement of a recent result of the author, where he obtained a tower-type bound. Prior to our work, the best known bound was an Ackermann-type function in $r$ and $d$ , though it did not depend on $F$ . It follows from our results that a biased polynomial has low rank; there too we…

Figures2

Click any figure to enlarge with its caption.

Equations70

rank (P) \leq (c \cdot 2^{d} \cdot lo g (1/ ϵ))^{c^{'} (d)} + 1

rank (P) \leq (c \cdot 2^{d} \cdot lo g (1/ ϵ))^{c^{'} (d)} + 1

\|f\|_{U^{d}}=\big{|}\mathbb{E}_{x,y_{1},\dots,y_{d}\in G}\prod_{S\subset[d]}\mathcal{C}^{d-|S|}f(x+\sum_{i\in S}y_{i})\big{|}^{1/2^{d}},

\|f\|_{U^{d}}=\big{|}\mathbb{E}_{x,y_{1},\dots,y_{d}\in G}\prod_{S\subset[d]}\mathcal{C}^{d-|S|}f(x+\sum_{i\in S}y_{i})\big{|}^{1/2^{d}},

rank (P) \leq (c \cdot 2^{d} \cdot lo g (1/ ϵ))^{c^{'} (d)} + 1

rank (P) \leq (c \cdot 2^{d} \cdot lo g (1/ ϵ))^{c^{'} (d)} + 1

∣ E_{x \in F^{n}} ω^{P (x)} \overline{ω^{Q (x)}} ∣ \geq ∣ F ∣^{- (c \cdot 2^{d} \cdot l o g (1/ ϵ))^{c^{'} (d)} - 1}

∣ E_{x \in F^{n}} ω^{P (x)} \overline{ω^{Q (x)}} ∣ \geq ∣ F ∣^{- (c \cdot 2^{d} \cdot l o g (1/ ϵ))^{c^{'} (d)} - 1}

1=\mathbb{E}_{x\in\mathbb{F}^{n}}|\omega^{P(x)}|^{2}=\sum_{\chi\in\hat{G}}\overline{\hat{g}(\chi)}\bigg{(}\mathbb{E}_{x\in\mathbb{F}^{n}}\omega^{P(x)}\overline{\chi(Q_{1}(x),\dots,Q_{r}(x))}\bigg{)}.

1=\mathbb{E}_{x\in\mathbb{F}^{n}}|\omega^{P(x)}|^{2}=\sum_{\chi\in\hat{G}}\overline{\hat{g}(\chi)}\bigg{(}\mathbb{E}_{x\in\mathbb{F}^{n}}\omega^{P(x)}\overline{\chi(Q_{1}(x),\dots,Q_{r}(x))}\bigg{)}.

E_{v^{1} \in V_{1}, \dots, v^{d} \in V_{d}} [χ (T (v^{1}, \dots, v^{d}))]

E_{v^{1} \in V_{1}, \dots, v^{d} \in V_{d}} [χ (T (v^{1}, \dots, v^{d}))]

= P_{v^{1} \in V_{1}, \dots, v^{d - 1} \in V_{d - 1}} [T (v^{1}, \dots, v^{d - 1}, x) \equiv 0],

prank (T) \leq (c \cdot lo g ∣ F ∣)^{c^{'} (d)} \cdot r^{c^{'} (d)}

prank (T) \leq (c \cdot lo g ∣ F ∣)^{c^{'} (d)} \cdot r^{c^{'} (d)}

bias (T) = E_{y_{1}, \dots, y_{d} \in F^{n}} χ (T (y_{1}, \dots, y_{d})) = E_{y_{1}, \dots, y_{d} \in F^{n}} S \subset [d] \prod C^{d - ∣ S ∣} f (x + i \in S \sum y_{i})

bias (T) = E_{y_{1}, \dots, y_{d} \in F^{n}} χ (T (y_{1}, \dots, y_{d})) = E_{y_{1}, \dots, y_{d} \in F^{n}} S \subset [d] \prod C^{d - ∣ S ∣} f (x + i \in S \sum y_{i})

prank (T) \leq (c \cdot 2^{d} \cdot lo g (1/ ϵ))^{c^{'} (d)} .

prank (T) \leq (c \cdot 2^{d} \cdot lo g (1/ ϵ))^{c^{'} (d)} .

(c \cdot 2^{d} \cdot lo g (1/ ϵ))^{c^{'} (d)} + 1.

(c \cdot 2^{d} \cdot lo g (1/ ϵ))^{c^{'} (d)} + 1.

prank (T)

prank (T)

= 2^{d - 1} ((lo g ∣ F ∣) \cdot c_{1} (d - 1) \cdot lo g (∣ F ∣^{r}))^{c_{2} (d - 1)}

= 2^{d - 1} ((lo g ∣ F ∣)^{2} \cdot c_{1} (d - 1) \cdot r)^{c_{2} (d - 1)}

\leq ((lo g ∣ F ∣)^{2} \cdot c_{1} (d) \cdot r)^{c_{2} (d - 1)}

r \in V_{{1, 2, 3}} + V_{{1}} \otimes F^{{2, 3}} + F^{n_{1}} \otimes V_{{2, 3}} + F^{n_{2}} \otimes K_{{1, 3}} (r) + F^{n_{3}} \otimes K_{{1, 2}} (r)

r \in V_{{1, 2, 3}} + V_{{1}} \otimes F^{{2, 3}} + F^{n_{1}} \otimes V_{{2, 3}} + F^{n_{2}} \otimes K_{{1, 3}} (r) + F^{n_{3}} \otimes K_{{1, 2}} (r)

r_{2} \in V_{{1}} \otimes F^{{2, 3}} + F^{n_{1}} \otimes V_{{2, 3}} + F^{n_{3}} \otimes K_{{1, 2}} (r), r_{3} \in V_{{1, 2, 3}}, r_{4} \in F^{n_{2}} \otimes K_{{1, 3}} (r) .

r_{2} \in V_{{1}} \otimes F^{{2, 3}} + F^{n_{1}} \otimes V_{{2, 3}} + F^{n_{3}} \otimes K_{{1, 2}} (r), r_{3} \in V_{{1, 2, 3}}, r_{4} \in F^{n_{2}} \otimes K_{{1, 3}} (r) .

r_{4} \in V_{{2}} \otimes F^{{1, 3}} + F^{n_{2}} \otimes V_{{1, 3}} + F^{n_{3}} \otimes L_{{1, 2}}^{'} (r)

r_{4} \in V_{{2}} \otimes F^{{1, 3}} + F^{n_{2}} \otimes V_{{1, 3}} + F^{n_{3}} \otimes L_{{1, 2}}^{'} (r)

r u \in W_{{1, 3}} (u) + F^{n_{1}} \otimes W_{{3}} (u) + W_{{1}} (u) \otimes F^{n_{3}}

r u \in W_{{1, 3}} (u) + F^{n_{1}} \otimes W_{{3}} (u) + W_{{1}} (u) \otimes F^{n_{3}}

r_{4} u = r u - r_{2} u - r_{3} u \in W_{{1, 3}} (u) + V_{{1, 2, 3}} u + s (u)

r_{4} u = r u - r_{2} u - r_{3} u \in W_{{1, 3}} (u) + V_{{1, 2, 3}} u + s (u)

r \in W_{[d]} + J ≺ I \sum (W_{J} \otimes F^{J^{c}} + F^{J} \otimes W_{J^{c}}) + J ⪰ I \sum F^{J} \otimes H_{J^{c}} (r)

r \in W_{[d]} + J ≺ I \sum (W_{J} \otimes F^{J^{c}} + F^{J} \otimes W_{J^{c}}) + J ⪰ I \sum F^{J} \otimes H_{J^{c}} (r)

r \in W_{[d]} + J ⪯ I \sum (U_{J} \otimes F^{J^{c}} + F^{J} \otimes U_{J^{c}}) + J ≻ I \sum F^{J} \otimes K_{J^{c}} (r)

r \in W_{[d]} + J ⪯ I \sum (U_{J} \otimes F^{J^{c}} + F^{J} \otimes U_{J^{c}}) + J ≻ I \sum F^{J} \otimes K_{J^{c}} (r)

r \in W_{[d]} + J ≺ I \sum (W_{J} \otimes F^{J^{c}} + F^{J} \otimes W_{J^{c}}) + J ⪰ I \sum F^{J} \otimes H_{J^{c}} (r)

r \in W_{[d]} + J ≺ I \sum (W_{J} \otimes F^{J^{c}} + F^{J} \otimes W_{J^{c}}) + J ⪰ I \sum F^{J} \otimes H_{J^{c}} (r)

W_{[d]} + J ⪯ I \sum (U_{J} \otimes F^{J^{c}} + F^{J} \otimes U_{J^{c}}) + J ≻ I \sum F^{J} \otimes K_{J^{c}} (r)

W_{[d]} + J ⪯ I \sum (U_{J} \otimes F^{J^{c}} + F^{J} \otimes U_{J^{c}}) + J ≻ I \sum F^{J} \otimes K_{J^{c}} (r)

g_{1}

g_{1}

\leq ((lo g ∣ F ∣)^{2} c_{1} (d - 1) 2^{3^{d + 4}} C (d lo g 1/ δ)^{4})^{c_{2} (d - 1)} \leq ((lo g ∣ F ∣)^{2} (c_{1} (d - 1))^{2} (lo g 1/ δ)^{4})^{c_{2} (d - 1)}

\leq G (d - 1, δ)^{4} \leq k^{2} .

r_{1} \in J \subset I, J \neq = I, J \neq = \emptyset \sum W_{J} \otimes F^{J^{c}},

r_{1} \in J \subset I, J \neq = I, J \neq = \emptyset \sum W_{J} \otimes F^{J^{c}},

r_{2} \in J ≺ I, J \neq \subset I \sum (W_{J} \otimes F^{J^{c}} + F^{J} \otimes W_{J^{c}}) + J ≻ I \sum F^{J} \otimes H_{J^{c}} (r),

r_{2} \in J ≺ I, J \neq \subset I \sum (W_{J} \otimes F^{J^{c}} + F^{J} \otimes W_{J^{c}}) + J ≻ I \sum F^{J} \otimes H_{J^{c}} (r),

r_{3} \in W_{[d]} + J \subset I, J \neq = I, J \neq = \emptyset \sum F^{J} \otimes W_{J^{c}},

r_{3} \in W_{[d]} + J \subset I, J \neq = I, J \neq = \emptyset \sum F^{J} \otimes W_{J^{c}},

r_{4} \in F^{I} \otimes H_{I^{c}} (r) .

r_{4} \in F^{I} \otimes H_{I^{c}} (r) .

s \in Q^{'} \sum dim K (j + 1, s) \geq g_{3} ∣ Q^{'} ∣ + s \in Q^{'} \sum dim K (j, s) .

s \in Q^{'} \sum dim K (j + 1, s) \geq g_{3} ∣ Q^{'} ∣ + s \in Q^{'} \sum dim K (j, s) .

∣ Q^{'} ∣ g_{4} \geq s \in Q^{'} \sum dim K (m, s) \geq m g_{3} ∣ Q^{'} ∣,

∣ Q^{'} ∣ g_{4} \geq s \in Q^{'} \sum dim K (m, s) \geq m g_{3} ∣ Q^{'} ∣,

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Polynomial bound for the Partition Rank vs the Analytic Rank of Tensors

Oliver Janzer

Abstract

A tensor defined over a finite field $\mathbb{F}$ has low analytic rank if the distribution of its values differs significantly from the uniform distribution. An order $d$ tensor has partition rank 1 if it can be written as a product of two tensors of order less than $d$ , and it has partition rank at most $k$ if it can be written as a sum of $k$ tensors of partition rank 1. In this paper, we prove that if the analytic rank of an order $d$ tensor is at most $r$ , then its partition rank is at most $f(r,d,|\mathbb{F}|)$ , where, for fixed $d$ and $\mathbb{F}$ , $f$ is a polynomial in $r$ . This is an improvement of a recent result of the author, where he obtained a tower-type bound. Prior to our work, the best known bound was an Ackermann-type function in $r$ and $d$ , though it did not depend on $\mathbb{F}$ . It follows from our results that a biased polynomial has low rank; there too we obtain a polynomial dependence improving the previously known Ackermann-type bound.

A similar polynomial bound for the partition rank was obtained independently and simultaneously by Milićević.

\dajAUTHORdetails

title = Polynomial bound for the Partition Rank vs the Analytic Rank of Tensors, author = Oliver Janzer, plaintextauthor = Oliver Janzer, keywords = partition rank, analytic rank, tensor, \dajEDITORdetailsyear=2020, number=7, received=23 October 2018, revised=6 March 2019, published=19 May 2020, doi=10.19086/da.12935,

[classification=text]

1 Introduction

1.1 Bias and rank of polynomials

For a finite field $\mathbb{F}$ and a polynomial $P:\mathbb{F}^{n}\rightarrow\mathbb{F}$ , we say that $P$ is unbiased if the distribution of the values $P(x)$ is close to the uniform distribution on $\mathbb{F}$ ; otherwise we say that $P$ is biased. It is an important direction of research in higher order Fourier analysis to understand the structure of biased polynomials.

Note that a generic degree $d$ polynomial should be unbiased. In fact, as we will see below, if a degree $d$ polynomial is biased, then it can be written as a function of not too many polynomials of degree at most $d-1$ . Let us now make this discussion more precise.

Definition 1.1.

Let $\mathbb{F}$ be a finite field and let $\chi$ be a nontrivial character of $\mathbb{F}$ . The bias of a function $f:\mathbb{F}^{n}\rightarrow\mathbb{F}$ with respect to $\chi$ is defined to be ${\rm bias}_{\chi}(f)=\mathbb{E}_{x\in\mathbb{F}^{n}}[\chi(f(x))]$ . (Here and elsewhere in the paper $\mathbb{E}_{x\in G}h(x)$ denotes $\frac{1}{|G|}\sum_{x\in G}h(x)$ .)

*Remark 1.2**.*

Most of the previous work is on the case $\mathbb{F}=\mathbb{F}_{p}$ with $p$ a prime, in which case the standard definition of bias is ${\rm bias}(f)=\mathbb{E}_{x\in\mathbb{F}^{n}}\omega^{f(x)}$ where $\omega=e^{\frac{2\pi i}{p}}$ .

Definition 1.3.

Let $P$ be a polynomial $\mathbb{F}^{n}\rightarrow\mathbb{F}$ of degree $d$ . The rank of $P$ (denoted $\mathop{\mathrm{rank}}(P)$ ) is defined to be the smallest integer $r$ such that there exist polynomials $Q_{1},\dots,Q_{r}:\mathbb{F}^{n}\rightarrow\mathbb{F}$ of degree at most $d-1$ and a function $f:\mathbb{F}^{r}\rightarrow\mathbb{F}$ such that $P=f(Q_{1},\dots,Q_{r})$ .

As discussed above, it is known that if a polynomial has large bias, then it has low rank. The first result in this direction was proved by Green and Tao [4] who showed that if $\mathbb{F}$ is a field of prime order and $P:\mathbb{F}^{n}\rightarrow\mathbb{F}$ is a polynomial of degree $d$ with $d<|\mathbb{F}|$ and ${\rm bias}(P)\geq\delta>0$ , then $\mathop{\mathrm{rank}}(P)\leq c(\mathbb{F},\delta,d)$ . Kaufman and Lovett [8] proved that the condition $d<|\mathbb{F}|$ can be omitted. In both results, $c$ has Ackermann-type dependence on its parameters. Finally, Bhowmick and Lovett [1] proved that if $d<\text{char}(\mathbb{F})$ and ${\rm bias}(P)\geq|\mathbb{F}|^{-s}$ , then $\mathop{\mathrm{rank}}(P)\leq c^{\prime}(d,s)$ . The novelty of this result is that $c^{\prime}$ does not depend on $\mathbb{F}$ . However, it still has Ackermann-type dependence on $d$ and $s$ .

One of our main results is the following theorem, which improves the result of Bhowmick and Lovett, unless $|\mathbb{F}|$ is very large.

Theorem 1.4.

Let $\mathbb{F}$ be a finite field and let $\chi$ be a nontrivial character of $\mathbb{F}$ . Let $P$ be a polynomial $\mathbb{F}^{n}\rightarrow\mathbb{F}$ of degree $d<\text{char}(\mathbb{F})$ . Suppose that ${\rm bias}_{\chi}(P)\geq\epsilon>0$ where $\epsilon\leq 1/|\mathbb{F}|$ . Then

[TABLE]

where $c$ is an absolute constant and $c^{\prime}(d)=4^{d^{d}}$ .

Recall that if $G$ is an Abelian group and $d$ is a positive integer, then the Gowers $U^{d}$ norm (which is only a seminorm for $d=1$ ) of $f:G\rightarrow\mathbb{C}$ is defined to be

[TABLE]

where $\mathcal{C}$ is the conjugation operator. It is a major area of research to understand the structure of functions $f$ whose $U^{d}$ norm is large. Our next theorem is a result in this direction.

Theorem 1.5.

Let $\mathbb{F}$ be a finite field and let $\chi$ be a nontrivial character of $\mathbb{F}$ . Let $P$ be a polynomial $\mathbb{F}^{n}\rightarrow\mathbb{F}$ of degree $d<\text{char}(\mathbb{F})$ . Let $f(x)=\chi(P(x))$ and assume that $\|f\|_{U^{d}}\geq\epsilon>0$ where $\epsilon\leq 1/|\mathbb{F}|$ . Then

[TABLE]

where $c$ is an absolute constant and $c^{\prime}(d)=4^{d^{d}}$ .

Our result implies a similar improvement to the bounds for the quantitative inverse theorem for Gowers norms for polynomial phase functions of degree $d$ .

Theorem 1.6.

Let $\mathbb{F}$ be a field of prime order and let $P$ be a polynomial $\mathbb{F}^{n}\rightarrow\mathbb{F}$ of degree $d<\text{char}(\mathbb{F})$ . Let $f(x)=\omega^{P(x)}$ where $\omega=e^{\frac{2\pi i}{|\mathbb{F}|}}$ and assume that $\|f\|_{U^{d}}\geq\epsilon>0$ where $\epsilon\leq 1/|\mathbb{F}|$ . Then there exists a polynomial $Q:\mathbb{F}^{n}\rightarrow\mathbb{F}$ of degree at most $d-1$ such that

[TABLE]

where $c$ is an absolute constant and $c^{\prime}(d)=4^{d^{d}}$ .

Theorems 1.4 and 1.6 easily follow from Theorem 1.5.

Proof of Theorem 1.4. Note that when $f(x)=\chi(P(x))$ , then $\|f\|^{2}_{U^{1}}=|\mathbb{E}_{x,y\in\mathbb{F}^{n}}\overline{f(x)}f(x+y)|=|\mathbb{E}_{x\in\mathbb{F}^{n}}f(x)|^{2}$ , so $\|f\|_{U^{1}}=|E_{x\in\mathbb{F}^{n}}f(x)|=|{\rm bias}_{\chi}(P)|$ . However, $\|f\|_{U^{k}}$ is increasing in $k$ (see eg. Claim 6.2.2 in [6]), therefore $\|f\|_{U^{d}}\geq|{\rm bias}_{\chi}(P)|\geq\epsilon$ . The result is now immediate from Theorem 1.5.

Proof of Theorem 1.6. By Theorem 1.5, there exists a set of $r\leq(c\cdot 2^{d}\cdot\log(1/\epsilon))^{c^{\prime}(d)}+1$ polynomials $Q_{1},\dots,Q_{r}$ such that $P(x)$ is a function of $Q_{1}(x),\dots,Q_{r}(x)$ .

Then $\omega^{P(x)}=g(Q_{1}(x),\dots,Q_{r}(x))$ for some function $g:\mathbb{F}^{r}\rightarrow\mathbb{C}$ . Let $G=\mathbb{F}^{r}$ . Note that $|g(y)|=1$ for all $y\in G$ , therefore $|\hat{g}(\chi)|\leq 1$ for every character $\chi\in\hat{G}$ . Now $\omega^{P(x)}=\sum_{\chi\in\hat{G}}\hat{g}(\chi)\,\chi((Q_{1}(x),\dots,Q_{r}(x))$ , so

[TABLE]

Thus, there exists some $\chi\in\hat{G}$ with $|\mathbb{E}_{x\in\mathbb{F}^{n}}\omega^{P(x)}\overline{\chi(Q_{1}(x),\dots,Q_{r}(x))}|\geq 1/|G|=1/|\mathbb{F}|^{r}$ . But $\chi$ is of the form $\chi(y_{1},\dots,y_{r})=\omega^{\sum_{i\leq r}\alpha_{i}y_{i}}$ for some $\alpha_{i}\in\mathbb{F}$ . Then $\chi(Q_{1}(x),\dots,Q_{r}(x))=\omega^{Q_{\alpha}(x)}$ , where $Q_{\alpha}$ is the degree $d-1$ polynomial $Q_{\alpha}(x)=\sum_{i\leq r}\alpha_{i}Q_{i}(x)$ . So $Q=Q_{\alpha}$ is a suitable choice.

1.2 Analytic rank and partition rank of tensors

Related to the bias and rank of polynomials are the notions of analytic rank and partition rank of tensors. Recall that if $\mathbb{F}$ is a field and $V_{1},\dots,V_{d}$ are finite dimensional vector spaces over $\mathbb{F}$ , then an order $d$ tensor is a multilinear map $T:V_{1}\times\dots\times V_{d}\rightarrow\mathbb{F}$ . (In this subsection, assume that $d\geq 2$ .) Each $V_{k}$ can be identified with $\mathbb{F}^{n_{k}}$ for some $n_{k}$ , and then there exist $t_{i_{1},\dots,i_{d}}\in\mathbb{F}$ for all $i_{1}\leq n_{1},\dots,i_{d}\leq n_{d}$ such that $T(v^{1},\dots,v^{d})=\sum_{i_{1}\leq n_{1},\dots,i_{d}\leq n_{d}}t_{i_{1},\dots,i_{d}}v^{1}_{i_{1}}\dots v^{d}_{i_{d}}$ for every $v^{1}\in\mathbb{F}^{n_{1}},\dots,v^{d}\in\mathbb{F}^{n_{d}}$ (where $v_{k}$ is the $k$ th coordinate of the vector $v$ ). Indeed, $t_{i_{1},\dots,i_{d}}$ is just $T(e^{i_{1}},\dots,e^{i_{d}})$ , where $e^{i}$ is the $i$ th standard basis vector.

The following notion was introduced by Gowers and Wolf [3].

Definition 1.7.

Let $\mathbb{F}$ be a finite field, let $V_{1},\dots,V_{d}$ be finite dimensional vector spaces over $\mathbb{F}$ and let $T:V_{1}\times\dots\times V_{d}\rightarrow\mathbb{F}$ be an order $d$ tensor. Then the analytic rank of $T$ is defined to be ${\rm arank}(T)=-\log_{|\mathbb{F}|}{\rm bias}(T)$ , where ${\rm bias}(T)=\mathbb{E}_{v^{1}\in V_{1},\dots,v^{d}\in V_{d}}[\chi(T(v^{1},\dots,v^{d}))]$ for any nontrivial character $\chi$ of $\mathbb{F}$ .

*Remark 1.8**.*

This is well-defined. Indeed, if $\chi$ is a nontrivial character of $\mathbb{F}$ , then

[TABLE]

where $T(v^{1},\dots,v^{d-1},x)$ is viewed as a function in $x$ . The second equality holds because

$\mathbb{E}_{v^{d}\in V_{d}}\,\chi(T(v^{1},\dots,v^{d}))=0$ unless $T(v^{1},\dots,v^{d-1},x)\equiv 0$ , in which case it is 1.

Thus, $\mathbb{E}_{v^{1}\in V_{1},\dots,v^{d}\in V_{d}}[\,\chi(T(v^{1},\dots,v^{d}))]$ does not depend on $\chi$ , and is always positive. Moreover, it is at most 1, therefore the analytic rank is always nonnegative.

A different notion of rank was defined by Naslund [13].

Definition 1.9.

Let $T:V_{1}\times\dots\times V_{d}\rightarrow\mathbb{F}$ be a (non-zero) order $d$ tensor. We say that $T$ has partition rank 1 if there is some $S\subset[d]$ with $S\neq\emptyset,S\neq[d]$ such that $T(v^{1},\dots,v^{d})=T_{1}(v^{i}:i\in S)T_{2}(v^{i}:i\not\in S)$ where $T_{1}:\prod_{i\in S}V_{i}\rightarrow\mathbb{F},T_{2}:\prod_{i\not\in S}V_{i}\rightarrow\mathbb{F}$ are tensors. In general, the partition rank of $T$ is the smallest $r$ such that $T$ can be written as the sum of $r$ tensors of partition rank 1. This number is denoted ${\rm prank}(T)$ .

Kazhdan and Ziegler [9] and Lovett [11] proved that ${\rm arank}(T)\leq{\rm prank}(T)$ . In the other direction, it follows from the work of Bhowmick and Lovett [1] that if an order $d$ tensor $T$ has ${\rm arank}(T)\leq r$ , then ${\rm prank}(T)\leq f(r,d)$ for some function $f$ . Note that $f$ does not depend on $|\mathbb{F}|$ or the dimension of the vector spaces $V_{k}$ . However, $f$ has an Ackermann-type dependence on $d$ and $r$ . For $d=3,4$ , better bounds were established by Haramaty and Shpilka [5]. They proved that for $d=3$ we have ${\rm prank}(T)=O(r^{4})$ , and that for $d=4$ we have ${\rm prank}(T)=\exp(O(r))$ .

The main result of our paper is a polynomial upper bound, which holds for general $d$ .

Theorem 1.10.

Let $T:V_{1}\times\dots\times V_{d}\rightarrow\mathbb{F}$ be an order $d$ tensor with ${\rm arank}(T)\leq r$ and assume that $r\geq 1$ . Then

[TABLE]

for some absolute constant $c$ , and $c^{\prime}(d)=4^{d^{d}}$ .

We remark that a very similar result was obtained independently and simultaneously by Milićević [12]. Moreover, in the special case $d=4$ , a similar bound was proved independently by Lampert [10].

It is not hard to see that Theorem 1.10 implies Theorem 1.5. Indeed, let $P$ be a polynomial $\mathbb{F}^{n}\rightarrow\mathbb{F}$ of degree $d<\text{char}(\mathbb{F})$ , let $f(x)=\chi(P(x))$ and assume that $\|f\|_{U^{d}}\geq\epsilon>0$ , where $\epsilon\leq 1/|\mathbb{F}|$ . Define $T:(\mathbb{F}^{n})^{d}\rightarrow\mathbb{F}$ by $T(y_{1},\dots,y_{d})=\sum_{S\subset[d]}(-1)^{d-|S|}P(\sum_{i\in S}y_{i})$ . By Lemma 2.4 from [3], $T$ is a tensor of order $d$ . Moreover, by the same lemma, we have $T(y_{1},\dots,y_{d})=\sum_{S\subset[d]}(-1)^{d-|S|}P(x+\sum_{i\in S}y_{i})$ for any $x\in\mathbb{F}^{n}$ . Thus,

[TABLE]

for any $x\in\mathbb{F}^{n}$ . By averaging over all $x\in\mathbb{F}^{n}$ , it follows that ${\rm bias}(T)=\|f\|^{2^{d}}_{U^{d}}\geq\epsilon^{2^{d}}$ . Thus, ${\rm arank}(T)\leq 2^{d}\log_{|\mathbb{F}|}(1/\epsilon)$ . Note that $2^{d}\log_{|\mathbb{F}|}(1/\epsilon)\geq 1$ . Therefore, by Theorem 1.10 with $r=2^{d}\log_{|\mathbb{F}|}(1/\epsilon)$ , we get

[TABLE]

Note that $T(y_{1},\dots,y_{d})=D_{y_{1}}\dots D_{y_{d}}P(x)$ where $D_{y}g(x)=g(x+y)-g(x)$ . Thus, by Taylor’s approximation theorem, since $d<\text{char}(\mathbb{F})$ , we get $P(x)=\frac{1}{d!}D_{x}\dots D_{x}P(0)+W(x)=\frac{1}{d!}T(x,\dots,x)+W(x)$ for some polynomial $W$ of degree at most $d-1$ .

By equation (1), $T$ can be written as a sum of at most $(c\cdot 2^{d}\cdot\log(1/\epsilon))^{c^{\prime}(d)}$ tensors of partition rank 1. Hence, $\frac{1}{d!}T(x,\dots,x)$ can be written as a sum of at most $(c\cdot 2^{d}\cdot\log(1/\epsilon))^{c^{\prime}(d)}$ expressions of the form $Q(x)R(x)$ where $Q,R$ are polynomials of degree at most $d-1$ each. Thus, $P-W$ has rank at most $(c\cdot 2^{d}\cdot\log(1/\epsilon))^{c^{\prime}(d)}$ , and therefore $P$ has rank at most

[TABLE]

We remark that the proof of the main result of this paper follows the strategy introduced by the author in [7], but the argument is improved locally at a few places.

2 The proof of Theorem 1.10

2.1 Notation and preliminaries

In the rest of the paper, we identify $V_{i}$ with $\mathbb{F}^{n_{i}}$ . Thus, the set of all tensors $V_{1}\times\dots\times V_{d}\rightarrow\mathbb{F}$ is the tensor product $\mathbb{F}^{n_{1}}\otimes\dots\otimes\mathbb{F}^{n_{d}}$ , which will be denoted by $\mathcal{G}$ throughout this section. Also, $\mathcal{B}$ will always stand for the multiset $\{u_{1}\otimes\dots\otimes u_{d}:u_{i}\in\mathbb{F}^{n_{i}}\text{ for all }i\}$ . The elements of $\mathcal{B}$ will be called pure tensors. Note that $\mathcal{G}=\mathbb{F}^{n_{1}}\otimes\dots\otimes\mathbb{F}^{n_{d}}$ can be viewed as the set of $d$ -dimensional $(n_{1},\dots,n_{d})$ -arrays over $\mathbb{F}$ which in turn can be viewed as $\mathbb{F}^{n_{1}n_{2}\dots n_{d}}$ , equipped with the entry-wise dot product.

For $I\subset[d]$ , we write $\mathbb{F}^{I}$ for $\bigotimes_{i\in I}\mathbb{F}^{n_{i}}$ so that we naturally have $\mathcal{G}=\mathbb{F}^{I}\otimes\mathbb{F}^{I^{c}}$ , where $I^{c}$ always denotes $[d]\setminus I$ .

If $r\in\mathbb{F}^{[d]}=\mathcal{G}$ and $s\in\mathbb{F}^{[k]}$ (for some $k\leq d$ ), then we define $rs$ to be the tensor in $\mathbb{F}^{[k+1,d]}$ with coordinates $(rs)_{i_{k+1},\dots,i_{d}}=\sum_{i_{1}\leq n_{1},\dots,i_{k}\leq n_{k}}r_{i_{1},\dots,i_{d}}s_{i_{1},\dots,i_{k}}$ . If $k=d$ , then $rs$ is the same as the entry-wise dot product $r.s$ . Also, note that viewing $r$ as a $d$ -multilinear map $R:\mathbb{F}^{n_{1}}\times\dots\times\mathbb{F}^{n_{d}}\rightarrow\mathbb{F}$ , we have $R(v^{1},\dots,v^{d})=\sum_{i_{1}\leq n_{i},\dots,i_{d}\leq n_{d}}r_{i_{1},\dots,i_{d}}v^{1}_{i_{1}}\dots v^{d}_{i_{d}}=r(v^{1}\otimes\dots\otimes v^{d})$ .

Finally, we use a non-standard notation and write $kB$ to mean the set of elements of $\mathcal{G}$ which can be written as a sum of at most $k$ elements of $B$ , where $B$ is some fixed (multi)subset of $\mathcal{G}$ , and similarly, we write $kB-lB$ for the set of elements that can be obtained by adding at most $k$ members and subtracting at most $l$ members of $B$ .

We will use the next result several times in our proofs. It is a version of Bogolyubov’s lemma, due to Sanders.

Lemma 2.1 (Sanders [14]).

There is an absolute constant $C$ with the following property. Let $A$ be a subset of $\mathbb{F}^{n}$ with $|A|\geq\delta|\mathbb{F}^{n}|$ . Then $2A-2A$ contains a subspace of $\mathbb{F}^{n}$ of codimension at most $C(\log(1/\delta))^{4}$ .

Throughout the paper, $C$ stands for the constant appearing in the previous lemma. Clearly we may assume that $C\geq 1$ . Logarithms are base 2.

2.2 The main lemma and some consequences

Theorem 1.10 will follow easily from the next lemma, which is the main technical result of this paper. See [2] for an application of a qualitative version of this lemma.

Lemma 2.2.

Let $d\geq 1$ be an integer and let $\delta\leq 1/2$ . Let $f_{1}(d)=2^{3^{d+3}}$ , $f_{2}(d)=2^{-3^{d+3}}$ and $G(d,\delta,\mathbb{F})=((\log|\mathbb{F}|)c_{1}(d)(\log 1/\delta))^{c_{2}(d)}$ where $c_{1}(d)=C\cdot 2^{3^{d+6}}$ and $c_{2}(d)=4^{d^{d}}$ . If $\mathcal{B}^{\prime}\subset\mathcal{B}$ is a multiset such that $|\mathcal{B}^{\prime}|\geq\delta|\mathcal{B}|$ , then there exists a multiset $Q$ whose elements are pure tensors chosen from $f_{1}(d)\mathcal{B}^{\prime}-f_{1}(d)\mathcal{B}^{\prime}$ (but with arbitrary multiplicity) with the following property. The set of arrays $r\in\mathcal{G}$ with $r.q=0$ for at least $(1-f_{2}(d))|Q|$ choices $q\in Q$ is contained in $\sum_{I\subset[d],I\neq\emptyset}V_{I}\otimes\mathbb{F}^{I^{c}}$ for subspaces $V_{I}\subset\mathbb{F}^{I}$ of dimension at most $G(d,\delta,\mathbb{F})$ .

Throughout the paper, the functions $G,c_{1},c_{2}$ will refer to the functions introduced in the previous lemma. In fact, as $\mathbb{F}$ is fixed, we will write $G(d,\delta)$ to mean $G(d,\delta,\mathbb{F})$ .

In this subsection we deduce Theorem 1.10 from Lemma 2.2.

The notion introduced in the next definition is closely related to the partition rank, but will be somewhat more convenient to work with.

Definition 2.3.

Let $k$ be a positive integer. We say that $r\in\mathcal{G}$ is $k$ -degenerate if for every $I\subset[d],I\neq\emptyset,I\neq[d]$ , there exists a subspace $H_{I}\subset\mathbb{F}^{I}$ of dimension at most $k$ such that $r\in\sum_{I\subset[d-1],I\neq\emptyset}H_{I}\otimes H_{I^{c}}$ .

If $r\in H_{I}\otimes\mathbb{F}^{I^{c}}$ with $\dim(H_{I})\leq k$ , then $r\in H_{I}\otimes H_{I^{c}}$ for some $H_{I^{c}}\subset\mathbb{F}^{I^{c}}$ of dimension at most $k$ . (This follows by writing $r$ as $\sum_{j\leq m}s_{j}\otimes t_{j}$ with $\{s_{j}\}$ a basis for $H_{I}$ and letting $H_{I^{c}}$ be the span of all the $t_{j}$ .) Thus, $r$ is $k$ -degenerate if and only if $r\in\sum_{I\subset[d-1],I\neq\emptyset}H_{I}\otimes\mathbb{F}^{I^{c}}$ for some $H_{I}\subset\mathbb{F}^{I}$ of dimension at most $k$ , or equivalently, if and only if $r\in\sum_{I\subset[d-1],I\neq\emptyset}\mathbb{F}^{I}\otimes H_{I^{c}}$ for some $H_{I^{c}}\subset\mathbb{F}^{I^{c}}$ of dimension at most $k$ . Moreover, note that if $r$ is $k$ -degenerate, then ${\rm prank}(r)\leq 2^{d-1}k$ . This is because if $I\neq\emptyset,I\subset[d-1]$ and $w\in H_{I}\otimes H_{I^{c}}$ for subspaces $H_{I}\subset\mathbb{F}^{I}$ and $H_{I^{c}}\subset\mathbb{F}^{I^{c}}$ of dimension at most $k$ , then $w=\sum_{i\leq k}s_{i}\otimes t_{i}$ for some $s_{i}\in H_{I}$ , $t_{i}\in H_{I^{c}}$ . But clearly, $s_{i}\otimes t_{i}$ has partition rank 1.

Lemma 2.4.

Let $\delta\leq 1/2$ and $d\geq 2$ . Suppose that Lemma 2.2 has been proved for $d^{\prime}=d-1$ . Let $r\in\mathcal{G}$ be such that $r(v_{1}\otimes\dots\otimes v_{d-1})=0\in\mathbb{F}^{n_{d}}$ for at least $\delta|\mathbb{F}|^{n_{1}\dots n_{d-1}}$ choices $v_{1}\in\mathbb{F}^{n_{1}},\dots,v_{d-1}\in\mathbb{F}^{n_{d-1}}$ . Then $r$ is $f$ -degenerate for $f=G(d-1,\delta)$ .

Proof.

Write $r=\sum_{i}s_{i}\otimes t_{i}$ where $s_{i}\in\mathbb{F}^{[d-1]}$ and $\{t_{i}\}_{i}$ is a basis for $\mathbb{F}^{n_{d}}$ . Let $\mathcal{D}$ be the multiset $\{u_{1}\otimes\dots\otimes u_{d-1}:u_{1}\in\mathbb{F}^{n_{1}},\dots,u_{d-1}\in\mathbb{F}^{n_{d-1}}\}$ and let $\mathcal{D}^{\prime}=\{w\in\mathcal{D}:rw=0\}$ . Since $|\mathcal{D}^{\prime}|\geq\delta|\mathcal{D}|$ , by Lemma 2.2 there is a multiset $Q$ with elements from $2^{3^{d+2}}\mathcal{D}^{\prime}-2^{3^{d+2}}\mathcal{D}^{\prime}$ such that the set of arrays $r^{\prime}\in\mathbb{F}^{[d-1]}$ with $r^{\prime}.q=0$ for all choices $q\in Q$ is contained in some $\sum_{I\subset[d-1],I\neq\emptyset}V_{I}\otimes\mathbb{F}^{[d-1]\setminus I}$ , where $\dim(V_{I})\leq G(d-1,\delta)$ . Note that for every $i$ we have $s_{i}.w=0$ for all $w\in\mathcal{D}^{\prime}$ and so also $s_{i}.q=0$ for all $q\in Q$ . Thus, $r\in\sum_{I\subset[d-1],I\neq\emptyset}V_{I}\otimes\mathbb{F}^{I^{c}}$ . ∎

Now we are in a position to prove Theorem 1.10 conditional on Lemma 2.2.

Proof of Theorem 1.10.

Let $T:\mathbb{F}^{n_{1}}\times\dots\times\mathbb{F}^{n_{d}}\rightarrow\mathbb{F}$ be an order $d$ tensor with ${\rm arank}(T)\leq r$ . By Remark 1.8, we have $\mathbb{P}_{v_{1}\in\mathbb{F}^{n_{1}},\dots,v_{d-1}\in\mathbb{F}^{n_{d-1}}}[T(v_{1},\dots,v_{d-1},x)\equiv 0]\geq|\mathbb{F}|^{-r}$ . Writing $t$ for the element in $\mathcal{G}$ corresponding to $T$ , we get that $t(v_{1}\otimes\dots\otimes v_{d-1}\otimes x)\equiv 0$ as a function of $x$ for at least $\delta|\mathbb{F}|^{n_{1}\dots n_{d}}$ choices $v_{1}\in\mathbb{F}^{n_{1}},\dots,v_{d-1}\in\mathbb{F}^{n_{d-1}}$ , where $\delta=|\mathbb{F}|^{-r}$ . But $t(v_{1}\otimes\dots\otimes v_{d-1}\otimes x)=\big{(}t(v_{1}\otimes\dots\otimes v_{d-1})\big{)}.x$ , so we have $t(v_{1}\otimes\dots\otimes v_{d-1})=0$ for all these choices of $v_{i}$ . The condition $r\geq 1$ implies $\delta\leq 1/2$ , therefore by Lemma 2.4, $t$ is $f$ -degenerate for $f=G(d-1,\delta)$ . Hence,

[TABLE]

But there exists some absolute constant $c$ such that $c_{1}(d)^{c_{2}(d-1)}\leq c^{c_{2}(d)}$ holds for all $d$ . Moreover, $2c_{2}(d-1)\leq c_{2}(d)$ . Thus, ${\rm prank}(T)\leq(c\cdot\log|\mathbb{F}|)^{c_{2}(d)}\cdot r^{c_{2}(d)}=(c\cdot\log|\mathbb{F}|)^{c^{\prime}(d)}\cdot r^{c^{\prime}(d)}$ . ∎

2.3 The overview of the proof of Lemma 2.2

The proof of the lemma goes by induction on $d$ . In what follows, we shall prove results conditional on the assumption that Lemma 2.2 has been verified for all $d^{\prime}<d$ . Eventually, we will use these results to prove the induction step.

In this subsection, we give a detailed sketch of the proof in the $d=3$ case. At the end of the subsection, we also briefly sketch the $d>3$ case.

2.3.1 The high-level outline in the case $d=3$

We assume that Lemma 2.2 has been proven for $d\leq 2$ and use this assumption to show that it holds for $d=3$ . We will take $Q=Q_{\{1,2,3\}}\cup Q_{\{1\}}\cup Q_{\{2\}}\cup Q_{\{3\}}$ with elements chosen from $2^{3^{d+3}}\mathcal{B}^{\prime}-2^{3^{d+3}}\mathcal{B}^{\prime}$ such that the $Q_{I}$ have roughly equal size. This implies that if for some $r\in\mathcal{G}$ we have $r.q=0$ for almost all $q\in Q$ , then $r.q=0$ holds for almost all $q\in Q_{I}$ for every $I=\{1\},\{2\},\{3\},\{1,2,3\}$ . We define $Q_{\{1,2,3\}}$ first, in a way that if $r.q=0$ for almost all $q\in Q_{\{1,2,3\}}$ , then $r=x+y$ where $x\in V_{\{1,2,3\}}$ for a vector space $V_{\{1,2,3\}}$ which is independent of $r$ and have small dimension, and $y$ has small partition rank. This already implies that any array $r\in\mathcal{G}$ with $r.q=0$ for almost all $q\in Q$ is contained in $V_{\{1,2,3\}}+\mathbb{F}^{n_{1}}\otimes H_{\{2,3\}}(r)+\mathbb{F}^{n_{2}}\otimes H_{\{1,3\}}(r)+\mathbb{F}^{n_{3}}\otimes H_{\{1,2\}}(r)$ for some subspaces $H_{I}(r)\subset\mathbb{F}^{I}$ depending on $r$ and of small dimension. We then find $Q_{\{1\}}$ such that if $r\in V_{\{1,2,3\}}+\mathbb{F}^{n_{1}}\otimes H_{\{2,3\}}(r)+\mathbb{F}^{n_{2}}\otimes H_{\{1,3\}}(r)+\mathbb{F}^{n_{3}}\otimes H_{\{1,2\}}(r)$ has $r.q=0$ for almost all $q\in Q_{\{1\}}$ , then $r\in V_{\{1,2,3\}}+V_{\{1\}}\otimes\mathbb{F}^{\{2,3\}}+\mathbb{F}^{n_{1}}\otimes V_{\{2,3\}}+\mathbb{F}^{n_{2}}\otimes K_{\{1,3\}}(r)+\mathbb{F}^{n_{3}}\otimes K_{\{1,2\}}(r)$ , where $V_{\{1\}}\subset\mathbb{F}^{n_{1}}$ and $V_{\{2,3\}}\subset\mathbb{F}^{\{2,3\}}$ are subspaces independent of $r$ and have small dimension, and $K_{I}(r)\subset\mathbb{F}^{I}$ are subspaces of small dimension (although quite a bit larger than $\dim(H_{I}(r))$ ). Then we find $Q_{\{2\}}$ such that if $r\in V_{\{1,2,3\}}+V_{\{1\}}\otimes\mathbb{F}^{\{2,3\}}+\mathbb{F}^{n_{1}}\otimes V_{\{2,3\}}+\mathbb{F}^{n_{2}}\otimes K_{\{1,3\}}(r)+\mathbb{F}^{n_{3}}\otimes K_{\{1,2\}}(r)$ has $r.q=0$ for almost all $q\in Q_{\{2\}}$ , then $r\in V_{\{1,2,3\}}+V_{\{1\}}\otimes\mathbb{F}^{\{2,3\}}+\mathbb{F}^{n_{1}}\otimes V_{\{2,3\}}+V_{\{2\}}\otimes\mathbb{F}^{\{1,3\}}+\mathbb{F}^{n_{2}}\otimes V_{\{1,3\}}+\mathbb{F}^{n_{3}}\otimes L_{\{1,2\}}(r)$ , where $V_{\{2\}}\subset\mathbb{F}^{n_{2}}$ and $V_{\{1,3\}}\subset\mathbb{F}^{\{1,3\}}$ are subspaces independent of $r$ and have small dimension, and $L_{\{1,2\}}(r)\subset\mathbb{F}^{\{1,2\}}$ is a subspace of small dimension. Finally, we find $Q_{\{3\}}$ such that if $r\in V_{\{1,2,3\}}+V_{\{1\}}\otimes\mathbb{F}^{\{2,3\}}+\mathbb{F}^{n_{1}}\otimes V_{\{2,3\}}+V_{\{2\}}\otimes\mathbb{F}^{\{1,3\}}+\mathbb{F}^{n_{2}}\otimes V_{\{1,3\}}+\mathbb{F}^{n_{3}}\otimes L_{\{1,2\}}(r)$ has $r.q=0$ for almost all $q\in Q_{\{3\}}$ , then $r\in V_{\{1,2,3\}}+V_{\{1\}}\otimes\mathbb{F}^{\{2,3\}}+\mathbb{F}^{n_{1}}\otimes V_{\{2,3\}}+V_{\{2\}}\otimes\mathbb{F}^{\{1,3\}}+\mathbb{F}^{n_{2}}\otimes V_{\{1,3\}}+V_{\{3\}}\otimes\mathbb{F}^{\{1,2\}}+\mathbb{F}^{n_{3}}\otimes V_{\{1,2\}}$ , where $V_{\{3\}}\subset\mathbb{F}^{n_{3}}$ and $V_{\{1,2\}}\subset\mathbb{F}^{\{1,2\}}$ are subspaces independent of $r$ and have small dimension.

How will we find $Q_{\{1,2,3\}},Q_{\{1\}},Q_{\{2\}}$ and $Q_{\{3\}}$ ? In this outline we will only explain how to find $Q_{\{2\}}$ (but finding $Q_{\{1\}}$ and $Q_{\{3\}}$ is very similar). We take $Q_{\{2\}}=\bigcup_{u\in U}u\otimes Q_{u}$ where $U\subset\mathbb{F}^{n_{2}}$ is a subspace of low codimension, and for each $u\in U$ , $Q_{u}\subset\mathbb{F}^{\{1,3\}}$ is a multiset consisting of pure tensors such that if for some $x\in\mathbb{F}^{\{1,3\}}$ we have $x.t=0$ for almost all $t\in Q_{u}$ , then $x\in W_{\{1,3\}}(u)+\mathbb{F}^{n_{1}}\otimes W_{\{3\}}(u)+W_{\{1\}}(u)\otimes\mathbb{F}^{n_{3}}$ for some subspaces $W_{I}(u)\subset\mathbb{F}^{I}$ not depending on $x$ and of small dimension. Let us call a $Q_{u}$ with this property forcing. We will also make sure that all the $Q_{u}$ have roughly the same size.

2.3.2 Why does this $Q_{\{2\}}$ work?

In what follows, we will sketch why this choice is suitable. We remark that in the general case this is done in Lemma 2.15. Let $R$ consist of those

[TABLE]

such that $r.q=0$ for almost all $q\in Q_{\{2\}}$ . Let $r\in R$ . Write $r=r_{2}+r_{3}+r_{4}$ where

[TABLE]

It is enough to prove that

[TABLE]

for some small subspaces $V_{\{2\}}\subset\mathbb{F}^{n_{2}}$ , $V_{\{1,3\}}\subset\mathbb{F}^{\{1,3\}}$ and $L^{\prime}_{\{1,2\}}(r)\subset\mathbb{F}^{\{1,2\}}$ (in fact, we will be able to take $V_{\{2\}}=U^{\perp}$ ).

First note that $r_{2}u$ has small (partition) rank for every $u\in U$ . Indeed, $r_{2}u\in V_{\{1\}}\otimes\mathbb{F}^{n_{3}}+\mathbb{F}^{n_{1}}\otimes V_{\{2,3\}}u+\mathbb{F}^{n_{3}}\otimes K_{\{1,2\}}(r)u$ , where, for a vector space $L$ of tensors, $Lu$ denotes the space $\{su:s\in L\}$ .

Moreover, since the $Q_{u}$ all have roughly the same size, for almost every $u\in U$ we have that $r.(u\otimes t)=0$ holds for almost every $t\in Q_{u}$ . But $r.(u\otimes t)=(ru).t$ , therefore as $Q_{u}$ is forcing, it follows that for any such $u$

[TABLE]

for some subspaces $W_{I}(u)\subset\mathbb{F}^{I}$ not depending on $r$ and of small dimension. Since any element of $\mathbb{F}^{n_{1}}\otimes W_{\{3\}}(u)+W_{\{1\}}(u)\otimes\mathbb{F}^{n_{3}}$ has small partition rank, it follows that for almost every $u\in U$ ,

[TABLE]

where $s(u)$ is a tensor of small partition rank.

Define a sequence $0=Z(0)\subset Z(1)\subset\dots\subset Z(m)\subset\mathbb{F}^{\{1,3\}}$ of subspaces recursively as follows. Given $Z(j)$ , if there is some $r\in R$ such that $r_{4}u$ is far from $Z(j)$ for many $u\in U$ , then set $Z(j+1)=Z(j)+K_{1,3}(r)$ . What we mean by $r_{4}u$ being far from $Z(j)$ is that there is no $z\in Z(j)$ such that $r_{4}u-z$ has small partition rank. For suitably chosen parameters, one can show that this procedure cannot go on for too long, ie. that for some not too large $m$ we have that for every $r\in R$ , for almost all $u\in U$ there is some $z\in Z(m)$ with $r_{4}u-z$ having small partition rank.

Now let $r\in R$ . Let $X(r)$ be the set consisting of those $x\in K_{\{1,3\}}(r)$ which are close to $Z(m)$ . Then $r_{4}u\in X(r)$ for almost every $u\in U$ . Let $t_{1},\dots,t_{\alpha}$ be a maximal linearly independent subset of $X(r)$ and extend it to a basis $t_{1},\dots,t_{\alpha},t^{\prime}_{1},\dots,t^{\prime}_{\beta}$ for $K_{\{1,3\}}(r)$ . Now if a linear combination of $t_{1},\dots,t_{\alpha},t^{\prime}_{1},\dots,t^{\prime}_{\beta}$ is in $X(r)$ , then the coefficients of $t^{\prime}_{1},\dots,t^{\prime}_{\beta}$ are all zero. Write $r_{4}=\sum_{i\leq\alpha}s_{i}\otimes t_{i}+\sum_{j\leq\beta}s^{\prime}_{j}\otimes t^{\prime}_{j}$ for some $s_{i},s^{\prime}_{j}\in\mathbb{F}^{n_{2}}$ . Since $r_{4}u\in X(r)$ for almost all $u\in U$ , we have, for all $j$ , that $s^{\prime}_{j}.u=0$ for almost all $u\in U$ . Since these hold for more than half of $u\in U$ , we obtain $s^{\prime}_{j}\in U^{\perp}$ for every $j$ , therefore $\sum_{j\leq\beta}s^{\prime}_{j}\otimes t^{\prime}_{j}\in U^{\perp}\otimes\mathbb{F}^{\{1,3\}}$ .

Since $t_{i}\in X(r)$ for every $i$ , we may choose $z_{i}\in Z(m)$ such that $t_{i}=z_{i}+y_{i}$ where $y_{i}\in\mathbb{F}^{\{1,3\}}$ has small partition rank. Now $\sum_{i\leq\alpha}s_{i}\otimes t_{i}\in\mathbb{F}^{n_{2}}\otimes Z(m)+\sum_{i\leq\alpha}s_{i}\otimes y_{i}$ . Moreover, as $\alpha$ is small and each $y_{i}$ has small partition rank, we have $\sum_{i\leq\alpha}s_{i}\otimes y_{i}\in L^{\prime}_{\{1,2\}}(r)\otimes\mathbb{F}^{n_{3}}$ for some small $L^{\prime}_{\{1,2\}}(r)\subset\mathbb{F}^{\{1,2\}}$ . So we have proved (2) with $V_{\{2\}}=U^{\perp}$ and $V_{\{1,3\}}=Z(m)$ .

2.3.3 Why can we find such a $Q_{\{2\}}$ inside $2^{3^{d+3}}\mathcal{B}^{\prime}-2^{3^{d+3}}\mathcal{B}^{\prime}$ ?

Now we describe why there must exist $Q_{\{2\}}$ with elements chosen from $2^{3^{3+3}}\mathcal{B}^{\prime}-2^{3^{3+3}}\mathcal{B}^{\prime}$ and having the required properties. We remark that in the general case this is done in Lemma 2.14. We want to find a subspace $U\subset\mathbb{F}^{n_{2}}$ of low codimension, and forcing multisets $Q_{u}\subset\mathbb{F}^{\{1,3\}}$ ( $u\in U$ ) consisting of pure tensors such that for every $u\in U$ , $u\otimes Q_{u}\subset 2^{3^{3+3}}\mathcal{B}^{\prime}-2^{3^{3+3}}\mathcal{B}^{\prime}$ . Let $\mathcal{D}$ be the multiset $\{v\otimes w:v\in\mathbb{F}^{n_{1}},w\in\mathbb{F}^{n_{3}}\}$ . Notice that if some set $R$ is dense in $\mathcal{D}$ , then by the induction hypothesis we can find a forcing set in $2^{3^{2+3}}R-2^{3^{2+3}}R$ consisting of pure tensors. Therefore it is enough to find a low codimensional subspace $U$ and dense sets $R_{u}\subset\mathcal{D}$ (for every $u\in U$ ) such that $u\otimes R_{u}\subset 32\mathcal{B}^{\prime}-32\mathcal{B}^{\prime}$ . As $\mathcal{B}^{\prime}$ is dense in $\mathcal{B}$ , we have a dense subset $S\subset\mathbb{F}^{n_{2}}$ and dense subsets $T_{s}\subset\mathcal{D}$ ( $s\in S$ ) such that $s\otimes T_{s}\subset\mathcal{B}^{\prime}$ for every $s\in S$ . By Bogolyubov’s lemma (Lemma 2.1), there is a low codimensional subspace $U$ contained in $2S-2S$ . To establish the existence of a dense $R_{u}\subset\mathcal{D}$ with $u\otimes R_{u}\subset 32\mathcal{B}^{\prime}-32\mathcal{B}^{\prime}$ for every $u\in U$ , it is enough to prove the following lemma.

Lemma 2.5.

Let $T_{1},T_{2},T_{3},T_{4}$ be dense subsets of $\mathcal{D}$ . Then $\mathcal{D}\cap\bigcap_{i\leq 4}(8T_{i}-8T_{i})$ is dense in $\mathcal{D}$ .

Indeed, once we have this lemma, it follows that for any $s_{1},s_{2},s_{3},s_{4}\in S$ , the set $\mathcal{D}\cap\bigcap_{i\leq 4}(8T_{s_{i}}-8T_{s_{i}})$ is dense in $\mathcal{D}$ . But if $u\in U$ , then we can write $u=s_{1}+s_{2}-s_{3}-s_{4}$ for some $s_{i}\in S$ , and then $u\otimes\bigcap_{i\leq 4}(8T_{s_{i}}-8T_{s_{i}})\subset s_{1}\otimes\bigcap_{i\leq 4}(8T_{s_{i}}-8T_{s_{i}})+s_{2}\otimes\bigcap_{i\leq 4}(8T_{s_{i}}-8T_{s_{i}})-s_{3}\otimes\bigcap_{i\leq 4}(8T_{s_{i}}-8T_{s_{i}})-s_{4}\otimes\bigcap_{i\leq 4}(8T_{s_{i}}-8T_{s_{i}})\subset 32\mathcal{B}^{\prime}-32\mathcal{B}^{\prime}$ .

Lemma 2.5 follows easily from the next two lemmas.

Lemma 2.6.

Let $A$ be a dense subset of $\mathcal{D}$ . Then there exist a dense subspace $V\subset\mathbb{F}^{n_{1}}$ and for each $v\in V$ a dense subspace $W_{v}\subset\mathbb{F}^{n_{3}}$ such that $v\otimes W_{v}\subset 8A-8A$ for every $v\in V$ .

Proof.

There exist a dense subset $B\subset\mathbb{F}^{n_{1}}$ and dense subsets $C_{b}\subset\mathbb{F}^{n_{3}}$ for each $b\in B$ such that $b\otimes C_{b}\subset A$ . By Bogolyubov’s lemma, $2B-2B$ contains a dense subspace $V\subset\mathbb{F}^{n_{1}}$ , and for every $b\in B$ , $2C_{b}-2C_{b}$ contains a dense subspace $L_{b}\subset\mathbb{F}^{n_{3}}$ . For any $v\in V$ , choose $b_{1},b_{2},b_{3},b_{4}\in B$ with $v=b_{1}+b_{2}-b_{3}-b_{4}$ and set $W_{v}=\bigcap_{i\leq 4}L_{b_{i}}$ . Note that $b_{i}\otimes w\in 2A-2A$ for every $i\leq 4$ and $w\in W_{v}$ , therefore $v\otimes w\in 8A-8A$ . ∎

Lemma 2.7.

Suppose that we have dense subspaces $V,V^{\prime}\subset\mathbb{F}^{n_{1}}$ , for each $v\in V$ a dense subspace $W_{v}\subset\mathbb{F}^{n_{3}}$ , and for each $v^{\prime}\in V^{\prime}$ a dense subspace $W^{\prime}_{v^{\prime}}\subset\mathbb{F}^{n_{3}}$ . Then $(\bigcup_{v\in V}v\otimes W_{v})\cap(\bigcup_{v^{\prime}\in V^{\prime}}v^{\prime}\otimes W^{\prime}_{v^{\prime}})=\bigcup_{v\in V\cap V^{\prime}}v\otimes(W_{v}\cap W^{\prime}_{v})$ . In particular, this intersection is a dense subset of $\mathcal{D}$ .

Proof.

The identity is trivial. Since the subspaces $V\cap V^{\prime}$ and $W_{v}\cap W^{\prime}_{v}$ are dense, the second assertion follows. ∎

2.3.4 How can this be extended to $d>3$ ?

Now we briefly sketch what the main difficulties are in the $d>3$ case and how we can address them. The underlying strategy is similar: we take an ordering $\prec$ of the set of non-empty subsets $I\subset[d-1]$ , and for each such $I$ we choose $Q_{I}$ such that any array

[TABLE]

with $r.q=0$ for almost all $q\in Q_{I}$ has

[TABLE]

where $U_{J},U_{J^{c}},K_{J^{c}}(r)$ can have dimension slightly larger than those of $W_{J},W_{J^{c}}$ and $H_{J^{c}}$ , but they are still low dimensional. In the $d=3$ case, we have made use of a decomposition $r=r_{2}+r_{3}+r_{4}$ where $r_{4}\in\mathbb{F}^{I}\otimes H_{I^{c}}(r)$ , $r_{2}u$ has small partition rank and $r_{3}u$ is in a small subspace independent of $r$ for every $u\in\mathbb{F}^{I}$ . In general, such a decomposition need not exist. For example, when $d=4$ and $I=\{1,2\}$ , then an array in $W_{\{1\}}\otimes\mathbb{F}^{\{2,3,4\}}$ (or in $\mathbb{F}^{n_{1}}\otimes H_{\{2,3,4\}}(r)$ if we were to take $\{1,2\}\prec\{1\}$ ), when multiplied by some pure tensor $u\in\mathbb{F}^{\{1,2\}}$ , yields a tensor which need not have small partition rank and need not lie a small space independent of $r$ . However, by restricting the possible choices for $u$ , we can make sure that the product is always zero. So we will take a decomposition $r=r_{1}+r_{2}+r_{3}+r_{4}$ such that $r_{4}\in\mathbb{F}^{I}\otimes H_{I^{c}}(r)$ ; for every pure tensor $u\in\mathbb{F}^{I}$ , $r_{2}u$ has small partition rank and $r_{3}u$ lies in a small space depending only on $u$ ; and crucially, for every $q\in Q_{I}$ , $r_{1}.q=0$ . To achieve this, we need to insist that $J\prec I$ whenever $J\subsetneq I$ and that $Q_{I}$ is orthogonal to certain subspaces. To see this, note that in the above example where $d=4$ and $I=\{1,2\}$ we need that $\{1\}\prec\{1,2\}$ and $Q_{\{1,2\}}$ is orthogonal to $W_{\{1\}}\otimes\mathbb{F}^{\{2,3,4\}}$ . (If we had $\{1,2\}\prec\{1\}$ , then in (4) we would have a term $\mathbb{F}^{n_{1}}\otimes H_{\{2,3,4\}}(r)$ rather than $W_{\{1\}}\otimes\mathbb{F}^{\{2,3,4\}}$ , which we could not control.)

We also need to generalise Lemma 2.5 to the case $d>3$ . Instead of using $\bigcup_{v\in V}v\otimes W_{v}$ as in Lemma 2.6, we need to define an object in $\mathcal{B}$ such that

an instance of the object can be found in $k\mathcal{B}^{\prime}-k\mathcal{B}^{\prime}$ for some small $k$ whenever $\mathcal{B}^{\prime}$ is dense in $\mathcal{B}$ (generalising Lemma 2.6) 2. 2.

the intersection of few instances of this object is a dense subset of $\mathcal{B}$ (generalising Lemma 2.7)

In the next subsection we describe this object and show that it has the required properties.

2.4 Construction of some auxiliary sets

Definition 2.8.

Suppose that we have a collection of vector spaces as follows. The first one is $U\subset\mathbb{F}^{n_{1}}$ , of codimension at most $l$ . Then, for every $u_{1}\in U$ , there is some $U_{u_{1}}\subset\mathbb{F}^{n_{2}}$ . In general, for every $2\leq k\leq d$ and every $u_{1}\in U,u_{2}\in U_{u_{1}},\dots,u_{k-1}\in U_{u_{1},\dots,u_{k-2}}$ , there is a subspace $U_{u_{1},\dots,u_{k-1}}\subset\mathbb{F}^{n_{k}}$ . Assume, in addition, that the codimension of $U_{u_{1},\dots,u_{k-1}}$ in $\mathbb{F}^{n_{k}}$ is at most $l$ for every $u_{1}\in U,\dots,u_{k-1}\in U_{u_{1},\dots,u_{k-2}}$ . Then the multiset $Q=\{u_{1}\otimes\dots\otimes u_{d}:u_{1}\in U,\dots,u_{d}\in U_{u_{1},\dots,u_{d-1}}\}$ is called an $l$ -system.

The next lemma is the generalisation of Lemma 2.7 from the previous subsection.

Lemma 2.9.

Let $Q$ be an $l$ -system and let $Q^{\prime}$ be an $l^{\prime}$ -system. Then $Q\cap Q^{\prime}$ contains an $(l+l^{\prime})$ -system.

Proof.

Let $Q$ have spaces as in Definition 2.8 and let $Q^{\prime}$ have spaces $U^{\prime}_{u^{\prime}_{1},\dots,u^{\prime}_{k-1}}$ . We define an $(l+l^{\prime})$ -system $P$ contained in $Q\cap Q^{\prime}$ as follows. Let $V=U\cap U^{\prime}$ . Suppose we have defined $V_{v_{1},\dots,v_{j-1}}$ for all $j\leq k$ . Let $v_{1}\in V,v_{2}\in V_{v_{1}},\dots,v_{k-1}\in V_{v_{1},\dots,v_{k-2}}$ . We let $V_{v_{1}\dots,v_{k-1}}=U_{v_{1}\dots,v_{k-1}}\cap U^{\prime}_{v_{1}\dots,v_{k-1}}$ . This is well-defined and has codimension at most $l+l^{\prime}$ in $\mathbb{F}^{n_{k}}$ . Let $P$ be the $(l+l^{\prime})$ -system with spaces $V_{v_{1},\dots,v_{k-1}}$ . ∎

The next lemma is the generalisation of Lemma 2.6 from the previous subsection.

Lemma 2.10.

Let $\mathcal{B}^{\prime}\subset\mathcal{B}$ be a multiset such that $|\mathcal{B}^{\prime}|\geq\delta|\mathcal{B}|$ . Then there exists an $f_{1}$ -system whose elements are chosen from $f_{2}\mathcal{B}^{\prime}-f_{2}\mathcal{B}^{\prime}$ with $f_{1}=C\cdot 4^{d}(\log(2^{d}/\delta))^{4}$ and $f_{2}=4^{d}$ .

Proof.

The proof is by induction on $d$ . The case $d=1$ is a direct consequence of Lemma 2.1. Suppose that the lemma has been proved for all $d^{\prime}<d$ and let $\mathcal{B}^{\prime}\subset\mathcal{B}$ be a multiset such that $|\mathcal{B}^{\prime}|\geq\delta|\mathcal{B}|$ . Let $\mathcal{D}$ be the multiset $\{v_{2}\otimes\dots\otimes v_{d}:v_{2}\in\mathbb{F}^{n_{2}},\dots,v_{d}\in\mathbb{F}^{n_{d}}\}$ . For each $u\in\mathbb{F}^{n_{1}}$ , let $\mathcal{B}_{u}^{\prime}=\{s\in\mathcal{D}:u\otimes s\in\mathcal{B}^{\prime}\}$ and let $T=\{u\in\mathbb{F}^{n_{1}}:|\mathcal{B}_{u}^{\prime}|\geq\frac{\delta}{2}|\mathcal{D}|\}$ . By averaging, we have that $|T|\geq\frac{\delta}{2}|\mathbb{F}^{n_{1}}|$ . Now by the induction hypothesis, for every $t\in T$ , there exists a $g_{1}$ -system in $\mathbb{F}^{n_{2}}\otimes\dots\otimes\mathbb{F}^{n_{d}}$ (whose definition is analogous to the definition of a system in $\mathbb{F}^{n_{1}}\otimes\dots\otimes\mathbb{F}^{n_{d}}$ ), called $P_{t}$ , contained in $g_{2}\mathcal{B}^{\prime}_{t}-g_{2}\mathcal{B}^{\prime}_{t}$ where $g_{1}=C\cdot 4^{d-1}(\log(2^{d}/\delta))^{4}$ and $g_{2}=4^{d-1}$ . By Lemma 2.1, $2T-2T$ contains a subspace $U\subset\mathbb{F}^{n_{1}}$ of codimension at most $C(\log(2/\delta))^{4}$ . For each $u\in U$ , write $u=t_{1}+t_{2}-t_{3}-t_{4}$ arbitrarily with $t_{i}\in T$ , and let $Q_{u}=P_{t_{1}}\cap P_{t_{2}}\cap P_{t_{3}}\cap P_{t_{4}}$ , which is a $g_{3}$ -system with $g_{3}=4g_{1}=C\cdot 4^{d}(\log(2^{d}/\delta))^{4}$ , by Lemma 2.9. Thus, $Q=\bigcup_{u\in U}(u\otimes Q_{u})$ is indeed an $f_{1}$ -system. Moreover, for any $u\in U,s\in Q_{u}$ , we have $u\otimes s=t_{1}\otimes s+t_{2}\otimes s-t_{3}\otimes s-t_{4}\otimes s$ for some $t_{i}\in T$ and $s\in\bigcap_{i\leq 4}P_{t_{i}}$ . Then $t_{i}\otimes s\in g_{2}\mathcal{B}^{\prime}-g_{2}\mathcal{B}^{\prime}$ , therefore $u\otimes s\in 4g_{2}\mathcal{B}^{\prime}-4g_{2}\mathcal{B}^{\prime}$ , so the elements of $Q$ are indeed chosen from $f_{2}\mathcal{B}^{\prime}-f_{2}\mathcal{B}^{\prime}$ . ∎

The next lemma describes a property of systems which was not needed for us in the $d=3$ case, but is crucial in the general case. It is required for finding a suitable decomposition $r=r_{1}+r_{2}+r_{3}+r_{4}$ described at the end of the previous subsection. Indeed, we need a set $Q_{I}$ which is orthogonal to certain spaces of the form $W_{J}\otimes\mathbb{F}^{J^{c}}$ (ie. is contained in $W_{J}^{\perp}\otimes\mathbb{F}^{J^{c}}$ ) to make sure that $r_{1}.q=0$ for every $q\in Q_{I}$ . We will use the following lemma to guarantee the existence of such a set $Q_{I}$ .

Lemma 2.11.

Let $Q$ be a $k$ -system and for every non-empty $I\subset[d]$ , let $L_{I}\subset\mathbb{F}^{I}$ be a subspace of codimension at most $l$ . Let $T=\bigcap_{I}(L_{I}\otimes\mathbb{F}^{I^{c}})$ . Then $Q\cap T$ contains an $f$ -system for $f=k+2^{d}l$ .

Proof.

Let the spaces of $Q$ be $U_{u_{1},\dots,u_{j-1}}$ . It suffices to prove that for every $1\leq j\leq d$ , and every $u_{1}\in U,\dots,u_{j-1}\in U_{u_{1},\dots,u_{j-2}}$ , the codimension of $(u_{1}\otimes\dots\otimes u_{j-1}\otimes U_{u_{1},\dots,u_{j-1}})\cap\bigcap_{I\subset[j],j\in I}(L_{I}\otimes\mathbb{F}^{[j]\setminus I})$ in $u_{1}\otimes\dots\otimes u_{j-1}\otimes U_{u_{1},\dots,u_{j-1}}$ is at most $2^{d}l$ . Thus, it suffices to prove that for every $I\subset[j]$ with $j\in I$ , the codimension of $(u_{1}\otimes\dots\otimes u_{j-1}\otimes U_{u_{1},\dots,u_{j-1}})\cap(L_{I}\otimes\mathbb{F}^{[j]\setminus I})$ in $u_{1}\otimes\dots\otimes u_{j-1}\otimes U_{u_{1},\dots,u_{j-1}}$ is at most $l$ . But this is equivalent to the statement that $\big{(}(\bigotimes_{i\in I\setminus\{j\}}u_{i})\otimes U_{u_{1},\dots,u_{j-1}}\big{)}\cap L_{I}$ has codimension at most $l$ in $(\bigotimes_{i\in I\setminus\{j\}}u_{i})\otimes U_{u_{1},\dots,u_{j-1}}$ , which clearly holds. ∎

2.5 The proof of Lemma 2.2

We now turn to the proof of Lemma 2.2. As described in the outline, the first step is to find a $Q_{[d]}$ such that if $r.q=0$ for almost all $q\in Q_{[d]}$ , then $r=x+y$ where $x\in V_{[d]}$ for a small space $V_{[d]}$ independent of $r$ , and $y$ has low partition rank.

Lemma 2.12.

Let $d\geq 2$ and suppose that Lemma 2.2 has been proved for $d^{\prime}=d-1$ . Let $\mathcal{B}^{\prime}\subset\mathcal{B}$ be such that $|\mathcal{B}^{\prime}|\geq\delta|\mathcal{B}|$ for some $\delta>0$ . Then there exist some $Q\subset 2\mathcal{B}^{\prime}-2\mathcal{B}^{\prime}$ consisting of pure tensors and a subspace $V_{[d]}\subset\mathbb{F}^{[d]}$ of dimension at most $4C(\log(2/\delta))^{4}$ with the following property. Any array $r$ with $r.q=0$ for at least $\frac{7}{8}|Q|$ choices $q\in Q$ can be written as $r=x+y$ where $x\in V_{[d]}$ and $y$ is $f$ -degenerate for $f=G(d-1,\frac{\delta}{4|\mathbb{F}|^{4C(\log 2/\delta)^{4}}})$ .

Proof.

Let $\mathcal{D}$ be the multiset $\{u_{1}\otimes\dots\otimes u_{d-1}:u_{1}\in\mathbb{F}^{n_{1}},\dots,u_{d-1}\in\mathbb{F}^{n_{d-1}}\}$ and let $\mathcal{D}^{\prime}=\{t\in\mathcal{D}:t\otimes u\in\mathcal{B}^{\prime}\text{ for at least }\frac{\delta}{2}|\mathbb{F}|^{n_{d}}\text{ choices }u\in\mathbb{F}^{n_{d}}\}$ . Clearly, we have $|\mathcal{D}^{\prime}|\geq\frac{\delta}{2}|\mathcal{D}|$ . Moreover, by Lemma 2.1, for every $t\in\mathcal{D}^{\prime}$ , there exists a subspace $U_{t}\subset\mathbb{F}^{n_{d}}$ of codimension at most $C(\log(2/\delta))^{4}$ such that $t\otimes U_{t}\subset 2\mathcal{B}^{\prime}-2\mathcal{B}^{\prime}$ . After passing to suitable subspaces, we may assume that all $U_{t}$ have the same codimension $k\leq C(\log(2/\delta))^{4}$ . Now let $Q=\cup_{t\in\mathcal{D}^{\prime}}(t\otimes U_{t})$ .

Write $R$ for the set of arrays $r$ with $r.q=0$ for at least $\frac{7}{8}|Q|$ choices $q\in Q$ .

We now define a sequence of subspaces $0=V(0)\subset V(1)\subset\dots\subset V(m)\subset\mathbb{F}^{[d]}$ recursively as follows.

Given $V(j)$ , if for every $r\in R$ there are at least $\frac{|\mathcal{D}^{\prime}|}{2}$ choices $t\in\mathcal{D}^{\prime}$ with $rt\in V(j)t$ , then we set $m=j$ and terminate. (Here and below, for a subspace $L\subset\mathcal{G}$ and an array $s\in\mathbb{F}^{I}$ , we write $Ls$ for the subspace $\{rs:r\in L\}\subset\mathbb{F}^{I^{c}}$ .)

Else, we choose some $r\in R$ such that there are at most $\frac{|\mathcal{D}^{\prime}|}{2}$ choices $t\in\mathcal{D}^{\prime}$ with $rt\in V(j)t$ . We set $V(j+1)=V(j)+\text{span}(r)$ . Note that $r.(t\otimes s)=(rt).s$ for every $s\in U_{t}$ . If $rt\not\in U_{t}^{\perp}$ , then $(rt).s=0$ holds for only a proportion $1/|\mathbb{F}|\leq 1/2$ of all $s\in U_{t}$ . Thus, as $r\in R$ , we have $rt\in U_{t}^{\perp}$ for at least $\frac{3}{4}|\mathcal{D}^{\prime}|$ choices $t\in\mathcal{D}^{\prime}$ . Moreover, since $rt\in V(j)t$ holds for at most $\frac{|\mathcal{D}^{\prime}|}{2}$ choices $t\in\mathcal{D}^{\prime}$ , it follows that for at least $\frac{|\mathcal{D}^{\prime}|}{4}$ choices $t\in\mathcal{D}^{\prime}$ we have $rt\in U_{t}^{\perp}\setminus V(j)t$ . Thus, we have $\dim(U_{t}^{\perp}\cap V(j+1)t)>\dim(U_{t}^{\perp}\cap V(j)t)$ for at least $\frac{|\mathcal{D}^{\prime}|}{4}$ choices $t\in\mathcal{D}^{\prime}$ .

However, for any $j$ we have $\sum_{t\in\mathcal{D}^{\prime}}\dim(U_{t}^{\perp}\cap V(j)t)\leq\sum_{t\in\mathcal{D}^{\prime}}\dim U_{t}^{\perp}\leq C|\mathcal{D}^{\prime}|(\log(2/\delta))^{4}$ . Thus, we get $m\leq 4C(\log(2/\delta))^{4}$ . Set $V_{[d]}=V(m)$ . Then $\dim V_{[d]}\leq 4C(\log(2/\delta))^{4}$ , as claimed.

Now let $r\in R$ be arbitrary. By definition, there are at least $|\mathcal{D}^{\prime}|/2$ choices $t\in\mathcal{D}^{\prime}$ with $rt\in V_{[d]}t$ . Then there is some $v\in V_{[d]}$ such that $rt=vt$ for at least $\frac{|\mathcal{D}^{\prime}|}{2|V_{[d]}|}$ choices $t\in\mathcal{D}^{\prime}$ , and hence also for at least $\frac{\delta|\mathcal{D}|}{4|V_{[d]}|}$ choices $t\in\mathcal{D}$ . Note that $\frac{\delta}{4|V_{[d]}|}\geq\frac{\delta}{4|\mathbb{F}|^{4C(\log 2/\delta)^{4}}}$ , therefore by Lemma 2.4, $r-v$ is $f$ -degenerate. ∎

Definition 2.13.

Let $k$ be a positive integer and let $0\leq\alpha\leq 1$ . Let $Q$ be a multiset with elements chosen from $\mathcal{G}$ (with arbitrary multiplicity). We say that $Q$ is $(k,\alpha)$ -forcing if the set of all arrays $r\in\mathcal{G}$ with $r.q=0$ for at least $\alpha|Q|$ choices $q\in Q$ is contained in a set of the from $\sum_{I\subset[d],I\neq\emptyset}V_{I}\otimes\mathbb{F}^{I^{c}}$ for some $V_{I}\subset\mathbb{F}^{I}$ of dimension at most $k$ .

We now turn to the main part of the proof of Lemma 2.2. For each non-empty $I\subset[d-1]$ we will construct $Q_{I}$ as defined in the next result, and (roughly) we will take $Q=Q_{[d]}\cup\bigcup_{I\subset[d-1],I\neq\emptyset}Q_{I}$ , where $Q_{[d]}$ is provided by Lemma 2.12. The properties that $Q_{I}$ has are generalisations of the properties that $Q_{\{2\}}$ had in Subsection 2.3. Accordingly, the next lemma is the generalisation of the discussion in Subsubsection 2.3.3.

Lemma 2.14.

Let $d\geq 2$ and suppose that Lemma 2.2 has been proved for every $d^{\prime}<d$ . Let $\mathcal{B}^{\prime}\subset\mathcal{B}$ have $|\mathcal{B}^{\prime}|\geq\delta|\mathcal{B}|$ for some $0<\delta\leq 1/2$ . Let $k\geq G(d-1,\delta)$ be arbitrary, let $I\subset[d-1],I\neq\emptyset$ , and let $W_{J}\subset\mathbb{F}^{J}$ be subspaces of dimension at most $k$ for every $J\subset I,J\neq I,J\neq\emptyset$ . Then there exist a multiset $Q^{\prime}$ , and a multiset $Q_{s}$ for each $s\in Q^{\prime}$ with the following properties.

(1)

The elements of $Q^{\prime}$ are pure tensors chosen from $\bigcap_{J\subset I,J\neq I,J\neq\emptyset}(W_{J}^{\perp}\otimes\mathbb{F}^{I\setminus J})\subset\mathbb{F}^{I}$ 2. (2)

$Q^{\prime}$ * is $(f_{1},1-f_{2})$ -forcing with $f_{1}=G(|I|,|\mathbb{F}|^{-2^{d+1}dk})$ , $f_{2}=2^{-3^{d+2}}$ * 3. (3)

For each $s\in Q^{\prime}$ , the elements of $Q_{s}$ are pure tensors chosen from $\mathbb{F}^{I^{c}}$ 4. (4)

For each $s\in Q^{\prime}$ , $Q_{s}$ is $(f_{3},1-f_{4})$ -forcing with $f_{3}=G(d-|I|,|\mathbb{F}|^{-2^{3^{d+4}}C(\log(2^{d-1}/\delta))^{4}})$ , $f_{4}=2^{-3^{d+2}}$ 5. (5)

$\max_{s\in Q^{\prime}}|Q_{s}|\leq 2\min_{s\in Q^{\prime}}|Q_{s}|$ ** 6. (6)

The elements of the multiset $Q_{I}:=\{s\otimes t:s\in Q^{\prime},t\in Q_{s}\}=\bigcup_{s\in Q^{\prime}}(s\otimes Q_{s})$ are chosen from $f_{5}\mathcal{B}^{\prime}-f_{5}\mathcal{B}^{\prime}$ with $f_{5}=2^{3^{d+3}}$ .

Proof.

By symmetry, we may assume that $I=[a]$ for some $1\leq a\leq d-1$ . Let $\mathcal{C}$ be the multiset $\{u_{1}\otimes\dots\otimes u_{a}:u_{i}\in\mathbb{F}^{n_{i}}\}$ and let $\mathcal{D}$ be the multiset $\{u_{a+1}\otimes\dots\otimes u_{d}:u_{i}\in\mathbb{F}^{n_{i}}\}$ . For each $s\in\mathcal{C}$ , let $\mathcal{D}_{s}=\{t\in\mathcal{D}:s\otimes t\in\mathcal{B}^{\prime}\}$ . Also, let $\mathcal{C}^{\prime}=\{s\in\mathcal{C}:|\mathcal{D}_{s}|\geq\frac{\delta}{2}|\mathcal{D}|\}$ . Clearly, $|\mathcal{C}^{\prime}|\geq\frac{\delta}{2}|\mathcal{C}|$ . By Lemma 2.10, there exists a $g_{1}$ -system $R$ (with respect to $\mathbb{F}^{I}$ ) with elements chosen from $g_{2}\mathcal{C}^{\prime}-g_{2}\mathcal{C}^{\prime}$ with $g_{1}=C\cdot 4^{d}(\log(2^{d-1}/\delta))^{4}$ and $g_{2}=4^{d}$ . By Lemma 2.11, $R\cap\bigcap_{J\subset I,J\neq I,J\neq\emptyset}(W_{J}^{\perp}\otimes\mathbb{F}^{I\setminus J})$ contains a $g_{3}$ -system $T^{\prime}$ for $g_{3}=C\cdot 4^{d}(\log(2^{d-1}/\delta))^{4}+2^{d}k$ . Now $|T^{\prime}|\geq|\mathbb{F}|^{-dg_{3}}|\mathcal{C}|$ . By Lemma 2.2 (applied to $a$ in place of $d$ ), it follows that there exists a multiset $Q^{\prime}$ whose elements are pure tensors chosen from $g_{4}T^{\prime}-g_{4}T^{\prime}$ and which is $(g_{5},1-g_{6})$ -forcing for $g_{4}=2^{3^{a+3}}\leq 2^{3^{d+2}}$ , $g_{5}=G(a,|\mathbb{F}|^{-dg_{3}})$ and $g_{6}=2^{-3^{a+3}}\geq 2^{-3^{d+2}}$ . Note that since $\delta\leq 1/2$ , we have $C\cdot 4^{d}(\log(2^{d-1}/\delta))^{4}=C\cdot 4^{d}(d-1+\log(1/\delta))^{4}\leq C\cdot 4^{d}(d\log(1/\delta))^{4}$ . But this is at most as $G(d-1,\delta)\leq k$ , so $g_{3}\leq 2\cdot 2^{d}k$ , therefore $Q^{\prime}$ satisfies (1) and (2) in the statement of this lemma.

By Lemma 2.10, for each $s\in\mathcal{C}^{\prime}$ there exists a $g_{7}$ -system $R_{s}$ (with respect to $\mathbb{F}^{I^{c}}$ ) contained in $g_{8}\mathcal{D}_{s}-g_{8}\mathcal{D}_{s}$ , where $g_{7}=C\cdot 4^{d}(\log(2^{d-1}/\delta))^{4}$ and $g_{8}=4^{d}$ . For every $s\in Q^{\prime}$ , choose $s_{1},\dots,s_{l+l^{\prime}}\in\mathcal{C}^{\prime}$ with $l,l^{\prime}\leq 2^{3^{d+3}}$ such that $s=s_{1}+\dots+s_{l}-s_{l+1}-\dots-s_{l+l^{\prime}}$ (this is possible, since the elements of $Q^{\prime}$ are chosen from $2g_{2}g_{4}\mathcal{C}^{\prime}-2g_{2}g_{4}\mathcal{C}^{\prime}$ and $2g_{2}g_{4}\leq 2^{3^{d+3}}$ ), and let $P_{s}=\bigcap_{i\leq l+l^{\prime}}R_{s}$ . By Lemma 2.9, $P_{s}$ contains a $g_{9}$ -system with $g_{9}=2\cdot 2^{3^{d+3}}\cdot C\cdot 4^{d}(\log(2^{d-1}/\delta))^{4}$ , therefore $|P_{s}|\geq g_{10}|\mathcal{D}|$ for $g_{10}=|\mathbb{F}|^{-dg_{9}}\geq|\mathbb{F}|^{-2^{3^{d+4}}C(\log(2^{d-1}/\delta))^{4}}$ . By Lemma 2.2 (applied to $d-a$ in place of $d$ ), for every $s\in Q^{\prime}$ there exists a multiset $Q_{s}$ consisting of pure tensors with elements chosen from $g_{11}P_{s}-g_{11}P_{s}$ which is $(g_{12},1-g_{13})$ -forcing for $g_{11}=2^{3^{d-a+3}}\leq 2^{3^{d+2}}$ , $g_{12}=G(d-a,|\mathbb{F}|^{-dg_{9}})\leq G(d-a,|\mathbb{F}|^{-2^{3^{d+4}}C(\log(2^{d-1}/\delta))^{4}})$ and $g_{13}=2^{-3^{d-a+3}}\geq 2^{-3^{d+2}}$ . Notice that if we repeat every element of $Q_{s}$ the same number of times, then the multiset obtained is still $(g_{12},1-g_{13})$ -forcing, so we may assume that $\max_{s\in Q^{\prime}}|Q_{s}|\leq 2\min_{s\in Q^{\prime}}|Q_{s}|$ . Thus, the $Q_{s}$ satisfy (3), (4) and (5).

Define $Q_{I}=\{s\otimes t:s\in Q^{\prime},t\in Q_{s}\}=\bigcup_{s\in Q^{\prime}}(s\otimes Q_{s})$ . Note that as $R_{s}\subset g_{8}\mathcal{D}_{s}-g_{8}\mathcal{D}_{s}$ for all $s\in\mathcal{C}^{\prime}$ , we have $s\otimes R_{s}\subset g_{8}\mathcal{B}^{\prime}-g_{8}\mathcal{B}^{\prime}$ for all $s\in\mathcal{C}^{\prime}$ . But the elements of $Q^{\prime}$ are chosen from $2g_{2}g_{4}\mathcal{C}^{\prime}-2g_{2}g_{4}\mathcal{C}^{\prime}$ , so $s\otimes P_{s}\subset 4g_{2}g_{4}g_{8}\mathcal{B}^{\prime}-4g_{2}g_{4}g_{8}\mathcal{B}^{\prime}$ for all $s\in Q^{\prime}$ . Finally, the elements of $Q_{s}$ are chosen from $g_{11}P_{s}-g_{11}P_{s}$ , so the elements of $s\otimes Q_{s}$ are chosen from $8g_{2}g_{4}g_{8}g_{11}\mathcal{B}^{\prime}-8g_{2}g_{4}g_{8}g_{11}\mathcal{B}^{\prime}$ for every $s\in Q^{\prime}$ . Since $8g_{2}g_{4}g_{8}g_{11}\leq 8\cdot(4^{d})^{2}\cdot(2^{3^{d+2}})^{2}=2^{3+4d+2\cdot 3^{d+2}}\leq 2^{3^{d+3}}$ , property (6) is satisfied. ∎

The next lemma is the last ingredient of the proof. It is a generalisation of the discussion in Subsubsection 2.3.2. Given a tensor $r\in V_{[d]}+\sum_{I\subset[d-1],I\neq\emptyset}\mathbb{F}^{I}\otimes H_{I^{c}}(r)$ , we turn the terms $\mathbb{F}^{I}\otimes H_{I^{c}}(r)$ one by one into terms $V_{I}\otimes\mathbb{F}^{I^{c}}+\mathbb{F}^{I}\otimes V_{I^{c}}$ where $V_{J}$ are small and do not depend on $r$ . (Note that this is not quite the same as our approach to the case $d=3$ .) As briefly explained in Subsubsection 2.3.4, the order in which the various $I$ are considered is important: we define $\prec$ to be any total order on the set of non-empty subsets of $[d-1]$ such that if $J\subsetneq I$ then $J\prec I$ . It is worth noting that unlike in the $d=3$ case, the subspaces $V_{J},V_{J^{c}}$ with $J\prec I$ are allowed to change when $V_{I}$ and $V_{I^{c}}$ get defined (although in fact the $V_{J^{c}}$ will not change, and the $V_{J}$ change only for $J\subsetneq I$ ). All we require is that they do not become much larger.

Lemma 2.15.

Let $d\geq 2$ , $0<\delta\leq 1/2$ and $k\geq G(d-1,\delta)^{2}$ . Let $I\subset[d-1],I\neq\emptyset$ and let $W_{J}\subset\mathbb{F}^{J},W_{J^{c}}\subset\mathbb{F}^{J^{c}}$ be subspaces of dimension at most $k$ for every $J\prec I$ . Moreover, let $W_{[d]}\subset\mathbb{F}^{[d]}$ have dimension at most $k$ . Suppose that $Q^{\prime},Q_{s}$ (and $Q_{I}$ ) have the six properties described in Lemma 2.14. Then any array

[TABLE]

with $\dim(H_{J^{c}}(r))\leq k$ and the property that $r.q=0$ for at least $(1-\frac{1}{4}(2^{-3^{d+2}})^{2})|Q_{I}|$ choices $q\in Q_{I}$ is contained in

[TABLE]

for some $U_{J}\subset\mathbb{F}^{J},U_{J^{c}}\subset\mathbb{F}^{J^{c}}$ not depending on $r$ and some $K_{J^{c}}(r)\subset\mathbb{F}^{J^{c}}$ possibly depending on $r$ , all of dimension at most $k^{2c_{2}(|I|)}$ .

Proof.

By (4) in Lemma 2.14, for every $s\in Q^{\prime}$ there exist subspaces $V_{J}(s)\subset\mathbb{F}^{J}$ for every $J\subset I^{c},J\neq\emptyset$ , with dimension at most $g_{1}=G(d-1,|\mathbb{F}|^{-2^{3^{d+4}}C(\log 2^{d-1}/\delta)^{4}})$ such that the set of arrays $t\in\mathbb{F}^{I^{c}}$ with $t.q=0$ for at least $(1-g_{2})|Q_{s}|$ choices $q\in Q_{s}$ is contained in $\sum_{J\subset I^{c},J\neq\emptyset}V_{J}(s)\otimes\mathbb{F}^{I^{c}\setminus J}$ , where $g_{2}=2^{-3^{d+2}}$ . Note, for future reference, that

[TABLE]

Let $R$ consist of the set of arrays with $r\in W_{[d]}+\sum_{J\prec I}(W_{J}\otimes\mathbb{F}^{J^{c}}+\mathbb{F}^{J}\otimes W_{J^{c}})+\sum_{J\succeq I}\mathbb{F}^{J}\otimes H_{J^{c}}(r)$ with $\dim(H_{J^{c}}(r))\leq k$ and the property that $r.q=0$ for at least $(1-\frac{1}{4}(2^{-3^{d+2}})^{2})|Q_{I}|$ choices $q\in Q_{I}$ .

Let $r\in R$ . Then by averaging and using (5) from Lemma 2.14, for at least $(1-g_{3})|Q^{\prime}|$ choices $s\in Q^{\prime}$ we have $r.(s\otimes t)=0$ for at least $(1-g_{2})|Q_{s}|$ choices $t\in Q_{s}$ , where $g_{3}=\frac{1}{2}2^{-3^{d+2}}$ . Thus, (noting that $r.(s\otimes t)=(rs).t$ ), $rs\in\sum_{J\subset I^{c},J\neq\emptyset}V_{J}(s)\otimes\mathbb{F}^{I^{c}\setminus J}$ holds for at least $(1-g_{3})|Q^{\prime}|$ choices $s\in Q^{\prime}$ . Let $Q^{\prime}(r)$ be the submultiset of $Q^{\prime}$ consisting of those $s\in Q^{\prime}$ for which $rs\in\sum_{J\subset I^{c},J\neq\emptyset}V_{J}(s)\otimes\mathbb{F}^{I^{c}\setminus J}$ . Then we have $|Q^{\prime}(r)|\geq(1-g_{3})|Q^{\prime}|$ .

Note that we can write $r=r_{1}+r_{2}+r_{3}+r_{4}$ where

[TABLE]

By (1) in Lemma 2.14, the elements of $Q^{\prime}$ belong to $\bigcap_{J\subset I,J\neq I,J\neq\emptyset}(W_{J}^{\perp}\otimes\mathbb{F}^{I\setminus J})$ , so we have $r_{1}s=0$ for every $s\in Q^{\prime}$ .

Note that for every pure tensor $s\in\mathbb{F}^{I}$ , $r_{2}s$ is $2^{d}k$ -degenerate. Indeed, for any $J\subset[d-1]$ with $J\not\subset I$ there are some $s_{1}\in\mathbb{F}^{I\cap J},s_{2}\in\mathbb{F}^{I\cap J^{c}}$ with $s=s_{1}\otimes s_{2}$ . Then $(W_{J}\otimes\mathbb{F}^{J^{c}})s\subset(W_{J}s_{1})\otimes\mathbb{F}^{I^{c}\setminus J}$ . Since $\dim(W_{J}s_{1})\leq k$ , $J\not\subset I$ and $d\in I^{c}\setminus J$ , any tensor in $(W_{J}s_{1})\otimes\mathbb{F}^{I^{c}\setminus J}$ is $k$ -degenerate. Similarly, any tensor in $(\mathbb{F}^{J}\otimes W_{J^{c}})s$ or $(\mathbb{F}^{J}\otimes H_{J^{c}}(r))s$ is also $k$ -degenerate, so $r_{2}s$ is indeed $2^{d}k$ -degenerate. Since $Q^{\prime}$ consists of pure tensors, this holds for every $s\in Q^{\prime}$ .

Also, $r_{3}s\in\sum_{J\subset I,J\neq I}((\mathbb{F}^{J}\otimes W_{J^{c}})s)$ . It follows that for every $s\in Q^{\prime}(r)$ , there exists some $t(s)\in V_{I^{c}}(s)+\sum_{J\subset I,J\neq I}((\mathbb{F}^{J}\otimes W_{J^{c}})s)$ such that $r_{4}s-t(s)$ is $g_{4}$ -degenerate for $g_{4}=g_{1}+2^{d}k$ (we have used that $\dim(V_{J}(s))\leq g_{1}$ ). To ease the notation, write $T(s)$ for the space $V_{I^{c}}(s)+\sum_{J\subset I,J\neq I}((\mathbb{F}^{J}\otimes W_{J^{c}})s)$ . We claim that the dimension of $T(s)$ is at most $g_{4}=g_{1}+2^{d}k$ . Indeed, $\dim(V_{I^{c}})\leq g_{1}$ , so it suffices to prove that $\dim((\mathbb{F}^{J}\otimes W_{J^{c}})s)\leq k$ for every $J\subset I,J\neq I$ . Since $s\in Q^{\prime}$ , $s$ is a pure tensor, so for any such $J$ we have $s=s_{1}\otimes s_{2}$ for some $s_{1}\in\mathbb{F}^{J},s_{2}\in\mathbb{F}^{I\setminus J}$ . But then $(\mathbb{F}^{J}\otimes W_{J^{c}})s\subset W_{J^{c}}s_{2}$ , which has dimension at most $\dim(W_{J^{c}})\leq k$ .

Let us define a sequence of subspaces $0=Z(0)\subset Z(1)\subset\dots\subset Z(m)\subset\mathbb{F}^{I^{c}}$ recursively as follows. Given $Z(j)$ , if for all $r\in R$ we have that for all but at most $2g_{3}|Q^{\prime}|$ choices $s\in Q^{\prime}$ there is some $z\in Z(j)$ such that $r_{4}s-z$ is $(g_{4}+1)g_{4}$ -degenerate, then set $m=j$ and terminate.

Else, choose some $r\in R$ such that for at least $2g_{3}|Q^{\prime}|$ choices $s\in Q^{\prime}$ there is no $z\in Z(j)$ such that $r_{4}s-z$ is $(g_{4}+1)g_{4}$ -degenerate, and set $Z(j+1)=Z(j)+H_{I^{c}}(r)$ . Recall that for every $s\in Q^{\prime}(r)$ , and in particular, for at least $(1-g_{3})|Q^{\prime}|$ choices $s\in Q^{\prime}$ , there exists some $t(s)\in T(s)$ such that $r_{4}s-t(s)$ is $g_{4}$ -degenerate. So for at least $g_{3}|Q^{\prime}|$ choices $s\in Q^{\prime}$ there is some $t(s)\in T(s)$ such that $r_{4}s-t(s)$ is $g_{4}$ -degenerate, but there is no $z\in Z(j)$ such that $r_{4}s-z$ is $(g_{4}+1)g_{4}$ -degenerate. In this case there is no $z\in Z(j)$ such that $z-t(s)$ is $g_{4}^{2}$ -degenerate. On the other hand, since $r_{4}s\in H_{I^{c}}(r)\subset Z(j+1)$ , there is some $z\in Z(j+1)$ such that $z-t(s)$ is $g_{4}$ -degenerate. For any $i$ , let $K(i,s)$ be the subspace of $T(s)$ spanned by those $t\in T(s)$ for which there is some $z\in Z(i)$ with $z-t$ being $g_{4}$ -degenerate. Since the dimension of $T(s)$ is at most $g_{4}$ , we have $t(s)\not\in K(j,s)$ , else there would exist some $z\in Z(j)$ such that $z-t(s)$ is $g_{4}^{2}$ -degenerate. On the other hand, $t(s)\in K(j+1,s)$ . Thus, $\dim K(j+1,s)>\dim K(j,s)$ . This holds for at least $g_{3}|Q^{\prime}|$ choices $s\in Q^{\prime}$ , so

[TABLE]

Since $K(m,s)\subset T(s)$ , we have $\dim K(m,s)\leq g_{4}$ . Thus,

[TABLE]

so $m\leq\frac{g_{4}}{g_{3}}$ and $\dim Z(m)\leq\frac{kg_{4}}{g_{3}}$ . Write $Z=Z(m)$ .

Now let $r\in R$ . Let $X(r)$ be the set consisting of those $x\in H_{I^{c}}(r)$ for which there is some $z\in Z$ with $x-z$ being $(g_{4}+1)g_{4}$ -degenerate. Then $r_{4}s\in X(r)$ apart from at most $2g_{3}|Q^{\prime}|$ choices $s\in Q^{\prime}$ . Let $t_{1},\dots,t_{\alpha}$ be a maximal linearly independent subset of $X(r)$ and extend it to a basis $t_{1},\dots,t_{\alpha},t^{\prime}_{1},\dots,t^{\prime}_{\beta}$ for $H_{I^{c}}(r)$ . Now if a linear combination of $t_{1},\dots,t_{\alpha},t^{\prime}_{1},\dots,t^{\prime}_{\beta}$ is in $X(r)$ , then the coefficients of $t^{\prime}_{1},\dots,t^{\prime}_{\beta}$ are all zero. Write $r_{4}=\sum_{i\leq\alpha}s_{i}\otimes t_{i}+\sum_{j\leq\beta}s^{\prime}_{j}\otimes t^{\prime}_{j}$ for some $s_{i},s^{\prime}_{j}\in\mathbb{F}^{I}$ . Since $r_{4}q\in X(r)$ for at least $(1-2g_{3})|Q^{\prime}|=(1-2^{-3^{d+2}})|Q^{\prime}|$ choices $q\in Q^{\prime}$ , we have, for all $j$ , that $s^{\prime}_{j}.q=0$ for at least $(1-2^{-3^{d+2}})|Q^{\prime}|$ choices $q\in Q^{\prime}$ . Thus, by (2) in Lemma 2.14 there exist subspaces $L_{J}\subset\mathbb{F}^{J}$ ( $J\subset I,J\neq\emptyset$ ) not depending on $r$ , and of dimension at most $G(|I|,|\mathbb{F}|^{-2^{d+1}dk})$ such that $s^{\prime}_{j}\in\sum_{J\subset I,J\neq\emptyset}L_{J}\otimes\mathbb{F}^{I\setminus J}$ for all $j$ . Thus, $r_{4}\in\sum_{i\leq\alpha}s_{i}\otimes t_{i}+\sum_{J\subset I,J\neq\emptyset}L_{J}\otimes\mathbb{F}^{J^{c}}$ . Moreover, for every $i\leq\alpha$ , we have $t_{i}\in X(r)$ , so there exist $z_{i}\in Z$ such that $t_{i}-z_{i}$ is $(g_{4}+1)g_{4}$ -degenerate. It follows that $r_{4}\in\mathbb{F}^{I}\otimes Z+\sum_{J\supset I,J\neq I,J\subset[d-1]}\mathbb{F}^{J}\otimes K^{\prime}_{J^{c}}(r)+\sum_{J\subset I,J\neq\emptyset}L_{J}\otimes\mathbb{F}^{J^{c}}$ for some $K^{\prime}_{J^{c}}(r)\subset\mathbb{F}^{J^{c}}$ of dimension at most $\alpha\cdot(g_{4}+1)g_{4}\leq k\cdot(g_{4}+1)g_{4}$ .

We claim that $\dim(Z),\dim(K^{\prime}_{J^{c}})$ and $\dim(L_{J})$ are all bounded by $k^{2c_{2}(|I|)}-k$ .

Firstly, note that $g_{4}=g_{1}+2^{d}k\leq k^{2}+2^{d}k\leq 2k^{2}$ .

Now $\dim(K^{\prime}_{J^{c}})\leq k(g_{4}+1)g_{4}\leq k^{6}\leq k^{2c_{2}(|I|)}-k$ . Also, $\dim(Z)\leq\frac{kg_{4}}{g_{3}}\leq k^{4}\leq k^{2c_{2}(|I|)}-k$ . Finally,

[TABLE]

This completes the proof of the claim and the lemma. ∎

Proof of Lemma 2.2.

As stated earlier, the proof goes by induction on $d$ . For $d=1$ , by Lemma 2.1 there is a subspace $U\subset\mathbb{F}^{n_{1}}$ of codimension at most $C(\log 1/\delta)^{4}$ contained in $2\mathcal{B}^{\prime}-2\mathcal{B}^{\prime}$ . Choose $Q=U$ . Now if $r.q=0$ for at least $(1-2^{-3^{4}})|Q|$ choices $q\in Q$ then the same holds for all $q\in Q$ , therefore $r\in U^{\perp}$ , but $\dim(U^{\perp})\leq C(\log 1/\delta)^{4}$ , so the case $d=1$ is proved.

Now let us assume that $d\geq 2$ . Extend the total order $\prec$ defined above such that it now contains $\emptyset$ which has $\emptyset\prec I$ for every non-empty $I\subset[d-1]$ . Say $\emptyset=I_{0}\prec I_{1}\prec I_{2}\prec\dots\prec I_{2^{d-1}-1}$ where $\{I_{0},\dots,I_{2^{d-1}-1}\}=P([d-1])$ .

Claim. For every $0\leq i\leq 2^{d-1}-1$ there exists a multiset $Q_{I_{i}}$ of pure tensors with elements chosen from $2^{3^{d+3}}\mathcal{B}^{\prime}-2^{3^{d+3}}\mathcal{B}^{\prime}$ , and subspaces $W_{I_{j}}(i)\subset\mathbb{F}^{I_{j}}$ , $W_{(I_{j})^{c}}(i)\subset\mathbb{F}^{(I_{j})^{c}}$ for every $j\leq i$ (for $j=0$ , we only require $W_{[d]}(i)$ and not $W_{\emptyset}(i)$ ) with the following properties. The dimension of each of these spaces is at most $g_{1}(i)=G(d-1,\delta)^{\alpha(i)}$ , where $\alpha(i)=4\cdot\Pi_{1\leq j\leq i}\hskip 2.84526pt2c_{2}(|I_{j}|)$ . Moreover, if $r\in\mathcal{G}$ has $r.q=0$ for at least $(1-\frac{1}{4}(2^{-3^{d+2}})^{2})|Q_{I_{j}}|$ choices $q\in Q_{I_{j}}$ for all $j\leq i$ , then $r\in W_{[d]}(i)+\sum_{1\leq j\leq i}(W_{I_{j}}(i)\otimes\mathbb{F}^{(I_{j})^{c}}+\mathbb{F}^{I_{j}}\otimes W_{(I_{j})^{c}}(i))+\sum_{j>i}\mathbb{F}^{I_{j}}\otimes H_{(I_{j})^{c}}(i,r)$ holds for some $H_{(I_{j})^{c}}(i,r)$ possibly depending on $r$ and of dimension at most $g_{1}(i)$ .

Proof of Claim. This is proved by induction on $i$ . For $i=0$ , by Lemma 2.12, there exist $Q_{\emptyset}\subset 2\mathcal{B}^{\prime}-2\mathcal{B}^{\prime}$ consisting of pure tensors and $V_{[d]}\subset\mathbb{F}^{[d]}$ of dimension at most $4C(\log(2/\delta))^{4}\leq 4C(2\log(1/\delta))^{4}\leq G(d-1,\delta)^{4}$ such that if $r.q=0$ for at least $\frac{7}{8}|Q_{\emptyset}|$ choices $q\in Q_{\emptyset}$ , then $r$ can be written as $r=x+y$ where $x\in V_{[d]}$ and $y$ is $g_{2}$ -degenerate for $g_{2}=G(d-1,\frac{\delta}{4|\mathbb{F}|^{4C(\log 2/\delta)^{4}}})$ . Since

[TABLE]

we can take $W_{[d]}(0)=V_{[d]}$ .

Once we have found suitable sets $W_{I_{j}}(i-1)$ and $W_{(I_{j})^{c}}(i-1)$ for all $j\leq i-1$ , we can apply Lemmas 2.14 and 2.15 with $I=I_{i}$ and $k=g_{1}(i-1)$ to find a suitable $Q_{I_{i}}$ , $W_{I_{j}}(i)$ and $W_{(I_{j})^{c}}(i)$ for all $j\leq i$ , and the claim is proved, since $g_{1}(i)=g_{1}(i-1)^{2c_{2}(|I_{i}|)}$ .

Now, after taking several copies of each $Q_{I}$ , we may assume that additionally $\max_{I}|Q_{I}|\leq 2\min_{I}|Q_{I}|$ . Let $Q=\bigcup_{I\subset[d-1]}Q_{I}$ and suppose that $r.q=0$ for at least $(1-2^{-3^{d+3}})|Q|$ choices $q\in Q$ . Since $2^{-3^{d+3}}\leq\frac{1}{2\cdot 2^{d-1}}\cdot\frac{1}{4}(2^{-3^{d+2}})^{2}$ , it follows that for every $I\subset[d-1]$ we have $r.q=0$ for at least $(1-\frac{1}{4}(2^{-3^{d+2}})^{2})|Q_{I}|$ choices $q\in Q_{I}$ . By the Claim with $i=2^{d-1}-1$ , we get that $r\in\sum_{I\subset[d],I\neq\emptyset}V_{I}\otimes\mathbb{F}^{I^{c}}$ for some $V_{I}\subset\mathbb{F}^{I}$ not depending on $r$ , and of dimension at most $g_{1}(2^{d-1}-1)=G(d-1,\delta)^{\alpha(2^{d-1}-1)}$ . Note that

[TABLE]

But

[TABLE]

Thus, $\alpha(2^{d-1}-1)\leq 4^{d^{d}}$ . This completes the proof of the lemma. ∎

Acknowledgments

I would like to thank Timothy Gowers for helpful discussions. I am also grateful to him and the anonymous referee for their valuable comments on a previous version of this paper.

Bibliography14

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] A. Bhowmick and S. Lovett. Bias vs structure of polynomials in large fields, and applications in effective algebraic geometry and coding theory. 2015, ar Xiv:1506.02047.
2[2] W. T. Gowers and O. Janzer. Subsets of Cayley graphs that induce many edges. Theory of Computing , 15(20):1–29, 2019.
3[3] W. T. Gowers and J. Wolf. Linear forms and higher-degree uniformity for functions on 𝔽 p n superscript subscript 𝔽 𝑝 𝑛 \mathbb{F}_{p}^{n} . Geometric and Functional Analysis , 21(1):36–69, 2011.
4[4] B. Green and T. Tao. The distribution of polynomials over finite fields, with applications to the Gowers norms. Contributions to Discrete Mathematics , 4(2), 2009.
5[5] E. Haramaty, and A. Shpilka. On the structure of cubic and quartic polynomials. Proceedings of the forty-second ACM symposium on Theory of computing , pp. 331-340. ACM, 2010.
6[6] H. Hatami, P. Hatami and S. Lovett. Higher-order Fourier Analysis and Applications.
7[7] O. Janzer. Low analytic rank implies low partition rank for tensors. 2018, ar Xiv:1809.10931.
8[8] T. Kaufman and S. Lovett. Worst case to average case reductions for polynomials. 49th Annual IEEE symposium on Foundations of Computer Science , 2008.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Polynomial bound for the Partition Rank vs the Analytic Rank of Tensors

Abstract

1 Introduction

1.1 Bias and rank of polynomials

Definition 1.1**.**

Remark 1.2*.*

Definition 1.3**.**

Theorem 1.4**.**

Theorem 1.5**.**

Theorem 1.6**.**

1.2 Analytic rank and partition rank of tensors

Definition 1.7**.**

Remark 1.8*.*

Definition 1.9**.**

Theorem 1.10**.**

2 The proof of Theorem 1.10

2.1 Notation and preliminaries

Lemma 2.1** (Sanders [14]).**

2.2 The main lemma and some consequences

Lemma 2.2**.**

Definition 2.3**.**

Lemma 2.4**.**

Proof.

Proof of Theorem 1.10.

2.3 The overview of the proof of Lemma 2.2

2.3.1 The high-level outline in the case d=3d=3d=3

2.3.2 Why does this Q{2}Q_{\{2\}}Q{2}​ work?

2.3.3 Why can we find such a Q{2}Q_{\{2\}}Q{2}​ inside 23d+3B′−23d+3B′2^{3^{d+3}}\mathcal{B}^{\prime}-2^{3^{d+3}}\mathcal{B}^{\prime}23d+3B′−23d+3B′?

Lemma 2.5**.**

Lemma 2.6**.**

Proof.

Lemma 2.7**.**

Proof.

2.3.4 How can this be extended to d>3d>3d>3?

2.4 Construction of some auxiliary sets

Definition 2.8**.**

Lemma 2.9**.**

Proof.

Lemma 2.10**.**

Proof.

Lemma 2.11**.**

Proof.

2.5 The proof of Lemma 2.2

Lemma 2.12**.**

Proof.

Definition 2.13**.**

Lemma 2.14**.**

Proof.

Lemma 2.15**.**

Proof.

Proof of Lemma 2.2.

Acknowledgments

Definition 1.1.

*Remark 1.2**.*

Definition 1.3.

Theorem 1.4.

Theorem 1.5.

Theorem 1.6.

Definition 1.7.

*Remark 1.8**.*

Definition 1.9.

Theorem 1.10.

Lemma 2.1 (Sanders [14]).

Lemma 2.2.

Definition 2.3.

Lemma 2.4.

2.3.1 The high-level outline in the case $d=3$

2.3.2 Why does this $Q_{\{2\}}$ work?

2.3.3 Why can we find such a $Q_{\{2\}}$ inside $2^{3^{d+3}}\mathcal{B}^{\prime}-2^{3^{d+3}}\mathcal{B}^{\prime}$ ?

Lemma 2.5.

Lemma 2.6.

Lemma 2.7.

2.3.4 How can this be extended to $d>3$ ?

Definition 2.8.

Lemma 2.9.

Lemma 2.10.

Lemma 2.11.

Lemma 2.12.

Definition 2.13.

Lemma 2.14.

Lemma 2.15.