Polynomial bound for partition rank in terms of analytic rank

Luka Mili\'cevi\'c

arXiv:1902.09830·math.CO·April 25, 2019

Polynomial bound for partition rank in terms of analytic rank

Luka Mili\'cevi\'c

PDF

TL;DR

This paper establishes a polynomial bound relating the partition rank to the analytic rank of multilinear forms over finite fields, confirming a conjecture and advancing understanding of their structural relationship.

Contribution

The paper proves a polynomial upper bound of the partition rank in terms of the analytic rank, improving previous exponential bounds and confirming a conjecture by Kazhdan and Ziegler.

Findings

01

Partition rank is polynomially bounded by analytic rank.

02

Confirmed a conjecture of Kazhdan and Ziegler.

03

Independent proof by Janzer corroborates results.

Abstract

Let $G_{1}, \dots, G_{k}$ be vector spaces over a finite field $F = F_{q}$ with a non-trivial additive character $χ$ . The analytic rank of a multilinear form $α : G_{1} \times \dots \times G_{k} \to F$ is defined as $arank (α) = - lo g_{q} E_{x_{1} \in G_{1}, \dots, x_{k} \in G_{k}} χ (α (x_{1}, \dots, x_{k}))$ . The partition rank $prank (α)$ of $α$ is the smallest number of maps of partition rank 1 that add up to $α$ , where a map is of partition rank 1 if it can be written as a product of two multilinear forms, depending on different coordinates. It is easy to see that $arank (α) \leq O (prank (α))$ and it has been known that $prank (α)$ can be bounded from above in terms of $arank (α)$ . In this paper, we improve the…

Equations200

\operatorname{arank}(\alpha)=-\log_{|\mathbb{F}|}\mathop{\mathbb{E}}_{x_{1}\in G_{1},\dots,x_{k}\in G_{k}}\chi\Big{(}\alpha(x_{1},\dots,x_{k})\Big{)}.

\operatorname{arank}(\alpha)=-\log_{|\mathbb{F}|}\mathop{\mathbb{E}}_{x_{1}\in G_{1},\dots,x_{k}\in G_{k}}\chi\Big{(}\alpha(x_{1},\dots,x_{k})\Big{)}.

\Big{(}\forall x_{1}\in G_{1},\dots,x_{k}\in G_{k}\Big{)}\alpha(x_{1},\dots,x_{k})=\beta(x_{i}\colon i\in I)\gamma(x_{i}\colon i\in[k]\setminus I).

\Big{(}\forall x_{1}\in G_{1},\dots,x_{k}\in G_{k}\Big{)}\alpha(x_{1},\dots,x_{k})=\beta(x_{i}\colon i\in I)\gamma(x_{i}\colon i\in[k]\setminus I).

prank (α) \leq C^{'} arank (α)^{D},

prank (α) \leq C^{'} arank (α)^{D},

\Big{(}\forall x_{1},\dots,x_{n}\in\mathbb{F}\Big{)}f(x_{1},\dots,x_{n})=g\Big{(}f_{1}(x_{1},\dots,x_{n}),\dots,f_{r}(x_{1},\dots,x_{n})\Big{)}.

\Big{(}\forall x_{1},\dots,x_{n}\in\mathbb{F}\Big{)}f(x_{1},\dots,x_{n})=g\Big{(}f_{1}(x_{1},\dots,x_{n}),\dots,f_{r}(x_{1},\dots,x_{n})\Big{)}.

\begin{split}\sum_{S\subset[d-1]}(-1)^{d-1-|S|}f\Big{(}x+\sum_{i\in S}y^{i}\Big{)}=(d-1)!\alpha(x,y^{1},\dots,y^{d-1})+&g(y^{1},\dots,y^{d-1})\\ +&\sum_{i\in[d-1]}h_{i}(y^{1},\dots,y^{i-1},x,y^{i+1},\dots,y^{d-1}),\end{split}

\begin{split}\sum_{S\subset[d-1]}(-1)^{d-1-|S|}f\Big{(}x+\sum_{i\in S}y^{i}\Big{)}=(d-1)!\alpha(x,y^{1},\dots,y^{d-1})+&g(y^{1},\dots,y^{d-1})\\ +&\sum_{i\in[d-1]}h_{i}(y^{1},\dots,y^{i-1},x,y^{i+1},\dots,y^{d-1}),\end{split}

\begin{split}\Big{|}\mathop{\mathbb{E}}_{x\in\mathbb{F}^{n}}\chi\Big{(}\alpha(x,\dots,x)\Big{)}\Big{|}^{2}=\Big{|}\mathop{\mathbb{E}}_{x,y\in\mathbb{F}^{n}}&\chi\Big{(}\alpha(x+y,\dots,x+y)\Big{)}\Big{|}^{2}\leq\mathop{\mathbb{E}}_{x\in\mathbb{F}^{n}}\Big{|}\mathop{\mathbb{E}}_{y\in\mathbb{F}^{n}}\chi\Big{(}\alpha(x+y,\dots,x+y)\Big{)}\Big{|}^{2}\\ =&\mathop{\mathbb{E}}_{x,y,z\in\mathbb{F}^{n}}\chi\Big{(}\alpha(x+y,\dots,x+y)-\alpha(x+z,\dots,x+z)\Big{)}\\ =&\mathop{\mathbb{E}}_{x,y\in\mathbb{F}^{n}}\chi\Big{(}\alpha(x+y,\dots,x+y)-\alpha(x,\dots,x)\Big{)}.\end{split}

\begin{split}\Big{|}\mathop{\mathbb{E}}_{x\in\mathbb{F}^{n}}\chi\Big{(}\alpha(x,\dots,x)\Big{)}\Big{|}^{2}=\Big{|}\mathop{\mathbb{E}}_{x,y\in\mathbb{F}^{n}}&\chi\Big{(}\alpha(x+y,\dots,x+y)\Big{)}\Big{|}^{2}\leq\mathop{\mathbb{E}}_{x\in\mathbb{F}^{n}}\Big{|}\mathop{\mathbb{E}}_{y\in\mathbb{F}^{n}}\chi\Big{(}\alpha(x+y,\dots,x+y)\Big{)}\Big{|}^{2}\\ =&\mathop{\mathbb{E}}_{x,y,z\in\mathbb{F}^{n}}\chi\Big{(}\alpha(x+y,\dots,x+y)-\alpha(x+z,\dots,x+z)\Big{)}\\ =&\mathop{\mathbb{E}}_{x,y\in\mathbb{F}^{n}}\chi\Big{(}\alpha(x+y,\dots,x+y)-\alpha(x,\dots,x)\Big{)}.\end{split}

\begin{split}c^{2^{d-1}}=&\Big{|}\mathop{\mathbb{E}}_{x\in\mathbb{F}^{n}}\chi(f(x))\Big{|}^{2^{d-1}}=\Big{|}\mathop{\mathbb{E}}_{x\in\mathbb{F}^{n}}\chi\Big{(}\alpha(x,\dots,x)\Big{)}\Big{|}^{2^{d-1}}\leq\mathop{\mathbb{E}}_{x,y^{1},\dots,y^{d-1}\in\mathbb{F}^{n}}\chi\Big{(}\sum_{S\subset[d-1]}(-1)^{d-1-|S|}f\big{(}x+\sum_{i\in S}y^{i}\big{)}\Big{)}\\ =&\mathop{\mathbb{E}}_{x,y^{1},\dots,y^{d-1}\in\mathbb{F}^{n}}\chi\Big{(}(d-1)!\hskip 2.0pt\alpha(x,y^{1},\dots,y^{d-1})+g(y^{1},\dots,y^{d-1})+\sum_{i\in[d-1]}h_{i}(y^{1},\dots,y^{i-1},x,y^{i+1},\dots,y^{d-1})\Big{)}.\end{split}

\begin{split}c^{2^{d-1}}=&\Big{|}\mathop{\mathbb{E}}_{x\in\mathbb{F}^{n}}\chi(f(x))\Big{|}^{2^{d-1}}=\Big{|}\mathop{\mathbb{E}}_{x\in\mathbb{F}^{n}}\chi\Big{(}\alpha(x,\dots,x)\Big{)}\Big{|}^{2^{d-1}}\leq\mathop{\mathbb{E}}_{x,y^{1},\dots,y^{d-1}\in\mathbb{F}^{n}}\chi\Big{(}\sum_{S\subset[d-1]}(-1)^{d-1-|S|}f\big{(}x+\sum_{i\in S}y^{i}\big{)}\Big{)}\\ =&\mathop{\mathbb{E}}_{x,y^{1},\dots,y^{d-1}\in\mathbb{F}^{n}}\chi\Big{(}(d-1)!\hskip 2.0pt\alpha(x,y^{1},\dots,y^{d-1})+g(y^{1},\dots,y^{d-1})+\sum_{i\in[d-1]}h_{i}(y^{1},\dots,y^{i-1},x,y^{i+1},\dots,y^{d-1})\Big{)}.\end{split}

\mathop{\mathbb{E}}_{x,y^{1},\dots,y^{d-1}\in\mathbb{F}^{n}}\chi\Big{(}\alpha(x,y^{1},\dots,y^{d-1}\Big{)}\geq c^{2^{2d-2}}.

\mathop{\mathbb{E}}_{x,y^{1},\dots,y^{d-1}\in\mathbb{F}^{n}}\chi\Big{(}\alpha(x,y^{1},\dots,y^{d-1}\Big{)}\geq c^{2^{2d-2}}.

α (u^{1}, \dots, u^{d}) = i \in [r] \sum β_{i} (u_{j} : j \in I_{i}) γ_{i} (u_{j} : j \in [d] ∖ I_{i}) .

α (u^{1}, \dots, u^{d}) = i \in [r] \sum β_{i} (u_{j} : j \in I_{i}) γ_{i} (u_{j} : j \in [d] ∖ I_{i}) .

α (x_{[i - 1]}, y + z, x_{[i + 1, k]}) = α (x_{[i - 1]}, y, x_{[i + 1, k]}) + α (x_{[i - 1]}, z, x_{[i + 1, k]}) .

α (x_{[i - 1]}, y + z, x_{[i + 1, k]}) = α (x_{[i - 1]}, y, x_{[i + 1, k]}) + α (x_{[i - 1]}, z, x_{[i + 1, k]}) .

α (x_{[i - 1]}, y + z - w, x_{[i + 1, k]}) = α (x_{[i - 1]}, y, x_{[i + 1, k]}) + α (x_{[i - 1]}, z, x_{[i + 1, k]}) - α (x_{[i - 1]}, w, x_{[i + 1, k]}) .

α (x_{[i - 1]}, y + z - w, x_{[i + 1, k]}) = α (x_{[i - 1]}, y, x_{[i + 1, k]}) + α (x_{[i - 1]}, z, x_{[i + 1, k]}) - α (x_{[i - 1]}, w, x_{[i + 1, k]}) .

\Big{\{}x_{[k]}\in G_{[k]}\colon(\forall i\in[r])\beta_{i}(x_{I_{i}})=0\Big{\}}\subset\Big{\{}x_{[k]}\in G_{[k]}\colon\alpha(x_{[k]})=0\Big{\}}.

\Big{\{}x_{[k]}\in G_{[k]}\colon(\forall i\in[r])\beta_{i}(x_{I_{i}})=0\Big{\}}\subset\Big{\{}x_{[k]}\in G_{[k]}\colon\alpha(x_{[k]})=0\Big{\}}.

\Big{(}\forall x_{[k]}\in G_{[k]}\Big{)}\hskip 6.0pt\alpha(x_{[k]})=\sum_{i\in[r]}\beta_{i}(x_{I_{i}})\gamma_{i}(x_{[k]\setminus I_{i}}).

\Big{(}\forall x_{[k]}\in G_{[k]}\Big{)}\hskip 6.0pt\alpha(x_{[k]})=\sum_{i\in[r]}\beta_{i}(x_{I_{i}})\gamma_{i}(x_{[k]\setminus I_{i}}).

X = {x_{[k - 1]} \in G_{[k - 1]} : ∣ {y \in G_{k} : α (x_{[k - 1]}, y) \in S} ∣ \geq ϵ ∣ G_{k} ∣} .

X = {x_{[k - 1]} \in G_{[k - 1]} : ∣ {y \in G_{k} : α (x_{[k - 1]}, y) \in S} ∣ \geq ϵ ∣ G_{k} ∣} .

C_{i} f (x_{[k]}) = E_{y_{i} \in G_{i}} f (x_{[i - 1]}, y_{i} + x_{i}, x_{[i + 1, k]}) \overline{f (x_{[i - 1]}, y_{i}, x_{[i + 1, k]})} .

C_{i} f (x_{[k]}) = E_{y_{i} \in G_{i}} f (x_{[i - 1]}, y_{i} + x_{i}, x_{[i + 1, k]}) \overline{f (x_{[i - 1]}, y_{i}, x_{[i + 1, k]})} .

\bigg{|}\mathbf{C}_{l}\mathbf{C}_{l-1}\cdots\mathbf{C}_{1}Z(x_{[k]})-\sum_{i\in[m]}c_{i}\chi\Big{(}\rho_{i}(x_{[k]})\Big{)}\bigg{|}\leq\epsilon

\bigg{|}\mathbf{C}_{l}\mathbf{C}_{l-1}\cdots\mathbf{C}_{1}Z(x_{[k]})-\sum_{i\in[m]}c_{i}\chi\Big{(}\rho_{i}(x_{[k]})\Big{)}\bigg{|}\leq\epsilon

\begin{split}\bm{Weak}(k)\implies\bm{Strong}(k)\implies\bm{Inner}(k-1)&\implies\bm{Columns}(k)\\ \Big{(}\bm{Inner}(k-1)\land\bm{Columns}(k-1)\Big{)}&\implies\bm{Conv}(k)\implies\bm{Weak}(k+1).\end{split}

\begin{split}\bm{Weak}(k)\implies\bm{Strong}(k)\implies\bm{Inner}(k-1)&\implies\bm{Columns}(k)\\ \Big{(}\bm{Inner}(k-1)\land\bm{Columns}(k-1)\Big{)}&\implies\bm{Conv}(k)\implies\bm{Weak}(k+1).\end{split}

\mathop{\mathbb{E}}_{x_{1},\dots,x_{k}}\chi\Big{(}\rho(x_{[k]})-\sum\limits_{i\in F}\lambda_{i}\gamma_{i}(x_{[k]})\Big{)}\leq\eta=2^{-2k}|\mathbb{F}|^{-(k+1)(3r+2)},

\mathop{\mathbb{E}}_{x_{1},\dots,x_{k}}\chi\Big{(}\rho(x_{[k]})-\sum\limits_{i\in F}\lambda_{i}\gamma_{i}(x_{[k]})\Big{)}\leq\eta=2^{-2k}|\mathbb{F}|^{-(k+1)(3r+2)},

\mathbb{P}(\phi(x_{[k]})=0)=\mathbb{P}\Big{(}(\forall i\in[s])A(x_{[k]})\cdot h_{i}=0\Big{)}=|\mathbb{F}|^{-s}.

\mathbb{P}(\phi(x_{[k]})=0)=\mathbb{P}\Big{(}(\forall i\in[s])A(x_{[k]})\cdot h_{i}=0\Big{)}=|\mathbb{F}|^{-s}.

Φ (x_{[k]}, λ) = i \in [r] \sum λ_{i} A_{i} (x_{[k]}) .

Φ (x_{[k]}, λ) = i \in [r] \sum λ_{i} A_{i} (x_{[k]}) .

\begin{split}\Big{\{}x_{[k]}\in G_{[k]}\colon\sum_{i\in[r]}\lambda_{i}\phi_{i}(x_{[k]})=0\Big{\}}&=\Big{\{}x_{[k]}\in G_{[k]}\colon\sum_{i\in[r]}\lambda_{i}\psi(x_{[k]},e_{i})=0\Big{\}}=\{x_{[k]}\in G_{[k]}\colon\psi(x_{[k]},\lambda)=0\}\\ &\supset\{x_{[k]}\in G_{[k]}\colon\Phi(x_{[k]},\lambda)=0\}=\Big{\{}x_{[k]}\in G_{[k]}\colon\sum_{i\in[r]}\lambda_{i}A_{i}(x_{[k]})=0\Big{\}}\end{split}

\begin{split}\Big{\{}x_{[k]}\in G_{[k]}\colon\sum_{i\in[r]}\lambda_{i}\phi_{i}(x_{[k]})=0\Big{\}}&=\Big{\{}x_{[k]}\in G_{[k]}\colon\sum_{i\in[r]}\lambda_{i}\psi(x_{[k]},e_{i})=0\Big{\}}=\{x_{[k]}\in G_{[k]}\colon\psi(x_{[k]},\lambda)=0\}\\ &\supset\{x_{[k]}\in G_{[k]}\colon\Phi(x_{[k]},\lambda)=0\}=\Big{\{}x_{[k]}\in G_{[k]}\colon\sum_{i\in[r]}\lambda_{i}A_{i}(x_{[k]})=0\Big{\}}\end{split}

\begin{split}&\Big{|}\Big{\{}x_{[k]}\in G_{[k]}\colon\sum_{i\in[r]}\lambda_{i}\phi_{i}(x_{[k]})=0\Big{\}}\setminus\Big{\{}x_{[k]}\in G_{[k]}\colon\sum_{i\in[r]}\lambda_{i}A_{i}(x_{[k]})=0\Big{\}}\Big{|}\\ \leq&\Big{|}\Big{\{}(x_{[k]},\mu)\in G_{[k]}\times\mathbb{F}^{r}\colon\psi(x_{[k]},\mu)=0\Big{\}}\setminus\Big{\{}(x_{[k]},\mu)\in G_{[k]}\times\mathbb{F}^{r}\colon\Phi(x_{[k]},\mu)=0\Big{\}}\Big{|}\\ \leq&|\mathbb{F}|^{-r}\epsilon|G_{[k]}\times\mathbb{F}^{r}|=\epsilon|G_{[k]}|,\end{split}

\begin{split}&\Big{|}\Big{\{}x_{[k]}\in G_{[k]}\colon\sum_{i\in[r]}\lambda_{i}\phi_{i}(x_{[k]})=0\Big{\}}\setminus\Big{\{}x_{[k]}\in G_{[k]}\colon\sum_{i\in[r]}\lambda_{i}A_{i}(x_{[k]})=0\Big{\}}\Big{|}\\ \leq&\Big{|}\Big{\{}(x_{[k]},\mu)\in G_{[k]}\times\mathbb{F}^{r}\colon\psi(x_{[k]},\mu)=0\Big{\}}\setminus\Big{\{}(x_{[k]},\mu)\in G_{[k]}\times\mathbb{F}^{r}\colon\Phi(x_{[k]},\mu)=0\Big{\}}\Big{|}\\ \leq&|\mathbb{F}|^{-r}\epsilon|G_{[k]}\times\mathbb{F}^{r}|=\epsilon|G_{[k]}|,\end{split}

\Big{|}\mathop{\mathbb{E}}_{x_{[k]}}\chi(\alpha(x_{[k]}))\Big{|}\leq\mathop{\mathbb{E}}_{x_{[k]}}\chi(\alpha_{[k]}(x_{[k]}))\in\mathbb{R}_{\geq 0}.

\Big{|}\mathop{\mathbb{E}}_{x_{[k]}}\chi(\alpha(x_{[k]}))\Big{|}\leq\mathop{\mathbb{E}}_{x_{[k]}}\chi(\alpha_{[k]}(x_{[k]}))\in\mathbb{R}_{\geq 0}.

\begin{split}\Big{|}\mathop{\mathbb{E}}_{x_{[k]}}\chi(\alpha(x_{[k]}))\Big{|}=&\Big{|}\mathop{\mathbb{E}}_{x_{[k-1]}}\chi(\alpha^{\prime}(x_{[k-1]}))\mathop{\mathbb{E}}_{x_{k}}\chi\Big{(}A(x_{[k-1]})\cdot x_{k}\Big{)}\Big{|}=\Big{|}\mathop{\mathbb{E}}_{x_{[k-1]}}\chi(\alpha^{\prime}(x_{[k-1]}))\bm{1}(A(x_{[k-1]})=0)\Big{|}\\ \leq&\mathop{\mathbb{E}}_{x_{[k-1]}}\bm{1}(A(x_{[k-1]})=0)=\mathop{\mathbb{E}}_{x_{[k]}}\chi\Big{(}\sum_{k\in I\subset[k]}\alpha_{I}(x_{I})\Big{)}.\end{split}

\begin{split}\Big{|}\mathop{\mathbb{E}}_{x_{[k]}}\chi(\alpha(x_{[k]}))\Big{|}=&\Big{|}\mathop{\mathbb{E}}_{x_{[k-1]}}\chi(\alpha^{\prime}(x_{[k-1]}))\mathop{\mathbb{E}}_{x_{k}}\chi\Big{(}A(x_{[k-1]})\cdot x_{k}\Big{)}\Big{|}=\Big{|}\mathop{\mathbb{E}}_{x_{[k-1]}}\chi(\alpha^{\prime}(x_{[k-1]}))\bm{1}(A(x_{[k-1]})=0)\Big{|}\\ \leq&\mathop{\mathbb{E}}_{x_{[k-1]}}\bm{1}(A(x_{[k-1]})=0)=\mathop{\mathbb{E}}_{x_{[k]}}\chi\Big{(}\sum_{k\in I\subset[k]}\alpha_{I}(x_{I})\Big{)}.\end{split}

∥ f ∥_{□^{k}}^{2^{k}} = E_{x_{[k]}, y_{[k]} \in G_{[k]}} I \subset [k] \prod Conj^{∣ I ∣} f (x_{I}, y_{[k] ∖ I})

∥ f ∥_{□^{k}}^{2^{k}} = E_{x_{[k]}, y_{[k]} \in G_{[k]}} I \subset [k] \prod Conj^{∣ I ∣} f (x_{I}, y_{[k] ∖ I})

\Big{|}\mathop{\mathbb{E}}_{x_{[k]},y_{[k]}\in G_{[k]}}\prod_{I\subset[k]}\operatorname{Conj}^{|I|}f_{I}(x_{I},y_{[k]\setminus I})\Big{|}\leq\prod_{I\subset[k]}\|f_{I}\|_{\square^{k}}.

\Big{|}\mathop{\mathbb{E}}_{x_{[k]},y_{[k]}\in G_{[k]}}\prod_{I\subset[k]}\operatorname{Conj}^{|I|}f_{I}(x_{I},y_{[k]\setminus I})\Big{|}\leq\prod_{I\subset[k]}\|f_{I}\|_{\square^{k}}.

\mathop{\mathbb{E}}_{x_{1},\dots,x_{k}}\chi\Big{(}\rho(x_{[k]})-\sum\limits_{i\in F}\lambda_{i}\gamma_{i}(x_{[k]})\Big{)}\leq\eta=2^{-2k}|\mathbb{F}|^{-(k+1)(3r+2)},

\mathop{\mathbb{E}}_{x_{1},\dots,x_{k}}\chi\Big{(}\rho(x_{[k]})-\sum\limits_{i\in F}\lambda_{i}\gamma_{i}(x_{[k]})\Big{)}\leq\eta=2^{-2k}|\mathbb{F}|^{-(k+1)(3r+2)},

γ_{i} (x_{I_{i}}) = Γ_{i} (x_{I_{i} ∖ {k}}) \cdot x_{k} and ρ (x_{[k]}) = R (x_{[k - 1]}) \cdot x_{k} .

γ_{i} (x_{I_{i}}) = Γ_{i} (x_{I_{i} ∖ {k}}) \cdot x_{k} and ρ (x_{[k]}) = R (x_{[k - 1]}) \cdot x_{k} .

R(x_{[k-1]}),R(y_{[k-1]})\notin\operatorname{span}\Big{\{}\Gamma_{i}(x_{I_{i}\setminus\{k\}}),\Gamma_{i}(y_{I_{i}\setminus\{k\}})\colon i\in[r_{0}+1,r]\Big{\}},

R(x_{[k-1]}),R(y_{[k-1]})\notin\operatorname{span}\Big{\{}\Gamma_{i}(x_{I_{i}\setminus\{k\}}),\Gamma_{i}(y_{I_{i}\setminus\{k\}})\colon i\in[r_{0}+1,r]\Big{\}},

R(x_{[k-1]})\in\operatorname{span}\Big{\{}\Gamma_{i}(x_{I_{i}\setminus\{k\}}),\Gamma_{i}(y_{I_{i}\setminus\{k\}})\colon i\in[r_{0}+1,r]\Big{\}},

R(x_{[k-1]})\in\operatorname{span}\Big{\{}\Gamma_{i}(x_{I_{i}\setminus\{k\}}),\Gamma_{i}(y_{I_{i}\setminus\{k\}})\colon i\in[r_{0}+1,r]\Big{\}},

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

**Polynomial bound for partition rank in terms of analytic rank

** Luka Milićević†

00footnotetext: † Mathematical Institute of the Serbian Academy of Sciences and Arts, Belgrade, Serbia.

† Email: [email protected]

Abstract

Let $G_{1},\dots,G_{k}$ be vector spaces over a finite field $\mathbb{F}=\mathbb{F}_{q}$ with a non-trivial additive character $\chi$ . The analytic rank of a multilinear form $\alpha\colon G_{1}\times\dots\times G_{k}\to\mathbb{F}$ is defined as $\operatorname{arank}(\alpha)=-\log_{q}\mathop{\mathbb{E}}_{x_{1}\in G_{1},\dots,x_{k}\in G_{k}}\chi\big{(}\alpha(x_{1},\dots,x_{k})\big{)}$ . The partition rank $\operatorname{prank}(\alpha)$ of $\alpha$ is the smallest number of maps of partition rank 1 that add up to $\alpha$ , where a map is of partition rank 1 if it can be written as a product of two multilinear forms, depending on different coordinates. It is easy to see that $\operatorname{arank}(\alpha)\leq O\Big{(}\operatorname{prank}(\alpha)\Big{)}$ and it has been known that $\operatorname{prank}(\alpha)$ can be bounded from above in terms of $\operatorname{arank}(\alpha)$ . In this paper, we improve the latter bound to polynomial, i.e. we show that there are quantities $C,D$ depending on $k$ only such that $\operatorname{prank}(\alpha)\leq C(\operatorname{arank}(\alpha)^{D}+1)$ . As a consequence, we prove a conjecture of Kazhdan and Ziegler.

aaaThe same result was obtained independently and simultaneously by Janzer.

**§1 Introduction

**

Throughout the paper $G_{1},\dots,G_{k}$ are vector spaces over a finite field $\mathbb{F}$ and $\chi$ is a non-trivial additive character of $\mathbb{F}$ . In order to measure how much the distribution of values of a multilinear form $\alpha\colon G_{1}\times\dots\times G_{k}\to\mathbb{F}$ deviates from the uniform distribution Gowers and Wolf [3] introduced the notion of analytic rank.

Definition 1 (Analytic rank).

Analytic rank of $\alpha$ is defined as

[TABLE]

Note that this definition does not depend on the choice of $\chi$ and that for the case of two variables coincides with the usual algebraic rank of the corresponding matrix. Another way to generalize the notion of algebraic rank to multilinear forms is the partition rank, introduced by Naslund [12], which is defined as follows.

Definition 2 (Partition rank).

Let $\alpha\colon G_{1}\times\dots\times G_{k}\to\mathbb{F}$ be a multilinear form. We say that $\alpha$ is of partition rank 1 if there is a set $\emptyset\not=I\subset[k]$ and multlinear forms $\beta\colon\prod_{i\in I}G_{i}\to\mathbb{F}$ and $\gamma\colon\prod_{i\in[k]\setminus I}G_{i}\to\mathbb{F}$ such that

[TABLE]

In general, partition rank $\operatorname{prank}(\alpha)$ of $\alpha$ is the smallest integer $r$ such that $\alpha$ is a sum of $r$ multilinear forms of partition rank 1. We also set $\operatorname{prank}(0)=0$ .

It is easy to see that $\operatorname{arank}(\alpha)\leq O\Big{(}\operatorname{prank}(\alpha)\Big{)}$ and Lovett [11] proved $\operatorname{arank}(\alpha)\leq\operatorname{prank}(\alpha)$ . On the other hand, it turns out that we can bound partition rank in terms of analytic rank. This was first proved by Bhowmick and Lovett [1], and they showed that $\operatorname{prank}(\alpha)\leq f(\operatorname{arank}(\alpha),k,|\mathbb{F}|)$ , where $f$ has an Ackerman-type dependence on its parameters. This bound was recently improved significantly by Janzer [7] to tower of exponentials of height depending on $k$ . In this paper we prove

Theorem 3.

For given $k$ , there are constants $C,D$ such that $\operatorname{prank}(\alpha)\leq C(\operatorname{arank}(\alpha)^{D}+1).$

We refer to this result as the strong inverse theorem for multilinear forms of low analytic rank or simply as strong inverse theorem.111We emphasize the word strong to make distiction from a weak version of the inverse theorem which we also prove in this paper. The constants $C,D$ can be taken to be $C=2^{k^{2^{O(k^{2})}}},D=2^{2^{O(k^{2})}}$ . Note also that, since every non-zero form $\alpha\colon G_{1}\times\dots\times G_{k}\to\mathbb{F}$ takes at least $\Big{(}\frac{|\mathbb{F}|-1}{|\mathbb{F}|}\Big{)}^{k}|G_{1}|\cdots|G_{k}|$ non-zero values, there is a constant $a_{0}$ , depending on $k$ and $\mathbb{F}$ , such that for all non-zero forms $\alpha$ , $\operatorname{arank}(\alpha)\geq a_{0}$ . Hence, we may rewrite the bound in Theorem 3 as

[TABLE]

but the new constant $C^{\prime}$ would need to depend on $\mathbb{F}$ as well.

Theorem 3 was also proved by Janzer [8] independently and simultaneously, using a different argument.

Before the study of ranks of multilinear forms, an important topic of study has been the distribution of multivariate polynomials over finite fields. In this direction, we have the following result proved by Green and Tao [4] (with the restriction on the degree of the polynomial to be bounded by the size of the field), and Kaufman and Lovett [9] (without the restriction on the degree), and was applied by Bhowmick and Lovett [1] to coding theory and effective algebraic geometry.

Theorem 4 (Green and Tao [4]; Kaufman and Lovett [9]).

Let $\mathbb{F}$ be a field of prime order. Suppose that $f\colon\mathbb{F}^{n}\to\mathbb{F}$ is a polynomial of degree $d$ ( $d<|\mathbb{F}|$ in the result of Green and Tao). If $\Big{|}\mathop{\mathbb{E}}_{x_{1},\dots,x_{n}\in\mathbb{F}}\chi\Big{(}f(x_{1},\dots,x_{n})\Big{)}\Big{|}\geq c$ , then there is $r\leq O_{p,d,c^{-1}}(1)$ , polynomials $f_{1},\dots,f_{r}$ of degree $d-1$ , and a map $g\colon\mathbb{F}^{r}\to\mathbb{F}$ such that

[TABLE]

As observed by Green and Tao in [4] and by Janzer in [7], it is easy to deduce this result from the strong inverse theorem, at least when $d<\operatorname{char}\mathbb{F}$ . We include a very short sketch to confirm that we may take polynomial bounds in Theorem 4 as well. This proves a conjecture of Kazhdan and Zielger (Conjecture 3.5 in [10]).

Sketch proof of Theorem 4.

Consider a symmetric multilinear form $\alpha\colon(\mathbb{F}^{n})^{d}\to\mathbb{F}$ such that $f(x)=\alpha(x,x,\dots,x)$ . Since $d<\operatorname{char}\mathbb{F}$ , $d!$ is invertible in $\mathbb{F}$ , and such a form exists. Notice that for all $x,y^{1},\dots,y^{d-1}\in\mathbb{F}^{n}$

[TABLE]

for some maps $g,h_{1},\dots,h_{d-1}$ . Observe also that

[TABLE]

Applying this $d-1$ times in total, we get

[TABLE]

Hence, by Cauchy-Schwarz inequality for the box norm (Lemma 15), we get

[TABLE]

Apply the strong inverse theorem to find $r\leq C\Big{(}2^{2d-2}\log_{|\mathbb{F}|}(|\mathbb{F}|c^{-1})\Big{)}^{D}$ , sets $\emptyset\not=I_{i}\subsetneq[d]$ and multilinear forms $\beta_{i}\colon\prod_{j\in I_{i}}G_{j}\to\mathbb{F},\gamma_{i}\colon\prod_{j\in[d]\setminus I_{i}}G_{j}\to\mathbb{F}$ for $i\in[r]$ such that

[TABLE]

Then $f(x)=\alpha(x,\dots,x)=\sum_{i\in[r]}\beta_{i}(x,\dots,x)\gamma_{i}(x,\dots,x)$ , as desired.∎

The proof of the strong inverse theorem given here is entirely self-contained and in particular does not use other results of additive combinatorics such as Freiman’s theorem, and, as it is clear from the bounds we obtain here, we do not apply a regularity lemma. In fact, it is probable that the results of this paper may replace the use of regularity lemmas in similarly algebraic settings. In the next subsection, we list main results of this paper, discuss the proofs and the ideas in more detail.

**1.1. Main results and overview of the argument

**

Notation. In the rest of the paper, we use the following convention to save writing in situations where we have many indices appearing in predictable patterns. Instead of whole sequence $x_{1},\dots,x_{m}$ , we write $x_{[m]}$ , and we write $x_{I}$ for $I\subset[m]$ to be the subsequence with indices in $I$ . This applies to products as well: $G_{[k]}$ stands for $\prod_{i\in[k]}G_{i}$ and $G_{I}=\prod_{i\in I}G_{i}$ . For example, instead of writing $\alpha\colon\prod_{i\in I}G_{i}\to\mathbb{F}$ and $\alpha(x_{i}\colon i\in I)$ , we write $\alpha\colon G_{I}\to\mathbb{F}$ and $\alpha(x_{I})$ .

Given $\mathbb{F}$ -vector spaces $G_{1},\dots,G_{k},H$ , a map $\alpha\colon G_{[k]}\to H$ is said to be multilinear if it is linear in each coordinate, that is, whenever $x_{[i-1]}\in G_{[i-1]},x_{[i+1,k]}\in G_{[i+1,k]}$ and $y,z\in G_{i}$ , then

[TABLE]

Similarly, $\alpha$ is multiaffine if it is affine in each coordinate, i.e. whenever $x_{[i-1]}\in G_{[i-1]},x_{[i+1,k]}\in G_{[i+1,k]}$ and $y,z,w\in G_{i}$ , then

[TABLE]

Also, we refer to the zero set of a multiaffine map $\alpha\colon G_{[k]}\to H$ , where $H$ is a vector space over $\mathbb{F}$ , as variety, and the codimension of a variety is $\dim H$ . Another convention we adopt is that we write $\mathop{\mathbb{E}}_{x}$ , without specifying the set from which $x$ is taken, when this causes no confusion. Frequently we shall consider ‘slices’ of sets $S\subset G_{[k]}$ , by which we mean sets $S_{x_{I}}=\{y_{[k]\setminus I}\in G_{[k]\setminus I}\colon(x_{I},y_{[k]\setminus I})\in S\}$ , for $I\subset[k],x_{I}\in G_{I}$ . Occasionally, we might have a single element $z\in G_{i}$ instead of $x_{I}$ , and in this case we write $S_{i\colon z}$ for the resulting slice, since the direction $i$ is not clear from the notation $z$ , unlike in the case of $x_{I}$ . Finally, for each vector space $G_{i}$ , fix a dot product $\cdot$ . We need this for the characterization of linear forms on $G_{i}$ – each linear form $\phi\colon G_{i}\to\mathbb{F}$ takes form $\phi(x)=x\cdot u$ for an element $u\in G_{i}$ .

Results and outline. Our first main result is the weak version of the (strong) inverse theorem.

Theorem 5 (Weak inverse theorem for maps of low analytic rank - Weak( $k$ )).

For given $k$ , there are constants $C=C^{\bm{weak}}_{k},D=D^{\bm{weak}}_{k}>0$ with the following property. Suppose that $\alpha\colon G_{[k]}\to\mathbb{F}$ is a multilinear form such that $\mathop{\mathbb{E}}_{x_{[k]}}\chi\Big{(}\alpha(x_{[k]})\Big{)}\geq c$ , for some $c>0$ . Then, there is $r\leq C\log_{|\mathbb{F}|}^{D}(|\mathbb{F}|c^{-1})$ and there are multilinear maps $\beta_{i}\colon G_{I_{i}}\to\mathbb{F}$ , $i\in[r]$ , where $\emptyset\not=I_{i}\subset[k-1]$ such that

[TABLE]

Note that there is a multilinear map $A\colon G_{[k-1]}\to G_{k}$ such that for each $x_{[k]}\in G_{[k]}$ , $\alpha(x_{[k]})=A(x_{[k-1]})\cdot x_{k}$ . Then $\Big{|}\Big{\{}x_{[k-1]}\in G_{[k-1]}\colon A(x_{[k-1]})=0\Big{\}}\Big{|}=\Big{(}\mathop{\mathbb{E}}_{x_{[k]}}\chi(\alpha(x_{[k]}))\Big{)}|G_{[k-1]}|$ . Thus, another way to phrase the weak inverse theorem is to say that every dense variety contains a low-codimensional variety. On the other hand, it is very easy to see that low-codimensional varieties are necessarily dense, see Lemma 11.

Next, we have the strong inverse theorem.

Theorem 6 (Strong inverse theorem for maps of low analytic rank - Strong( $k$ )).

For given $k$ , there are constants $C=C^{\bm{strong}}_{k},D=D^{\bm{strong}}_{k}>0$ with the following property. Suppose that $\alpha\colon G_{[k]}\to\mathbb{F}$ is a multilinear form such that $\mathop{\mathbb{E}}_{x_{[k]}}\chi(\alpha(x_{[k]}))\geq c$ , for some $c>0$ . Then, there is $r\leq C\log_{|\mathbb{F}|}^{D}(|\mathbb{F}|c^{-1})$ and there are multilinear maps $\beta_{i}\colon G_{I_{i}}\to\mathbb{F}$ and $\gamma_{i}\colon G_{[k]\setminus I_{i}}\to\mathbb{F}$ , $i\in[r]$ , where $\emptyset\not=I_{i}\subset[k-1]$ such that

[TABLE]

We remark that the bounds on the constants claimed in the introduction, namely $C^{\bm{strong}}_{k}=2^{k^{2^{O(k^{2})}}}$ and $D^{\bm{strong}}_{k}=2^{2^{O(k^{2})}}$ , follow from inequalities (9) and (4) which appear later in the paper. In fact, bounds of the same form hold in all results stated in this subsection.

Let $S\subset G_{[k]}$ and let $\alpha\colon G_{[k]}\to H$ be a multiaffine map. A layer of $\alpha$ is any set of the form $\{x_{[k]}\in G_{[k]}\colon\alpha(x_{[k]})=\lambda\}$ , for $\lambda\in H$ . We say that layers of $\alpha$ *internally $\epsilon$ -approximate * $S$ , if there are layers $L_{1},\dots,L_{m}$ of $\alpha$ such that $S\supset L_{i}$ and $\Big{|}S\setminus\Big{(}\cup_{i\in[m]}L_{i}\Big{)}\Big{|}\leq\epsilon|G_{[k]}|$ . Similarly, we say that layers of $\alpha$ *externally $\epsilon$ -approximate * $S$ , if there are layers $L_{1},\dots,L_{m}$ of $\alpha$ such that $S\subset\cup_{i\in[m]}L_{i}$ and $\Big{|}\Big{(}\cup_{i\in[m]}L_{i}\Big{)}\setminus S\Big{|}\leq\epsilon|G_{[k]}|$ .

The next two results say that we may approximate internally and externally certain sets by low-codimensional varieties. In the first case, the sets we have in mind are dense varieties, and in the second case these are the sets of dense columns of a variety.

Theorem 7 (Simultaneous inner approximation of varieties - Inner( $k$ )).

For given $k$ , there are constants $C=C^{\bm{inner}}_{k},D=D^{\bm{inner}}_{k}>0$ with the following property. Let $\epsilon>0$ and let $B_{1},\dots,B_{r}\colon G_{[k]}\to H$ be multiaffine maps. For each $\lambda\in\mathbb{F}^{r}$ , let $Z_{\lambda}=\{x_{[k]}\in G_{[k]}\colon\sum_{i\in[r]}\lambda_{i}B_{i}(x_{[k]})=0\}$ . Then there is $s\leq C\Big{(}r\log_{|\mathbb{F}|}(|\mathbb{F}|\epsilon^{-1})\Big{)}^{D}$ , a multiaffine map $\beta\colon G_{[k]}\to\mathbb{F}^{s}$ such that for each $\lambda\in\mathbb{F}^{r}$ , layers of $\beta$ internally $\epsilon$ -approximate $Z_{\lambda}$ .

Theorem 8 (Structure of a set of dense columns of a variety - Columns( $k$ )).

For given $k$ , there are constants $C=C^{\bm{columns}}_{k},D=D^{\bm{columns}}_{k}>0$ with the following property. Let $\alpha\colon G_{[k]}\to\mathbb{F}^{r}$ be a multiaffine map. Let $S\subset\mathbb{F}^{r}$ and $\epsilon>0$ . Define the set of $\epsilon$ -dense columns as

[TABLE]

Then, there is $s\leq C\Big{(}r\log_{|\mathbb{F}|}(|\mathbb{F}|\epsilon^{-1})\Big{)}^{D}$ , a multiaffine map $\beta\colon G_{[k-1]}\to\mathbb{F}^{s}$ such that layers of $\beta$ $\epsilon$ -internally and $\epsilon$ -externally approximate $X$ .

Finally, we prove a strong approximation result for the convolutions of the indicator function of a low-codimensional variety. We call it an almost $L^{\infty}$ approximation theorem which sounds slightly oxymoronical, but is appropriate since we actually prove that the convolution can be approximated by a finite exponential sum on very structured set (a union of layers of a low-codimensional variety) of density $1-o(1)$ . For this theorem, we need one more piece of notation. For a map $f\colon G_{[k]}\to\mathbb{C}$ , we define its convolution in direction $i$ , denoted by $\mathbf{C}_{i}f\colon G_{[k]}\to\mathbb{C}$ as

[TABLE]

We also misuse notation slightly and for the given set $Z$ we also treat $Z$ as the indicator function in the expression below.

Theorem 9 (Almost $L^{\infty}$ approximation theorem for convolutions of varieties of low codimension - Conv( $k$ )).

For given $l\in[k]$ , there are constants $C=C^{\bm{conv}}_{k,l},D=D^{\bm{conv}}_{k,l}>0$ with the following property. Let $\alpha\colon G_{[k]}\to\mathbb{F}^{r}$ be a multilinear map, $Z=\{x_{[k]}\in G_{[k]}\colon\alpha(x_{[k]})=0\}$ and let $\epsilon>0$ . Then there are $s,t\leq C\Big{(}r\log_{|\mathbb{F}|}(|\mathbb{F}|\epsilon^{-1})\Big{)}^{D}$ , multiaffine forms $\beta_{i}\colon G_{[k]}\to\mathbb{F}$ for $i\in[s]$ , multiaffine map $\gamma\colon G_{[k]\setminus\{l\}}\to\mathbb{F}^{t}$ , constants $c_{1},\dots,c_{m}\in\mathbb{C}$ , multiaffine maps $\rho_{1},\dots,\rho_{m}\in\operatorname{span}\{\beta_{[s]}\}$ and layers $L_{1},\dots,L_{n}$ of $\gamma$ such that

[TABLE]

for all $x_{[k]}\in G_{[k]}\setminus\Big{(}(\cup_{i\in[n]}L_{i})\times G_{l}\Big{)}$ , $|\cup_{i\in[n]}L_{i}|\leq\epsilon|G_{[k]\setminus\{l\}}|$ and $\sum_{i\in[m]}|c_{i}|\leq 1$ .

The proof naturally splits into five parts, each showing one of the following implications.

[TABLE]

To complete this inductive scheme, we note that $\bm{Strong}(2)$ holds, and this is a simple consequence of linear algebra. Indeed, if $\alpha\colon G_{1}\times G_{2}\to\mathbb{F}$ is a bilinear form such that $\mathop{\mathbb{E}}_{x,y}\chi(\alpha(x,y))\geq c$ , writing $\alpha(x,y)=A(x)\cdot y$ for a linear map $A\colon G_{1}\to G_{2}$ , we see that $|\{A=0\}|\geq c|G_{1}|$ . By rank-nullity theorem, $A$ has rank $r\leq\log_{|\mathbb{F}|}c^{-1}$ , thus, there are $v_{1},\dots,v_{r}\in G_{2}$ , and linear forms $\beta_{1},\dots,\beta_{r}\colon G_{1}\to\mathbb{F}$ such that $(\forall x\in G_{1})A(x)=\sum_{i\in[r]}v_{i}\beta_{i}(x)$ . Thus $\alpha(x,y)=\sum_{i\in[r]}\beta_{i}(x)(v_{i}\cdot y)$ , as desired.

In some sense, all the results above can be seen as corollaries of Theorem 6 (or any other theorem listed here), but this would not be an entirely correct viewpoint since the proof has the structure outlined here. Still, the deduction of the strong inverse theorem from the weak one occupies the largest part of the proof. The crucial idea in this part of the proof is the following proposition. To state it, we need to introduce a notion of connectivity for subsets of $G_{[k]}$ . Namely, we consider $G_{[k]}$ as vertices of a graph $\mathcal{G}$ with edges between points that differ in exactly one coordinate. We say that a set $S\subset G_{[k]}$ is connected if $\mathcal{G}[S]$ is connected. The diameter of $S$ is the largest distance between two vertices in the graph $\mathcal{G}[S]$ .

Proposition 10 (One-sided regularity lemma).

Let $\rho\colon G_{[k]}\to\mathbb{F},\gamma_{i}\colon G_{I_{i}}\to\mathbb{F}$ , $i\in[r]$ be multilinear maps. Let $F=\{i\in[r]\colon I_{i}=[k]\}$ . Suppose that

[TABLE]

for any choice of $\lambda\in\mathbb{F}^{F}$ . Then, the set $\{x_{[k]}\in G_{[k]}\colon(\forall i\in[r])\gamma_{i}(x_{I_{i}})=0,\rho(x_{[k]})\not=0\}$ is connected and of diameter at most $(2k+1)(2^{k}-1)$ .

Thus, if the form $\rho$ is sufficiently quasirandom w.r.t. other forms $\gamma_{i}$ , then the set $\{x_{[k]}\in G_{[k]}\colon(\forall i\in[r])\gamma_{i}(x_{I_{i}})=0,\rho(x_{[k]})\not=0\}$ is well-behaved. For our purposes, this means that we may easily remove $\rho$ from the collection of the considered maps. On the other hand, if neither form is sufficiently quasirandom, then we may replace them by forms that depend on fewer coordinates using the weak inverse theorem.

Another idea that plays a very important role in the proof is the dependent random choice, which takes a particularly simple form in the algebraic setting and allows us to externally approximate dense varieties by low-codimensional varieties very efficiently (Lemma 12).

Acknowledgements. I would like to acknowledge the support of the Ministry of Education, Science and Technological Development of the Republic of Serbia, Grant ON174026. I would also like to thank the anonymous referee for a very careful reading.

**§2 Preliminaries

**

From now on, we adopt a non-standard convention and write $\log$ , without subscripts, to be a slightly modified version of the logarithm. Namely, for positive real $x$ , we write $\log x=\log_{|\mathbb{F}|}(|\mathbb{F}|x)=(\log_{|\mathbb{F}|}x)+1$ . This has the merit of being greater or equal to 1, when $x\geq 1$ , which simplifies the calculations. If we write $\log_{|\mathbb{F}|}$ , we still have its usual meaning in mind.

As a warm-up, we show that low-codimensional varieties are necessarily dense. We use this very simple fact in the proofs that follow without explicitly referring to the next lemma.

Lemma 11.

Let $B$ be a variety of codimension $r$ in $G_{[k]}$ . Let $x_{[k]}\in B$ . Then there are at least $|\mathbb{F}|^{-kr}|G_{[k]}|$ points in $B$ at distance222In the induced graph $\mathcal{G}[B]$ , where $\mathcal{G}$ has the same meaning as in the introduction. at most $k$ from $x_{[k]}$ . In particular, if $B$ is non-empty, then $|B|\geq|\mathbb{F}|^{-kr}|G_{[k]}|$ .

Proof.

For $i\in[k]$ , we show that there is $Y_{i}\subset G_{[i]}$ of size $|Y_{i}|\geq|\mathbb{F}|^{-ir}|G_{[i]}|$ such that for each $y_{[i]}\in Y_{i}$ , the point $(y_{[i]},x_{[i+1,k]})$ belongs to $B$ and is at distance at most $i$ from $x_{[k]}$ . Let $\beta\colon G_{[k]}\to\mathbb{F}^{r}$ be the multiaffine map defining $B$ , thus $B=\{\beta=\lambda\}$ , for some $\lambda\in\mathbb{F}^{r}$ . For $i=1$ , we may take $Y_{1}=\{y_{1}\in G_{1}\colon\beta(y_{1},x_{[2,k]})=\lambda\}\times\{x_{[2,k]}\}$ . The projection of this set in $G_{1}$ is a non-empty (since it contains $x_{1}$ ) coset of codimension at most $r$ , hence the claim follows. Suppose that the claim holds for some $i\leq k-1$ . Similarly as in the previous case, for each $y_{[i]}\in Y_{i}$ look at $Z(y_{[i]})=\{z_{i+1}\in G_{i+1}\colon\beta(y_{[i]},z_{i+1},x_{[i+2,k]})=\lambda\}$ , which is again non-empty (it contains $x_{i+1}$ ) coset of codimension at most $r$ . Taking $Y_{i+1}=\cup_{y_{[i]}\in Y_{i}}y_{[i]}\times Z(y_{[i]})\times\{x_{[i+2,k]}\}$ finishes the proof.∎

When $A\colon G_{[k]}\to H$ is a map, we write $\{A=0\}=\{x_{[k]}\in G_{[k]}\colon A(x_{[k]})=0\}$ , when there is no danger of confusion. Also, if $A_{1},\dots,A_{r}\colon G_{[k]}\to H$ are maps, and $\lambda\in\mathbb{F}^{r}$ , we write $\lambda\cdot A$ for the map $\sum_{i\in[r]}\lambda_{i}A_{i}$ , when there is no danger of confusion.

Lemma 12 (Approximating dense varieties externally).

Let $A\colon G_{[k]}\to H$ be a multiaffine map. Then, there is a multiaffine map $\phi\colon G_{[k]}\to\mathbb{F}^{s}$ such that $\{A=0\}\subset\{\phi=0\}$ and $|\{\phi=0\}\setminus\{A=0\}|\leq|\mathbb{F}|^{-s}|G_{[k]}|$ . If, additionally, $A$ is linear in coordinate $c$ , then so is $\phi$ .

Proof.

Take $h_{1},\dots,h_{s}$ uniformly and independently from $H$ , and set $\phi(x_{[k]})_{i}=A(x_{[k]})\cdot h_{i}$ , $i\in[s]$ . If $A$ is linear in coordinate $c$ , then so is $\phi$ , as required. Notice that $\{A=0\}\subset\{\phi=0\}$ holds immediately. On the other hand, if $x_{[k]}\in G_{[k]}$ satisfies $A(x_{[k]})\not=0$ , then

[TABLE]

Thus, $\mathop{\mathbb{E}}|\{\phi=0\}\setminus\{A=0\}|\leq|\mathbb{F}|^{-s}|\{A\not=0\}|$ , and the claim follows.∎

Lemma 13 (Approximating dense varieties simultaneously.).

Let $A_{1},\dots,A_{r}\colon G_{[k]}\to H$ be multiaffine maps. Let $\epsilon>0$ . Then, there is $s\leq r+\log_{|\mathbb{F}|}\epsilon^{-1}$ , multiaffine maps $\phi_{1},\dots,\phi_{r}\colon G_{[k]}\to\mathbb{F}^{s}$ such that for each $\lambda\in\mathbb{F}^{r}$ we have $\{\lambda\cdot A=0\}\subset\{\lambda\cdot\phi=0\}$ and $|\{\lambda\cdot\phi=0\}\setminus\{\lambda\cdot A=0\}|\leq\epsilon|G_{[k]}|$ .

Proof.

Consider an auxiliary multiaffine map $\Phi\colon G_{[k]}\times\mathbb{F}^{r}\to H$ , defined by

[TABLE]

Apply Lemma 12 to find $s\leq\log_{|\mathbb{F}|}(|\mathbb{F}|^{r}\epsilon^{-1})=r+\log_{|\mathbb{F}|}\epsilon^{-1}$ and a multiaffine map $\psi\colon G_{[k]}\times\mathbb{F}^{r}\to\mathbb{F}^{s}$ such that $\{\psi=0\}\supset\{\Phi=0\}$ and the difference set has density at most $|\mathbb{F}|^{-r}\epsilon$ in $G_{[k]}\times\mathbb{F}^{r}$ . Furthermore, since $\Phi$ is linear in the last (auxiliary) coordinate, so is $\psi$ . If $e_{1},\dots,e_{r}$ is the standard basis of $\mathbb{F}^{r}$ , let $\phi_{i}$ be defined by $\phi_{i}(x_{[k]})=\psi(x_{[k]},e_{i})$ . Hence, for each $\lambda\in\mathbb{F}^{r}$ , we have that

[TABLE]

and

[TABLE]

as desired.∎

When $A\colon G_{[k]}\to H$ is a multiaffine map, we may write $A(x_{[k]})=\sum_{I\subset[k]}A_{I}(x_{I})$ , for multilinear maps $A_{I}\colon G_{I}\to H$ (for $I=\emptyset$ , $A_{I}$ is a constant, but not necessarily zero). We call $A_{I}$ the multilinear parts of $A$ . We make use of the following observation of Lovett [11].

Lemma 14 (Lovett [11]).

Let $\alpha\colon G_{[k]}\to\mathbb{F}$ be a multiaffine form, with multilinear parts $\alpha_{I}$ . Then

[TABLE]

Sketch proof.

Write $\alpha(x_{[k]})=\alpha^{\prime}(x_{[k-1]})+A(x_{[k-1]})\cdot x_{k}$ , for multiaffine maps $\alpha^{\prime}$ and $A$ . Then

[TABLE]

Apply this observation $k-1$ more times to end up with $\alpha_{[k]}$ only in the final bound.∎

Recall the definition of Gowers box norm (Definition B.1 in the Appendix B of [5])

[TABLE]

for a map $f\colon G_{[k]}\to\mathbb{C}$ , where $\operatorname{Conj}$ stands for the conjugation operator. This definition is a generalization of Gowers uniformity norms, which were introduced by Gowers in [2] in additive combinatorics and by Host and Kra in [6] in the context of ergodic theory. For a more detailed discussion of box norms, see [5]. Note that when $\phi\colon G_{[k]}\to\mathbb{F}$ is a multilinear form, we have $\|\chi\circ\phi\|_{\square^{k}}^{2^{k}}=\mathop{\mathbb{E}}_{x_{[k]}\in G_{[k]}}\chi(\phi(x_{[k]}))$ . The following is the Gowers-Cauchy-Schwarz inequality for the box norm.

Proposition 15.

Let $f_{I}\colon G_{[k]}\to\mathbb{C}$ be a function for each $I\subset[k]$ . Then

[TABLE]

This can be proved by induction on $k$ , Cauchy-Schwarz and Hölder inequalities, for details, see [5]. Note that Proposition 15 also implies a bound resembling that in Lemma 14, but such a bound would be weaker than that in Lemma 14. In fact, in the rest of the paper we only need Lemma 14, but we include the definition of Gowers box norm and Proposition 15, since we apply it in the introduction in the sketch proof of Theorem 4.

**§3 $\bm{Weak}(k)\implies\bm{Strong}(k)$

**

Proof. We begin the deduction of the strong inverse theorem from the weak one by proving the ‘one-sided regularity lemma’.

Proposition 16.

Let $\rho\colon G_{[k]}\to\mathbb{F},\gamma_{i}\colon G_{I_{i}}\to\mathbb{F}$ , $i\in[r]$ be multilinear maps. Let $F=\{i\in[r]\colon I_{i}=[k]\}$ . Suppose that

[TABLE]

for any choice of $\lambda\in\mathbb{F}^{F}$ . Then, the set $\{x_{[k]}\in G_{[k]}\colon(\forall i\in[r])\gamma_{i}(x_{I_{i}})=0,\rho(x_{[k]})\not=0\}$ is connected and of diameter at most $(2k+1)(2^{k}-1)$ .

Proof.

Write $r=r_{0}+r_{1}$ and reorder maps so that $k\in I_{i}$ if and only if $i\in[r_{0}+1,r]$ . Also, write $S_{0}=\{x_{[k-1]}\in G_{[k-1]}\colon(\forall i\in[r_{0}])\gamma_{i}(x_{I_{i}})=0\}$ and $S_{1}=\{x_{[k]}\in G_{[k]}\colon(\forall i\in[r_{0}+1,r])\gamma_{i}(x_{I_{i}})=0,\rho(x_{[k]})\not=0\}$ . The set we are interested then becomes $S=(S_{0}\times G_{k})\cap S_{1}.$ We first prove that for almost all of pairs $(x_{[k-1]}),$ $(y_{[k-1]})\in G_{[k-1]}$ we have some $z\in G_{k}$ such that $(x_{[k-1]},z),(y_{[k-1]},z)\in S_{1}$ .

We may find multilinear maps $\Gamma_{i}\colon G_{I_{i}\setminus\{k\}}\to G_{k},$ $i\in[r_{0}+1,r]$ and $R\colon G_{[k-1]}\to G_{k}$ such that

[TABLE]

Observe that if $x_{[k-1]},y_{[k-1]}\in G_{[k-1]}$ are such that

[TABLE]

then we may certainly get a $z\in G_{k}$ such that $(x_{[k-1]},z),(y_{[k-1]},z)\in S_{1}$ . For the sake of completeness, we include a short proof.

Observation 17.

Let $G$ be a $\mathbb{F}$ -vector space with a dot product $\cdot$ . Let $v_{1},v_{2},u_{1},\dots,u_{m}\in G$ be elements such that $v_{1},v_{2}\notin\operatorname{span}\{u_{1},\dots,u_{m}\}$ . Then we have $z\in G$ such that $v_{1}\cdot z,v_{2}\cdot z\not=0$ , but $u_{i}\cdot z=0$ for all $i\in[m]$ .

Proof of Observation 17..

Suppose contrary, for any $z$ with $u_{i}\cdot z=0$ for all $i\in[m]$ , we have $v_{1}\cdot z=0$ or $v_{2}\cdot z=0$ . Suppose that we have $z_{1}$ and $z_{2}$ such that $u_{i}\cdot z_{1}=0$ and $u_{i}\cdot z_{2}=0$ for all $i\in[m]$ , but $v_{1}\cdot z_{1}\not=0,v_{2}\cdot z_{2}\not=0$ . Then $v_{2}\cdot z_{1}=0,v_{1}\cdot z_{2}=0$ and hence $(\forall i\in[m])u_{i}\cdot(z_{1}+z_{2})=0$ , $v_{1}\cdot(z_{1}+z_{2})\not=0,v_{2}\cdot(z_{1}+z_{2})\not=0$ , which is a contradiction. Hence, w.l.o.g. we have that whenever $u_{i}\cdot z=0$ for all $i\in[m]$ , then $v_{1}\cdot z=0$ . But then $v_{1}\in\operatorname{span}\{u_{1},\dots,u_{m}\}$ , which is the final contradiction.∎

We count the number of pairs $x_{[k-1]},y_{[k-1]}\in G_{[k-1]}$ such that

[TABLE]

by counting for each linear combination $\lambda,\mu\in\mathbb{F}^{[r_{0}+1,r]}$ how often

[TABLE]

happens (and analogously for $R(y_{[k-1]})$ ). The density of such pairs is exactly

[TABLE]

Using Lemma 14, we may bound (1) by

[TABLE]

where $F\subset[r]$ is the set of all $i$ such that $I_{i}=[k]$ . From this we deduce that for all but at most $2|\mathbb{F}|^{2r}\eta$ proportion of pairs $x_{[k-1]},$ $y_{[k-1]}\in G_{[k-1]}$ we have some $z\in G_{k}$ such that $(x_{[k-1]},z),$ $(y_{[k-1]},z)\in S_{1}$ . Moreover, for any given pair, we have at least $|\mathbb{F}|^{-2(r+1)}|G_{k}|$ such $z$ , since for $\lambda_{1},\lambda_{2}\in\mathbb{F}$

[TABLE]

is an at most $2(r+1)$ -codimensional coset.

Write $F^{\prime}=\{i\in[r]\colon I_{i}=[k-1]\}$ . Fix $t\in G_{k}$ , and consider analytic rank of the map $\tau_{t,\lambda}\colon G_{[k-1]}\to\mathbb{F}$ defined by

[TABLE]

for $\lambda\in\mathbb{F}^{F\cup F^{\prime}}$ . If analytic ranks of $\tau_{t,\lambda}$ are small for all choices of $\lambda\in\mathbb{F}^{F\cup F^{\prime}}$ , then the induction hypothesis applies, and $S_{k\colon t}$ (recall that this is the slice notation $S_{k\colon t}=\{z_{[k-1]}\in G_{[k-1]}\colon(z_{[k-1]},t)\in S\}$ ) is connected and of diameter at most $(2k-1)(2^{k-1}-1)$ . For any fixed $\lambda\in\mathbb{F}^{F\cup F^{\prime}}$ , we have

[TABLE]

hence, by averaging, we obtain a set $T\subset G_{k}$ of density at least $1-\frac{1}{2}|\mathbb{F}|^{-2r-2}$ , such that

[TABLE]

Consequently, by induction hypothesis, for each $z\in T$ , $S_{k\colon z}$ is connected and of diameter at most $(2k-1)(2^{k-1}-1)$ .

We are now ready to prove that $S$ is connected and of bounded diameter. We do this in two steps, the first one being to show that this holds for a very large subset of $S$ , and then the second is to extend this to whole $S$ .

Let $X\subset G_{[k-1]}$ be the set of all $x_{[k-1]}\in G_{[k-1]}$ such that for proportion at least $1-2|\mathbb{F}|^{r}\sqrt{\eta}$ of $y_{[k-1]}\in G_{[k-1]}$ , we have at least $|\mathbb{F}|^{-2(r+1)}|G_{k}|$ of $z\in G_{k}$ such that $(x_{[k-1]},z),(y_{[k-1]},z)\in S_{1}$ . By the previous argument, we get $|X|\geq\Big{(}1-2|\mathbb{F}|^{r}\sqrt{\eta}\Big{)}|G_{[k-1]}|$ . We claim that $(X\times G_{k})\cap S$ is connected and of diameter at most $(2k-1)(2^{k}-2)+3$ . Indeed, let $x_{[k]},y_{[k]}\in(X\times G_{k})\cap S$ . Since $|S_{0}|\geq|\mathbb{F}|^{-(k-1)r}|G_{[k-1]}|>4|\mathbb{F}|^{r}\sqrt{\eta}|G_{[k-1]}|$ , by the way we defined $X$ , we may find some $u_{[k-1]}\in S_{0}$ such that we have at least $|\mathbb{F}|^{-2(r+1)}|G_{k}|$ of $z\in G_{k}$ such that $(x_{[k-1]},z),$ $(u_{[k-1]},z)\in S_{1}$ , and we also have at least $|\mathbb{F}|^{-2(r+1)}|G_{k}|$ of $z^{\prime}\in G_{k}$ such that $(y_{[k-1]},z^{\prime}),$ $(u_{[k-1]},z^{\prime})\in S_{1}$ . In particular, recalling that $|T|\geq\Big{(}1-\frac{1}{2}|\mathbb{F}|^{-2r-2}\Big{)}|G_{[k]}|$ , we have a choice of $z,z^{\prime}\in T$ with the above properties. But, $S_{k\colon z}$ and $S_{k\colon z^{\prime}}$ are connected and of diameter at most $(2k-1)(2^{k-1}-1)$ , which completes the first step.

Finally, take any $x_{[k]}\in S$ . Since $x_{[k]}$ is at distance at most $k$ to at least $|\mathbb{F}|^{-k(r+1)}|G_{[k]}|$ of points in $S$ , and $|S\setminus(X\times G_{k})|\leq|G_{[k-1]}\setminus X||G_{k}|\leq 2|\mathbb{F}|^{r}\sqrt{\eta}|G_{[k]}|$ , at least one such point lies in $(X\times G_{k})\cap S$ , and we are done, with the final diameter bound being $(2k-1)(2^{k}-2)+3+2k\leq(2k+1)(2^{k}-1)$ .∎

Let $\overline{C}=C^{\bm{weak}}_{k}$ and $\overline{D}=D^{\bm{weak}}_{k}$ .

Proof of Theorem 6.

Apply Theorem 5 to $\alpha$ to find $m_{0}\leq\overline{C}\log^{\overline{D}}c^{-1}$ and multilinear $\beta_{i}\colon G_{I_{i}}\to\mathbb{F}$ , $i\in[m_{0}]$ where $\emptyset\not=I_{i}\subset[k-1]$ and

[TABLE]

Write $\alpha(x_{[k]})=A(x_{[k-1]})\cdot x_{k}$ for a multilinear map $A\colon G_{[k-1]}\to G_{k}$ . Thus, we also have

[TABLE]

For a set $Q$ , the power-set of $Q$ is the collection of all subsets of $Q$ (including $\emptyset$ and $Q$ itself) and is denoted by $\mathcal{P}Q$ . By induction on the size of up-set333Collection of sets closed under taking supersets. $\mathcal{F}\subset\mathcal{P}[k-1]$ , we prove the following proposition.

Proposition 18.

Let $\mathcal{F}\subset\mathcal{P}[k-1]$ be an up-set. Then there are constants $C_{\mathcal{F}},D_{\mathcal{F}}$ with the following property. We may find $m_{\mathcal{F}},n_{\mathcal{F}}\leq C_{\mathcal{F}}\log^{D_{\mathcal{F}}}c^{-1}$ , a collection of multilinear maps $\rho^{\mathcal{F}}_{i}\colon G_{J^{\mathcal{F}}_{i}}\to\mathbb{F}$ , where $i\in[n_{\mathcal{F}}]$ , $\emptyset\not=J^{\mathcal{F}}_{i}\subset[k-1]$ , points $y^{\mathcal{F},i}_{J^{\mathcal{F}}_{i}}\in G_{J^{\mathcal{F}}_{i}}$ , another collection of multilinear maps $\beta^{\mathcal{F}}_{i}\colon G_{I^{\mathcal{F}}_{i}}\to\mathbb{F}$ , where $i\in[m_{\mathcal{F}}]$ , $\emptyset\not=I^{\mathcal{F}}_{i}\in\mathcal{P}[k-1]\setminus\mathcal{F}$ , such that the multilinear map $A^{\mathcal{F}}\colon G_{[k-1]}\to G_{k}$ defined as

[TABLE]

satisfies

[TABLE]

We may take

[TABLE]

Proof of Proposition 18.

For $\mathcal{F}=\emptyset$ , we take the given maps $\beta_{i}$ , and hence $n_{\emptyset}=0,m_{\emptyset}=m_{0}$ . Let $\mathcal{F}\cup\{S\}$ be a given up-set, where $S$ is a minimal set inside it, thus making $\mathcal{F}$ an up-set as well. Assume that the claim holds for $\mathcal{F}$ , and that we get multilinear maps $\beta^{\mathcal{F}}_{i}\colon G_{I^{\mathcal{F}}_{i}}\to\mathbb{F}$ , $i\in[m_{\mathcal{F}}]$ , and $A^{\prime}=A^{\mathcal{F}}$ , with the property above. If no $i\in[m_{\mathcal{F}}]$ satisfies $I^{\mathcal{F}}_{i}=S$ , then the same collection works for $\mathcal{F}\cup\{S\}$ . Thus, after reordering maps $\beta^{\mathcal{F}}_{i}$ if necessary, assume that $I^{\mathcal{F}}_{i}=S$ if and only if $i\in[s]$ . Let $\{\lambda^{1},\dots,\lambda^{d}\}\subset\mathbb{F}^{s}$ be a maximal independent set such that for each $j\in[d]$

[TABLE]

Thus, if we extend $\lambda_{1},\dots,\lambda_{d}$ by further $\mu^{1},\dots,\mu^{s-d}$ to a basis of $\mathbb{F}^{s}$ , and setting

[TABLE]

for $i\in[d]$ , and

[TABLE]

for $i\in[s-d]$ , then we have the following properties:

(i)

$(\forall i\in[d])$ , $\mathop{\mathbb{E}}_{x_{S}}\chi(\gamma_{i}(x_{S}))\geq\eta$ ,

(ii)

$(\forall\nu\in\mathbb{F}^{d}$ , $\tau\in\mathbb{F}^{s-d}\setminus\{0\})$ , $\mathop{\mathbb{E}}_{x_{S}}\chi\Big{(}\sum_{i\in[d]}\nu_{i}\gamma_{i}(x_{S})+\sum_{i\in[s-d]}\tau_{i}\rho_{i}(x_{S})\Big{)}\leq\eta$ ,

(iii)

$\bigg{\{}x_{[k-1]}\in G_{[k-1]}\colon(\forall i\in[d])\gamma_{i}(x_{S})=0,(\forall i\in[s-d])\rho_{i}(x_{S})=0,(\forall i\in[s+1,m_{\mathcal{F}}])\beta^{\mathcal{F}}_{i}(x_{I^{\mathcal{F}}_{i}})=0\bigg{\}}$

$\phantom{a}\hskip 28.45274pt\subset\{x_{[k-1]}\in G_{[k-1]}\colon A^{\prime}(x_{[k-1]})=0\}.$

We first deal with $\rho_{i}$ . Let $F=\{i\in[s+1,m_{\mathcal{F}}]\colon I^{\mathcal{F}}_{i}\subset[k-1]\setminus S\}$ and $Z=\Big{\{}x_{[k-1]\setminus S}\in G_{[k-1]\setminus S}\colon(\forall i\in F)\beta_{i}(x_{I^{\mathcal{F}}_{i}})=0\Big{\}}$ . The property (ii) and Proposition 16 imply that for each $z_{[k-1]\setminus S}\in G_{[k-1]\setminus S}$ , for each $i\in[s-d]$ , the set

[TABLE]

is connected.

Next, we pick points $y^{i}_{S}\in G_{S}$ for each $i\in[s-d]$ and define

[TABLE]

We claim that we may choose the points $y_{S}^{1},\dots,y_{S}^{s-d}\in G_{S}$ so that

[TABLE]

Indeed, we may certainly satisfy this condition since otherwise get that some $\rho_{i}=0$ whenever the maps $\rho_{j}$ for $j\not=i$ and $\gamma_{j}$ are all zero, and we may discard $\rho_{i}$ .

Next, we set

[TABLE]

and we observe the following.

Proposition 19.

For all $x_{[k-1]}\in W\cap(G_{S}\times(Z\cap V))$ we have the equality

[TABLE]

Proof of Proposition 19.

Fix $z_{[k-1]\setminus S}\in Z\cap V$ . We show that for all $x_{S}\in W_{z_{[k-1]\setminus S}}$

[TABLE]

holds. We argue by induction on the maximal index $i\in[s-d]$ such that $\rho_{i}(x_{S})\not=0$ , and if there are no such $i$ , we put $i=0$ . Thus the base case is when all $\rho_{i}(x_{S})=0$ . However, since $(x_{S},z_{[k-1]\setminus S})\in W$ and $z_{[k-1]\setminus S}\in Z$ , by property (iii), it follows that $A^{\prime}(x_{S},z_{[k-1]\setminus S})=0$ . The right hand side is also zero, so the identity holds.

Assume now that the claim holds for values of $i$ smaller than some $i_{0}\geq 1$ , and that we are given $x_{S}\in W_{z_{[k-1]\setminus S}}$ such that $\rho_{i_{0}}(x_{S})\not=0$ , but $\rho_{i}(x_{S})=0$ for $i>i_{0}$ . Recall that the set

[TABLE]

is connected. Thus, since $x_{S},y^{i_{0}}_{S}\in R$ , there is a sequence of points $x_{S}=x^{0}_{S},x^{1}_{S},\dots,x^{l}_{S}=y^{i_{0}}_{S}$ inside this set, such that every two consecutive points differ in exactly one coordinate (we call such points neighbouring). We introduce the following piece of notation. When $a$ and $b$ are two points in $G_{[k]}$ differing only in coordinate $i$ , we write $a-b$ to be the point with coordinates $(a-b)_{j}=a_{j}=b_{j}$ for $j\not=i$ and $(a-b)_{i}=a_{i}-b_{i}$ . This notation makes calculations much neater, since for a multilinear map $\phi$ on $G_{[k]}$ , we have $\phi(a-b)=\phi(a)-\phi(b)$ .

Let $\tau_{i}=\rho_{i_{0}}(x^{i}_{S})$ for $i\in[0,l]$ . If $x^{0}_{S}$ and $x^{1}_{S}$ differ in coordinate $c_{0}$ , multiply each of points $x^{1}_{S},\dots,x^{l}_{S}$ at coordinate $c_{0}$ by $\tau_{0}\tau_{1}^{-1}$ . The new points still have the property that the consecutive points are either neighbouring or identical, and that they all lie in the set $R$ . Misusing the notation, we keep writing $x^{i}_{S}$ for the modified points. If $c_{1}$ is the coordinate where $x^{1}_{S}$ and $x^{2}_{S}$ differ, multiply all points among $x^{2}_{S},\dots,x^{l}_{S}$ by $\tau_{1}\tau_{2}^{-1}$ at coordinate $c_{1}$ , and proceed. Hence, we end up with a sequence $x_{S}=x^{0}_{S},x^{1}_{S},\dots,x^{l}_{S}$ in $R$ such that for each $s\in S$ , $x^{l}_{s}=\sigma_{s}y^{i_{0}}_{s}$ for some $\sigma_{s}\in\mathbb{F}\setminus\{0\}$ , the consecutive points are either neighbouring or identical and $(\forall i\in[0,l])\rho_{i_{0}}(x^{i}_{S})=\tau_{0}$ . Hence $\rho_{j}(x^{i}_{S}-x^{i+1}_{S})=0$ for $j\in[i_{0},s-d]$ . Thus, we get

[TABLE]

as desired.∎

Hence, the multilinear map $A^{\mathcal{F}\cup\{S\}}$ defined by

[TABLE]

satisfies

[TABLE]

On the other hand, recalling the property (i), applying Theorem 5 to each $\gamma_{i}$ , allows us to find

[TABLE]

and further multilinear maps $\beta^{\prime}_{i,j}\colon G_{I_{i,j}}\to\mathbb{F}$ , $j\in[m_{i}]$ , $\emptyset\not=I_{i,j}\subset S\setminus\{\max S\}$ such that $\{x_{S}\in G_{S}\colon(\forall j\in[m_{i}])\beta^{\prime}_{i,j}(x_{I_{i,j}})=0\}\subset\{x_{S}\in G_{S}\colon\gamma_{i}(x_{S})=0\}$ . Thus

[TABLE]

as desired. When it comes to bounds, we may take

[TABLE]

and

[TABLE]

This finishes the proof.∎

Applying Proposition 18 with $\mathcal{F}=\mathcal{P}[k-1]$ implies that

[TABLE]

for all $x_{[k-1]}\in G_{[k-1]}$ . Taking dot product with $x_{k}$ completes the proof.∎

Hence, we may take $C^{\bm{strong}}_{k}=\Big{(}3\overline{C}(7k)^{\overline{D}}\Big{)}^{(\overline{D}+1)^{2^{k}}}$ , and $D^{\bm{strong}}_{k}=\overline{D}(\overline{D}+1)^{2^{k}}$ , where $\overline{C}=C^{\bm{weak}}_{k}$ and $\overline{D}=D^{\bm{weak}}_{k}$ . Hence, if $t$ is a quantity such that $C^{\bm{weak}}_{k}\leq 2^{k^{2^{t}}}$ and $D^{\bm{weak}}_{k}\leq 2^{2^{t}}$ , then

[TABLE]

∎

**§4 $\bm{Strong}(k)\implies\bm{Inner}(k-1)$

**

Proof. Let $\overline{C}=C^{\bm{strong}}_{k}$ and $\overline{D}=D^{\bm{strong}}_{k}$ . The theorem will follow from the following proposition. For a given $\mathcal{F}\subset\mathcal{P}([k-1])$ , we say that a multiaffine map $B\colon G_{[k-1]}\to H$ is $\mathcal{F}$ -supported if it can be written as $B(x_{[k-1]})=\sum_{I\in\mathcal{F}}B_{I}(x_{I})$ , for some multilinear maps $B_{I}\colon G_{I}\to H$ .

Proposition 20.

Let $\mathcal{F}\subset\mathcal{P}([k-1])$ be a non-empty down-set.444A collection of set closed under taking subsets. Then, there are constants $C_{\mathcal{F}},D_{\mathcal{F}}$ with the following property. Let $\epsilon>0$ and let $B_{1},\dots,B_{r}\colon G_{[k-1]}\to H$ be multiaffine maps. For $\lambda\in\mathbb{F}^{r}$ , let $Z_{\lambda}=\{x_{[k]}\in G_{[k-1]}\colon\sum_{i\in[r]}\lambda_{i}B_{i}(x_{[k-1]})=0\}$ . Then, there are $s,t\leq C_{\mathcal{F}}\Big{(}r\log\epsilon^{-1}\Big{)}^{D_{\mathcal{F}}}$ , a multiaffine map $\beta\colon G_{[k-1]}\to\mathbb{F}^{s}$ , and a collection of $\mathcal{F}$ -supported multiaffine maps $\Gamma_{1},\dots,\Gamma_{t}\colon G_{[k-1]}\to H$ such that:

(i)

for each $\lambda\in\mathbb{F}^{r}$ , there are distinct layers $L_{1},\dots,L_{m}$ of $\beta$ and multiaffine maps $A_{1},\dots,A_{m}\in\operatorname{span}\{\Gamma_{[t]}\}$ , such that $Z_{\lambda}\cap L_{i}=\{x_{[k-1]}\in G_{[k-1]}\colon A_{i}(x_{[k-1]})=0\}\cap L_{i}$ ,

(ii)

$\Big{|}Z_{\lambda}\setminus\Big{(}\cup_{i\in[m]}(Z_{\lambda}\cap L_{i})\Big{)}\Big{|}\leq\frac{2^{k}-|\mathcal{F}|}{2^{k}}\epsilon|G_{[k-1]}|$ .

We may take $C_{\mathcal{F}}=\Big{(}2\overline{C}(2k)^{\overline{D}}\Big{)}^{(\overline{D}+1)^{2^{k}-|\mathcal{F}|}}$ and $D_{\mathcal{F}}=(2\overline{D}+2)^{2^{k}-|\mathcal{F}|}$ .

Proof.

We prove the claim by down-wards induction on $|\mathcal{F}|$ . The base case is $\mathcal{F}=\mathcal{P}([k-1])$ , in which case we take $s=1$ , $\beta=0$ , $t=r$ and $\Gamma_{i}=B_{i}$ .

Assume that we have proved the claim for some $\mathcal{F}$ . Let $C=C_{\mathcal{F}}$ and $D=D_{\mathcal{F}}$ . Let $\beta,\Gamma_{[t]}$ be as above for the choice $\mathcal{F}$ for the down-set. Thus $s,t\leq C\log^{D}\epsilon^{-1}r^{D}$ . Let $I_{0}$ be a maximal set in $\mathcal{F}$ . The set $I_{0}$ is non-empty, since we are done otherwise. Let $\mathcal{F}^{\prime}=\mathcal{F}\setminus\{I_{0}\}$ . Write each $\Gamma_{i}(x_{[k-1]})=\Gamma^{\prime}_{i}(x_{[k-1]})+\Psi_{i}(x_{I_{0}})$ , for a $\mathcal{F}^{\prime}$ -supported multiaffine map $\Gamma^{\prime}_{i}$ and a multilinear map $\Psi_{i}\colon G_{I_{0}}\to H$ . Look at maximal independent set $\lambda^{1},\dots,\lambda^{d}\in\mathbb{F}^{t}$ such that for each $i\in[d]$ , $\mathop{\mathbb{E}}_{x_{I_{0}},h}\chi\Big{(}\Big{(}\sum_{j\in[t]}\lambda^{i}_{j}\Psi_{j}(x_{I_{0}})\Big{)}\cdot h\Big{)}\geq\nu=\epsilon 2^{-k}|\mathbb{F}|^{-s}$ . For each $i\in[d]$ , we apply Theorem 6 to the multilinear map on $G_{I_{0}}\times H$ given by $(x_{I_{0}},h)\mapsto\big{(}\sum_{j\in[t]}\lambda^{i}_{j}\Psi_{j}(x_{I_{0}})\big{)}\cdot h$ , to find $r_{i}\leq\overline{C}\log^{\overline{D}}\nu^{-1}$ , multilinear maps $\gamma^{i}_{j}\colon G_{J_{ij}}\to\mathbb{F}$ and $\Lambda^{i}_{j}\colon G_{I_{0}\setminus J_{ij}}\to H$ , for $j\in[r_{i}]$ , where $\emptyset\not=J_{ij}\subset I_{0}$ , such that

[TABLE]

Define a multiaffine map $\beta^{\prime}\colon G_{[k-1]}\to\mathbb{F}^{s}\times\mathbb{F}^{\{(i,j)\colon i\in[d],j\in[r_{i}]\}}$ by

[TABLE]

We now show that $\beta^{\prime}$ and $\Gamma^{\prime}_{[t]},\Lambda^{i}_{j}$ , $i\in[d],j\in[r_{i}]$ have the desired properties for the down-set $\mathcal{F}^{\prime}$ .

Take arbitrary $\lambda\in\mathbb{F}^{r}$ and consider $Z_{\lambda}$ . By induction hypothesis, there are distinct layers $L_{1},\dots,L_{m}$ of $\beta$ and multiaffine maps $A_{1},\dots,A_{m}\in\operatorname{span}\{\Gamma_{[t]}\}$ , such that $Z_{\lambda}\cap L_{i}=\{x_{[k-1]}\in G_{[k-1]}\colon A_{i}(x_{[k-1]})=0\}\cap L_{i}$ , and $\Big{|}Z_{\lambda}\setminus\Big{(}\cup_{i\in[m]}(Z_{\lambda}\cap L_{i})\Big{)}\Big{|}\leq\frac{2^{k}-|\mathcal{F}|}{2^{k}}\epsilon|G_{[k-1]}|$ . Here $m\leq|\mathbb{F}|^{s}$ . For each $i\in[m]$ , write $A_{i}(x_{[k-1]})=A^{\prime}_{i}(x_{[k-1]})+C_{i}(x_{I_{0}})$ so that $A^{\prime}_{i}\in\operatorname{span}\{\Gamma^{\prime}_{[t]}\}$ and $C_{i}\in\operatorname{span}\{\Psi_{[t]}\}$ . Thus, either $\mathop{\mathbb{E}}_{x_{I_{0}}\in G_{I_{0}},h\in H}\chi(C_{i}(x_{I_{0}})\cdot h)<\nu$ , or $C_{i}$ is a linear combination of $\lambda^{1}\cdot\Psi,\dots,\lambda^{d}\cdot\Psi$ .

In the former case, by Lemma 14, this means that

[TABLE]

where $\tau$ is the map defined by $(x_{I_{0}},h)\mapsto C_{i}(x_{I_{0}})\cdot h$ . We choose to discard these layers. By doing so, we lose at most $2^{-k}\epsilon$ of density in total.

In the latter case, we may find $\mu\in\mathbb{F}^{d}$ such that $C_{i}(x_{I_{0}})=\sum_{j_{1}\in[d],j_{2}\in[r_{i}]}\mu_{j_{1}}\gamma_{j_{2}}^{j_{1}}(x_{J_{j_{1}j_{2}}})\Lambda^{j_{1}}_{j_{2}}(x_{I_{0}\setminus J_{j_{1}j_{2}}})$ . Partition $L_{i}$ into layers $M_{1},\dots,M_{l}$ of $\beta^{\prime}$ . On each layer $M_{j}$ , $C_{i}$ becomes a linear combination of $\Lambda^{j_{1}}_{j_{2}}$ , and thus $A_{i}$ becomes a linear combination of $\Gamma^{\prime}_{j_{1}},\Lambda^{j_{1}}_{j_{2}}$ , finishing the proof.

For the bounds, observe the codimension of $\beta^{\prime}$ is at most

[TABLE]

and the number of maps $\Gamma^{\prime}_{j_{1}},\Lambda^{j_{1}}_{j_{2}}$ is at most

[TABLE]

as desired.∎

The theorem follows by applying the proposition for $\mathcal{F}=\{\emptyset\}$ . Then each $Z_{\lambda}\cap L_{i}=L_{i}$ is a layer of $\beta$ . The constants may be taken to be

[TABLE]

where $\overline{C}=C^{\bm{strong}}_{k}$ and $\overline{D}=D^{\bm{strong}}_{k}$ . Hence, if $t$ is a quantity such that $C^{\bm{weak}}_{k}\leq 2^{k^{2^{t}}}$ and $D^{\bm{weak}}_{k}\leq 2^{2^{t}}$ , using (4), then

[TABLE]

∎

**§5 $\bm{Inner}(k-1)\implies\bm{Columns}(k)$

**

Proof. We begin the proof by proving the following lemma.

Lemma 21.

Let $G$ be a $\mathbb{F}$ -vector space. Let $x_{1},\dots,x_{r}\in G$ and let $\lambda_{1},\dots,\lambda_{r}\in\mathbb{F}$ . The following are equivalent.

(i)

There is $y\in G$ such that $x_{i}\cdot y=\lambda_{i}$ for each $i\in[r]$ .

(ii)

$(\forall\mu\in\mathbb{F}^{r})\sum_{i\in[r]}\mu_{i}x_{i}=0\implies\mu\cdot\lambda=0$ .

Proof.

(i) $\implies$ (ii): Suppose that $\sum_{i\in[r]}\mu_{i}x_{i}=0$ . Take dot product with $y$ .

(ii) $\implies$ (i): Take a maximal independent set $\{x_{i_{1}},\dots,x_{i_{s}}\}$ among $x_{i}$ . Renaming $x_{i}$ if necessary, we may assume that this set is $\{x_{1},\dots,x_{s}\}$ . For $i\in[s+1,r]$ , we have some $\mu^{i}\in\mathbb{F}^{s}$ such that $x_{i}=\sum_{j\in[s]}\mu^{i}_{j}x_{j}$ . By property (ii) we also have $\lambda_{i}=\sum_{j\in[s]}\mu^{i}_{j}\lambda_{j}$ . Since $x_{1},\dots,x_{s}$ are independent, there is $y$ such that $x_{i}\cdot y=\lambda_{i}$ for each $i\in[s]$ . Property (i) follows since $\lambda_{i}$ for $i\in[s+1,r]$ satisfy the identities above.∎

Write $\alpha_{i}(x_{[k]})=\alpha^{\prime}_{i}(x_{[k-1]})+x_{k}\cdot A_{i}(x_{[k-1]})$ for multiaffine maps $\alpha^{\prime}_{i}\colon G_{[k-1]}\to\mathbb{F}$ and $A_{i}\colon G_{[k-1]}\to G_{k}$ . For each coset $\Lambda\subset\mathbb{F}^{r}$ define $V_{\Lambda}=\Big{\{}x_{[k-1]}\in G_{[k-1]}\colon(\forall\lambda\in\Lambda)(\exists y\in G_{k})\alpha(x_{[k-1]},y)=\lambda\Big{\}}$ . Applying the lemma above, we may rewrite this set as

[TABLE]

Notice that for each $x_{[k-1]}\in G_{[k-1]}$ , the set $\{\alpha(x_{[k-1]},y)\colon y\in G_{k}\}$ is a coset $\Lambda$ in $\mathbb{F}^{r}$ , and that $|\{y\in G_{k}\colon\alpha(x_{[k-1]},y)\in S\}|=\frac{|\Lambda\cap S|}{|\Lambda|}|G_{k}|$ . Thus, $X$ is the union of sets of the form $V_{\Lambda}\setminus\Big{(}\cup_{\begin{subarray}{c}\Lambda\subsetneq M\subseteq\mathbb{F}^{r}\\ M\text{ coset}\end{subarray}}V_{M}\Big{)}$ , where $\Lambda$ are cosets in $\mathbb{F}^{r}$ such that $|\Lambda\cap S|\geq\epsilon|\Lambda|$ .

Next, for each $M\leq\mathbb{F}^{r}$ , define

[TABLE]

Thus

[TABLE]

Let $\Lambda=\lambda^{0}+\Lambda^{0}$ , for a subspace $\Lambda^{0}$ . Notice that

[TABLE]

Hence

[TABLE]

Apply Theorem 7 to $A_{1},\dots,A_{r}$ to find $s\leq C^{\bm{inner}}_{k-1}\Big{(}(4r^{3}+3r^{2})\log\epsilon^{-1}\Big{)}^{D^{\bm{inner}}_{k-1}}$ , a multiaffine map $\beta\colon G_{[k-1]}\to\mathbb{F}^{s}$ so that for each $\lambda\in\mathbb{F}^{r}$ the set $\{\lambda\cdot A=0\}$ can be internally approximated by layers of $\beta$ up to error of at most $\epsilon|\mathbb{F}|^{-(4r^{2}+3r)}$ in density. Notice that, if $M=\langle\mu^{1},\dots,\mu^{d}\rangle$ , where $d\leq r$ , then

[TABLE]

Recall that $W_{\langle\mu^{i}\rangle}=\Big{\{}x_{[k-1]}\in G_{[k-1]}\colon\sum_{i\in[r]}\mu_{i}A_{i}(x_{[k-1]})=0\Big{\}}$ , so we may we internally approximate $W_{\langle\mu^{i}\rangle}$ by a union $D_{i}$ of layers of $\beta$ with error of density at most $\epsilon|\mathbb{F}|^{-(4r^{2}+3r)}$ . Thus $W_{M}\supset\cap_{i\in[d]}D_{i}$ and

[TABLE]

Also, apply Proposition 13 to $A_{1},\dots,A_{r}$ to find multiaffine maps $\gamma_{1},\dots,\gamma_{r}\colon G_{[k-1]}\to\mathbb{F}^{t}$ , where $t\leq(5r^{3}+2r^{2})\log\epsilon^{-1}$ such that each $W_{\langle\mu\rangle}$ can be externally approximated by $\{\mu\cdot\gamma=0\}$ with error of density at most $|\mathbb{F}|^{-5r^{2}-2r}\epsilon$ . Thus,

[TABLE]

Finally, define a multiaffine map $\phi\colon G_{[k-1]}\to\mathbb{F}^{r}\times\mathbb{F}^{s}\times\mathbb{F}^{t}$ by $\phi=(\alpha^{\prime},\beta,\gamma)$ . Then, each $W_{M}$ can be approximated both internally and externally using layers of $\phi$ up to error of density at most $|\mathbb{F}|^{-4r^{2}-2r}\epsilon$ . Hence, from (6), every $V_{\Lambda}$ may be approximated both internally and externally using layers of $\phi$ up to error of density at most $|\mathbb{F}|^{-2r^{2}-2r}\epsilon$ . Finally, each $V_{\Lambda}\setminus\Big{(}\cup_{\begin{subarray}{c}\Lambda\subsetneq M\subseteq\mathbb{F}^{r}\\ M\text{ coset}\end{subarray}}V_{M}\Big{)}$ may be approximated both internally and externally using layers of $\phi$ up to error of density at most $|\mathbb{F}|^{-r^{2}-r}\epsilon$ . Thus, the set

[TABLE]

may be approximated both internally and externally using layers of $\phi$ up to error of density at most $\epsilon$ , as desired.

When it comes to bounds, the codimension of the desired map is $r+s+t$ , and we may take $C^{\bm{columns}}_{k}=20C^{\bm{inner}}_{k-1}$ and $D^{\bm{columns}}_{k}=3D^{\bm{inner}}_{k-1}$ . Hence, if $t$ is a quantity such that $C^{\bm{weak}}_{k}\leq 2^{k^{2^{t}}}$ and $D^{\bm{weak}}_{k}\leq 2^{2^{t}}$ , using (5), then

[TABLE]

∎

**§6 $\bm{Columns}(k-1)\land\bm{Inner}(k-1)\implies\bm{Conv}(k)$

**

Proof. We prove the claim by induction on $k$ , followed by induction on $l$ . Recall that we use $Z$ in the expressions below also to denote the indicator function of the set $Z$ . We include the artificial case $l=0$ . In this case, we have $Z(x_{[k]})=|\mathbb{F}|^{-r}\sum_{\lambda\in\mathbb{F}^{r}}\chi\Big{(}\lambda\cdot\alpha(x_{[k]})\Big{)}$ . Hence, in this case we may take $C^{\bm{conv}}_{k,0}=1,D^{\bm{conv}}_{k,0}=1$ , $c_{i}=|\mathbb{F}|^{-r}$ , $\beta_{i}=\alpha_{i}$ .

Write $C=C^{\bm{conv}}_{k,l-1},D=D^{\bm{conv}}_{k,l-1}$ . By induction hypothesis there are $s,t\leq C\log^{D}_{|\mathbb{F}|}(100\epsilon^{-2})r^{D}$ , multiaffine forms $\beta_{i}\colon G_{[k]}\to\mathbb{F}$ for $i\in[s]$ , multiaffine map $\gamma\colon G_{[k]\setminus\{l-1\}}\to\mathbb{F}^{t}$ , constants $c_{1},\dots,c_{m}\in\mathbb{C}$ , multiaffine maps $\rho_{1},\dots,\rho_{m}\in\operatorname{span}\{\beta_{[s]}\}$ and layers $L_{1},\dots,L_{n}$ of $\gamma$ such that

[TABLE]

for all $x_{[k]}\in G_{[k]}\setminus\Big{(}(\cup_{i\in[n]}L_{i})\times G_{l-1}\Big{)}$ , $|\cup_{i\in[n]}L_{i}|\leq\frac{\epsilon^{2}}{100}|G_{[k]\setminus\{l-1\}}|$ and $\sum_{i\in[m]}|c_{i}|\leq 1$ .

Let $E=\Big{(}\cup_{i\in[n]}L_{i}\Big{)}\times G_{l-1}$ , which therefore has density at most $\frac{\epsilon^{2}}{100}$ . Write $\rho_{i}(x_{[k]})=\rho^{\prime}_{i}(x_{[k]\setminus\{l\}})+\Gamma_{i}(x_{[k]\setminus\{l\}})\cdot x_{l}$ for multiaffine maps $\rho^{\prime}_{i}\colon G_{[k]\setminus\{l\}}\to\mathbb{F}$ and $\Gamma_{i}\colon G_{[k]\setminus\{l\}}\to G_{l}$ . Write $a\overset{\nu}{\approx}b$ if $|a-b|\leq\nu$ . Consider $x_{[k]}\in G_{[k]}$ such that $|E_{x_{[k]\setminus\{l\}}}|\leq\frac{\epsilon}{10}|G_{[k]\setminus{l}}|$ . For such $x_{[k]}$ we have

[TABLE]

Write $\beta_{i}(x_{[k]})=\beta^{\prime}_{i}(x_{[k]\setminus\{l\}})+\Psi_{i}(x_{[k]\setminus\{l\}})\cdot x_{l}$ for multiaffine maps $\beta^{\prime}_{i}\colon G_{[k]\setminus\{l\}}\to\mathbb{F}$ and $\Psi_{i}\colon G_{[k]\setminus\{l\}}\to G_{l}$ . Hence $\Gamma_{i}\in\operatorname{span}\{\Psi_{[s]}\}.$ Thus, for each $i,j\in[m]$ , there is $\mu^{ij}\in\mathbb{F}^{s}$ such that $\Gamma_{i}-\Gamma_{j}=\sum_{v\in[s]}\mu^{ij}_{v}\Psi_{v}$ .

Apply Proposition 13 to maps $\Psi_{[s]}$ to find $u\leq(2s^{3}+s)\log(100\epsilon^{-1})$ and multiaffine maps $\tau_{1},\dots,$ $\tau_{s}\colon G_{[k]\setminus\{l\}}\to\mathbb{F}^{u}$ such that for each $\lambda\in\mathbb{F}^{s}$ , we have that $\{\sum_{i\in[s]}\lambda_{i}\tau_{i}=0\}\supset\{\sum_{i\in[s]}\lambda_{i}\Psi_{i}=0\}$ and $|\{\sum_{i\in[s]}\lambda_{i}\tau_{i}=0\}\setminus\{\sum_{i\in[s]}\lambda_{i}\Psi_{i}=0\}|\leq|\mathbb{F}|^{-2s^{2}}\frac{\epsilon}{100}|G_{[k]\setminus\{l\}}|$ .

Therefore,

[TABLE]

holds for all $x_{[k]}\in G_{[k]}$ outside the set

[TABLE]

Since $|E|\leq\frac{\epsilon^{2}}{100}|G_{[k]}|$ , we have

[TABLE]

Also, by the way we chose $\tau$ , recalling that $m\leq|\mathbb{F}|^{s}$ ,

[TABLE]

Next, apply Theorem 7 to $\Psi_{1},\dots,\Psi_{s}$ , to find a map $\phi\colon G_{[k]\setminus\{l\}}\to\mathbb{F}^{s^{\prime}}$ , where

[TABLE]

such that for each $\lambda\in\mathbb{F}^{s}$ , we may approximate $\{\lambda\cdot\Psi=0\}$ (and hence each $\{\Gamma_{i}-\Gamma_{j}=0\}$ ) internally using layers of $\phi$ up to error of at most $|\mathbb{F}|^{-2s}\frac{\epsilon}{100}$ in density.

Finally, we need to approximate $\Big{\{}x_{[k]}\in E\colon|E_{x_{[k]\setminus\{l\}}}|\geq\frac{\epsilon}{10}|G_{l}|\Big{\}}$ externally by layers of a multiaffine map of bounded codimension up to error of density at most $\frac{\epsilon}{100}$ . We apply Theorem 8 to $\gamma$ , recalling that $E=\Big{(}\cup_{i\in[n]}L_{i}\Big{)}\times G_{l-1}$ , to find such a map $\delta\colon G_{[k]\setminus\{l-1,l\}}\to\mathbb{F}^{w}$ , where

[TABLE]

Hence, we obtained the desired approximation outside layers $L^{\prime}_{1},\dots,L^{\prime}_{q}$ of the multiaffine map $(\tau,\phi,\delta)$ of codimension at most

[TABLE]

such that $|\cup_{i\in[q]}L^{\prime}_{i}|\leq\epsilon|G_{[k]}|$ . The multiaffine forms $\beta^{\prime}_{[s]},\Big{(}x_{[k]}\mapsto\Psi_{[s]}(x_{[k]\setminus\{l\}})\cdot x_{l}\Big{)},\tau_{[s],[u]}$ span the set of multiaffine forms used in the arguments of $\chi$ in the approximation sum, and their number is at most

[TABLE]

Hence, we may take constants

[TABLE]

and

[TABLE]

where $C=C^{\bm{conv}}_{k,l-1},D=D^{\bm{conv}}_{k,l-1}$ , which completes the proof. Hence, if $t$ is a quantity such that $C^{\bm{weak}}_{k}\leq 2^{k^{2^{t}}}$ and $D^{\bm{weak}}_{k}\leq 2^{2^{t}}$ , using (7), then

[TABLE]

∎

**§7 $\bm{Conv}(k)\implies\bm{Weak}(k+1)$

**

Proof.

Let $\alpha\colon G_{[k+1]}\to\mathbb{F}$ be a multilinear form such that $\mathop{\mathbb{E}}_{x_{[k+1]}}\chi\Big{(}\alpha(x_{[k+1]})\Big{)}\geq c$ . Let $\alpha(x_{[k+1]})=A(x_{[k]})\cdot x_{k+1}$ for a multilinear map $A\colon G_{[k]}\to G_{k+1}$ . Then $\{A=0\}$ has density at least $c$ in $G_{[k]}$ . Apply Lemma 12 to find a multilinear $\beta\colon G_{[k]}\to\mathbb{F}^{r}$ , where $r\leq 2^{k}\log c^{-1}+(k+3)\log 2$ , such that $\{A=0\}\subset\{\beta=0\}$ and $|\{\beta=0\}\setminus\{A=0\}|\leq 2^{-k-3}c^{2^{k}}|G_{[k]}|$ . Let $Z=\{\beta=0\}$ . Apply Theorem 9 to $\beta$ , to find $s,t\leq C^{\bm{conv}}_{k,k}\Big{(}2^{k}\log c^{-1}+k+3\Big{)}^{2\cdot D^{\bm{conv}}_{k,k}}$ , multiaffine forms $\tau_{i}\colon G_{[k]}\to\mathbb{F}$ for $i\in[s]$ , multiaffine map $\gamma\colon G_{[k-1]}\to\mathbb{F}^{t}$ , constants $c_{1},\dots,c_{m}\in\mathbb{C}$ , multiaffine maps $\rho_{1},\dots,\rho_{m}\in\operatorname{span}\{\tau_{[s]}\}$ and layers $L_{1},\dots,L_{n}$ of $\gamma$ such that

[TABLE]

for all $x_{[k]}\in G_{[k]}\setminus\Big{(}(\cup_{i\in[n]}L_{i})\times G_{k}\Big{)}$ , $|\cup_{i\in[n]}L_{i}|\leq\frac{1}{8}c^{2^{k}}|G_{[k-1]}|$ and $\sum_{i\in[m]}|c_{i}|\leq 1$ .

By applying Cauchy-Schwarz inequality several times, we see that

[TABLE]

By averaging, there is a non-empty layer $D$ of $(\tau,\gamma)$ such that, for each $x_{[k]}\in D$ , $\mathbf{C}_{k}\cdots\mathbf{C}_{1}Z(x_{[k]})\geq\frac{1}{4}c^{2^{k}}$ .

We now give a definition of an arrangement of points in $G_{[k]}$ . Firstly, we define $\emptyset$ -arrangement of lengths $l_{[k]}\in G_{[k]}$ to be the singleton sequence whose only element is $l_{[k]}$ . For $i\in[k]$ , an $[i]$ -arrangement of lengths $l_{[k]}\in G_{[k]}$ is a sequence of length $2^{i}$ , being a concatenation $(q_{1},q_{2})$ of two $[i-1]$ -arrangements $q_{1}$ and $q_{2}$ (for $i=1$ , $[0]$ is taken to be $\emptyset$ ), where $q_{1}$ has lengths $(l_{[i-1]},l_{i}+y,l_{[i+1,k]})$ and $q_{2}$ has lengths $(l_{[i-1]},y,l_{[i+1,k]})$ for some $y\in G_{i}$ . We note the following.

Lemma 22.

(i)

For a set $S\subset G_{[k]}$ , and a given $x_{[k]}\in G_{[k]}$ , the number of $[i]$ -arrangements of lengths $x_{[k]}$ whose points lie in $S$ is exactly

[TABLE]

(ii)

For $x_{[k]},l_{[k]}\in G_{[k]}$ , $j\leq 2^{i}$ , there are exactly $|G_{1}|^{2^{i-1}-1}|G_{2}|^{2^{i-2}-1}\cdots|G_{i}|^{2^{0}-1}$ $[i]$ -arrangements of lengths $l_{[k]}$ that contain $x_{[k]}$ at $j$ th position, when $l_{[i+1,k]}=x_{[i+1,k]}$ , and no such $[i]$ -arrangements otherwise.

(iii)

If all $2^{i}$ points of an $[i]$ -arrangement of lenghts $x_{[k]}$ lie inside $\{A=0\}$ , then $A(x_{[k]})=0$ .

As a few times before, we misuse the notation slightly by writing $S$ for the indicator function of the set $S$ .

Proof.

(i): We prove the claim by induction on $i$ . For $i=0$ , the only $[0]$ -arrangement of lengths $x_{[k]}$ is precisely the singleton sequence $(x_{[k]})$ . Thus, the number of such arrangements equals $S(x_{[k]})$ .

Suppose the claim holds for some $i$ and take any $x_{[k]}$ . By definition, each $[i+1]$ -arrangement of lengths $x_{[k]}$ is concatenation of two $[i]$ -arrangements, of lengths $(x_{[i]},x_{i}+y,x_{[i+2,k]})$ and $(x_{[i]},y,x_{[i+2,k]})$ , for some $y\in G_{i+1}$ . Using this, and the inductive hypothesis, we see that the number of $[i+1]$ arrangements of lengths $x_{[k]}$ is exactly

[TABLE]

(ii): For $i=0$ , this is clear. Suppose the claim holds for $[i]$ -arrangments. Assume that $x_{[i+2,k]}=l_{[i+2,k]}$ , otherwise there are clearly no desired arrangements. Note that from part (i) there are exactly $|G_{1}|^{2^{i-1}}|G_{2}|^{2^{i-2}}\cdots|G_{i}|$ of $[i]$ -arrangements of any given lengths. If $j\leq 2^{i}$ , then we know that the number of $[i]$ -arrangments of lengths $(l_{[i]},y,l_{[i+2,k]})$ that contain $x_{[k]}$ at $j$ th position is $|G_{1}|^{2^{i-1}-1}$ $|G_{2}|^{2^{i-2}-1}\cdots$ $|G_{i}|^{2^{0}-1}$ , when $y=x_{i+1}$ , and zero otherwise. On the other hand, there are exactly $|G_{1}|^{2^{i-1}}$ $|G_{2}|^{2^{i-2}}\cdots$ $|G_{i}|$ of $[i]$ -arrangements of lengths $(l_{[i]},x_{i+1}-l_{i+1},l_{[i+2,k]})$ . The result now follows. The case $j>2^{i}$ can be treated similarly.

(iii): For $i=0$ , this is clear. Suppose the claim holds for $[i]$ -arrangments. Let $q$ be an $[i+1]$ -arrangment of lengths $x_{[k]}$ whose all points lie inside $\{A=0\}$ . Then, $q=(q_{1},q_{2})$ for an $[i]$ -arrangement $q_{1}$ of lengths $(x_{[i]},x_{i}+y,x_{[i+2,k]})$ and an $[i]$ -arrangement $q_{2}$ of lengths $(x_{[i]},y,x_{[i+2,k]})$ . By induction hypothesis, we have $A(x_{[i]},x_{i}+y,x_{[i+2,k]})=0$ and $A(x_{[i]},y,x_{[i+2,k]})=0$ . Since $A$ is linear in $(i+1)$ th coordinate, $A(x_{[k]})=0$ , as desired.∎

By part (i) of the lemma, for each $x_{[k]}\in D$ , there are at least $\frac{1}{4}c^{2^{k}}|G_{1}|^{2^{k-1}}|G_{2}|^{2^{k-2}}\cdots|G_{k}|^{2^{0}}$ $[k]$ -arrangements of lengths $x_{[k]}$ with all points in $\{\beta=0\}$ . Since $|\{\beta=0\}\setminus\{A=0\}|\leq 2^{-(k+3)}c^{2^{k}}|G_{[k]}|$ , and, by (ii), each point in $G_{[k]}$ belongs to at most $2^{k}|G_{1}|^{2^{k-1}-1}|G_{2}|^{2^{k-2}-1}\cdots|G_{k}|^{0}$ $[k]$ -arrangements of lengths $x_{[k]}$ (the factor of $2^{k}$ comes from the number of positions a point may take in a $[k]$ -arrangement), this implies that for each $x_{[k]}\in D$ , at least one $[k]$ -arrangement of lengths $x_{[k]}$ has all its points inside $\{A=0\}$ . Using part (iii), we obtain $A(x_{[d]})=0$ . Hence, $D\subset\{A=0\}$ . To finish the proof we modify $D$ to a variety inside $\{A=0\}$ , but one which is defined by multilinear maps only.

Lemma 23.

Suppose that $A\colon G_{[k]}\to H$ is a multilinear map and that $D$ is a variety of codimension $r$ in $G_{[k]}$ such that $D\subset\{A=0\}$ . Then, there is $R\leq 2^{2k}r$ , multilinear forms $\beta_{i}\colon G_{I_{i}}\to\mathbb{F}$ , $\emptyset\not=I_{i}\subset[k]$ for $i\in[R]$ , such that $\{x_{[k]}\in G_{[k]}\colon(\forall i\in[R])\beta_{i}(x_{I_{i}})=0\}\subset\{A=0\}$ .

Proof.

By splitting the map that defines $D$ into its multilinear parts, we see that there are mutlilinear forms $\delta_{i}\colon G_{I_{i}}\to\mathbb{F}$ , $\emptyset\not=I_{i}\subset[k]$ , scalars $\lambda_{i}\in\mathbb{F}$ , $i\leq r^{\prime}\leq 2^{k}r$ , such that

[TABLE]

By induction on $d$ , we prove that, misusing the notation above, we may replace the maps by new ones so that $\lambda_{i}=0$ , when $I_{i}\cap[d]\not=\emptyset$ , at the expense of larger bound $r^{\prime}\leq 2^{k+d}r$ .

For $d=0$ , the given maps satisfy these properties. Assume now that the claim holds for some $d\geq 0$ . Take an arbitrary point $(v_{[k]})$ such that $\delta_{i}(v_{I_{i}})=\lambda_{i}$ for all $i\in[r^{\prime}]$ . We claim that if $x_{[k]}\in G_{[k]}$ satisfies $\delta_{i}(x_{I_{i}})=\lambda_{i}$ for $d+1\notin I_{i}$ , $\delta_{i}(x_{I_{i}})=0$ and $\delta_{i}(x_{I_{i}\setminus\{d+1\}},v_{d+1})=\lambda_{i}$ for $d+1\in I_{i}$ , then $A(x_{[k]})=0$ . Indeed, let $x_{[k]}$ be such a point. It suffices to show that $A(x_{[d]},x_{d+1}+v_{d+1},x_{[d+2,k]})=A(x_{[d]},v_{d+1},x_{[d+2,k]})=0$ . Since $\delta_{i}(x_{I_{i}})=\lambda_{i}$ for $d+1\notin I_{i}$ , we just need to show that $\delta_{i}(x_{I_{i}\setminus\{d+1\}};x_{d+1}+v_{d+1})=\delta_{i}(x_{I_{i}\setminus\{d+1\}};v_{d+1})=\lambda_{i}$ for $d+1\in I_{i}$ , which we already know to hold. Note also that $v_{[k]}$ satisfies all these equalities, so we get a non-empty variety. Thus, we may replace the given maps with at most $2r^{\prime}$ maps with desired properties.∎

This completes the proof. As far as the bounds are concerned, we may find a variety inside $\{A=0\}$ with desired properties with codimension bounded by

[TABLE]

Thus, we may take $C^{\bm{weak}}_{k+1}=2^{2k+1}C^{\bm{conv}}_{k,k}(2^{k}+k+3)^{2D^{\bm{conv}}_{k,k}}$ and $D^{\bm{weak}}_{k+1}=2D^{\bm{conv}}_{k,k}$ . Hence, if $t$ is a quantity such that $C^{\bm{weak}}_{k}\leq 2^{k^{2^{t}}}$ and $D^{\bm{weak}}_{k}\leq 2^{2^{t}}$ , using (8), then

[TABLE]

∎

Bibliography12

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] A. Bhowmick and S. Lovett, Bias vs structure of polynomials in large fields, and applications in effective algebraic geometry and coding theory , ar Xiv preprint (2015), ar Xiv:1506.02047 .
2[2] W.T. Gowers, A new proof of Szemerédi’s theorem , Geometric and Functional Analysis 11 (2001), no. 3, 465–588.
3[3] W.T. Gowers and J. Wolf, Linear forms and higher-degree uniformitty functions on 𝔽 p n subscript superscript 𝔽 𝑛 𝑝 \mathbb{F}^{n}_{p} , Geometric and Functional Analysis 21 (2011), no. 1, 36–69.
4[4] B. Green and T. Tao. The distribution of polynomials over finite fields, with applications to the Gowers norms , Contributions to Discrete Mathematics 4 (2009), no. 2, 1–36.
5[5] B. Green and T. Tao, Linear equations in primes , Annals of Mathematics 171 (2010), no. 3, 1753–1850.
6[6] B. Host and B. Kra, Nonconventional ergodic averages and nilmanifolds , Annals of Mathematics 161 (2005), no. 1, 397–488.
7[7] O. Janzer, Low analytic rank implies low partition rank for tensors , ar Xiv preprint (2018) ar Xiv:1809.10931 .
8[8] O. Janzer, Polynomial bound for the partition rank vs the analytic rank of tensors , ar Xiv preprint (2019) ar Xiv:1902.11207 .

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Definition 1** (Analytic rank).**

Definition 2** (Partition rank).**

Theorem 3**.**

Theorem 4** (Green and Tao [4]; Kaufman and Lovett [9]).**

Sketch proof of Theorem 4.

Theorem 5** (Weak inverse theorem for maps of low analytic rank - Weak(kkk)).**

Theorem 6** (Strong inverse theorem for maps of low analytic rank - Strong(kkk)).**

Theorem 7** (Simultaneous inner approximation of varieties - Inner(kkk)).**

Theorem 8** (Structure of a set of dense columns of a variety - Columns(kkk)).**

Theorem 9** (Almost L∞L^{\infty}L∞ approximation theorem for convolutions of varieties of low codimension - Conv(kkk)).**

Proposition 10** (One-sided regularity lemma).**

Lemma 11**.**

Proof.

Lemma 12** (Approximating dense varieties externally).**

Proof.

Lemma 13** (Approximating dense varieties simultaneously.).**

Proof.

Lemma 14** (Lovett [11]).**

Sketch proof.

Proposition 15**.**

Proposition 16**.**

Proof.

Observation 17**.**

Proof of Observation 17..

Proof of Theorem 6.

Proposition 18**.**

Proof of Proposition 18.

Proposition 19**.**

Proof of Proposition 19.

Proposition 20**.**

Proof.

Lemma 21**.**

Proof.

Proof.

Lemma 22**.**

Proof.

Lemma 23**.**

Proof.

Definition 1 (Analytic rank).

Definition 2 (Partition rank).

Theorem 3.

Theorem 4 (Green and Tao [4]; Kaufman and Lovett [9]).

Theorem 5 (Weak inverse theorem for maps of low analytic rank - Weak( $k$ )).

Theorem 6 (Strong inverse theorem for maps of low analytic rank - Strong( $k$ )).

Theorem 7 (Simultaneous inner approximation of varieties - Inner( $k$ )).

Theorem 8 (Structure of a set of dense columns of a variety - Columns( $k$ )).

Theorem 9 (Almost $L^{\infty}$ approximation theorem for convolutions of varieties of low codimension - Conv( $k$ )).

Proposition 10 (One-sided regularity lemma).

Lemma 11.

Lemma 12 (Approximating dense varieties externally).

Lemma 13 (Approximating dense varieties simultaneously.).

Lemma 14 (Lovett [11]).

Proposition 15.

Proposition 16.

Observation 17.

Proposition 18.

Proposition 19.

Proposition 20.

Lemma 21.

Lemma 22.

Lemma 23.