The asymptotic induced matching number of hypergraphs: balanced binary   strings

Srinivasan Arunachalam; P\'eter Vrana; Jeroen Zuiddam

arXiv:1905.03148·math.CO·May 9, 2019

The asymptotic induced matching number of hypergraphs: balanced binary strings

Srinivasan Arunachalam, P\'eter Vrana, Jeroen Zuiddam

PDF

TL;DR

This paper calculates the asymptotic induced matching number of specific hypergraphs related to balanced binary strings, using advanced algebraic methods, with implications for tensor theory, quantum information, and computational complexity.

Contribution

It introduces a new lower bound for the asymptotic induced matching number of certain hypergraphs using the higher-order Coppersmith-Winograd method.

Findings

01

Determines the asymptotic induced matching number for hypergraphs of balanced binary strings.

02

Establishes the asymptotic subrank of tensors supported by these hypergraphs.

03

Provides an optimal protocol for entanglement distillation in quantum information theory.

Abstract

We compute the asymptotic induced matching number of the $k$ -partite $k$ -uniform hypergraphs whose edges are the $k$ -bit strings of Hamming weight $k /2$ , for any large enough even number $k$ . Our lower bound relies on the higher-order extension of the well-known Coppersmith-Winograd method from algebraic complexity theory, which was proven by Christandl, Vrana and Zuiddam. Our result is motivated by the study of the power of this method as well as of the power of the Strassen support functionals (which provide upper bounds on the asymptotic induced matching number), and the connections to questions in tensor theory, quantum information theory and theoretical computer science. Phrased in the language of tensors, as a direct consequence of our result, we determine the asymptotic subrank of any tensor with support given by the aforementioned hypergraphs. In the context of quantum…

Equations297

Φ \subseteq V_{1} \times \dots \times V_{k} .

Φ \subseteq V_{1} \times \dots \times V_{k} .

Q (Φ) : = max {∣ Ψ ∣ \mathchar 58 Ψ \subseteq Φ, Ψ = Φ \cap (Ψ_{1} \times \dots \times Ψ_{k}), \forall a \neq = b \in Ψ \forall i \in [k] a_{i} \neq = b_{i}} .

Q (Φ) : = max {∣ Ψ ∣ \mathchar 58 Ψ \subseteq Φ, Ψ = Φ \cap (Ψ_{1} \times \dots \times Ψ_{k}), \forall a \neq = b \in Ψ \forall i \in [k] a_{i} \neq = b_{i}} .

Φ ⊠ Ψ

Φ ⊠ Ψ

\subseteq (V_{1} \times W_{1}) \times \dots \times (V_{k} \times W_{k}),

\underaccent \wtilde Q (Φ) : = n \to \infty lim Q (Φ^{⊠ n})^{1/ n} .

\underaccent \wtilde Q (Φ) : = n \to \infty lim Q (Φ^{⊠ n})^{1/ n} .

Φ_{λ} : = {s \in [n]^{k} \mathchar 58 type (s) = λ}

Φ_{λ} : = {s \in [n]^{k} \mathchar 58 type (s) = λ}

(λ_{1} 1, \dots, 1, λ_{2} 2, \dots, 2, \dots, λ_{n} n, \dots, n) .

(λ_{1} 1, \dots, 1, λ_{2} 2, \dots, 2, \dots, λ_{n} n, \dots, n) .

Φ_{(1, 1)} = {(2, 1), (1, 2)} \subseteq [2] \times [2]

Φ_{(1, 1)} = {(2, 1), (1, 2)} \subseteq [2] \times [2]

Φ_{(2, 2)}

Φ_{(2, 2)}

α_{1} (a_{1}) + \dots + α_{k} (a_{k}) = 0

α_{1} (a_{1}) + \dots + α_{k} (a_{k}) = 0

{α (x) - α (y) \mathchar 58 (x, y) \in R},

{α (x) - α (y) \mathchar 58 (x, y) \in R},

\log_{2}\operatorname{\underaccent{\wtilde}{Q}}(\Phi)\geq\max_{P\in\mathscr{P}}\Bigl{(}H(P)-(k-2)\max_{R\in\mathscr{R}}\frac{\max_{Q\in\mathscr{Q}_{R,(P_{1},\ldots,P_{k})}}H(Q)-H(P)}{r(R)}\Bigr{)}

\log_{2}\operatorname{\underaccent{\wtilde}{Q}}(\Phi)\geq\max_{P\in\mathscr{P}}\Bigl{(}H(P)-(k-2)\max_{R\in\mathscr{R}}\frac{\max_{Q\in\mathscr{Q}_{R,(P_{1},\ldots,P_{k})}}H(Q)-H(P)}{r(R)}\Bigr{)}

i = 1 \sum k a_{i} = j = 1 \sum n j λ_{j}

i = 1 \sum k a_{i} = j = 1 \sum n j λ_{j}

\mathinner{\!\bigl{\lvert}\bigl{\{}(x,y)\in\mathbb{F}_{2}^{k}\times\mathbb{F}_{2}^{k}\mathrel{\mathop{\mathchar 58\relax}}\mathinner{\lvert x\rvert}=\mathinner{\lvert y\rvert}=\tfrac{k}{2},\,x-y\in V\bigr{\}}\bigr{\rvert}}\leq\smash{\binom{k-1}{k/2}^{\!\frac{\dim_{\mathbb{F}_{2}}\!(V)}{k-2}+1}}

\mathinner{\!\bigl{\lvert}\bigl{\{}(x,y)\in\mathbb{F}_{2}^{k}\times\mathbb{F}_{2}^{k}\mathrel{\mathop{\mathchar 58\relax}}\mathinner{\lvert x\rvert}=\mathinner{\lvert y\rvert}=\tfrac{k}{2},\,x-y\in V\bigr{\}}\bigr{\rvert}}\leq\smash{\binom{k-1}{k/2}^{\!\frac{\dim_{\mathbb{F}_{2}}\!(V)}{k-2}+1}}

⟨ n ⟩ : = i = 1 \sum n e_{i} \otimes \dots \otimes e_{i} \in (F^{n})^{\otimes k} .

⟨ n ⟩ : = i = 1 \sum n e_{i} \otimes \dots \otimes e_{i} \in (F^{n})^{\otimes k} .

Q (a) : = max {n \in N \mathchar 58 ⟨ n ⟩ \leq a} .

Q (a) : = max {n \in N \mathchar 58 ⟨ n ⟩ \leq a} .

a ⊠ b : = i, j \sum a_{i} b_{j} (e_{i_{1}} \otimes e_{j_{1}}) \otimes \dots \otimes (e_{i_{k}} \otimes e_{j_{k}}) \in (F^{n_{1}} \otimes F^{m_{1}}) \otimes \dots \otimes (F^{n_{k}} \otimes F^{m_{k}}) .

a ⊠ b : = i, j \sum a_{i} b_{j} (e_{i_{1}} \otimes e_{j_{1}}) \otimes \dots \otimes (e_{i_{k}} \otimes e_{j_{k}}) \in (F^{n_{1}} \otimes F^{m_{1}}) \otimes \dots \otimes (F^{n_{k}} \otimes F^{m_{k}}) .

supp (a) : = {i \in [n_{1}] \times \dots \times [n_{k}] \mathchar 58 a_{i} \neq = 0} .

supp (a) : = {i \in [n_{1}] \times \dots \times [n_{k}] \mathchar 58 a_{i} \neq = 0} .

\underaccent \wtilde Q (supp (a)) \leq \underaccent \wtilde Q (a) .

\underaccent \wtilde Q (supp (a)) \leq \underaccent \wtilde Q (a) .

\underaccent \wtilde Q (Φ) \leq field F min a \in F^{n_{1}} \otimes \dots \otimes F^{n_{k}} \mathchar 58 supp (a) = Φ min \underaccent \wtilde Q (a) .

\underaccent \wtilde Q (Φ) \leq field F min a \in F^{n_{1}} \otimes \dots \otimes F^{n_{k}} \mathchar 58 supp (a) = Φ min \underaccent \wtilde Q (a) .

ϕ (a ⊠ b) \leq ϕ (a) ϕ (b)

ϕ (a ⊠ b) \leq ϕ (a) ϕ (b)

ϕ (⟨ n ⟩) = n

a \leq b \Rightarrow ϕ (a) \leq ϕ (b) .

ζ^{θ} \mathchar 58 {k -tensors over F} \to R_{\geq 0}

ζ^{θ} \mathchar 58 {k -tensors over F} \to R_{\geq 0}

\underaccent \wtilde Q (a) \leq θ min ζ^{θ} (a) .

\underaccent \wtilde Q (a) \leq θ min ζ^{θ} (a) .

\underaccent \wtilde Q (a) \leq n \to \infty lim inf slicerank (a^{⊠ n})^{1/ n} .

\underaccent \wtilde Q (a) \leq n \to \infty lim inf slicerank (a^{⊠ n})^{1/ n} .

n \to \infty lim sup slicerank (a^{⊠ n})^{1/ n} \leq θ min ζ^{θ} (a) .

n \to \infty lim sup slicerank (a^{⊠ n})^{1/ n} \leq θ min ζ^{θ} (a) .

lo g_{2} \underaccent \wtilde Q (Φ) \geq P \in P max i \in [3] min H (P_{i})

lo g_{2} \underaccent \wtilde Q (Φ) \geq P \in P max i \in [3] min H (P_{i})

lo g_{2} \underaccent \wtilde Q (Φ) \leq P \in P max i min H (P_{i}),

lo g_{2} \underaccent \wtilde Q (Φ) \leq P \in P max i min H (P_{i}),

T_{λ} : = s \in Φ_{λ} \sum e_{s_{1}} \otimes \dots \otimes e_{s_{k}} \in (F^{n})^{\otimes k} .

T_{λ} : = s \in Φ_{λ} \sum e_{s_{1}} \otimes \dots \otimes e_{s_{k}} \in (F^{n})^{\otimes k} .

\underaccent \wtilde Q (Φ_{λ}) \leq \underaccent \wtilde Q (T_{λ}) \leq 2^{H (λ / k)} .

\underaccent \wtilde Q (Φ_{λ}) \leq \underaccent \wtilde Q (T_{λ}) \leq 2^{H (λ / k)} .

\underaccent \wtilde Q (Φ_{(k - 1, 1)}) = \underaccent \wtilde Q (T_{(k - 1, 1)}) = 2^{H ((1 - 1/ k, 1/ k))}

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

The asymptotic induced matching number

of hypergraphs: balanced binary strings

Srinivasan Arunachalam

Center for Theoretical Physics, Massachusetts Institute of Technology, 77 Massachusetts Ave, 6-304, Cambridge, MA 02139, USA

[email protected]

,

Péter Vrana

Department of Geometry, Budapest University of Technology and Economics, Egry József u. 1., 1111 Budapest, Hungary

MTA-BME Lendület Quantum Information Theory Research Group

[email protected]

and

Jeroen Zuiddam

Institute for Advanced Study, 1 Einstein Drive, Princeton, NJ 08540, USA

[email protected]

Abstract.

We compute the asymptotic induced matching number of the $k$ -partite $k$ -uniform hypergraphs whose edges are the $k$ -bit strings of Hamming weight $k/2$ , for any large enough even number $k$ . Our lower bound relies on the higher-order extension of the well-known Coppersmith–Winograd method from algebraic complexity theory, which was proven by Christandl, Vrana and Zuiddam. Our result is motivated by the study of the power of this method as well as of the power of the Strassen support functionals (which provide upper bounds on the asymptotic induced matching number), and the connections to questions in tensor theory, quantum information theory and theoretical computer science.

Phrased in the language of tensors, as a direct consequence of our result, we determine the asymptotic subrank of any tensor with support given by the aforementioned hypergraphs. In the context of quantum information theory, our result amounts to an asymptotically optimal $k$ -party stochastic local operations and classical communication (slocc) protocol for the problem of distilling GHZ-type entanglement from a subfamily of Dicke-type entanglement.

Keywords. $k$ -partite $k$ -uniform hypergraphs, asymptotic induced matchings, higher-order Coppersmith–Winograd method

1. Introduction

1.1. Problem

We study in this paper an asymptotic parameter of $k$ -partite $k$ -uniform hypergraphs: the asymptotic induced matching number. For $k\in\mathbb{N}$ , a $k$ -partite $k$ -uniform hypergraph, or $k$ -graph for short, is a tuple of finite sets $V_{1},\ldots,V_{k}$ together with a subset $\Phi$ of their cartesian product:

[TABLE]

Whenever possible we will leave the vertex sets $V_{i}$ implicit and refer to the $k$ -graph by its edge set $\Phi$ . For any $k\in\mathbb{N}$ we use the notation $[k]\coloneqq\{1,2,\ldots,k\}$ . Let $\Phi$ be a $k$ -graph. We say a subset $\Psi$ of $\Phi$ is induced if $\Psi=\Phi\cap(\Psi_{1}\times\cdots\times\Psi_{k})$ where for each $i\in[k]$ we define the marginal set $\Psi_{i}\coloneqq\{a_{i}\mathrel{\mathop{\mathchar 58\relax}}a\in\Psi\}$ . We call $\Psi$ a matching if any two distinct elements $a,b\in\Psi$ are distinct in all $k$ coordinates, that is, $\forall i\in[k]\mathrel{\mathop{\mathchar 58\relax}}a_{i}\neq b_{i}$ . The subrank111The term subrank originates from an analogous parameter in the theory of tensors, see Section 1.4.1. or induced matching number $\operatorname{Q}(\Phi)$ is defined as the size of the largest subset $\Psi$ of $\Phi$ that is an induced matching, that is,

[TABLE]

For example, consider the 3-graph $\Phi=\{(1,1,1),(2,2,2),(3,3,3)\}\subseteq[3]\times[3]\times[3]$ . Here $\Phi$ is itself an induced matching, and so $\operatorname{Q}(\Phi)=3$ . Next, let $\Phi=\{(1,1,1),(2,2,2),(3,3,3),(1,2,3)\}$ . Now the subset $\{(1,1,1),(2,2,2)\}\subseteq\Phi$ is an induced matching and there is no larger induced matching in $\Phi$ , and so $\operatorname{Q}(\Phi)=2$ .

We define the Kronecker product of two $k$ -graphs $\Phi\subseteq V_{1}\times\cdots\times V_{k}$ and $\Psi\subseteq W_{1}\times\cdots\times W_{k}$ as the $k$ -graph

[TABLE]

and we naturally define the power $\Phi^{\boxtimes n}=\Phi\boxtimes\cdots\boxtimes\Phi$ . The asymptotic subrank or the asymptotic induced matching number of the $k$ -graph $\Phi$ is defined as

[TABLE]

This limit exists and equals the supremum $\sup_{n\in\mathbb{N}}\operatorname{Q}(\Phi^{\boxtimes n})^{1/n}$ by Fekete’s lemma (see, e.g., [PS98, No. 98]).

We study the following basic question:

Problem 1.1.

Given $\Phi$ what is the value of $\operatorname{\underaccent{\wtilde}{Q}}(\Phi)$ ?

A priori, for $\Phi\subseteq V_{1}\times\cdots\times V_{k}$ we have the upper bound $\operatorname{Q}(\Phi)\leq\min_{i}\mathinner{\lvert V_{i}\rvert}$ and therefore holds that $\operatorname{\underaccent{\wtilde}{Q}}(\Phi)\leq\min_{i}\mathinner{\lvert V_{i}\rvert}$ , since $\mathinner{\lvert V_{i}^{\times n}\rvert}=\mathinner{\lvert V_{i}\rvert}^{n}$ .

1.1 has been studied for several families of $k$ -graphs, in several different contexts: the cap set problem [EG17, Tao16, KSS16, Nor16, Peb16], approaches to fast matrix multiplication [Str91, BCC*+*17a, BCC*+*17b, Saw17], arithmetic removal lemmas [LS18, FLS18], property testing [FK14, HX17], quantum information theory [VC15, VC17], and the general study of asymptotic properties of tensors [TS16, CVZ18a, CVZ18c]. We finally mention the related result of Ruzsa and Szemerédi which says that the largest subset $E\subseteq\binom{n}{2}$ such that $(E\times E\times E)\cap\{(\{a,b\},\{b,c\},\{c,a\})\mathrel{\mathop{\mathchar 58\relax}}a,b,c\in[n]\}$ is a matching, has size $n^{2-o(1)}\leq\mathinner{\lvert E\rvert}\leq o(n^{2})$ when $n$ goes to infinity [RS78], see also [AS06, Equation 2].

1.2. Result

We solve 1.1 for a family of $k$ -graphs that are structured but nontrivial. For $k\geq n$ let $\lambda=(\lambda_{1},\ldots,\lambda_{n})\vdash k$ be an integer partition of $k$ with $n$ nonzero parts, that is, $\lambda_{1}\geq\lambda_{2}\geq\cdots\geq\lambda_{n}>0$ and $\sum_{i=1}^{n}\lambda_{i}=k$ . We define the $k$ -graph

[TABLE]

where the expression $\operatorname{type}(s)=\lambda$ means that $s$ is a permutation of the $k$ -tuple

[TABLE]

For example, the partition $\lambda=(1,1)\vdash 2$ corresponds to the 2-graph

[TABLE]

and the partition $\lambda=(2,2)\vdash 4$ corresponds to the 4-graph

[TABLE]

It was shown in [CVZ18a] that $\operatorname{\underaccent{\wtilde}{Q}}(\Phi_{(k-1,1)})=2^{H((1-1/k,1/k))}$ for every $k\in\mathbb{N}_{\geq 3}$ where $H$ is the Shannon entropy in base 2. As a natural continuation of that work we study $\operatorname{\underaccent{\wtilde}{Q}}(\Phi_{(k/2,k/2)})$ for even $k\in\mathbb{N}$ . Since $\Phi_{(k/2,k/2)}\subseteq[2]^{\times k}$ we have $\operatorname{\underaccent{\wtilde}{Q}}(\Phi_{(k/2,k/2)})\leq 2$ . Clearly, the 2-graph $\Phi_{(1,1)}$ is itself a matching, and so $\operatorname{\underaccent{\wtilde}{Q}}(\Phi_{(1,1)})=2$ . It was shown in [CVZ18a] that also $\operatorname{\underaccent{\wtilde}{Q}}(\Phi_{(2,2)})=2$ . Our new result is the following extension:

Theorem 1.2.

Let $k\in\mathbb{N}_{\geq 2}$ be even and large enough. Then $\operatorname{\underaccent{\wtilde}{Q}}(\Phi_{(k/2,k/2)})=2$ .

In other words, we prove that for every large enough even $k\in\mathbb{N}_{\geq 2}$ there is an induced matching $\Psi\subseteq\Phi_{(k/2,k/2)}^{\boxtimes n}$ of size $\mathinner{\lvert\Psi\rvert}=2^{n-o(n)}$ when $n$ goes to infinity.

Moreover, we numerically verified that $\operatorname{\underaccent{\wtilde}{Q}}(\Phi_{(k/2,k/2)})=2$ also holds for all even $k\leq 2000$ . We conjecture that $\operatorname{\underaccent{\wtilde}{Q}}(\Phi_{(k/2,k/2)})=2$ for all even $k$ . More generally, we conjecture (cf. [VC15] and [CVZ18a, Question 1.3.3]) that $\log_{2}\operatorname{\underaccent{\wtilde}{Q}}(\Phi_{\lambda})$ equals the Shannon entropy of the probability distribution obtained by normalising the partition $\lambda$ . We will discuss further motivation and background in Section 1.4.

1.3. Methods

We prove Theorem 1.2 by applying the higher-order Coppersmith–Winograd (CW) method from [CVZ18a] to the $k$ -graph $\Phi_{(k/2,k/2)}$ . This method is an extension of the work of Coppersmith and Winograd [CW87] and Strassen [Str91] from the case $k=3$ to the case $k\geq 4$ . It provides a construction of large induced matchings in $k$ -graphs via the probabilistic method, and we prove Theorem 1.2 by analysing the size of these induced matchings.

Theorem 1.3 (Higher-order CW method [CVZ18a]).

Let $\Phi\subseteq V_{1}\times\cdots\times V_{k}$ be a nonempty $k$ -graph for which there exist injective maps $\alpha_{i}\mathrel{\mathop{\mathchar 58\relax}}V_{i}\to\mathbb{Z}$ such that for all $a\in\Phi$ the equality

[TABLE]

holds. For any $R\subseteq\Phi\times\Phi$ let $r(R)$ be the rank over $\mathbb{Q}$ of the $\mathinner{\lvert R\rvert}\times k$ matrix with rows

[TABLE]

where $\alpha(x)\coloneqq(\alpha_{1}(x_{1}),\ldots,\alpha_{k}(x_{k}))\in\mathbb{Z}^{k}$ . Then

[TABLE]

where the parameters $P$ , $R$ and $Q$ are taken over the following domains:

•

$\mathscr{P}$ * is the set of probability distributions on $\Phi$ *

•

$\mathscr{R}$ * is the set of subsets of $\Phi\times\Phi$ that are not a subset of $\{(x,x)\mathrel{\mathop{\mathchar 58\relax}}x\in\Phi\}$ and moreover satisfy $\exists i\in[k]\,\forall(x,y)\in R\colon x_{i}=y_{i}$ *

•

$\mathscr{Q}_{R,(P_{1},\ldots,P_{k})}$ * is the set of probability distributions on $R\subseteq\Phi\times\Phi$ with marginal distributions equal to $P_{1},\ldots,P_{k},P_{1},\ldots,P_{k}$ respectively.*

Here for $P\in\mathscr{P}$ we denote by $P_{1},\ldots,P_{k}$ the marginal probability distributions of $P$ on the components $V_{1},\ldots,V_{k}$ respectively, and $H$ denotes Shannon entropy.

Let $\lambda\vdash k$ be any integer partition of $k$ with $n$ nonzero parts. We can apply Theorem 1.3 to the $k$ -graph $\Phi=\Phi_{\lambda}$ as follows. For every $a\in\Phi_{\lambda}$ the equality

[TABLE]

holds, since the element $j$ occurs $\lambda_{j}$ times in $a$ . Let $\alpha_{1},\ldots,\alpha_{k-1}$ be identity maps $\mathbb{Z}\to\mathbb{Z}$ and let $\alpha_{k}\mathrel{\mathop{\mathchar 58\relax}}\mathbb{Z}\to\mathbb{Z}\mathrel{\mathop{\mathchar 58\relax}}x\mapsto x-\sum_{j=1}^{\smash{n}}j\lambda_{j}$ . Then, because of (2), $\forall a\in\Phi_{\lambda}\colon\alpha_{1}(a_{1})+\cdots+\alpha_{k}(a_{k})=0$ . (Note that with this choice of maps $\alpha_{1},\ldots,\alpha_{k}$ we have that $\alpha(x)-\alpha(y)$ equals $x-y$ for every $(x,y)\in R$ .) Therefore Theorem 1.3 can be applied to obtain a lower bound on $\operatorname{\underaccent{\wtilde}{Q}}(\Phi_{\lambda})$ for any partition $\lambda$ . The difficulty now lies in evaluating the right-hand side of (1).

Let us return to the case $\lambda=(k/2,k/2)$ . To prove Theorem 1.2 via Theorem 1.3 we will show for every large enough even $k\in\mathbb{N}$ and $\Phi=\Phi_{(k/2,k/2)}$ that the right-hand side of (1) is at least 2, using the aforementioned choice of injective maps $\alpha_{1},\ldots,\alpha_{k}$ . In Section 2 we prove that this follows from the following statement, which may be of interest on its own.

Theorem 1.4.

For any large enough even $k\in\mathbb{N}_{\geq 4}$ and subspace $V\subseteq\{x\in\mathbb{F}_{2}^{k}\mathrel{\mathop{\mathchar 58\relax}}x_{k}=0\}\subseteq\mathbb{F}_{2}^{k}$ the inequality

[TABLE]

holds. Here $\mathinner{\lvert x\rvert}$ denotes the Hamming weight of $x\in\mathbb{F}_{2}^{k}$ .

In Section 3 we prove Theorem 1.4 for low-dimensional $V$ by carefully splitting the left-hand side of (3) into two parts and upper bounding these parts. In Section 4 we prove Theorem 1.4 for high-dimensional $V$ using Fourier analysis, Krawchouk polynomials and the Kahn–Kalai–Linial (KKL) inequality [KKL88]. We thus prove Theorem 1.4 and hence Theorem 1.2. While in our current proof the tools for the low- and high-dimensional cases are used complementarily, it may be possible that the full Theorem 1.2 can be proven by cleverly using only the low-dimensional tools or only the high-dimensional tools.

1.4. Motivation and background

Our original motivation to study the asymptotic induced matching number of $k$ -graphs comes from a connection to the study of asymptotic properties of tensors. In fact, the interplay in this connection goes both directions. The purpose of this section is to discuss the asymptotic study of tensors and the connection with the asymptotic induced matching number. Reading this section is not required to understand the rest of the paper.

1.4.1. Asymptotic rank and asymptotic subrank of tensors

The asymptotic study of tensors is a field of its own that started with the work of Strassen [Str87, Str88, Str91] in the context of fast matrix multiplication. We begin by introducing two fundamental asymptotic tensor parameters: asymptotic rank and asymptotic subrank.

Let $\mathbb{F}$ be a field. Let $a\in\mathbb{F}^{n_{1}}\otimes\cdots\otimes\mathbb{F}^{n_{k}}$ and $b\in\mathbb{F}^{m_{1}}\otimes\cdots\otimes\mathbb{F}^{m_{k}}$ be $k$ -tensors. We write $a\leq b$ if there are linear maps $A_{i}\mathrel{\mathop{\mathchar 58\relax}}\mathbb{F}^{m_{i}}\to\mathbb{F}^{n_{i}}$ for $i\in[k]$ such that $a=(A_{1}\otimes\cdots\otimes A_{k})(b)$ . For $n\in\mathbb{N}$ let $\{e_{n}\mathrel{\mathop{\mathchar 58\relax}}j\in[n]\}$ be the standard basis of $\mathbb{F}^{n}$ . For $n\in\mathbb{N}$ define the $k$ -tensor

[TABLE]

The rank of the $k$ -tensor $a$ is defined as $\operatorname{R}(a)\coloneqq\min\{n\in\mathbb{N}\mathrel{\mathop{\mathchar 58\relax}}a\leq\langle n\rangle\}$ . The subrank of the $k$ -tensor $a$ is defined as

[TABLE]

One can think of tensor rank as a measure of the complexity of a tensor, namely the “cost” of the tensor in terms of the diagonal tensors $\langle n\rangle$ . It has been studied in several contexts, see, e.g., [BCS97, Lan12]. In this language, the subrank is the “value” of the tensor in terms of $\langle n\rangle$ and as such is the natural companion to tensor rank. It has its own applications, which we will elaborate on after having discussed the asymptotic viewpoint.

Writing $a$ and $b$ in the standard basis as $a=\sum_{i}a_{i}\,e_{i_{1}}\otimes\cdots\otimes e_{i_{k}}$ , $b=\sum_{j}b_{j}\,e_{j_{1}}\otimes\cdots\otimes e_{j_{k}}$ , the tensor Kronecker product $a\boxtimes b$ is the $k$ -tensor defined by

[TABLE]

In other words, the $k$ -tensor $a\boxtimes b$ is the image of the $2k$ -tensor $a\otimes b$ under the natural regrouping map $\mathbb{F}^{n_{1}}\otimes\cdots\otimes\mathbb{F}^{n_{k}}\otimes\mathbb{F}^{m_{1}}\otimes\cdots\otimes\mathbb{F}^{m_{k}}\to(\mathbb{F}^{n_{1}}\otimes\mathbb{F}^{m_{1}})\otimes\cdots\otimes(\mathbb{F}^{n_{k}}\otimes\mathbb{F}^{m_{k}})$ . The asymptotic rank of $a$ is defined as $\operatorname{\underaccent{\wtilde}{R}}(a)\coloneqq\lim_{n\to\infty}\operatorname{R}(a^{\boxtimes n})^{1/n}$ and the asymptotic subrank of $a$ is defined as $\operatorname{\underaccent{\wtilde}{Q}}(a)\coloneqq\lim_{n\to\infty}\operatorname{Q}(a^{\boxtimes n})^{1/n}$ . These limits exist and equal the infimum $\inf_{n}\operatorname{R}(a^{\boxtimes n})^{1/n}$ and the supremum $\sup_{n}\operatorname{Q}(a^{\boxtimes n})^{1/n}$ , respectively. This follows from Fekete’s lemma and the fact that $\operatorname{R}(a\boxtimes b)\leq\operatorname{R}(a)\operatorname{R}(b)$ and $\operatorname{Q}(a\boxtimes b)\geq\operatorname{Q}(a)\operatorname{Q}(b)$ .

Tensor rank is known to be hard to compute [Hås90] (the natural tensor rank decision problem is NP-hard). Not much is known about the complexity of computing subrank, asymptotic subrank and asymptotic rank. It is a long-standing open problem in algebraic complexity theory to compute the asymptotic rank of the matrix multiplication tensor. The asymptotic rank of the matrix multiplication tensor corresponds directly to the asymptotic algebraic complexity of matrix multiplication. The asymptotic subrank of 3-tensors also plays a central role in the context of matrix multiplication, for example in recent work on barriers for upper bound methods on the asymptotic rank of the matrix multiplication tensor [CVZ18b, Alm18]. As another example, in combinatorics, the resolution of the cap set problem [EG17, Tao16] can be phrased in terms of the asymptotic subrank of a well-chosen 3-tensor, cf. [CVZ18a], via the general connection to the asymptotic induced matching number that we will review now.

The subrank of $k$ -tensors as defined in (4) and the subrank of $k$ -graphs as defined in Section 1.1 are related as follows. For any $k$ -tensor $a=\sum_{i}a_{i}\,e_{i_{1}}\otimes\cdots\otimes e_{i_{k}}\in\mathbb{F}^{n_{1}}\otimes\cdots\otimes\mathbb{F}^{n_{k}}$ we define the $k$ -graph $\operatorname{supp}(a)$ as the support of $a$ in the standard basis:

[TABLE]

It is readily verified that the subrank of the $k$ -graph $\operatorname{supp}(a)$ is at most the subrank of the $k$ -tensor $a$ , that is, $\operatorname{Q}(\operatorname{supp}(a))\leq\operatorname{Q}(a)$ . The reader may also verify directly that $\operatorname{supp}(a\boxtimes b)=\operatorname{supp}(a)\boxtimes\operatorname{supp}(b)$ . Therefore, the asymptotic subrank of the support of $a$ is at most the asymptotic subrank of the $k$ -tensor $a$ , that is,

[TABLE]

We can read (5) in two ways. On the one hand, given any $k$ -tensor $a$ we may find lower bounds on $\operatorname{\underaccent{\wtilde}{Q}}(a)$ by finding lower bounds on $\operatorname{\underaccent{\wtilde}{Q}}(\operatorname{supp}(a))$ . On the other hand, given any $k$ -graph $\Phi\subseteq[n_{1}]\times\cdots\times[n_{k}]$ the asymptotic subrank $\operatorname{\underaccent{\wtilde}{Q}}(\Phi)$ is upper bounded by $\operatorname{\underaccent{\wtilde}{Q}}(a)$ for any tensor $a\in\mathbb{F}^{n_{1}}\otimes\cdots\otimes\mathbb{F}^{n_{k}}$ (over any field $\mathbb{F}$ ) with support equal to $\Phi$ , that is,

[TABLE]

We do not know whether the inequality in (6) can be strict. We will discuss these two directions in the following two sections.

1.4.2. Upper bounds on asymptotic subrank of $k$ -tensors

Let us focus on the task of finding upper bounds on the asymptotic subrank of $k$ -tensors. One natural strategy is to construct maps ${\phi\mathrel{\mathop{\mathchar 58\relax}}\{\textnormal{$ k $-tensors over$ \mathbb{F} $}\}\to\mathbb{R}_{\geq 0}}$ that are sub-multiplicative under the tensor Kronecker product $\boxtimes$ , normalised on $\langle n\rangle$ to $n$ , and monotone under $\leq$ , that is, for any $k$ -tensors $a$ and $b$ and for any $n\in\mathbb{N}$ :

[TABLE]

The reader verifies directly that for any such map $\phi$ the inequality $\operatorname{\underaccent{\wtilde}{Q}}(a)\leq\phi(a)$ holds.

Strassen in [Str91], motivated by the study of the algebraic complexity of matrix multiplication, introduced an infinite family of maps

[TABLE]

parametrised by probability vectors $\theta\in\mathbb{R}_{\geq 0}^{k}$ , $\sum_{i=1}^{k}\theta_{i}=1$ . The maps $\zeta^{\theta}$ are called the upper support functionals. We will not define them here. Strassen proved that each map $\zeta^{\theta}$ satisfies conditions (7), (8) and (9). Thus

[TABLE]

Tao, motivated by the study of the cap set problem, proved in [Tao16] that subrank is upper bounded by a parameter called slice rank, that is, $\operatorname{Q}(a)\leq\operatorname{slicerank}(a)$ . We do not define slice rank here. While slice rank is easily seen to be normalised on $\langle n\rangle$ and monotone under $\leq$ , slice rank is not sub-multiplicative (see, e.g., [CVZ18c]). However, it still holds that

[TABLE]

It turns out [TS16, CVZ18c] that

[TABLE]

No examples are known for which this inequality is strict. It is known that for so-called oblique tensors holds $\limsup_{n\to\infty}\operatorname{slicerank}(a^{\boxtimes n})^{1/n}=\min_{\theta}\zeta^{\theta}(a)$ [CVZ18c].

1.4.3. Lower bounds on asymptotic subrank of $k$ -graphs

We now consider the task of finding lower bounds on the asymptotic subrank of $k$ -graphs. For $k=3$ the CW method introduced by Coppersmith and Winograd [CW87] and extended by Strassen [Str91] gives the following. Let $\Phi\subseteq V_{1}\times V_{2}\times V_{3}$ be a 3-graph for which there exist injective maps $\alpha_{i}\mathrel{\mathop{\mathchar 58\relax}}V_{i}\to\mathbb{Z}$ such that $\forall a\in\Phi\colon\alpha_{1}(a_{1})+\alpha_{2}(a_{2})+\alpha_{3}(a_{3})=0$ . Then

[TABLE]

where $\mathscr{P}$ is the set of probability distributions on $\Phi$ . The inequality

[TABLE]

follows from using (5) and using the support functionals as upper bound on the asymptotic subrank of tensors. Thus, the CW method is optimal whenever it can be applied.

Theorem 1.3 extends the CW method from $k=3$ to higher-order tensors, that is, $k\geq 4$ . Contrary to the situation for $k=3$ , the lower bound produced by Theorem 1.3 is not known to be tight.

1.4.4. Type tensors

As an investigation of the power of the higher-order CW method (Theorem 1.3) and of the power of the support functionals (Section 1.4.2) we study the asymptotic subrank of the following family of tensors and their support. While we do not have any immediate “application” for these tensors, we feel that they provide enough structure to make progress while still showing interesting behaviour.

Let $\lambda\vdash k$ be an integer partition of $k$ with $n$ nonzero parts. Recall the definition of the $k$ -graph $\Phi_{\lambda}$ from Section 1.1. We define the tensor $T_{\lambda}$ as the $k$ -tensor with support $\Phi_{\lambda}$ and all nonzero coefficients equal to 1, that is,

[TABLE]

In general, it follows from (5) and evaluating the right-hand side of (10) for $a=T_{\lambda}$ and the uniform $\theta=(1/k,\ldots,1/k)$ that

[TABLE]

It was shown in [CVZ18a] that

[TABLE]

for every $k\in\mathbb{N}_{\geq 3}$ using Theorem 1.3. (The same result was essentially obtained in [HX17].) In [CVZ18a] it was moreover shown that

[TABLE]

using Theorem 1.3. As mentioned before, our main result (Theorem 1.2) is that for any large enough even $k\in\mathbb{N}_{\geq 2}$ holds

[TABLE]

We conjecture that (12) holds for all even $k\in\mathbb{N}$ . We numerically verified this up to $k\leq 2000$ . More generally we conjecture that $\operatorname{\underaccent{\wtilde}{Q}}(\Phi_{\lambda})=\operatorname{\underaccent{\wtilde}{Q}}(T_{\lambda})=2^{H(\lambda/k)}$ holds for all partitions $\lambda\vdash k$ , where $H$ denotes the Shannon entropy and $\lambda/k$ denotes the probability vector $(\lambda_{1}/k,\ldots,\lambda_{n}/k)$ .

In quantum information theory, the tensors $T_{(m,n)}$ , when normalized, correspond to so-called Dicke states (see [Dic54, SGDM03, VC15], and, e.g., [BE19]). Namely, in quantum information language, Dicke states are $(m+n)$ -partite pure quantum states given by

[TABLE]

where the sum is over all permutations $\pi$ of the $k=m+n$ parties. Roughly speaking, our result, Theorem 1.2, amounts to an asymptotically optimal $k$ -party stochastic local operations and classical communication (slocc) protocol for the problem of distilling GHZ-type entanglement from a subfamily of the Dicke states. More precisely, letting $\mathrm{GHZ}=\tfrac{1}{\sqrt{2}}(\ket{0}^{\otimes k}+\ket{1}^{\otimes k})$ be the $k$ -party GHZ state, Theorem 1.2 says that for $k$ large enough the maximal rate $\beta$ such that $n$ copies of $D_{(k/2,k/2)}$ can be transformed via slocc to $\beta n-o(n)$ copies of $\mathrm{GHZ}$ equals 1 when $n$ goes to infinity, that is,

[TABLE]

and this rate is optimal.

2. Reduction to counting

We now begin working towards the proof of Theorem 1.2. The goal of this section is to reduce Theorem 1.2 to Theorem 1.4 by applying Theorem 1.3.

Lemma 2.1.

Theorem 1.4* implies Theorem 1.2.*

Proof.

We will use the higher-order CW method Theorem 1.3 to show that Theorem 1.4 implies Theorem 1.2. Let $\Phi=\Phi_{(k/2,k/2)}=\{x\in\{0,1\}^{k}\mathrel{\mathop{\mathchar 58\relax}}\mathinner{\lvert x\rvert}=k/2\}$ . Let $\alpha_{1},\ldots,\alpha_{k-1}$ be the identity map $\mathbb{Z}\to\mathbb{Z}$ and let $\alpha_{k}\mathrel{\mathop{\mathchar 58\relax}}\mathbb{Z}\to\mathbb{Z}\mathrel{\mathop{\mathchar 58\relax}}x\mapsto x-k/2$ . With this definition of $\alpha$ we have for all $a\in\Phi$ satisfied the condition $\sum_{i}\alpha_{i}(a_{i})=0$ from Theorem 1.3. As in the statement of Theorem 1.3, for $R\in\mathscr{R}$ let $r(R)$ be the dimension of the $\mathbb{Q}$ -vector space

[TABLE]

Let $P$ be the uniform distribution on $\Phi$ . Then Theorem 1.3 gives

[TABLE]

For any $Q\in\mathscr{Q}_{R,(P_{1},\ldots,P_{k})}$ we have that $H(Q)$ is at most the Shannon entropy of the uniform distribution on $R$ . We thus obtain

[TABLE]

It remains to upper bound the maximisation over $R\in\mathscr{R}$ in (13). We define the set

[TABLE]

For $R\in\mathscr{R}$ let $r_{2}(R)$ be the dimension of the $\mathbb{F}_{2}$ -vector space

[TABLE]

By assumption Theorem 1.4 is true. This means

[TABLE]

that is

[TABLE]

For any $R\in\mathscr{R}$ there is a subset $R^{\prime}\subseteq\Phi^{\prime\times 2}$ with $\mathinner{\lvert R\rvert}\leq 2\mathinner{\lvert R^{\prime}\rvert}$ and $r_{2}(R)=r_{2}(R^{\prime})$ . Namely, one constructs $R^{\prime}$ as follows. Without loss of generality $\forall(x,y)\in R\colon x_{1}=y_{1}$ . For every $(x,y)\in R$ , if $x_{1}=y_{1}=1$ , then add $((x_{2},\ldots,x_{k}),(y_{2},\ldots,y_{k}))$ to $R^{\prime}$ , and if $x_{1}=y_{1}=0$ , then add the negated tuple $((1,\ldots,1)-(x_{2},\ldots,x_{k}),(1,\ldots,1)-(y_{2},\ldots,y_{k}))$ to $R^{\prime}$ . Therefore, (14) implies

[TABLE]

that is

[TABLE]

that is

[TABLE]

Combining (15) with (13) and using $r_{2}(R)\leq r(R)$ gives

[TABLE]

This proves the lemma. ∎

3. Case: low dimension

To prove Theorem 1.2 it remains to prove Theorem 1.4. Our proof of Theorem 1.4 is divided into two cases. In this section we prove the low-dimensional case.

Theorem 3.1.

For any even $k\in\mathbb{N}_{\geq 4}$ and subspace $V\subseteq\{x\in\mathbb{F}_{2}^{k}\mathrel{\mathop{\mathchar 58\relax}}x_{k}=0\}\subseteq\mathbb{F}_{2}^{k}$ such that $\dim_{\mathbb{F}_{2}}(V)\leq 11k/12$ , the inequality

[TABLE]

holds.

We set up some notation. Let $k\in 2\mathbb{N}$ and $\Phi=\{x\in\mathbb{F}_{2}^{k}\mid\mathinner{\lvert x\rvert}=k/2\}$ . We will think of $\mathbb{F}_{2}^{k-1}$ as the subspace where the last component is [math]. We want to prove: for any $V\leq\mathbb{F}_{2}^{k-1}\leq\mathbb{F}_{2}^{k}$ the inequality

[TABLE]

holds for all $r\leq\frac{11k}{12}$ , where $R=\{(x,y)\in\Phi^{2}\mid x-y\in V,\,x_{k}=y_{k}=0\}$ and $r=\dim_{\mathbb{F}_{2}}V$ . The proof is divided into three claims. The first claim is trivial:

Claim 3.2.

Inequality (16) holds when $r=0$ .

Proof.

One verifies directly that (16) becomes an equality when $r=0$ . ∎

We prepare to deal with $r\geq 2$ . Without loss of generality, we may assume that every vector in $V$ has even weight. To upper bound $\mathinner{\lvert R\rvert}$ we introduce the function

[TABLE]

which counts the number of pairs $(x,y)\in\Phi^{2}$ such that $x-y$ is an arbitrary but fixed vector with Hamming weight $m$ . This function has the following properties.

Proposition 3.3.

(1)

For any even $0<m<k$ holds $f(k,m)=f(k,k-m)$ . 2. (2)

$f(k,m)$ * strictly decreases in $m$ for even $0\leq m\leq k/2$ .* 3. (3)

$f(k,0)=\binom{k-1}{k/2-1}=\binom{k-1}{k/2}$ . 4. (4)

$f(k,0)\geq f(k,k-2)=f(k,2)\geq f(k,k-4)=f(k,4)\geq\cdots.$ **

Proof.

Claim (3) one verifies directly. For (1) we verify that

[TABLE]

For (2) we verify that

[TABLE]

which is $>1$ when $(m+2)(k-m-1)>(m+1)(k-m)$ , that is, when $k/2-2\geq m$ . Claim (4) follows from (1) and (2). ∎

Using the definition of $f(k,m)$ , we can write $|R|$ in (16) as follows: suppose $V$ has $a_{m}$ vectors of weight $m$ , then

[TABLE]

To get an upper bound on $|R|$ , we fix some even $s\in\{2,\ldots,k/2\}$ and in the terms with $f(k,m)>f(k,s)$ we replace $a_{m}$ by $\binom{k-1}{m}$ , while in the remaining terms we replace $f(k,m)$ by $f(k,s)$ . This gives, using Proposition 3.3 (4),

[TABLE]

Now our goal is to understand for which values of $k,r,s$ the inequality

[TABLE]

holds. In particular, if for every $k$ and $r\leq 11k/12$ , there exists such an $s$ , then (16) and hence Theorem 3.1 holds.

First we replace (20) by a stronger but simpler inequality. Divide both sides of (20) by $\binom{k-1}{k/2-1}$ and bound the right-hand side from below as follows

[TABLE]

Thus (20) is implied by

[TABLE]

Claim 3.4.

Inequality (16) holds for every $k\geq 27$ , and $r\in\{2,\ldots,\frac{k}{2\log k}\}.$

Proof.

Let $s=2$ . The left-hand side of (22) equals

[TABLE]

Since $2^{-r}\leq\frac{1}{4}$ , we see that (22) is implied by

[TABLE]

This is equivalent to

[TABLE]

We use that for $k$ large enough holds $\frac{1}{1/4+k/(2(k-1))}\geq 13/10$ , $2(k-2)\geq\frac{5}{3}k$ , and

[TABLE]

to see that the right-hand side of (25) is at least $k/(2\log k)$ . ∎

We now further simplify the left-hand side of (22) via

[TABLE]

and

[TABLE]

We have the upper bound $\binom{s}{s/2}\leq 2^{s}\sqrt{\frac{2}{\pi s}}$ . In the product of $s/2$ terms, each term is at least $1$ and the largest term is the last one. Since $s\leq k/2$ , we can use $k-s-1\geq k/2-1$ to get

[TABLE]

for all $k\geq 4$ . Plugging in (26),(27) into (22), we see that (20) is implied by

[TABLE]

that is, (20) is implied by

[TABLE]

To further upper bound the left-hand side of (30) we use the following lemma, which we will prove later.

Lemma 3.5.

For any even $k$ and $2\leq s\leq k/2$ the following inequality holds:

[TABLE]

Remark 3.6.

Numerics suggest that the optimal constant in the above inequality is $\sqrt{2/\pi}$ instead of $4/\sqrt{\pi}$ .

Assuming that $r$ satisfies

[TABLE]

we have

[TABLE]

where the first inequality used Lemma 3.5, the second inequality used (32), and the third inequality used $\frac{k}{k-s}\leq 2$ (which holds, since $s\leq k/2$ ). Thus, assuming (32), we have that (30) is implied by

[TABLE]

In other words, if there is an $s\geq 24\geq\frac{72}{\pi}=22.9183...$ such that

[TABLE]

then (30) holds. We further upper bound the left-hand side of (35) by

[TABLE]

Hence (35) is implied by

[TABLE]

Claim 3.7.

Inequality (16) holds for $k$ large enough and every $r\in\{\frac{k}{2\log k},\ldots,11k/12\}$ .

Proof.

Use the bound of (37) with $s=2\lfloor k^{\beta}/2\rfloor$ to get that inequality (16) holds for $\beta\in(0,1)$ , $k\geq\max\{24^{1/\beta},2^{1/(1-\beta)}\}$ , and

[TABLE]

Fix $\beta=1-\frac{2\log\log k}{\log k}$ . For this choice of $\beta$ , we have $k\geq 24^{1/\beta}$ for every $k\geq 3500$ and clearly $k\geq 2^{1/(1-\beta)}$ for every $k\geq 3$ , thereby satisfying the requirements for (38). Now observe that

[TABLE]

where the first inequality uses the fact that for every $x\in(0,1/2]$ holds $h(x)\leq 2x\log\frac{1}{x}$ , and the second inequality holds for every $k\geq 13\cdot 10^{12}$ . Next, for $k$ large enough

[TABLE]

For very large $k$ , observe that

[TABLE]

Putting together (41) and (39) along with (38), we prove the claim. ∎

Proof of Lemma 3.5.

We will make use of the following variant of Stirling’s formula (due to Robbins [Rob55]), valid for all positive integers $n$ :

[TABLE]

First we bound the ratio of the individual terms (assuming $m\neq 0$ ) as

[TABLE]

since the third factor is $1$ and the argument of the exponential is negative if $2\leq m\leq\frac{k}{2}$ .

Now let us turn to the ratio of the sums. Let $0<c_{1}<2c_{1}<c_{2}<\frac{1}{2}$ be fixed constants. Assume first that $2\leq s\leq c_{2}k$ . The denominator can be bounded from below by its last term, while the numerator can be bounded from above as

[TABLE]

where in the first inequality we have used

[TABLE]

for $n+1\leq s/2$ . Combining with (43) we arrive at the estimate

[TABLE]

Now we turn to the case when $c_{2}k\leq s\leq k/2$ . Split the sum in the numerator into two at $m\approx c_{1}k$ . For $m\leq\lfloor c_{1}k\rfloor$ use the simple bound $\binom{k/2}{m/2}^{2}\leq\binom{k}{m}$ , while for $m\geq\lfloor c_{1}k\rfloor+1\geq c_{1}k$ use (43) to get

[TABLE]

Introducing

[TABLE]

The estimate

[TABLE]

follows. The ratio

[TABLE]

is monotonically decreasing in $n$ , therefore, by induction

[TABLE]

whenever $a\leq b$ . Apply this with $a=2\lfloor c_{1}k/2\rfloor$ , $b=s$ and $t=2\lfloor c_{1}k/2\rfloor-m$ to get

[TABLE]

that is,

[TABLE]

We now look for a constant $C$ that satisfies

[TABLE]

when $c_{2}k\leq s\leq k/2$ . Equivalently, we need

[TABLE]

Using $\sqrt{\frac{s}{k}\left(1-\frac{s}{k}\right)}\leq\frac{1}{2}$ and that $2^{k(h(c_{1})-h(c_{2}))}k$ has a global maximum at $k=\frac{1}{\ln 2}\frac{1}{h(c_{2})-h(c_{1})}$ , an upper bound on the left-hand side is

[TABLE]

In particular, with $c_{1}=0.09711\ldots$ and $c_{2}=0.39252\ldots$ we get $C=2.25503\ldots<\frac{4}{\sqrt{\pi}}$ . ∎

4. Case: high dimension

Finally, in this section we consider the remaining high-dimensional case.

Theorem 4.1.

For any large enough even $k\in\mathbb{N}_{\geq 4}$ and subspace $V\subseteq\{x\in\mathbb{F}_{2}^{k}\mathrel{\mathop{\mathchar 58\relax}}x_{k}=0\}\subseteq\mathbb{F}_{2}^{k}$ such that $\dim_{\mathbb{F}_{2}}(V)\geq 11(k-1)/12$ , the inequality

[TABLE]

holds. Here $\mathinner{\lvert x\rvert}$ denotes the Hamming weight of $x\in\mathbb{F}_{2}^{k}$ .

4.1. Preliminaries

Our proof of Theorem 4.1 uses Fourier analysis on the Boolean cube $\mathbb{F}_{2}^{n}=\{0,1\}^{n}$ , the Krawchouk polynomials, a consequence of the KKL inequality and some elementary bounds for expressions involving binomial coefficients.

4.1.1. Fourier transform

For $z\in\{0,1\}^{n}$ define the function $\chi_{z}\mathrel{\mathop{\mathchar 58\relax}}\{0,1\}^{n}\to\mathbb{R}$ by $\chi_{z}(x)=(-1)^{z\cdot x}$ with $z\cdot x=\sum_{i}z_{i}x_{i}$ . These so-called characters form an orthonormal basis for the space of functions $\{0,1\}^{n}\to\mathbb{R}$ for the inner product $\langle f,g\rangle=\frac{1}{2^{n}}\sum_{x}f(x)g(x)$ . For a function $f\mathrel{\mathop{\mathchar 58\relax}}\{0,1\}^{n}\to\mathbb{R}$ define $\widehat{f}\mathrel{\mathop{\mathchar 58\relax}}\{0,1\}^{n}\to\mathbb{R}$ by $\widehat{f}(z)=\langle f,\chi_{z}\rangle=\frac{1}{2^{n}}\sum_{x}f(x)\chi_{z}(x)$ . The function $\widehat{f}$ is the Fourier transform of $f$ . One verifies that for any functions $f,g\mathrel{\mathop{\mathchar 58\relax}}\{0,1\}^{n}\to\mathbb{R}$ we have the identity

[TABLE]

with sums over $x,y\in\{0,1\}^{n}$ and $z\in\{0,1\}^{n}$ .

4.1.2. Krawchouk polynomials

For $0\leq k\leq n$ define the function

[TABLE]

as the sum of the characters $\chi_{z}$ with $z\in\{0,1\}^{n}$ and $|z|=k$ , that is

[TABLE]

The function $K_{k}^{n}(x)$ depends only on the Hamming weight $|x|$ and can thus be interpreted as a function on integers $0\leq t\leq n$ . This function may be written as $K^{n}_{k}(t)=\sum_{j=0}^{k}(-1)^{j}\binom{t}{j}\binom{n-t}{k-j}$ and this defines a real polynomial of degree $k$ , called the $k$ th Krawchouk polynomial. We will use the following expression for the “middle” Krawchouk polynomial for odd $n$ .

Lemma 4.2 (Proposition 4.4 in [Fei16]).

Let $n$ be odd and $t\in\{0,\ldots,n\}$ . Then

[TABLE]

We will encounter the Krawchouk polynomials in the following way. For any $0\leq k\leq n$ define the function $w^{n}_{k}\mathrel{\mathop{\mathchar 58\relax}}\{0,1\}^{n}\to\{0,1\}$ by $w^{n}_{k}(z)=[|z|=k]$ . Then

[TABLE]

4.1.3. KKL inequality

Let $A\subseteq\{0,1\}^{n}$ . The characteristic function $f\mathrel{\mathop{\mathchar 58\relax}}\{0,1\}^{n}\to\{0,1\}$ of $A$ is defined by $f(x)=[x\in A]$ . Now suppose $A$ is a linear subspace. Let $A^{\perp}\coloneqq\{y\in\{0,1\}^{n}\mathrel{\mathop{\mathchar 58\relax}}\textnormal{$ y\cdot x=0 $for all$ x\in A $}\}$ be the orthogonal complement of $A$ . The Fourier transform of $f$ is given by

[TABLE]

Indeed, $\widehat{f}(z)=\tfrac{1}{2^{n}}\sum_{x\in A}(-1)^{x\cdot z}$ and, if $z\in A^{\perp}$ , then this sum equals $\tfrac{1}{2^{n}}|A|$ ; if $z\not\in A^{\perp}$ , say $x_{0}\cdot z=1$ , then $\sum_{x\in A}(-1)^{x\cdot z}=\sum_{x\in A}(-1)^{(x+x_{0})z}=(-1)\sum_{x\in A}(-1)^{x\cdot z}$ so the sum equals zero.

The following lemma is a consequence of the KKL inequality [KKL88] and can be found in [Mon11].

Lemma 4.3 (KKL inequality).

Let $A\subseteq\{0,1\}^{n}$ be a non-empty subset. Let $f$ be the characteristic function of $A$ . Define $c=n-\log|A|$ . For any integer $1\leq t\leq\ln(2)c$ we have

[TABLE]

with sums over $z\in\{0,1\}^{n}$ .

For any subset $A\subseteq\{0,1\}^{n}$ and integer $0\leq t\leq n$ we denote by $A_{t}$ the set of vectors in $A$ with Hamming weight $t$ .

Corollary 4.4.

Let $V\subseteq\{0,1\}^{n}$ be a subspace and define $c=n-\dim(V)$ . For any integer $1\leq t\leq\ln(2)c$ we have the following upper bound on the number of vectors in $V^{\perp}$ with Hamming weight $t$ and $n-t$ respectively:

[TABLE]

Proof.

Let $f$ be the indicator function of $V$ . Then, using (59) and Lemma 4.3 we get

[TABLE]

and the same for $\mathinner{\!\bigl{\lvert}(V^{\perp})_{n-t}\bigr{\rvert}}$ . ∎

Example 4.5.

As mentioned in [Mon11] the following example shows that Corollary 4.4 is almost tight. Let $V\subseteq\{0,1\}^{n}$ be the $d$ -dimensional subspace consisting of all bit strings that begin with $n-d$ zeros. Then $V^{\perp}$ is the space of bit strings that end with $d$ zeros. Let $c=n-\dim(V)=n-d$ . Then we can directly compute the lower bound

[TABLE]

while Corollary 4.4 gives for $1\leq t\leq\ln(2)c$ that

[TABLE]

4.1.4. Bounds involving binomial coefficients

Lemma 4.6.

Let $n$ be even. If $0\leq m\leq n/3$ , then

[TABLE]

If $1\leq m\leq(n+1)/3$ , then

[TABLE]

Proof.

We expand the binomial coefficients as fractions of factorials:

[TABLE]

where in the last inequality we upper bounded each of the first $m$ terms by $1/2$ and each of the last $m+1$ terms by $(2m+1)/(n-m+1)$ using the assumption $m\leq n/3$ . We do the same for the other inequality:

[TABLE]

where in the last inequality we upper bounded each of the first $m$ terms by $1/2$ and each of the last $m$ terms by $(2m)/(n-m+1)$ using the assumption $1\leq m\leq(n+1)/3$ . ∎

4.2. Proof of Theorem 4.1

Proof of Theorem 4.1.

Let $n\geq 59$ be odd. Let $V\subseteq\{0,1\}^{n}$ be a subspace of dimension at least $11n/12$ . We will prove that

[TABLE]

This proves the theorem. To see this, in the theorem statement, set $k=n+1$ , ignore the $(n+1)$ th coordinate, and note that the size of $\bigl{\{}(x,y)\in(\{0,1\}^{n})^{\times 2}\mathrel{\mathop{\mathchar 58\relax}}|x|=|y|=\tfrac{n-1}{2},x+y\in V\bigr{\}}$ equals the size of $\bigl{\{}(x,y)\in(\{0,1\}^{n})^{\times 2}\mathrel{\mathop{\mathchar 58\relax}}|x|=|y|=\tfrac{n+1}{2},x+y\in V\bigr{\}}$ via the bijection that flips the bits of $x$ and $y$ .

Let $f\mathrel{\mathop{\mathchar 58\relax}}\{0,1\}^{n}\to\{0,1\}$ be the characteristic function of $V$ , that is, $f(x)=[x\in V]$ . Recall that we defined the function $w_{k}^{n}\mathrel{\mathop{\mathchar 58\relax}}\{0,1\}^{n}\to\{0,1\}$ by $w^{n}_{k}(x)=[|x|=k]$ . Using (57) the left-hand side of (60) can be rewritten as

[TABLE]

with sums over $x,y\in\{0,1\}^{n}$ and $z\in\{0,1\}^{n}$ . Since $\widehat{w}^{n}_{k}(z)=\frac{1}{2^{n}}K^{n}_{k}(|z|)$ (see (58)) and $\widehat{f}(z)=\frac{1}{2^{n}}|V|\cdot[z\in V^{\perp}]$ (see (59)) we have

[TABLE]

Recall that $(V^{\perp})_{t}$ denotes the subset of $V^{\perp}$ consisting of vectors with Hamming weight $t$ . We rewrite the right-hand side of (61) as a sum over the Hamming weight $t=|z|\in\{0,\ldots,n\}$ .

[TABLE]

By Lemma 4.2 we have

[TABLE]

which we use to rewrite (62) as

[TABLE]

We assumed that $\dim(V)\geq 11n/12$ . Since the statement of the theorem is directly verified to be true when $\dim(V)=n-1$ we may in addition assume that $\dim(V)<n-1$ . We define $c=n-\dim(V)$ . Then $2\leq c\leq n/12$ . Let

[TABLE]

In Lemma 4.7 and Lemma 4.8 below we will prove the inequalities

[TABLE]

These inequalities show that (63) is upper bounded as follows:

[TABLE]

which proves the theorem. ∎

Lemma 4.7.

Let $n$ be odd. For $2\leq c\leq n/12$ such that $\dim(V)=n-c$ we have

[TABLE]

with

[TABLE]

Proof.

We first upper bound the sum over $t\in[1,\lfloor\ln(2)c\rfloor]\cup[n-\lfloor\ln(2)c\rfloor,n-1]$ and afterwards the sum over the remaining $t$ ’s. We use $\binom{(n-1)/2}{\lfloor t/2\rfloor}=\binom{(n-1)/2}{\lfloor(n-t)/2\rfloor}$ and then apply Corollary 4.4 to get

[TABLE]

We upper bound the sum over even $t$ and the sum over odd $t$ separately. For the even part we use $\lfloor\ln(2)c\rfloor\leq c$ , then use Lemma 4.6 and replace $t$ by $2t$ to get

[TABLE]

We upper bound the sum as follows, using $t\leq c/2$ and $c\leq n/12$ :

[TABLE]

For the odd part we shift $t$ by 1 and use $\lfloor\ln(2)c\rfloor\leq c-1$ , then use Lemma 4.6 to get

[TABLE]

Next we use $t\leq\ln(2)c+1$ and $4\leq 2e$ and we replace $t$ by $2t$ to get

[TABLE]

which again we upper bound with (68). We conclude that (66) is upper bounded by $16c^{2}/n^{2}$ .

To upper bound the sum over the remaining $t$ ’s we use the inequalities

[TABLE]

to get

[TABLE]

This finishes the proof. ∎

Lemma 4.8.

For $n\geq 59$ odd and $2\leq c\leq n/12$ we have

[TABLE]

with

[TABLE]

Proof.

For odd $n$ we have $2^{n}/\sqrt{n}\geq\binom{n}{(n-1)/2}$ and thus

[TABLE]

It is thus sufficient to show that for $n\geq 59$ and $2\leq c\leq n/12$ we have $2+f(n,c)\leq 2(\sqrt{n}/2)^{\frac{c-1}{n-1}}$ . One verifies that $2+f(n,2)\leq 2(\sqrt{n}/2)^{\frac{2-1}{n-1}}$ holds for every $n\geq 53$ . We will show that for every $n\geq 59$ the function $f_{n}(c)=2(\sqrt{n}/2)^{\frac{c-1}{n-1}}-(2+f(n,c))$ is increasing in $c$ for $2\leq c\leq n/12$ . We see that the derivative $\frac{\mathrm{d}}{\mathrm{d}c}f_{n}(c)$ equals

[TABLE]

with

[TABLE]

Using $c\leq n/12$ one can verify that $\ln(e^{2}\ln(2)c/n)\leq 0$ so that $g_{n}(c)\leq 0$ . Moreover, using $c\leq n/12$ , $n\geq 59$ and $(\sqrt{n}/2)^{\frac{c-1}{n-1}}\geq 1$ one can verify that

[TABLE]

We conclude that $\frac{\mathrm{d}}{\mathrm{d}c}f_{n}(c)\geq 0$ which proves the lemma. ∎

Acknowledgements

SA is funded by the MIT–IBMWatson AI Lab under the project Machine Learning in Hilbert space. This work was initiated when SA was a part of QuSoft, CWI and was supported by ERC Consolidator Grant QPROGRESS. JZ thanks Florian Speelman, Pjotr Buys and Avi Wigderson for helpful discussions. This work was initiated when JZ was a part of QuSoft, CWI. This material is based upon work supported by the National Science Foundation under Grant No. DMS-1638352 (JZ). This research was supported by the National Research, Development and Innovation Fund of Hungary within the Quantum Technology National Excellence Program (Project Nr. 2017-1.2.1-NKP-2017-00001) and via the research grants K124152, KH 129601 (PV).

Bibliography36

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[Alm 18] Josh Alman. Limits on the Universal Method for Matrix Multiplication . ar Xiv , 2018. ar Xiv:1812.08731 .
2[AS 06] Noga Alon and Asaf Shapira. On an extremal hypergraph problem of Brown, Erdős and Sós. Combinatorica , 26(6):627–645, 2006.
3[BCC + 17a] Jonah Blasiak, Thomas Church, Henry Cohn, Joshua A. Grochow, Eric Naslund, William F. Sawin, and Chris Umans. On cap sets and the group-theoretic approach to matrix multiplication . Discrete Anal. , 2017. ar Xiv:1605.06702 . · doi ↗
4[BCC + 17b] Jonah Blasiak, Thomas Church, Henry Cohn, Joshua A Grochow, and Chris Umans. Which groups are amenable to proving exponent two for matrix multiplication? ar Xiv , 2017. ar Xiv:1712.02302 .
5[BCS 97] Peter Bürgisser, Michael Clausen, and M. Amin Shokrollahi. Algebraic complexity theory , volume 315 of Grundlehren Math. Wiss. Springer-Verlag, Berlin, 1997. · doi ↗
6[BE 19] Andreas Bärtschi and Stephan Eidenbenz. Deterministic Preparation of Dicke States . ar Xiv , 2019. ar Xiv:1904.07358 .
7[CVZ 18a] Matthias Christandl, Péter Vrana, and Jeroen Zuiddam. Asymptotic tensor rank of graph tensors: beyond matrix multiplication . Comput. Complexity , 2018. ar Xiv:1609.07476 . · doi ↗
8[CVZ 18b] Matthias Christandl, Péter Vrana, and Jeroen Zuiddam. Barriers for fast matrix multiplication from irreversibility . ar Xiv , 2018. ar Xiv:1812.06952 .

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

The asymptotic induced matching number

Abstract.

1. Introduction

1.1. Problem

Problem 1.1**.**

1.2. Result

Theorem 1.2**.**

1.3. Methods

Theorem 1.3** (Higher-order CW method [CVZ18a]).**

Theorem 1.4**.**

1.4. Motivation and background

1.4.1. Asymptotic rank and asymptotic subrank of tensors

1.4.2. Upper bounds on asymptotic subrank of kkk-tensors

1.4.3. Lower bounds on asymptotic subrank of kkk-graphs

1.4.4. Type tensors

2. Reduction to counting

Lemma 2.1**.**

Proof.

3. Case: low dimension

Theorem 3.1**.**

Claim 3.2**.**

Proof.

Proposition 3.3**.**

Proof.

Claim 3.4**.**

Proof.

Lemma 3.5**.**

Remark 3.6**.**

Claim 3.7**.**

Proof.

Proof of Lemma 3.5.

4. Case: high dimension

Theorem 4.1**.**

4.1. Preliminaries

4.1.1. Fourier transform

4.1.2. Krawchouk polynomials

Lemma 4.2** (Proposition 4.4 in [Fei16]).**

4.1.3. KKL inequality

Lemma 4.3** (KKL inequality).**

Corollary 4.4**.**

Proof.

Example 4.5**.**

4.1.4. Bounds involving binomial coefficients

Lemma 4.6**.**

Proof.

4.2. Proof of Theorem 4.1

Proof of Theorem 4.1.

Lemma 4.7**.**

Proof.

Lemma 4.8**.**

Proof.

Acknowledgements

Problem 1.1.

Theorem 1.2.

Theorem 1.3 (Higher-order CW method [CVZ18a]).

Theorem 1.4.

1.4.2. Upper bounds on asymptotic subrank of $k$ -tensors

1.4.3. Lower bounds on asymptotic subrank of $k$ -graphs

Lemma 2.1.

Theorem 3.1.

Claim 3.2.

Proposition 3.3.

Claim 3.4.

Lemma 3.5.

Remark 3.6.

Claim 3.7.

Theorem 4.1.

Lemma 4.2 (Proposition 4.4 in [Fei16]).

Lemma 4.3 (KKL inequality).

Corollary 4.4.

Example 4.5.

Lemma 4.6.

Lemma 4.7.

Lemma 4.8.