The asymptotic induced matching number of hypergraphs: balanced binary strings
Srinivasan Arunachalam, P\'eter Vrana, Jeroen Zuiddam

TL;DR
This paper calculates the asymptotic induced matching number of specific hypergraphs related to balanced binary strings, using advanced algebraic methods, with implications for tensor theory, quantum information, and computational complexity.
Contribution
It introduces a new lower bound for the asymptotic induced matching number of certain hypergraphs using the higher-order Coppersmith-Winograd method.
Findings
Determines the asymptotic induced matching number for hypergraphs of balanced binary strings.
Establishes the asymptotic subrank of tensors supported by these hypergraphs.
Provides an optimal protocol for entanglement distillation in quantum information theory.
Abstract
We compute the asymptotic induced matching number of the -partite -uniform hypergraphs whose edges are the -bit strings of Hamming weight , for any large enough even number . Our lower bound relies on the higher-order extension of the well-known Coppersmith-Winograd method from algebraic complexity theory, which was proven by Christandl, Vrana and Zuiddam. Our result is motivated by the study of the power of this method as well as of the power of the Strassen support functionals (which provide upper bounds on the asymptotic induced matching number), and the connections to questions in tensor theory, quantum information theory and theoretical computer science. Phrased in the language of tensors, as a direct consequence of our result, we determine the asymptotic subrank of any tensor with support given by the aforementioned hypergraphs. In the context of quantum…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
The asymptotic induced matching number
of hypergraphs: balanced binary strings
Srinivasan Arunachalam
Center for Theoretical Physics, Massachusetts Institute of Technology, 77 Massachusetts Ave, 6-304, Cambridge, MA 02139, USA
,
Péter Vrana
Department of Geometry, Budapest University of Technology and Economics, Egry József u. 1., 1111 Budapest, Hungary
MTA-BME Lendület Quantum Information Theory Research Group
and
Jeroen Zuiddam
Institute for Advanced Study, 1 Einstein Drive, Princeton, NJ 08540, USA
Abstract.
We compute the asymptotic induced matching number of the -partite -uniform hypergraphs whose edges are the -bit strings of Hamming weight , for any large enough even number . Our lower bound relies on the higher-order extension of the well-known Coppersmith–Winograd method from algebraic complexity theory, which was proven by Christandl, Vrana and Zuiddam. Our result is motivated by the study of the power of this method as well as of the power of the Strassen support functionals (which provide upper bounds on the asymptotic induced matching number), and the connections to questions in tensor theory, quantum information theory and theoretical computer science.
Phrased in the language of tensors, as a direct consequence of our result, we determine the asymptotic subrank of any tensor with support given by the aforementioned hypergraphs. In the context of quantum information theory, our result amounts to an asymptotically optimal -party stochastic local operations and classical communication (slocc) protocol for the problem of distilling GHZ-type entanglement from a subfamily of Dicke-type entanglement.
Keywords. -partite -uniform hypergraphs, asymptotic induced matchings, higher-order Coppersmith–Winograd method
1. Introduction
1.1. Problem
We study in this paper an asymptotic parameter of -partite -uniform hypergraphs: the asymptotic induced matching number. For , a -partite -uniform hypergraph, or -graph for short, is a tuple of finite sets together with a subset of their cartesian product:
[TABLE]
Whenever possible we will leave the vertex sets implicit and refer to the -graph by its edge set . For any we use the notation . Let be a -graph. We say a subset of is induced if where for each we define the marginal set . We call a matching if any two distinct elements are distinct in all coordinates, that is, . The subrank111The term subrank originates from an analogous parameter in the theory of tensors, see Section 1.4.1. or induced matching number is defined as the size of the largest subset of that is an induced matching, that is,
[TABLE]
For example, consider the 3-graph . Here is itself an induced matching, and so . Next, let . Now the subset is an induced matching and there is no larger induced matching in , and so .
We define the Kronecker product of two -graphs and as the -graph
[TABLE]
and we naturally define the power . The asymptotic subrank or the asymptotic induced matching number of the -graph is defined as
[TABLE]
This limit exists and equals the supremum by Fekete’s lemma (see, e.g., [PS98, No. 98]).
We study the following basic question:
Problem 1.1**.**
Given what is the value of ?
A priori, for we have the upper bound and therefore holds that , since .
1.1 has been studied for several families of -graphs, in several different contexts: the cap set problem [EG17, Tao16, KSS16, Nor16, Peb16], approaches to fast matrix multiplication [Str91, BCC*+*17a, BCC*+*17b, Saw17], arithmetic removal lemmas [LS18, FLS18], property testing [FK14, HX17], quantum information theory [VC15, VC17], and the general study of asymptotic properties of tensors [TS16, CVZ18a, CVZ18c]. We finally mention the related result of Ruzsa and Szemerédi which says that the largest subset such that is a matching, has size when goes to infinity [RS78], see also [AS06, Equation 2].
1.2. Result
We solve 1.1 for a family of -graphs that are structured but nontrivial. For let be an integer partition of with nonzero parts, that is, and . We define the -graph
[TABLE]
where the expression means that is a permutation of the -tuple
[TABLE]
For example, the partition corresponds to the 2-graph
[TABLE]
and the partition corresponds to the 4-graph
[TABLE]
It was shown in [CVZ18a] that for every where is the Shannon entropy in base 2. As a natural continuation of that work we study for even . Since we have . Clearly, the 2-graph is itself a matching, and so . It was shown in [CVZ18a] that also . Our new result is the following extension:
Theorem 1.2**.**
Let be even and large enough. Then .
In other words, we prove that for every large enough even there is an induced matching of size when goes to infinity.
Moreover, we numerically verified that also holds for all even . We conjecture that for all even . More generally, we conjecture (cf. [VC15] and [CVZ18a, Question 1.3.3]) that equals the Shannon entropy of the probability distribution obtained by normalising the partition . We will discuss further motivation and background in Section 1.4.
1.3. Methods
We prove Theorem 1.2 by applying the higher-order Coppersmith–Winograd (CW) method from [CVZ18a] to the -graph . This method is an extension of the work of Coppersmith and Winograd [CW87] and Strassen [Str91] from the case to the case . It provides a construction of large induced matchings in -graphs via the probabilistic method, and we prove Theorem 1.2 by analysing the size of these induced matchings.
Theorem 1.3** (Higher-order CW method [CVZ18a]).**
Let be a nonempty -graph for which there exist injective maps such that for all the equality
[TABLE]
holds. For any let be the rank over of the matrix with rows
[TABLE]
where . Then
[TABLE]
where the parameters , and are taken over the following domains:
- •
* is the set of probability distributions on *
- •
* is the set of subsets of that are not a subset of and moreover satisfy *
- •
* is the set of probability distributions on with marginal distributions equal to respectively.*
Here for we denote by the marginal probability distributions of on the components respectively, and denotes Shannon entropy.
Let be any integer partition of with nonzero parts. We can apply Theorem 1.3 to the -graph as follows. For every the equality
[TABLE]
holds, since the element occurs times in . Let be identity maps and let . Then, because of (2), . (Note that with this choice of maps we have that equals for every .) Therefore Theorem 1.3 can be applied to obtain a lower bound on for any partition . The difficulty now lies in evaluating the right-hand side of (1).
Let us return to the case . To prove Theorem 1.2 via Theorem 1.3 we will show for every large enough even and that the right-hand side of (1) is at least 2, using the aforementioned choice of injective maps . In Section 2 we prove that this follows from the following statement, which may be of interest on its own.
Theorem 1.4**.**
For any large enough even and subspace the inequality
[TABLE]
holds. Here denotes the Hamming weight of .
In Section 3 we prove Theorem 1.4 for low-dimensional by carefully splitting the left-hand side of (3) into two parts and upper bounding these parts. In Section 4 we prove Theorem 1.4 for high-dimensional using Fourier analysis, Krawchouk polynomials and the Kahn–Kalai–Linial (KKL) inequality [KKL88]. We thus prove Theorem 1.4 and hence Theorem 1.2. While in our current proof the tools for the low- and high-dimensional cases are used complementarily, it may be possible that the full Theorem 1.2 can be proven by cleverly using only the low-dimensional tools or only the high-dimensional tools.
1.4. Motivation and background
Our original motivation to study the asymptotic induced matching number of -graphs comes from a connection to the study of asymptotic properties of tensors. In fact, the interplay in this connection goes both directions. The purpose of this section is to discuss the asymptotic study of tensors and the connection with the asymptotic induced matching number. Reading this section is not required to understand the rest of the paper.
1.4.1. Asymptotic rank and asymptotic subrank of tensors
The asymptotic study of tensors is a field of its own that started with the work of Strassen [Str87, Str88, Str91] in the context of fast matrix multiplication. We begin by introducing two fundamental asymptotic tensor parameters: asymptotic rank and asymptotic subrank.
Let be a field. Let and be -tensors. We write if there are linear maps for such that . For let be the standard basis of . For define the -tensor
[TABLE]
The rank of the -tensor is defined as . The subrank of the -tensor is defined as
[TABLE]
One can think of tensor rank as a measure of the complexity of a tensor, namely the “cost” of the tensor in terms of the diagonal tensors . It has been studied in several contexts, see, e.g., [BCS97, Lan12]. In this language, the subrank is the “value” of the tensor in terms of and as such is the natural companion to tensor rank. It has its own applications, which we will elaborate on after having discussed the asymptotic viewpoint.
Writing and in the standard basis as , , the tensor Kronecker product is the -tensor defined by
[TABLE]
In other words, the -tensor is the image of the -tensor under the natural regrouping map . The asymptotic rank of is defined as and the asymptotic subrank of is defined as . These limits exist and equal the infimum and the supremum , respectively. This follows from Fekete’s lemma and the fact that and .
Tensor rank is known to be hard to compute [Hås90] (the natural tensor rank decision problem is NP-hard). Not much is known about the complexity of computing subrank, asymptotic subrank and asymptotic rank. It is a long-standing open problem in algebraic complexity theory to compute the asymptotic rank of the matrix multiplication tensor. The asymptotic rank of the matrix multiplication tensor corresponds directly to the asymptotic algebraic complexity of matrix multiplication. The asymptotic subrank of 3-tensors also plays a central role in the context of matrix multiplication, for example in recent work on barriers for upper bound methods on the asymptotic rank of the matrix multiplication tensor [CVZ18b, Alm18]. As another example, in combinatorics, the resolution of the cap set problem [EG17, Tao16] can be phrased in terms of the asymptotic subrank of a well-chosen 3-tensor, cf. [CVZ18a], via the general connection to the asymptotic induced matching number that we will review now.
The subrank of -tensors as defined in (4) and the subrank of -graphs as defined in Section 1.1 are related as follows. For any -tensor we define the -graph as the support of in the standard basis:
[TABLE]
It is readily verified that the subrank of the -graph is at most the subrank of the -tensor , that is, . The reader may also verify directly that . Therefore, the asymptotic subrank of the support of is at most the asymptotic subrank of the -tensor , that is,
[TABLE]
We can read (5) in two ways. On the one hand, given any -tensor we may find lower bounds on by finding lower bounds on . On the other hand, given any -graph the asymptotic subrank is upper bounded by for any tensor (over any field ) with support equal to , that is,
[TABLE]
We do not know whether the inequality in (6) can be strict. We will discuss these two directions in the following two sections.
1.4.2. Upper bounds on asymptotic subrank of -tensors
Let us focus on the task of finding upper bounds on the asymptotic subrank of -tensors. One natural strategy is to construct maps {\phi\mathrel{\mathop{\mathchar 58\relax}}\{\textnormal{k\mathbb{F}}\}\to\mathbb{R}_{\geq 0}} that are sub-multiplicative under the tensor Kronecker product , normalised on to , and monotone under , that is, for any -tensors and and for any :
[TABLE]
The reader verifies directly that for any such map the inequality holds.
Strassen in [Str91], motivated by the study of the algebraic complexity of matrix multiplication, introduced an infinite family of maps
[TABLE]
parametrised by probability vectors , . The maps are called the upper support functionals. We will not define them here. Strassen proved that each map satisfies conditions (7), (8) and (9). Thus
[TABLE]
Tao, motivated by the study of the cap set problem, proved in [Tao16] that subrank is upper bounded by a parameter called slice rank, that is, . We do not define slice rank here. While slice rank is easily seen to be normalised on and monotone under , slice rank is not sub-multiplicative (see, e.g., [CVZ18c]). However, it still holds that
[TABLE]
It turns out [TS16, CVZ18c] that
[TABLE]
No examples are known for which this inequality is strict. It is known that for so-called oblique tensors holds [CVZ18c].
1.4.3. Lower bounds on asymptotic subrank of -graphs
We now consider the task of finding lower bounds on the asymptotic subrank of -graphs. For the CW method introduced by Coppersmith and Winograd [CW87] and extended by Strassen [Str91] gives the following. Let be a 3-graph for which there exist injective maps such that . Then
[TABLE]
where is the set of probability distributions on . The inequality
[TABLE]
follows from using (5) and using the support functionals as upper bound on the asymptotic subrank of tensors. Thus, the CW method is optimal whenever it can be applied.
Theorem 1.3 extends the CW method from to higher-order tensors, that is, . Contrary to the situation for , the lower bound produced by Theorem 1.3 is not known to be tight.
1.4.4. Type tensors
As an investigation of the power of the higher-order CW method (Theorem 1.3) and of the power of the support functionals (Section 1.4.2) we study the asymptotic subrank of the following family of tensors and their support. While we do not have any immediate “application” for these tensors, we feel that they provide enough structure to make progress while still showing interesting behaviour.
Let be an integer partition of with nonzero parts. Recall the definition of the -graph from Section 1.1. We define the tensor as the -tensor with support and all nonzero coefficients equal to 1, that is,
[TABLE]
In general, it follows from (5) and evaluating the right-hand side of (10) for and the uniform that
[TABLE]
It was shown in [CVZ18a] that
[TABLE]
for every using Theorem 1.3. (The same result was essentially obtained in [HX17].) In [CVZ18a] it was moreover shown that
[TABLE]
using Theorem 1.3. As mentioned before, our main result (Theorem 1.2) is that for any large enough even holds
[TABLE]
We conjecture that (12) holds for all even . We numerically verified this up to . More generally we conjecture that holds for all partitions , where denotes the Shannon entropy and denotes the probability vector .
In quantum information theory, the tensors , when normalized, correspond to so-called Dicke states (see [Dic54, SGDM03, VC15], and, e.g., [BE19]). Namely, in quantum information language, Dicke states are -partite pure quantum states given by
[TABLE]
where the sum is over all permutations of the parties. Roughly speaking, our result, Theorem 1.2, amounts to an asymptotically optimal -party stochastic local operations and classical communication (slocc) protocol for the problem of distilling GHZ-type entanglement from a subfamily of the Dicke states. More precisely, letting be the -party GHZ state, Theorem 1.2 says that for large enough the maximal rate such that copies of can be transformed via slocc to copies of equals 1 when goes to infinity, that is,
[TABLE]
and this rate is optimal.
2. Reduction to counting
We now begin working towards the proof of Theorem 1.2. The goal of this section is to reduce Theorem 1.2 to Theorem 1.4 by applying Theorem 1.3.
Lemma 2.1**.**
Theorem 1.4* implies Theorem 1.2.*
Proof.
We will use the higher-order CW method Theorem 1.3 to show that Theorem 1.4 implies Theorem 1.2. Let . Let be the identity map and let . With this definition of we have for all satisfied the condition from Theorem 1.3. As in the statement of Theorem 1.3, for let be the dimension of the -vector space
[TABLE]
Let be the uniform distribution on . Then Theorem 1.3 gives
[TABLE]
For any we have that is at most the Shannon entropy of the uniform distribution on . We thus obtain
[TABLE]
It remains to upper bound the maximisation over in (13). We define the set
[TABLE]
For let be the dimension of the -vector space
[TABLE]
By assumption Theorem 1.4 is true. This means
[TABLE]
that is
[TABLE]
For any there is a subset with and . Namely, one constructs as follows. Without loss of generality . For every , if , then add to , and if , then add the negated tuple to . Therefore, (14) implies
[TABLE]
that is
[TABLE]
that is
[TABLE]
Combining (15) with (13) and using gives
[TABLE]
This proves the lemma. ∎
3. Case: low dimension
To prove Theorem 1.2 it remains to prove Theorem 1.4. Our proof of Theorem 1.4 is divided into two cases. In this section we prove the low-dimensional case.
Theorem 3.1**.**
For any even and subspace such that , the inequality
[TABLE]
holds.
We set up some notation. Let and . We will think of as the subspace where the last component is [math]. We want to prove: for any the inequality
[TABLE]
holds for all , where and . The proof is divided into three claims. The first claim is trivial:
Claim 3.2**.**
Inequality (16) holds when .
Proof.
One verifies directly that (16) becomes an equality when . ∎
We prepare to deal with . Without loss of generality, we may assume that every vector in has even weight. To upper bound we introduce the function
[TABLE]
which counts the number of pairs such that is an arbitrary but fixed vector with Hamming weight . This function has the following properties.
Proposition 3.3**.**
- (1)
For any even holds . 2. (2)
* strictly decreases in for even .* 3. (3)
. 4. (4)
**
Proof.
Claim (3) one verifies directly. For (1) we verify that
[TABLE]
For (2) we verify that
[TABLE]
which is when , that is, when . Claim (4) follows from (1) and (2). ∎
Using the definition of , we can write in (16) as follows: suppose has vectors of weight , then
[TABLE]
To get an upper bound on , we fix some even and in the terms with we replace by , while in the remaining terms we replace by . This gives, using Proposition 3.3 (4),
[TABLE]
Now our goal is to understand for which values of the inequality
[TABLE]
holds. In particular, if for every and , there exists such an , then (16) and hence Theorem 3.1 holds.
First we replace (20) by a stronger but simpler inequality. Divide both sides of (20) by and bound the right-hand side from below as follows
[TABLE]
Thus (20) is implied by
[TABLE]
Claim 3.4**.**
Inequality (16) holds for every , and
Proof.
Let . The left-hand side of (22) equals
[TABLE]
Since , we see that (22) is implied by
[TABLE]
This is equivalent to
[TABLE]
We use that for large enough holds , , and
[TABLE]
to see that the right-hand side of (25) is at least . ∎
We now further simplify the left-hand side of (22) via
[TABLE]
and
[TABLE]
We have the upper bound . In the product of terms, each term is at least and the largest term is the last one. Since , we can use to get
[TABLE]
for all . Plugging in (26),(27) into (22), we see that (20) is implied by
[TABLE]
that is, (20) is implied by
[TABLE]
To further upper bound the left-hand side of (30) we use the following lemma, which we will prove later.
Lemma 3.5**.**
For any even and the following inequality holds:
[TABLE]
Remark 3.6**.**
Numerics suggest that the optimal constant in the above inequality is instead of .
Assuming that satisfies
[TABLE]
we have
[TABLE]
where the first inequality used Lemma 3.5, the second inequality used (32), and the third inequality used (which holds, since ). Thus, assuming (32), we have that (30) is implied by
[TABLE]
In other words, if there is an such that
[TABLE]
then (30) holds. We further upper bound the left-hand side of (35) by
[TABLE]
Hence (35) is implied by
[TABLE]
Claim 3.7**.**
Inequality (16) holds for large enough and every .
Proof.
Use the bound of (37) with to get that inequality (16) holds for , , and
[TABLE]
Fix . For this choice of , we have for every and clearly for every , thereby satisfying the requirements for (38). Now observe that
[TABLE]
where the first inequality uses the fact that for every holds , and the second inequality holds for every . Next, for large enough
[TABLE]
For very large , observe that
[TABLE]
Putting together (41) and (39) along with (38), we prove the claim. ∎
Proof of Lemma 3.5.
We will make use of the following variant of Stirling’s formula (due to Robbins [Rob55]), valid for all positive integers :
[TABLE]
First we bound the ratio of the individual terms (assuming ) as
[TABLE]
since the third factor is and the argument of the exponential is negative if .
Now let us turn to the ratio of the sums. Let be fixed constants. Assume first that . The denominator can be bounded from below by its last term, while the numerator can be bounded from above as
[TABLE]
where in the first inequality we have used
[TABLE]
for . Combining with (43) we arrive at the estimate
[TABLE]
Now we turn to the case when . Split the sum in the numerator into two at . For use the simple bound , while for use (43) to get
[TABLE]
Introducing
[TABLE]
The estimate
[TABLE]
follows. The ratio
[TABLE]
is monotonically decreasing in , therefore, by induction
[TABLE]
whenever . Apply this with , and to get
[TABLE]
that is,
[TABLE]
We now look for a constant that satisfies
[TABLE]
when . Equivalently, we need
[TABLE]
Using and that has a global maximum at , an upper bound on the left-hand side is
[TABLE]
In particular, with and we get . ∎
4. Case: high dimension
Finally, in this section we consider the remaining high-dimensional case.
Theorem 4.1**.**
For any large enough even and subspace such that , the inequality
[TABLE]
holds. Here denotes the Hamming weight of .
4.1. Preliminaries
Our proof of Theorem 4.1 uses Fourier analysis on the Boolean cube , the Krawchouk polynomials, a consequence of the KKL inequality and some elementary bounds for expressions involving binomial coefficients.
4.1.1. Fourier transform
For define the function by with . These so-called characters form an orthonormal basis for the space of functions for the inner product . For a function define by . The function is the Fourier transform of . One verifies that for any functions we have the identity
[TABLE]
with sums over and .
4.1.2. Krawchouk polynomials
For define the function
[TABLE]
as the sum of the characters with and , that is
[TABLE]
The function depends only on the Hamming weight and can thus be interpreted as a function on integers . This function may be written as and this defines a real polynomial of degree , called the th Krawchouk polynomial. We will use the following expression for the “middle” Krawchouk polynomial for odd .
Lemma 4.2** (Proposition 4.4 in [Fei16]).**
Let be odd and . Then
[TABLE]
We will encounter the Krawchouk polynomials in the following way. For any define the function by . Then
[TABLE]
4.1.3. KKL inequality
Let . The characteristic function of is defined by . Now suppose is a linear subspace. Let A^{\perp}\coloneqq\{y\in\{0,1\}^{n}\mathrel{\mathop{\mathchar 58\relax}}\textnormal{y\cdot x=0x\in A}\} be the orthogonal complement of . The Fourier transform of is given by
[TABLE]
Indeed, and, if , then this sum equals ; if , say , then so the sum equals zero.
The following lemma is a consequence of the KKL inequality [KKL88] and can be found in [Mon11].
Lemma 4.3** (KKL inequality).**
Let be a non-empty subset. Let be the characteristic function of . Define . For any integer we have
[TABLE]
with sums over .
For any subset and integer we denote by the set of vectors in with Hamming weight .
Corollary 4.4**.**
Let be a subspace and define . For any integer we have the following upper bound on the number of vectors in with Hamming weight and respectively:
[TABLE]
Proof.
Let be the indicator function of . Then, using (59) and Lemma 4.3 we get
[TABLE]
and the same for \mathinner{\!\bigl{\lvert}(V^{\perp})_{n-t}\bigr{\rvert}}. ∎
Example 4.5**.**
As mentioned in [Mon11] the following example shows that Corollary 4.4 is almost tight. Let be the -dimensional subspace consisting of all bit strings that begin with zeros. Then is the space of bit strings that end with zeros. Let . Then we can directly compute the lower bound
[TABLE]
while Corollary 4.4 gives for that
[TABLE]
4.1.4. Bounds involving binomial coefficients
Lemma 4.6**.**
Let be even. If , then
[TABLE]
If , then
[TABLE]
Proof.
We expand the binomial coefficients as fractions of factorials:
[TABLE]
where in the last inequality we upper bounded each of the first terms by and each of the last terms by using the assumption . We do the same for the other inequality:
[TABLE]
where in the last inequality we upper bounded each of the first terms by and each of the last terms by using the assumption . ∎
4.2. Proof of Theorem 4.1
Proof of Theorem 4.1.
Let be odd. Let be a subspace of dimension at least . We will prove that
[TABLE]
This proves the theorem. To see this, in the theorem statement, set , ignore the th coordinate, and note that the size of \bigl{\{}(x,y)\in(\{0,1\}^{n})^{\times 2}\mathrel{\mathop{\mathchar 58\relax}}|x|=|y|=\tfrac{n-1}{2},x+y\in V\bigr{\}} equals the size of \bigl{\{}(x,y)\in(\{0,1\}^{n})^{\times 2}\mathrel{\mathop{\mathchar 58\relax}}|x|=|y|=\tfrac{n+1}{2},x+y\in V\bigr{\}} via the bijection that flips the bits of and .
Let be the characteristic function of , that is, . Recall that we defined the function by . Using (57) the left-hand side of (60) can be rewritten as
[TABLE]
with sums over and . Since (see (58)) and (see (59)) we have
[TABLE]
Recall that denotes the subset of consisting of vectors with Hamming weight . We rewrite the right-hand side of (61) as a sum over the Hamming weight .
[TABLE]
By Lemma 4.2 we have
[TABLE]
which we use to rewrite (62) as
[TABLE]
We assumed that . Since the statement of the theorem is directly verified to be true when we may in addition assume that . We define . Then . Let
[TABLE]
In Lemma 4.7 and Lemma 4.8 below we will prove the inequalities
[TABLE]
These inequalities show that (63) is upper bounded as follows:
[TABLE]
which proves the theorem. ∎
Lemma 4.7**.**
Let be odd. For such that we have
[TABLE]
with
[TABLE]
Proof.
We first upper bound the sum over and afterwards the sum over the remaining ’s. We use and then apply Corollary 4.4 to get
[TABLE]
We upper bound the sum over even and the sum over odd separately. For the even part we use , then use Lemma 4.6 and replace by to get
[TABLE]
We upper bound the sum as follows, using and :
[TABLE]
For the odd part we shift by 1 and use , then use Lemma 4.6 to get
[TABLE]
Next we use and and we replace by to get
[TABLE]
which again we upper bound with (68). We conclude that (66) is upper bounded by .
To upper bound the sum over the remaining ’s we use the inequalities
[TABLE]
to get
[TABLE]
This finishes the proof. ∎
Lemma 4.8**.**
For odd and we have
[TABLE]
with
[TABLE]
Proof.
For odd we have and thus
[TABLE]
It is thus sufficient to show that for and we have . One verifies that holds for every . We will show that for every the function is increasing in for . We see that the derivative equals
[TABLE]
with
[TABLE]
Using one can verify that so that . Moreover, using , and one can verify that
[TABLE]
We conclude that which proves the lemma. ∎
Acknowledgements
SA is funded by the MIT–IBMWatson AI Lab under the project Machine Learning in Hilbert space. This work was initiated when SA was a part of QuSoft, CWI and was supported by ERC Consolidator Grant QPROGRESS. JZ thanks Florian Speelman, Pjotr Buys and Avi Wigderson for helpful discussions. This work was initiated when JZ was a part of QuSoft, CWI. This material is based upon work supported by the National Science Foundation under Grant No. DMS-1638352 (JZ). This research was supported by the National Research, Development and Innovation Fund of Hungary within the Quantum Technology National Excellence Program (Project Nr. 2017-1.2.1-NKP-2017-00001) and via the research grants K124152, KH 129601 (PV).
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[Alm 18] Josh Alman. Limits on the Universal Method for Matrix Multiplication . ar Xiv , 2018. ar Xiv:1812.08731 .
- 2[AS 06] Noga Alon and Asaf Shapira. On an extremal hypergraph problem of Brown, Erdős and Sós. Combinatorica , 26(6):627–645, 2006.
- 3[BCC + 17a] Jonah Blasiak, Thomas Church, Henry Cohn, Joshua A. Grochow, Eric Naslund, William F. Sawin, and Chris Umans. On cap sets and the group-theoretic approach to matrix multiplication . Discrete Anal. , 2017. ar Xiv:1605.06702 . · doi ↗
- 4[BCC + 17b] Jonah Blasiak, Thomas Church, Henry Cohn, Joshua A Grochow, and Chris Umans. Which groups are amenable to proving exponent two for matrix multiplication? ar Xiv , 2017. ar Xiv:1712.02302 .
- 5[BCS 97] Peter Bürgisser, Michael Clausen, and M. Amin Shokrollahi. Algebraic complexity theory , volume 315 of Grundlehren Math. Wiss. Springer-Verlag, Berlin, 1997. · doi ↗
- 6[BE 19] Andreas Bärtschi and Stephan Eidenbenz. Deterministic Preparation of Dicke States . ar Xiv , 2019. ar Xiv:1904.07358 .
- 7[CVZ 18a] Matthias Christandl, Péter Vrana, and Jeroen Zuiddam. Asymptotic tensor rank of graph tensors: beyond matrix multiplication . Comput. Complexity , 2018. ar Xiv:1609.07476 . · doi ↗
- 8[CVZ 18b] Matthias Christandl, Péter Vrana, and Jeroen Zuiddam. Barriers for fast matrix multiplication from irreversibility . ar Xiv , 2018. ar Xiv:1812.06952 .
