Direct Sum Testing: The General Case

Irit Dinur; Konstantin Golubev

arXiv:1904.12747·cs.CC·October 11, 2019

Direct Sum Testing: The General Case

Irit Dinur, Konstantin Golubev

PDF

TL;DR

This paper introduces a 4-query test to efficiently distinguish direct sum functions from those far from such functions, extending linearity testing to higher dimensions and tensor products.

Contribution

It presents a novel 4-query test for direct sums, generalizing the BLR linearity test and agreement tests to higher-dimensional tensor product functions.

Findings

01

The test distinguishes direct sums from far functions with high probability.

02

The approach extends linearity testing to tensor product structures.

03

An alternative, simpler test with up to (d+2) queries is also proposed.

Abstract

A function $f : [n_{1}] \times \dots \times [n_{d}] \to F_{2}$ is a direct sum if it is of the form $f (a_{1}, \dots, a_{d}) = f_{1} (a_{1}) \oplus \dots \oplus f_{d} (a_{d}),$ for some $d$ functions $f_{i} : [n_{i}] \to F_{2}$ for all $i = 1, \dots, d$ , and where $n_{1}, \dots, n_{d} \in N$ . We present a $4$ -query test which distinguishes between direct sums and functions that are far from them. The test relies on the BLR linearity test (Blum, Luby, Rubinfeld, 1993) and on an agreement test which slightly generalizes the direct product test (Dinur, Steurer, 2014). In multiplicative $\pm 1$ notation, our result reads as follows. A $d$ -dimensional tensor with $\pm 1$ entries is called a tensor product if it is a tensor product of $d$ vectors with $\pm 1$ entries, or equivalently, if it is of rank $1$ . The presented tests can be read as tests for distinguishing between tensor products and tensors that…

Equations114

D i r ec tS u m_{[\overline{n}; d]} = {f_{1} \oplus \dots \oplus f_{d} ∣ f_{i} : [n_{i}] \to F_{2}, i = 1, \dots, d} .

D i r ec tS u m_{[\overline{n}; d]} = {f_{1} \oplus \dots \oplus f_{d} ∣ f_{i} : [n_{i}] \to F_{2}, i = 1, \dots, d} .

f (a_{1}, \dots, a_{d}) = g (a_{1}) \oplus g (a_{2}) \oplus \dots \oplus g (a_{d}) .

f (a_{1}, \dots, a_{d}) = g (a_{1}) \oplus g (a_{2}) \oplus \dots \oplus g (a_{d}) .

f (a) \oplus f (a_{S} b) \oplus f (a_{T} b) \oplus f (a_{U} b) = 0.

f (a) \oplus f (a_{S} b) \oplus f (a_{T} b) \oplus f (a_{U} b) = 0.

dist (f, D i r ec tS u m_{[\overline{n}; d]}) \leq c \cdot a, b, S, T Pr [f (a) \oplus f (a_{S} b) \oplus f (a_{T} b) \oplus f (a_{S △ T} b) \neq = 0]

dist (f, D i r ec tS u m_{[\overline{n}; d]}) \leq c \cdot a, b, S, T Pr [f (a) \oplus f (a_{S} b) \oplus f (a_{T} b) \oplus f (a_{S △ T} b) \neq = 0]

h = h_{1} \otimes \dots \otimes h_{d} .

h = h_{1} \otimes \dots \otimes h_{d} .

T e n sor P r o d u c t_{[\overline{n}; d]} = {h_{1} \otimes \dots \otimes h_{d} ∣ h_{i} : [n_{i}] \to {- 1, 1},, i = 1, \dots, d}

T e n sor P r o d u c t_{[\overline{n}; d]} = {h_{1} \otimes \dots \otimes h_{d} ∣ h_{i} : [n_{i}] \to {- 1, 1},, i = 1, \dots, d}

dist (h, T e n sor P r o d u c t_{[\overline{n}; d]}) \leq c \cdot a, b, S, T Pr [h (a) \cdot h (a_{S} b) \cdot h (a_{T} b) \cdot h (a_{S △ T} b) \neq = 1] .

dist (h, T e n sor P r o d u c t_{[\overline{n}; d]}) \leq c \cdot a, b, S, T Pr [h (a) \cdot h (a_{S} b) \cdot h (a_{T} b) \cdot h (a_{S △ T} b) \neq = 1] .

ρ_{a, b} (x)_{i} = ⎩ ⎨ ⎧ a_{i} = b_{i}, b_{i}, a_{i}, i \neq \in Δ (a, b); i \in Δ (a, b) and x_{i} = 1; i \in Δ (a, b) and x_{i} = 0;

ρ_{a, b} (x)_{i} = ⎩ ⎨ ⎧ a_{i} = b_{i}, b_{i}, a_{i}, i \neq \in Δ (a, b); i \in Δ (a, b) and x_{i} = 1; i \in Δ (a, b) and x_{i} = 0;

g (x) = c \oplus i \in S ⨁ x_{i} .

g (x) = c \oplus i \in S ⨁ x_{i} .

g (0) \oplus g (x) \oplus g (y) \oplus g (x \oplus y) = 0.

g (0) \oplus g (x) \oplus g (y) \oplus g (x \oplus y) = 0.

x Pr [g (x) = (h_{1} (x), h_{2} (x), \dots, h_{k} (x))] \geq 1 - O (ε) .

x Pr [g (x) = (h_{1} (x), h_{2} (x), \dots, h_{k} (x))] \geq 1 - O (ε) .

x \sim μ_{\nicefrac 23} (F_{2}^{D}) Pr (χ_{S} (x) = 0) > \frac{2}{3},

x \sim μ_{\nicefrac 23} (F_{2}^{D}) Pr (χ_{S} (x) = 0) > \frac{2}{3},

x \sim μ_{\nicefrac 23} (F_{2}^{D}) Pr (χ_{S} (x) = 0) = x \sim μ_{\nicefrac 23} (F_{2}^{D}) Pr ((- 1)^{χ_{S} (x)} = 1) .

x \sim μ_{\nicefrac 23} (F_{2}^{D}) Pr (χ_{S} (x) = 0) = x \sim μ_{\nicefrac 23} (F_{2}^{D}) Pr ((- 1)^{χ_{S} (x)} = 1) .

\frac{1}{3} < 2 x \sim μ_{\nicefrac 23} (F_{2}^{D}) Pr ((- 1)^{χ_{S} (x)} = 1) - 1 = E_{x \sim μ_{\nicefrac 23} (F_{2}^{D})} (- 1)^{χ_{S} (x)} =

\frac{1}{3} < 2 x \sim μ_{\nicefrac 23} (F_{2}^{D}) Pr ((- 1)^{χ_{S} (x)} = 1) - 1 = E_{x \sim μ_{\nicefrac 23} (F_{2}^{D})} (- 1)^{χ_{S} (x)} =

i \in [D] \prod E_{x_{i} \sim μ_{\nicefrac 23} (F_{2})} (- 1)^{x_{i}} = (- \frac{1}{3})^{∣ S ∣} = (\frac{1}{3})^{∣ S ∣},

i \in [D] \prod E_{x_{i} \sim μ_{\nicefrac 23} (F_{2})} (- 1)^{x_{i}} = (- \frac{1}{3})^{∣ S ∣} = (\frac{1}{3})^{∣ S ∣},

a, b \sim [\overline{n}; d] x, y \sim C_{a, b} Pr (f_{a, b} (0) \oplus f_{a, b} (x) \oplus f_{a, b} (y) \oplus f_{a, b} (x \oplus y) = 0) > 1 - ε,

a, b \sim [\overline{n}; d] x, y \sim C_{a, b} Pr (f_{a, b} (0) \oplus f_{a, b} (x) \oplus f_{a, b} (y) \oplus f_{a, b} (x \oplus y) = 0) > 1 - ε,

b \sim [\overline{n}; d] x, y \sim C_{a, b} Pr (f_{a, b} (0) \oplus f_{a, b} (x) \oplus f_{a, b} (y) \oplus f_{a, b} (x \oplus y) = 0) > 1 - ε .

b \sim [\overline{n}; d] x, y \sim C_{a, b} Pr (f_{a, b} (0) \oplus f_{a, b} (x) \oplus f_{a, b} (y) \oplus f_{a, b} (x \oplus y) = 0) > 1 - ε .

x, y \sim C_{b} Pr (f_{b} (0) \oplus f_{b} (x) \oplus f_{b} (y) \oplus f_{b} (x \oplus y) = 0) = 1 - ε_{b} .

x, y \sim C_{b} Pr (f_{b} (0) \oplus f_{b} (x) \oplus f_{b} (y) \oplus f_{b} (x \oplus y) = 0) = 1 - ε_{b} .

x \sim C_{b} Pr (f_{b} (x) = χ_{S (b)} (x)) = 1 - ε_{b} .

x \sim C_{b} Pr (f_{b} (x) = χ_{S (b)} (x)) = 1 - ε_{b} .

b_{i}^{'} = {b_{i}, chosen uniformly at random from [n] ∖ {b_{i}}, w.p. \nicefrac 34; w.p. \nicefrac 14 .

b_{i}^{'} = {b_{i}, chosen uniformly at random from [n] ∖ {b_{i}}, w.p. \nicefrac 34; w.p. \nicefrac 14 .

{b_{i} = b_{i}^{'} chosen uniformly from [n], b_{i} \neq = b_{i}^{'} both chosen uniformly from [n] w.p. \nicefrac 34; w.p. \nicefrac 14 .

{b_{i} = b_{i}^{'} chosen uniformly from [n], b_{i} \neq = b_{i}^{'} both chosen uniformly from [n] w.p. \nicefrac 34; w.p. \nicefrac 14 .

x_{i} = ⎩ ⎨ ⎧ 0, 0, b_{i} = b_{i}^{'} w.p. \nicefrac 13; w.p. \nicefrac 23 . if i \in Δ (b, b^{'}); if i \neq \in Δ (b, b^{'}) .

x_{i} = ⎩ ⎨ ⎧ 0, 0, b_{i} = b_{i}^{'} w.p. \nicefrac 13; w.p. \nicefrac 23 . if i \in Δ (b, b^{'}); if i \neq \in Δ (b, b^{'}) .

ε_{b, b^{'}} = x \sim D_{b, b^{'}} Pr (f (x) \neq = χ_{F (b) (x)}) .

ε_{b, b^{'}} = x \sim D_{b, b^{'}} Pr (f (x) \neq = χ_{F (b) (x)}) .

ε_{b} = x \sim C_{b} Pr (f (x) \neq = χ_{F (b) (x)}) = E_{b^{'} \sim D (b)} ε_{b, b^{'}} .

ε_{b} = x \sim C_{b} Pr (f (x) \neq = χ_{F (b) (x)}) = E_{b^{'} \sim D (b)} ε_{b, b^{'}} .

x_{i} = {0, b_{i} w.p. \nicefrac 12; w.p. \nicefrac 12,

x_{i} = {0, b_{i} w.p. \nicefrac 12; w.p. \nicefrac 12,

b \sim [\overline{n}; d] b^{'} \sim D (b) Pr (ε_{b, b^{'}} + ε_{b^{'}, b} > \frac{1}{3}) < 6 ε

b \sim [\overline{n}; d] b^{'} \sim D (b) Pr (ε_{b, b^{'}} + ε_{b^{'}, b} > \frac{1}{3}) < 6 ε

E_{b \sim [\overline{n}; d]} E_{b^{'} \sim D (b)} ε_{b, b^{'}} = E_{b \sim [\overline{n}; d]} ε_{b} = ε .

E_{b \sim [\overline{n}; d]} E_{b^{'} \sim D (b)} ε_{b, b^{'}} = E_{b \sim [\overline{n}; d]} ε_{b} = ε .

E_{b \sim [\overline{n}; d]} E_{b^{'} \sim D (b)} ε_{b^{'}, b} = E_{b^{'} \sim D (b)} E_{b \sim [\overline{n}; d]} ε_{b^{'}, b} = ε .

E_{b \sim [\overline{n}; d]} E_{b^{'} \sim D (b)} ε_{b^{'}, b} = E_{b^{'} \sim D (b)} E_{b \sim [\overline{n}; d]} ε_{b^{'}, b} = ε .

E_{b \sim [\overline{n}; d]} E_{b^{'} \sim D (b)} (ε_{b, b^{'}} + ε_{b^{'}, b}) = 2 ε,

E_{b \sim [\overline{n}; d]} E_{b^{'} \sim D (b)} (ε_{b, b^{'}} + ε_{b^{'}, b}) = 2 ε,

x \sim D_{b, b^{'}} Pr (χ_{F (b)} (x) = χ_{F (b^{'})} (x)) > 1 - (ε_{b, b^{'}} + ε_{b^{'}, b}) .

x \sim D_{b, b^{'}} Pr (χ_{F (b)} (x) = χ_{F (b^{'})} (x)) > 1 - (ε_{b, b^{'}} + ε_{b^{'}, b}) .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

\DeclareCaptionType

ctest[Test]

Direct Sum Testing:

The General Case

Irit Dinur

The Weizmann Institute of Science, Israel

Konstantin Golubev

ETH Zurich, Switzerland

Abstract

A function $f:[n_{1}]\times\dots\times[n_{d}]\to\mathbb{F}_{2}$ is a direct sum if it is of the form $f\left(a_{1},\dots,a_{d}\right)=f_{1}(a_{1})\oplus\dots\oplus f_{d}(a_{d}),$ for some $d$ functions $f_{i}:[n_{i}]\to\mathbb{F}_{2}$ for all $i=1,\dots,d$ , and where $n_{1},\dots,n_{d}\in\mathbb{N}$ . We present a $4$ -query test which distinguishes between direct sums and functions that are far from them. The test relies on the BLR linearity test (Blum, Luby, Rubinfeld, 1993) and on an agreement test which slightly generalizes the direct product test (Dinur, Steurer, 2014).

In multiplicative $\pm 1$ notation, our result reads as follows. A $d$ -dimensional tensor with $\pm 1$ entries is called a tensor product if it is a tensor product of $d$ vectors with $\pm 1$ entries, or equivalently, if it is of rank $1$ . The presented tests can be read as tests for distinguishing between tensor products and tensors that are far from being tensor products.

We also present a different test, which queries the function at most $(d+2)$ times, but is easier to analyze.

1 Introduction

Let us first fix some notations and definitions. By $[n]$ we mean the set $\{0,1,2,\dots,n\}$ . For $d$ positive integers $n_{1},\dots,n_{d}$ , we denote $[\overline{n};d]=[n_{1}]\times\dots\times[n_{d}]$ . For two functions $F,G:X\to Y$ , we denote by ${\rm dist}(F,G)$ the relative Hamming distance between them, namely ${\rm dist}(F,G)=\Pr_{x\in X}[F(x)\neq G(x)]$ . We say that $F:X\to Y$ is $\varepsilon$ -close to have some Property, if there exists a function $G:X\to Y$ such that $g$ has the Property and ${\rm dist}(F,G)\leq\varepsilon$ .

Given $d$ functions $f_{i}:[n_{i}]\to\mathbb{F}_{2},\,i=1,\dots,d$ , where $n_{1},\dots,n_{d}\in\mathbb{N}$ , their direct sum is the function $f:[\overline{n};d]\to\mathbb{F}_{2}$ given by $f\left(a_{1},\dots,a_{d}\right)=f_{1}(a_{1})\oplus f_{2}(a_{2})\oplus\ldots\oplus f_{d}(a_{d})$ , where $\oplus$ stands for addition is in the field $\mathbb{F}_{2}$ . We denote $f=f_{1}\oplus\cdots\oplus f_{d}$ . We study the testability question: given a function $f:[\overline{n};d]\to\mathbb{F}_{2}$ test if it is a direct sum, namely if it belongs to the set

[TABLE]

Direct sum is a natural construction that is often used in complexity for hardness amplification [Y82, IJK06, IJKW08, STV01, T03]. It is related to the direct product construction: a function $f:[\overline{n};d]\to\mathbb{F}_{2}^{d}$ is the direct product of $f_{1},\ldots,f_{d}$ as above if $f\left(a_{1},\dots,a_{d}\right)=(f_{1}(a_{1}),\ldots,f_{d}(a_{d}))$ for all $(a_{1},\ldots,a_{d})\in[\overline{n};d]$ . The testability of direct products has received attention [GS97, DR06, DG08, IKW12, DS14] as abstraction of certain PCP tests. It was not surprising to find [DDG*+*17] that there is a connection between testing direct products to testing direct sum. However, somewhat unsatisfyingly this connection was confined to testing a certain type of symmetric direct sum. A symmetric direct sum is a function $f:[n]^{d}\to\mathbb{F}_{2}$ that is a direct product with all components equal; namely such that there is a single $g:[n]\to\mathbb{F}_{2}$ such that

[TABLE]

In [DDG*+*17], a 3-query test was presented for testing if a given $f$ is a symmetric direct sum, and the analysis carried out relying on the direct product test. It was left as an open question to devise and analyze a test for the property of being a (not necessarily symmetric) direct sum.

We design and analyze a four-query test which we call the “square in a cube” test, and show that it is a strong absolute local test for being a direct sum. That is, the number of queries is an absolute constant (namely, $4$ ), and the distance from a function to the subspace of direct sums is bounded by some absolute constant (independent of $n$ and $d$ ) times the probability of the failure of the test on this function. We also describe a simpler $(d+1)$ -query test, whose easy analysis we defer to section 3.

In order to define the test, we need to introduce the following notation. Given two strings $a,b\in[\overline{n};d]$ and a set $S\subseteq[d]$ , denote by $a_{S}b$ the string in $[\overline{n};d]$ whose $i$ -th coordinate equals $a_{i}$ if $i\in S$ and $b_{i}$ otherwise.

We prove the following theorem for Test 1.

Theorem 1.1 (Main).

There exists an absolute constant $c>0$ s.t. for all $d\in\mathbb{N}$ and $n_{1},\dots,n_{d}\in\mathbb{N}$ , given $f:[\overline{n};d]\to\mathbb{F}_{2}$ ,

[TABLE]

where $a,b$ are chosen independently and uniformly from the domain of $f$ , and $S,T$ are random subsets of $[d]$ .

Our proof, similarly to [DDG*+*17], relies on a combination of the BLR linearity testing theorem [BLR93] and a direct product test, similar to the one analyzed in [DS14]. These two components were also used in the proof of [DDG*+*17] for the symmetric case, but here we use the components differently. The trick is to find the right combination. We first observe that once we fix $a,b$ , the test is confined to a set of at most $2^{d}$ points in the domain, and can be viewed as performing a BLR (affinity rather than linearity) test on this piece of the domain. From the BLR theorem, we deduce an affine linear function on this piece. The next step is to combine the different affine linear functions, one from each piece, into one global direct sum, and this is done by reducing to direct product.

Testing if a tensor has rank $1$ .

An equivalent way to formulate our question is as a test for whether a $d$ -dimensional tensor with $\pm 1$ entries has rank $1$ . Indeed moving to multiplicative notation and writing $h_{i}=(-1)^{f_{i}}$ and $h=(-1)^{f}$ , we are asking whether there are $h_{1},\ldots,h_{d}$ such that

[TABLE]

Denoting

[TABLE]

we have

Corollary 1.2.

There exists an absolute constant $c>0$ s.t. for all $d\in\mathbb{N}$ and $n_{1},\dots,n_{d}\in\mathbb{N}$ , for every $h:[\overline{n};d]\to\{-1,1\}$ ,

[TABLE]

Structure of the Paper.

In Sections 2 and 3 we present two different approaches for testing whether a $d$ -dimensional binary tensor is a tensor product. In Section 5 we discuss possible directions for future research. In Section 4, we explain how to derive the specific direct product test that we need from the agreement testing theorem of [DD19]. This is used in the course of the proof in Section 2. The numbering is section-wise. Finally, in Section 5 we discuss possible directions for future research.

2 Square in a Cube Test

In this section we present the Square in a Cube Test. Then we introduce the required background: the BLR test for a function being Affine in Subsection 2.1, the direct product test in Subsection 2.2. Finally, in Subsection 2.3 we prove the main result on the test.

We start by introducing some notation.

Given two vectors $a=(a_{1},\dots,a_{d}),\,b=(b_{1},\dots,b_{d})\in[\overline{n};d]$ , define

•

$\Delta(a,b)=\{i:a_{i}\neq b_{i}\}\subseteq[d]$ ;

•

the induced subcube $C_{a,b}$ is the binary cube $\mathbb{F}_{2}^{\Delta(a,b)}$ ;

•

the projection map $\rho_{a,b}:C_{a,b}\to[\overline{n};d]$ defined for $x\in C_{a,b}$ as

[TABLE]

The following test is the same as Test 1 in Introduction.

Theorem 2.1.

Suppose a function $f:[\overline{n};d]^{d}\to\mathbb{F}_{2}$ passes Test 2 with probability $1-\varepsilon$ for some $\varepsilon>0$ , then $f$ is $O(\varepsilon)$ -close to a tensor product.

2.1 The BLR affinity test

The Blum-Luby-Rubinfeld linearity test was introduced in [BLR93], where its remarkable properties were proven. Later a simpler proof via Fourier analysis was presented, e.g. see [BCH*+*95]. Below we give a variation of this test for affine functions, see [O’D14, Chapter 1].

Definition 2.2.

A function $g:\mathbb{F}_{2}^{d}\to\mathbb{F}_{2}$ is called affine, if there exists a set $S\subseteq[d]$ and a constant $c\in\mathbb{F}_{2}$ such that for every vector $x\in\mathbb{F}_{2}^{d}$

[TABLE]

Note that (see [O’D14, Exercise 1.26]) a function $g$ is affine iff for any two vectors $x,y\in\mathbb{F}_{2}^{d}$ it satisfies

[TABLE]

The BLR test implies that if a function $g:\mathbb{F}_{2}^{d}\to\mathbb{F}_{2}$ satisfies (1) with high probability, then it is close to an affine function.

Theorem 2.3 ([BLR93]).

Suppose $g:\mathbb{F}_{2}^{d}\to\mathbb{F}_{2}$ passes the affinity test with probability $1-\varepsilon$ for some $\varepsilon>0$ . Then $g$ is $\varepsilon$ -close to being affine.

2.2 Generalized Direct Product Test

Definition 2.4.

For $k,M,N_{1},\ldots,N_{k}\in\mathbb{N}$ , and $k$ functions $g_{1},\dots,g_{k}:[N_{i}]\to[M]$ , their direct product is the function $g:\prod_{i}[N_{i}]\to[M]^{k}$ denoted $g=g_{1}\times\dots\times g_{k}$ and defined as $g\left((x_{1},\dots,x_{k})\right)=(g_{1}(x_{1}),\dots,g_{k}(x_{k}))$ . A function $g:\prod_{i}[N_{i}]\to[M]^{k}$ , is called a direct product if there exist $k$ functions $g_{1},\dots,g_{k}:[N_{i}]\to[M]$ such that $g=g_{1}\times\dots\times g_{k}$ for all $(x_{1},\dots,x_{k})\in\prod_{i}[N_{i}]$ .

Dinur and Steurer [DS14] presented a $2$ -query test, very similar to Test 4 below, that, with constant probability, distinguishes between direct products and functions that are far from direct product.

The proof in [DS14] works for the special case of $N_{1}=\cdots=N_{k}$ and can easily be modified to work for the more general situation. Nevertheless, for completeness, we will rely on a newer and more general agreement theorem of [DD19] that directly implies what we need.

Theorem 2.5 (Generalized direct product testing theorem).

Let $k,M,N_{1},\ldots,N_{k}\in\mathbb{N}$ be positive integers, and let $\varepsilon>0$ . Let $g:\prod_{i}[N_{i}]\to[M]^{k}$ be a function that passes Test 4 with parameter $\alpha=0.75$ with probability at least $1-\varepsilon$ . Then there exist functions $h_{i}:[N_{i}]\to[M]$ such that

[TABLE]

We will show in Section 4 how to derive the above theorem from the agreement theorem of [DD19].

2.3 Proof of Theorem 2.1

For a positive integer $D$ , we denote by $\mu_{\nicefrac{{2}}{{3}}}(\mathbb{F}_{2}^{D})$ the distribution on $\mathbb{F}_{2}^{D}$ , where each coordinate, independently, is equal to [math] with probability $1/3$ and to $1$ with probability $2/3$ .

We use the following proposition in the course of the proof.

Proposition 2.6.

Let $S\subseteq[D]$ be a set and $\chi_{S}:\mathbb{F}_{2}^{D}\to\mathbb{F}_{2}$ be the corresponding linear function, i.e., $\chi_{S}(x)=\bigoplus_{i\in S}x_{i}$ . Suppose

[TABLE]

then $S=\emptyset$ .

Proof.

Consider $(-1)^{\chi_{S}}$ . Then

[TABLE]

Also the following holds

[TABLE]

and the statement follows. ∎

Proof.

(of Theorem 2.1.) Assume Test 2 rejects a function $f:[\overline{n};d]\to\mathbb{F}_{2}$ with probability less than $\varepsilon$ , i.e.,

[TABLE]

where all distributions are uniform, and $f_{a,b}$ is a shorthand for $f\circ\rho_{a,b}$ . Then there exists $a\in[\overline{n};d]$ such that

[TABLE]

Note that the operations re-indexing the domain $[\overline{n};d]$ 111By this we mean selecting permutations $\pi_{i}$ on $[n_{i}]$ for $i=1,\dots,d$ , and setting $f^{\pi_{1},\dots,\pi_{d}}\left(x_{1},\dots,x_{d}\right)=f\left(\pi_{1}(x_{1}),\dots,\pi_{d}(x_{d})\right)$ , as well as flipping a function, i.e., adding the constant one function to it element-wise, preserve the distance between functions. Hence, w.l.o.g. we can assume for convenience that $a=(0,\dots,0)$ and that $f(a)=0$ .

We write $C_{b}$ for $C_{a,b}$ and $f_{b}$ for $f_{a,b}$ . Then for every $b\in[\overline{n};d]$ ,

[TABLE]

The BLR theorem (Theorem 2.3) implies that for each $b\in[\overline{n};d]$ there exists a subset $S(b)\subseteq\Delta(a,b)$ , such that

[TABLE]

*Remark 2.7**.*

By the BLR theorem, there should be the “greater or equal to” sign instead of the equality. We assume equality for convenience.

Let $F:[\overline{n};d]\to\mathbb{F}_{2}^{d}$ be a function defined as follows. For each $b\in[\overline{n};d]$ , the set $S(b)\subseteq\Delta(a,b)$ can be viewed as a subset of $[d]$ , since $\Delta(a,b)\subseteq[d]$ . Then $F(b)$ is defined as the element of $\mathbb{F}_{2}^{d}$ corresponding to the set $S(b)$ .

We now show that $F$ passes Test 4 with high probability and hence is close to a direct product.

Let $b\in[\overline{n};d]$ be chosen uniformly at random, and let $b^{\prime}\in[\overline{n};d]$ be chosen with respect to the following distribution $D(b)$ . For each $i\in[d]$ ,

[TABLE]

Note that the distribution on pairs $(b,b^{\prime})$ , where $b$ is chosen uniformly from $[\overline{n};d]$ and $b^{\prime}$ w.r.t. $D(b)$ , is equivalent to the following: for each $i\in[d]$ ,

[TABLE]

In particular, it is symmetric in the sense that choosing $b^{\prime}\sim[\overline{n};d]$ uniformly at random first, and then $b\sim D(b^{\prime})$ , leads to the same distribution on pairs $(b,b^{\prime})$ as the one described above.

For such a pair $(b,b^{\prime})$ define distribution $\mathcal{D}_{b,b^{\prime}}$ on $[\overline{n};d]$ as follows. For a vector $x\sim\mathcal{D}_{b,b^{\prime}}$ ,

[TABLE]

Note that the distribution $\mathcal{D}_{b,b^{\prime}}$ is supported on a binary cube of dimension $d-|\Delta(b,b^{\prime})|$ inside $[\overline{n};d]$ . Denote

[TABLE]

We claim that the following holds

[TABLE]

To see (3) note that since $b$ is chosen uniformly, $b^{\prime}$ is chosen w.r.t. $D(b)$ , and $x\sim\mathcal{D}_{b,b^{\prime}}$ , the resulting distribution for $x$ is

[TABLE]

which is exactly the uniform distribution on $C_{b}$ .

We now show that

[TABLE]

First note that it follows from the definitions that

[TABLE]

And by the symmetry of the distribution on pairs $(b,b^{\prime})$ ,

[TABLE]

Combined together, the previous two equations imply that

[TABLE]

and by the Markov inequality, Inequality 4 follows. By the definition of $\varepsilon_{b,b^{\prime}}$ ,

[TABLE]

which is equivalent to

[TABLE]

Proposition 2.6 implies that if $1-\left(\varepsilon_{b,b^{\prime}}+\varepsilon_{b^{\prime},b}\right)>\frac{2}{3}$ , then

[TABLE]

By Theorem 2.5, the function $F:[\overline{n};d]\to\mathbb{F}_{2}^{d}$ is close to a direct product, i.e., there exist $d$ functions $F_{1},\dots,F_{d}:[n]\to\mathbb{F}_{2}$ such that

[TABLE]

Therefore,

[TABLE]

∎

3 The Shapka Test

In this section we present a different test for whether a tensor is a tensor product. It queries the tensor at $(d+2)$ places at most, but the proof is simpler than for the previous test.

In [KL14], Kaufman and Lubotzky showed an interesting connection between the theory of high-dimensional expanders and property testing. Namely, they showed that $\mathbb{F}_{2}$ -coboundary expansion of a $2$ -dimensional complete simplicial complex implies testability of whether a symmetric $\mathbb{F}_{2}$ -matrix is a tensor square of a vector. The following test is inspired by their work and in a way generalizes it. However, since the description below does not employ neither terminology nor machinery of high-dimensional expanders, we refer to [KL14] for the connection between this theory and property testing.

Given two strings $a,b\in[\overline{n};d]$ , for $i\in[d]$ denote by $a_{b}^{i}\in[\overline{n};d]$ the vector which coincides with $a$ in every coordinate except for the $i$ -th one, where it coincides with $b$ , i.e.,

[TABLE]

For a string $a\in[\overline{n};d]$ , and a number $x\in[n_{i}]$ , we write $a_{x}^{i}$ for the string which is equal to $a$ in every coordinate except for the $i$ -th one, where it is equal to $x$ , i.e.,

[TABLE]

*Remark 3.1**.*

Shapka is the Russian word for a winter hat (derived from Old French chape for a cap). The name the Shapka test comes from the fact that the set $Q_{a,b}$ consists of the two top layers of the induced binary cube $C_{a,b}$ (and also the bottom layer if $d$ is even).

Theorem 3.2.

Suppose a function $f:[\overline{n};d]\to\mathbb{F}_{2}$ passes Test 8 with probability $1-\varepsilon$ for some $\varepsilon>0$ , then $f$ is $\varepsilon$ -close to a tensor product.

Proof.

Let $\delta$ be the relative Hamming distance from $f$ to the subspace of direct sums, i.e., for every direct sum $g:[\overline{n};d]\to\mathbb{F}_{2}$ it holds that

[TABLE]

For a vector $a\in[\overline{n};d]$ , let us define the local view of $f$ from $a$ , that is $d$ functions $f_{1}^{a},\dots,f_{d}^{a}$ , where $f_{i}^{d}:[n_{i}]\to\mathbb{F}_{2},\,i=1,\dots,d$ , that are defined as follows. For $1\leq i\leq d-1$ , and $x\in[n_{i}]$ ,

[TABLE]

For $i=d$ , the definition of $f^{a}_{d}:[n_{d}]\to\mathbb{F}_{2}$ depends on the parity of $d$ and goes as follows

[TABLE]

Given a collection of $d$ functions, $g_{i}:[n_{i}]\to\mathbb{F}_{2},\,i=1,\dots,d$ , recall that their direct sum is the function $g_{1}\oplus\dots\oplus g_{d}$ such that for a vector $x\in[\overline{n};d]$ the following holds

[TABLE]

The following holds for any $[\overline{n};d]$ ,

[TABLE]

As $f^{a}_{1}\oplus\dots\oplus f^{a}_{d}$ is a direct sum, it is at least $\delta$ -far from $f$ , and hence for any $a\in[\overline{n};d]$ ,

[TABLE]

Assume now that $f$ fails Test 8 with probability $\varepsilon$ , i.e.,

[TABLE]

Combining this equality with (5) and (6), we get the following

[TABLE]

which completes the proof. ∎

4 Generalized direct product test

In this section we prove Theorem 2.5, restated directly below, by relying on known agreement test results.

**Theorem 2.5 (restated) **Let $k,M,N_{1},\ldots,N_{k}\in\mathbb{N}$ be positive integers, and let $\varepsilon>0$ . Let $g:\prod_{i}[N_{i}]\to[M]^{k}$ be a function that passes Test 4 with parameter $\alpha=0.75$ with probability at least $1-\varepsilon$ . Then there exist functions $h_{i}:[N_{i}]\to[M]$ such that

[TABLE]

This theorem was proven “in spirit” in [DS14] although formally that proof is written only for the case of $N_{1}=N_{2}=\cdots=N_{k}$ . Instead of reworking the details we will rely on a newer work that generalizes the [DS14] paper to a broader context of agreement testing.

First, let us move from the distribution of Test 4 to a related distribution. It turns out that if $g$ passes one of these two-query tests with good probability then we can draw conclusions regarding its success in related tests.

Claim 4.1.

Suppose $g$ passes Test 4 with $\alpha=0.75$ with probability $1-\varepsilon$ then it passes Test 6 with parameter $k/10<t<k/4$ probability $1-O(\varepsilon)$ .

We prove this claim later in Section 4.1. Theorem 2.5 will follow by invoking a theorem from [DD19] about agreement testing. In agreement testing the input is a collection of local functions each defined on its own small domain. The agreement test checks that whenever the small domains overlap the functions agree with each other. An agreement theorem deduces a single global function (on a domain that contains all the smaller ones) from the given local pairwise agreements. To see who are the small domains in our context let us construct the following set system.

•

Vertices: Let $V_{1},\ldots,V_{k}$ be $k$ disjoint sets of vertices, $|V_{i}|=N_{i}$ and we identify $V_{i}$ with $[N_{i}]$ .

•

Subsets: We have a subset for every choice of one element from each $V_{i}$ ,

[TABLE]

There is a straightforward bijection between ${\cal S}$ and the domain of $g$ , namely $\prod_{i}[N_{i}]$ .

•

Local functions: For a set $S=\left\{v_{1},\ldots,v_{k}\right\}\in{\cal S}$ we have a local function $f_{S}:S\to[M]$ defined by

[TABLE]

where $\bar{v}_{i}\in[N_{i}]$ is associated with $v_{i}$ in the identification of $V_{i}$ and $[N_{i}]$ .

A direct product function $g:\prod_{i}[N_{i}]\to[M]^{k}$ can thus be represented as a collection $\left\{f_{S}\right\}$ of local functions. The direct product test, Test 6, can be rephrased as Test 7 below. Given $g:\prod_{i}[N_{i}]\to[M]^{k}$ we view it as a family of local functions $\left\{f_{S}\right\}$ and would like to invoke the following agreement test theorem,

Theorem 4.2 ([DD19, Theorem 4.4]).

Suppose ${\cal S}$ is a collection of subsets that are top faces of a $\lambda$ -one-sided $k$ -partite $\frac{1}{k^{3}}$ -high dimensional expander. Then given $\left\{f_{S}\right\}$ for which Test 7 succeeds with probability $1-\varepsilon$ , and assuming $t<k/4$ , there exists a function $h:V_{1}\sqcup\cdots\sqcup V_{k}\to[M]$ such that

[TABLE]

We will show in Section 4.2 that we are justified to apply this theorem because our collection of subsets, also known as the “complete multi-partite complex”, is a $\lambda$ -one-sided-HDX for any $\lambda\geq 0$ . Assuming this is the case, we can now take $h_{i}=h|_{V_{i}}$ and get the desired conclusion of Theorem 2.5,

[TABLE]

4.1 Moving between different variants of agreement tests

Claim 4.1 follows immediately from the following lemma, (one needs to apply the first item 3 times to get from $\alpha=0.75$ to $\alpha^{2}$ then $\alpha^{4}$ and then $\alpha^{8}<0.25$ and then item 2 once).

Lemma 4.3.

Let $g:\prod_{i}[N_{i}]\to[M]^{k}$ be a function that passes Test 4 with parameter $\alpha$ with probability at least $1-\varepsilon$ . Then,

•

$g$ * passes Test 4 with parameter $\alpha^{2}$ with probability at least $1-2\varepsilon$ .*

•

There exists a number $t$ , $\alpha k-\sqrt{k}\leq t\leq\alpha k+\sqrt{k}$ , such that $g$ passes Test 6 with parameter $t$ with probability at least $1-O(\varepsilon)$

Proof.

We first prove the first item. Choosing two queries $x,y$ according to the test distribution in Test 4 and then another pair $x,y^{\prime}$ conditioned on the first query being $x$ , we get a pair $y,y^{\prime}$ whose distribution is exactly as if the were chosen from Test 4 with parameter $\alpha^{2}$ . Suppose $A$ was the set of indices in which $y_{i}$ was chosen to equal $X_{i}$ , and suppose $A^{\prime}$ was that set for the pair $x,y^{\prime}$ . Setting $B=A\cap A^{\prime}$ it remains to notice that the event that $g(y)|_{B}\neq g(y^{\prime})|_{B}$ is contained in at least one of the events $g(y)|_{A}\neq g(x)|_{A}$ or $g(y^{\prime})|_{A^{\prime}}\neq g(x)|_{A^{\prime}}$ , so its probability is at most $2\varepsilon$ .

For the second item, observe that with probability $p>0.1$ the size of the set $A$ defined by the test is some $t$ such that $\alpha k-\sqrt{k}\leq t\leq\alpha k+\sqrt{k}$ (this follows from Hoefding’s tail bound). There must be some $t$ in this range for which the failure probability of the test is at most $2\varepsilon/p$ . Otherwise, even if the test succeeds with probability $1$ when $t$ is outside this range, we would still not be able to reach a sucess probability of $1-\varepsilon$ since

[TABLE]

∎

4.2 The complete multi-partite complex

The collection of subsets defined in the beginning of this section gives rise to the so-called complete multi-partite simplicial complex, by downwards closing that set system.

We wish to show that it satisfies the requirements of Theorem 4.2. For this we briefly recall the relevant definitions. For a more comprehensive introduction to this topic we refer the reader to [DD19] and the references therein.

•

Simplicial Complex: A simplicial complex is a hypergraph that is closed downward with respect to containment. It is $(d-1)$ -dimensional if the largest hyperedge has size $d$ . We refer to $X(\ell)$ as the hyperedges (also called faces) of size $\ell+1$ . $X(0)$ are the vertices. It is $d$ -partite if the vertices are partitioned into $d$ parts, and each hyperedge in $X(d-1)$ has one vertex from each part.

•

Link: Given a $i$ -face $\sigma$ , the link of $\sigma$ is the collection of faces that are disjoint from $\sigma$ and whose union belongs to $X$ ,

[TABLE]

This is a simplicial complex whose dimension is $dim(X)-|\sigma|-1$ .

•

Distribution: Given any probability distribution on the top faces $X(d-1)$ , it propagates to a distribution on the edges by selecting a top face and then a pair of vertices in it uniformly. This gives a weighted graph that is called the $1$ -skeleton of the complex.

•

HDX: A $(d-1)$ -dimensional simplicial complex is a $\lambda$ -one-sided HDX if for every face $\sigma\in X(t)$ , $t\leq d-3$ , the $1$ -skeleton of the link $X_{\sigma}$ is a $\lambda$ -one-sided expander graph, meaning that the random walk Markov chain on this weighted graph has all non-trivial normalized eigenvalues at most $\lambda$ .

The complete $d$ -partite complex has parameters $n_{1},\ldots,n_{d}$ and has a vertex set $V_{i}$ of size $n_{i}$ . It is defined by the following distribution over $d$ -hyperedges: For each $i$ choose $x_{i}\in V_{i}$ uniformly. This gives a probability distribution on faces $\left\{x_{1},\ldots,x_{d}\right\}$ in $X(d-1)$ . The $1$ -skeleton of this complex is a graph whose vertices are $V_{1}\sqcup\cdots\sqcup V_{d}$ and whose weighted edges are obtained by selecting a random hyperedge in $X(d)$ and then a random pair of vertices inside it. The link of a face in this complex is itself a complete partite complex, with fewer parts. To show that this complex is a $\lambda$ -one-sided HDX it remains to prove the following lemma,

Lemma 4.4.

Let $G$ be the $1$ -skeleton of a complete $d$ -partite complex with parameters $n_{1},\ldots,n_{d}$ . Then the normalized adjacency matrix of $G$ has one eigenvalue of $1$ , eigenvalue of [math] with multiplicity $\sum_{i}n_{i}-d$ , and the remaining $(d-1)$ eigenvalues have value $-1/(d-1)$ .

In particular, except for one eigenvalue of $1$ , all of $G$ ’s remaining eigenvalues are non-positive.

Proof.

Let, as before, $V_{i}$ denote the part of vertices of size $n_{i}$ . The distribution on edges induced by the uniform distribution on the maximal faces is as follows. For an edge $(v_{i},v_{j})$ , where $v_{i}\in V_{i},\,v_{j}\in V_{j}$ and $i\neq j$ , its probability is equal to

[TABLE]

Hence the transition probability of moving from the vertex $v_{i}$ to the vertex $v_{j}$ is equal to

[TABLE]

The transition matrix is of the following form

[TABLE]

where $J_{n_{i}\times n_{j}}$ stands for the all-one matrix of size $n_{i}\times n_{j}$ . In order to show that $A$ has a single positive eigenvalue, we use the approach developed in [EH80]. First, note that the multiplicity of [math] is $n-d$ , where $n=\sum_{i=1}^{d}n_{i}$ , because the matrix $A$ is of rank $n-d$ . Next, note that if $f$ is an eigenfunction with eigenvalue $\lambda\neq 0$ , then

it is constant on $V_{i}$ for each $i=1,\dots,d$ ; 2. 2.

and

[TABLE]

where $\alpha_{i}$ is the value of $f$ on $V_{i}$ .

For $v\in V_{i}$ ,

[TABLE]

The expression on r.h.s. is the same for every $v\in V_{i}$ , and $\lambda\neq 0$ , which completes the proof of (1). To show (2), it is enough to substitute $f(u)=\alpha_{j}$ for $u\in V_{j}$ in the equality above.

It follows from the above that the non-zero eigenvalues of $A$ are exactly the eigenvalues of the matrix

[TABLE]

which has eigenvalue $1$ with multiplicity $1$ , and $-\frac{1}{d-1}$ with multiplicity $(d-1)$ . ∎

5 Further Directions

Below we present possible directions for future research.

Can the original function $f:[\overline{n};d]\to\mathbb{F}_{2}$ be reconstructed by a voting scheme using the Shapka Test 8? 2. 2.

It is plausible that the Square in the Cube test 2 can be analyzed by the Fourier transform approach similarly to the analysis of the BLR test. 3. 3.

Another test in the spirit of the paper is the following.

We conjecture that this test is also good, i.e., if a function passes the test with high probability then it is close to a tensor product.

Acknowledgements

The authors would like to thank Oded Goldreich for pointing out a gap in the proof in a previous version of this manuscript.

The first author is supported by ERC-CoG grant number 772839. A substantial part of the work was done while the second author held a joint postdoctoral position at The Weizmann Institute and Bar-Ilan University funded by the ERC grant number 336283. Currently, the second author is supported by the SNF grant number 200020_169106. The second author would also like to thank the Swiss Mathematical Society for travel funding related to this paper.

Bibliography17

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[BCH + 95] Mihir Bellare, Don Coppersmith, Johan Håstad, Marcos A. Kiwi, and Madhu Sudan. Linearity testing in characteristic two. In 36th Annual Symposium on Foundations of Computer Science, Milwaukee, Wisconsin, USA, 23-25 October 1995 , pages 432–441, 1995.
2[BLR 93] Manuel Blum, Michael Luby, and Ronitt Rubinfeld. Self-testing/correcting with applications to numerical problems. Journal of computer and system sciences , 47(3):549–595, 1993.
3[DDG + 17] Roee David, Irit Dinur, Elazar Goldenberg, Guy Kindler, and Igor Shinkar. Direct sum testing. SIAM J. Comput. , 46(4):1336–1369, 2017.
4[DG 08] Irit Dinur and Elazar Goldenberg. Locally testing direct products in the low error range. In Proc. 49th IEEE Symp. on Foundations of Computer Science , 2008.
5[DR 06] Irit Dinur and Omer Reingold. Assignment testers: Towards combinatorial proofs of the PCP theorem. SIAM Journal on Computing , 36(4):975–1024, 2006. Special issue on Randomness and Computation.
6[DD 19] Yotam Dikstein and Irit Dinur. Agreement testing theorems on layered set systems. In 60th Annual IEEE Symposium on Foundations of Computer Science (FOCS) , 2019.
7[DS 14] Irit Dinur and David Steurer. Direct product testing. In 2014 IEEE 29th Conference on Computational Complexity (CCC) , pages 188–196, 2014.
8[EH 80] Friedrich Esser and Frank Harary. On the spectrum of a complete multipartite graph. In European Journal of Combinatorics , 1(3), 211–218, 1980.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Direct Sum Testing:

Abstract

1 Introduction

Theorem 1.1** (Main).**

Testing if a tensor has rank 111.

Corollary 1.2**.**

Structure of the Paper.

2 Square in a Cube Test

Theorem 2.1**.**

2.1 The BLR affinity test

Definition 2.2**.**

Theorem 2.3** ([BLR93]).**

2.2 Generalized Direct Product Test

Definition 2.4**.**

Theorem 2.5** (Generalized direct product testing theorem).**

2.3 Proof of Theorem 2.1

Proposition 2.6**.**

Proof.

Proof.

Remark 2.7*.*

3 The Shapka Test

Remark 3.1*.*

Theorem 3.2**.**

Proof.

4 Generalized direct product test

Claim 4.1**.**

Theorem 4.2** ([DD19, Theorem 4.4]).**

4.1 Moving between different variants of agreement tests

Lemma 4.3**.**

Proof.

4.2 The complete multi-partite complex

Lemma 4.4**.**

Proof.

5 Further Directions

Acknowledgements

Theorem 1.1 (Main).

Testing if a tensor has rank $1$ .

Corollary 1.2.

Theorem 2.1.

Definition 2.2.

Theorem 2.3 ([BLR93]).

Definition 2.4.

Theorem 2.5 (Generalized direct product testing theorem).

Proposition 2.6.

*Remark 2.7**.*

*Remark 3.1**.*

Theorem 3.2.

Claim 4.1.

Theorem 4.2 ([DD19, Theorem 4.4]).

Lemma 4.3.

Lemma 4.4.