Subspace arrangements, graph rigidity and derandomization through   submodular optimization

Orit E. Raz; Avi Wigderson

arXiv:1901.09423·cs.CC·January 29, 2019

Subspace arrangements, graph rigidity and derandomization through submodular optimization

Orit E. Raz, Avi Wigderson

PDF

Open Access

TL;DR

This paper introduces a deterministic polynomial-time algorithm for symbolic matrix rank computation, linking matroid theory, graph rigidity, and derandomization, with potential applications in polynomial identity testing and higher-dimensional rigidity problems.

Contribution

It provides a novel deterministic algorithm for a class of symbolic matrix rank problems, bridging matroid flats, graph rigidity, and submodular optimization, advancing derandomization techniques.

Findings

01

Deterministic polynomial-time algorithm for symbolic matrix rank.

02

Connection between graph rigidity and symbolic rank problems.

03

Potential for improved understanding of higher-dimensional graph rigidity.

Abstract

This paper presents a deterministic, strongly polynomial time algorithm for computing the matrix rank for a class of symbolic matrices (whose entries are polynomials over a field). This class was introduced, in a different language, by Lov\'asz [Lov] in his study of flats in matroids, and proved a duality theorem putting this problem in $N P \cap co N P$ . As such, our result is another demonstration where ``good characterization'' in the sense of Edmonds leads to an efficient algorithm. In a different paper Lov\'asz [Lov79] proved that all such symbolic rank problems have efficient probabilistic algorithms, namely are in $B P P$ . As such, our algorithm may be interpreted as a derandomization result, in the long sequence special cases of the PIT (Polynomial Identity Testing) problem. Finally, Lov\'asz and Yemini [LoYe] showed how the same problem generalizes the graph rigidity problem in two…

Equations280

rank M_{G, 2} = dim span {h (x) \cap f_{u, v} ∣ {u, v} \in E} .

rank M_{G, 2} = dim span {h (x) \cap f_{u, v} ∣ {u, v} \in E} .

(y_{1}, \dots, y_{n}, - x_{1}, \dots, - x_{n}, 0, \dots, 0)

(y_{1}, \dots, y_{n}, - x_{1}, \dots, - x_{n}, 0, \dots, 0)

(z_{1}, \dots, z_{n}, 0, \dots, 0, - x_{1}, \dots, - x_{n}) .

(z_{1}, \dots, z_{n}, 0, \dots, 0, - x_{1}, \dots, - x_{n}) .

rank M_{G, 3} = dim span {\tilde{h} (x) \cap f_{u, v} ∣ {u, v} \in E} .

rank M_{G, 3} = dim span {\tilde{h} (x) \cap f_{u, v} ∣ {u, v} \in E} .

rank (span X) = G \subseteq F min {rank (span ⋃ G) + ∣ F ∖ G ∣}

rank (span X) = G \subseteq F min {rank (span ⋃ G) + ∣ F ∖ G ∣}

q_{e}

q_{e}

q_{e} \cdot r_{v} = 0

q_{e} \cdot r_{v} = 0

q_{e} \cdot (m (u) - m (v)) = 0, for every e = {u, v} \in E .

q_{e} \cdot (m (u) - m (v)) = 0, for every e = {u, v} \in E .

rank (E) = Π = {F_{0}, \dots, F_{k}} min {∣ F_{0} ∣ + i = 1 \sum k ((2 d + 1) (V (F_{i}) - (2 d + 1) - R (F_{i}))},

rank (E) = Π = {F_{0}, \dots, F_{k}} min {∣ F_{0} ∣ + i = 1 \sum k ((2 d + 1) (V (F_{i}) - (2 d + 1) - R (F_{i}))},

d (F) := d (span F) .

d (F) := d (span F) .

Π \cap G := {P \cap G ∣ P \in Π, P \cap G \neq = \emptyset} .

Π \cap G := {P \cap G ∣ P \in Π, P \cap G \neq = \emptyset} .

ρ_{c} (F, Π) := P \in Π \sum (d (P) - c) .

ρ_{c} (F, Π) := P \in Π \sum (d (P) - c) .

ρ_{c} (F) := Π min ρ_{c} (F, Π),

ρ_{c} (F) := Π min ρ_{c} (F, Π),

P \in Π^{'} \sum (d (P) - c) \geq d (Q) - c .

P \in Π^{'} \sum (d (P) - c) \geq d (Q) - c .

Π^{'} = (P_{1}^{'}, \dots, P_{t}^{'}),

Π^{'} = (P_{1}^{'}, \dots, P_{t}^{'}),

V_{i}^{'} := span (j = 1 ⋃ i P_{j}^{'})

V_{i}^{'} := span (j = 1 ⋃ i P_{j}^{'})

d (Q) = i = 1 \sum t r_{i}^{'}

d (Q) = i = 1 \sum t r_{i}^{'}

s_{i}^{'} = d ((span P_{i}^{'}) \cap V_{i - 1}^{'}) .

s_{i}^{'} = d ((span P_{i}^{'}) \cap V_{i - 1}^{'}) .

i = 1 \sum t (r_{i}^{'} + s_{i}^{'}) - t c \geq i = 1 \sum t r_{i}^{'} - c

i = 1 \sum t (r_{i}^{'} + s_{i}^{'}) - t c \geq i = 1 \sum t r_{i}^{'} - c

i = 1 \sum t s_{i}^{'} \geq c (t - 1) .

i = 1 \sum t s_{i}^{'} \geq c (t - 1) .

V_{i} := span (j = 1 ⋃ i P_{j})

V_{i} := span (j = 1 ⋃ i P_{j})

d (i = 1 ⋃ t P_{i}) = i = 1 \sum t r_{i}

d (i = 1 ⋃ t P_{i}) = i = 1 \sum t r_{i}

s_{i} = d ((span P_{i}) \cap V_{i - 1}) .

s_{i} = d ((span P_{i}) \cap V_{i - 1}) .

i = 1 \sum t (d (P_{i}) - c) \geq d (i = 1 ⋃ t P_{i}) - c .

i = 1 \sum t (d (P_{i}) - c) \geq d (i = 1 ⋃ t P_{i}) - c .

i = 1 \sum t (r_{i} + s_{i}) - t c \geq i = 1 \sum t r_{i} - c

i = 1 \sum t (r_{i} + s_{i}) - t c \geq i = 1 \sum t r_{i} - c

i = 1 \sum t s_{i} \geq c (t - 1) .

i = 1 \sum t s_{i} \geq c (t - 1) .

d ((span P_{i}^{'}) \cap V_{i - 1}^{'}) \leq d ((span P_{i}) \cap V_{i - 1}) .

d ((span P_{i}^{'}) \cap V_{i - 1}^{'}) \leq d ((span P_{i}) \cap V_{i - 1}) .

ρ_{c} (G) = ρ_{c} (G ∖ {f_{1}}) and Π^{*} (G ∖ {f_{1}}) = Π^{*} (G) \cap (G ∖ {f_{1}})

ρ_{c} (G) = ρ_{c} (G ∖ {f_{1}}) and Π^{*} (G ∖ {f_{1}}) = Π^{*} (G) \cap (G ∖ {f_{1}})

Π^{*} (P^{'}) = Π^{*} (G) \cap P^{'} .

Π^{*} (P^{'}) = Π^{*} (G) \cap P^{'} .

\hat{F} := {f_{P} ∣ P \in Π^{*} (F)} .

\hat{F} := {f_{P} ∣ P \in Π^{*} (F)} .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Graph Theory Research · Complexity and Algorithms in Graphs · Interconnection Networks and Systems

Full text

Subspace arrangements, graph rigidity and derandomization through submodular optimization††thanks:

The first author was partially supported from NSF grant DMS-1128155. The second author was partially supported from NSF grant CCF-1412958

Orit E. Raz Department of Mathematics, University of British Columbia, Vancouver, Canada. [email protected]

Avi Wigderson School of Mathematics, Institute for Advanced Study, Princeton NJ 08540, U.S.A. [email protected]

Abstract

This paper presents a deterministic, strongly polynomial time algorithm for computing the matrix rank for a class of symbolic matrices (whose entries are polynomials over a field). This class was introduced, in a different language, by Lovász [19] in his study of flats in matroids, and proved a duality theorem putting this problem in $NP\cap coNP$ . As such, our result is another demonstration where “good characterization” in the sense of Edmonds leads to an efficient algorithm. In a different paper Lovász [16] proved that all such symbolic rank problems have efficient probabilistic algorithms, namely are in $BPP$ . As such, our algorithm may be interpreted as a derandomization result, in the long sequence special cases of the PIT (Polynomial Identity Testing) problem. Finally, Lovász and Yemini [20] showed how the same problem generalizes the graph rigidity problem in two dimensions. As such, our algorithm may be seen as a generalization of the well-known deterministic algorithm for the latter problem.

There are two somewhat unusual technical features in this paper. The first is the translation of Lovász’ flats problem into a symbolic rank one. The second is the use of submodular optimization for derandomization. We hope that the tools developed for both will be useful for related problems, in particular for better understanding of graph rigidity in higher dimensions.

*Dedicated with admiration to László Lovász,

on the occasion of his 70th birthday.*

1 Introduction

In this paper we provide a new deterministic, strongly polynomial time algorithm which can be viewed in two ways. The first is as solving a derandomization problem, providing a deterministic algorithm to a new special case of the PIT (Polynomial Identity Testing) problem. The second is as computing the dimension of the span a collection of subspaces in high dimensional space. Motivating and connecting the two is the problem of testing graph rigidity, to which an efficient deterministic algorithm is known only in the plane, and is open for higher dimensions. Accordingly, we will divide the introduction to explain these three problems.

1.1 Polynomial Identity Testing (PIT)

Let ${\mathbb{K}}$ be a field. Let ${\bf x}=(x_{1},\dots x_{d})$ be a $d$ -tuple of independent variables. The PIT problem is to determine, given a multivariate polynomial $p\in{\mathbb{K}}[{\bf x}]$ , if $p\equiv 0$ (as a polynomial). Of course, the description of $p$ as an input to this problem is central to its complexity, and many variants of this problem were considered. The most common formulation is when $p$ is given by an arithmetic formula or circuit111When the input is a circuit, the degree of $p$ is always assumed to be polynomial in the circuit’s size, and in all cases considered in this paper this will be evident..

The original version of this question was posed by Edmonds [5]. In his formulation, $p$ is the determinant of a matrix whose entries are linear forms in ${\bf x}$ (we will refer such a matrix as a symbolic matrix). Lovász [16] proved that this problem is in $BPP$ namely has a fast probabilistic algorithm (for fields ${\mathbb{K}}$ larger than the degree of $p$ ): indeed, the algorithm simply picks random elements from ${\mathbb{K}}$ and evaluates $p$ (note that evaluating $p$ is efficient in all three formulations above, and indeed in all formulations considered). This left open the problem of finding an efficient deterministic algorithm, namely derandomizing Lovász’s algorithm for PIT.

Open Problem 1.1.

Is PIT $\in P$ ?

The importance of this seemingly specific open problem was revealed in an important result of Kabanets and Impagliazzo [13]. They showed that if the answer is positive (as everyone expects), this will imply non-trivial lower bounds on either arithmetic or Boolean circuits, well beyond current techniques.

The progress towards resolving this open problem has been by providing deterministic polynomial time algorithms for a large variety of special cases of it, with the idea of building up techniques. By far, in most of these results the special cases are defined by restricting the input polynomial to lie in some complexity class. In these cases, progress in derandomization followed closely progress on lower bounds for the appropriate class (as is the case in the Boolean setting as well). There are literally dozens of such papers: many are mentioned and explained in the surveys [22, 24] and e.g. the recent paper [1].

In parallel, with motivation from algebra, geometry and other areas, a different collection of special cases of PIT was studied, of a structural nature. Here one works with Edmond’s formulation, and develops an understanding (and often a polynomial time algorithm) for cases where the symbolic matrix has restricted structure. This includes for example the works [3, 4, 6, 9, 11, 21].

This paper contributes to the second line of research, providing new families of symbolic matrices for which PIT can be solved in deterministic polynomial time. To explain this structure we introduce some notation. We will work in a slightly more general setting, in two ways, as the results generalize to both. First, we will allow our symbolic matrices to have polynomial entries. In such cases, these polynomials will have simple formulas describing them. Second, we will be interested in computing the rank of the input symbolic matrix, not just whether its determinant vanishes. While seemingly a more general problem, this turns out to be equivalent to PIT (see e.g. [8, Appendix A]222The proof in [8] is given for non-commutative rank, but the exact same proof works verbatim for our usual notion of rank over ${\mathbb{K}}({\bf x})$ .).

Let $R$ be a family of polynomial maps $R=\{r:{\mathbb{K}}^{d}\rightarrow{\mathbb{K}}^{n}\}$ . In all cases we assume the degree of all polynomials in all maps is at most $n$ , and the number of variables $d$ is at most polynomial in $n$ , so we will think of $n$ as the input size to the problem.

A family of maps $R$ prescribes a family of symbolic matrices, so that each row is an image of the $d$ -vector of variables ${\bf x}$ under some map in $R$ . More formally, define PIT( $R$ ) to be the set of all symbolic matrices $M$ (with $n$ columns, and ${\rm poly}(n)$ rows) in which every row of the matrix is of the form $r({\bf x})$ , for some map $r\in R$ . We will be interested in families $R$ for which the ranks of matrices in PIT( $R$ ) can be computed in polynomial time333We identify the set of matrices and the computational problem of determining their ranks..

We first demonstrate the convenience of this notation. Call $R$ complete, if a deterministic polynomial-time algorithm for PIT( $R$ ) implies a deterministic polynomial-time algorithm for PIT. Very simple maps are complete! It follows from Valiant’s [28] hardness of the determinant for the class444The arithmetic analog of the Boolean class $P$ . VP that

Theorem 1.2 ([28]).

The class $R_{\rm affine}$ of affine linear maps is complete.

Indeed, Valiant’s original proof (see more detail here [15]) implies a stronger theorem. Even restricting the support of each row to have at most a single variable in some coordinate, is general enough to be complete.

Theorem 1.3.

The class $R_{\rm sparse}$ of affine linear maps, such that each map is non-constant in at most a single variable from $\{x_{1},\dots x_{d}\}$ , is complete.

We now turn to define the polynomial maps we will be interested in, and for which we will be able to provide efficient deterministic algorithms. Some motivation for interest in these maps will be given in the next two subsections.

Consider the following class $R_{2}$ . Here $d=n$ . Every $p\in R_{2}$ is of the form ${\bf x}\mapsto(A-A^{T}){\bf x}$ , where $A$ is a rank-1 matrix. While this family may look very special, we note that the problem of graph rigidity in ${\mathbb{R}}^{2}$ (for which a polynomial time algorithm is known but far from trivial) is a very special case of PIT( $R_{2}$ ).555Moreover, the same family of rank-2, skew symmetric matrices is featured in a very different PIT problem: determining the maximum rank of a subspace generated by given such matrices. A deterministic polynomial time solution for this problem is given by Lovasz’ celebrated matroid parity algorithm [17] (see also [18], Theorem 11.1.2).

Theorem 1.4.

PIT( $R_{2}$ ) can be solved in deterministic polynomial time, over a field ${\mathbb{K}}$ with sufficiently large characteristic (more precisely, when ${\rm char}({\mathbb{K}})$ is larger than the number of rows of the input matrix or ${\rm char}({\mathbb{K}})=0$ ).

This construction can be generalized as follows. Here we will generate PIT instances whose entries are polynomials, rather than linear functions of the variables. For a $k$ -dimensional tensor $A$ of size $n$ , denote by $\hat{A}$ its “anti-symmetric” version, namely where for every entry $(i_{1},\dots,i_{k})$ we have $\hat{A}(i_{1},\dots,i_{k})=\sum_{\sigma\in S_{k}}\text{sgn}(\sigma)A(i_{\sigma(1)},\dots,i_{\sigma(k)})$ . Note that for $k=2$ we have $\hat{A}=A-A^{T}$ .

We now extend $R_{2}$ , in which a matrix (namely a 2-dimensional tensor) acts on one vector of variables, to $R_{k}$ , in which a $k$ -dimensional tensor acts on $k-1$ vectors of variables. Let $R_{k}$ denote the following class of (degree $k-1$ ) maps. Let ${\bf x}^{1},{\bf x}^{2},\dots,{\bf x}^{k-1}$ be $n$ -vectors of independent variables, so altogether ${\bf x}=({\bf x}^{1},{\bf x}^{2},\dots,{\bf x}^{k-1})$ is a vector of $(k-1)n$ variables. A $k$ -tensor of size $n$ in each dimension acts on ${\bf x}$ simply with the $i$ ’th dimension acting on ${\bf x}^{i}$ for $i\in[k-1]$ . The output of this action is a vector (along dimension $k$ ) of length $n$ of polynomials of degree $k-1$ , each linear in ${\bf x}^{i}$ for all $i$ . Define $R_{k}$ to be all maps defined by $\hat{A}$ for any rank-1 tensor $A$ . Note that with this notation $R_{2}$ is precisely the class defined above.

Generalizing the above theorem we prove:

Theorem 1.5.

For every $k<n$ , PIT( $R_{k}$ ) can be solved in deterministic polynomial time, over a field ${\mathbb{K}}$ with sufficiently large characteristic (more precisely, when ${\rm char}({\mathbb{K}})$ is larger than the number of rows of the input matrix or ${\rm char}({\mathbb{K}})=0$ ).

1.2 Graph Rigidity

The problem of graph rigidity arises from several motivations, originally, mechanical engineering (see [14]). Rigidity theory is a fast-growing area, and we refer the interested reader to [25] for more background and recent approaches. Graph rigidiy has several versions, we describe perhaps the most common one, generic rigidity. It is supposed to capture the structural rigidity of a “bars and joints” framework described by a graph. We will not be formal here as precise definitions can be found e.g. in [2]. Here the relevant field for the geometric/physical interpretation is the Real numbers ${\mathbb{R}}$ , and we use it in this subsection as in other papers on this problem (although the algebraic formulation is meaningful for every field ${\mathbb{K}}$ ).

Let $G(V,E)$ be an undirected graph on $n$ vertices and $m$ edges. An embedding of $G$ in ${\mathbb{R}}^{t}$ is a map $\phi:V\rightarrow{\mathbb{R}}^{t}$ . An embedding of $G$ is called rigid if there is no perturbation of the vertex positions which preserves all edge lengths, other than the rigid motions of ${\mathbb{R}}^{t}$ . The graph $G$ is called rigid if every generic embedding of $G$ is rigid (equivalently, if there exists an embedding of $G$ which is rigid, see [2]). The main question is to determine if a given graph $G$ is rigid (and more generally, compute the dimension of the non-rigid motions of a generic embedding, in case $G$ is not rigid).

An extremely convenient formulation of the problem (as a PIT) is the following. Let $x_{v,j}$ be a set of variables indexed by $v\in V$ and $j\in[t]$ . The intuition is that $(x_{v,1},\dots,x_{v,t})$ are the coordinates of a generic embedding of the vertex $v$ in ${\mathbb{R}}^{t}$ . Given $G$ , construct a symbolic matrix $M_{G,t}$ of dimensions $m\times nt$ , which may be viewed as a concatenation of $t$ matrices, one for each dimension $j\in[t]$ . Every row corresponds to an edge $\{u,v\}\in E$ , and for each $j$ , the column $u,j$ contains the entry $x_{u,j}-x_{v,j}$ , whereas the column $v,j$ contains the the negation $x_{v,j}-x_{u,j}$ .

It is not hard to prove that the rank (as usual, over ${\mathbb{R}}(x)$ ) of $M_{G,t}$ determines if $G$ is rigid, and indeed the dimension of non-rigid motions (see [2] for the details). It is easy to see that for every graph $G$ , the matrix $M_{G,2}$ is in the class $PIT(R_{2})$ above. Indeed, let $e_{1},\ldots,e_{2n}$ denote the standard basis vectors in ${\mathbb{R}}^{2n}$ . For some $u<v\in[n]$ , put $a=e_{u}-e_{v}$ and $b=e_{n+u}-e_{n+v}$ . Consider the matrix $A=A_{u,v}:=a^{t}b$ . Then $(A-A^{t}){\bf x}$ , where ${\bf x}=(x_{21},\ldots,x_{2n},x_{11},\ldots,x_{1n})$ is the $\{u,v\}$ row of $M_{G,2}$ . Thus Theorem 1.4 yields as a corollary a polynomial time algorithm to determine whether a given graph $G$ is rigid in ${\mathbb{R}}^{2}$ . Such algorithms for rigidity in ${\mathbb{R}}^{2}$ are known (see [10, Section 2.2] and references therein). Note that the matrices $M_{G,t}$ make sense over any field ${\mathbb{K}}$ , instead of ${\mathbb{R}}$ , and Theorem 1.4 in fact provides a deterministic polynomial time algorithm to compute the rank of these matrices over any field ${\mathbb{K}}$ with large enough characteristic.

The symbolic matrix representation above shows that for every $t$ , the problem of testing graph rigidity in ${\mathbb{R}}^{t}$ is in $BPP$ , and it is a decades-old problem to whether it is also in $P$ , even for the case $t=3$ .

Lovász and Yemini [20] have developed an alternative approach for studying graph rigidity in the plane, which obtains a somewhat finer characterization of rigidity than Laman’s. What is even more interesting is their method. They show that the matrices $M_{G,2}$ can actually be obtained in the following way. First, with every edge $\{u,v\}$ associate a certain $2$ -dimensional subspace $f_{u,v}\subset{\mathbb{R}}^{2n}$ . The intersection of this subspace $f_{u,v}$ with a generic hyperplane through the origin (of which the normal can be viewed essentially as the $2n$ -vector of variables $x_{v,j}$ ) yields the $\{u,v\}$ row of $M_{G,2}$ . In more detail, identify the vertices of $G$ with the set $V=[n]$ , and let $e_{1},\ldots,e_{2n}$ denote the standard basis in ${\mathbb{R}}^{2n}$ . Define $f_{u,v}$ to be the subspace of ${\mathbb{R}}^{2n}$ spanned by the pair of vectors $e_{u}-e_{v}$ and $e_{n+u}-e_{n+v}$ (note that the definition of $f_{u,v}$ is symmetric in $u,v$ ). Let $h({\bf x})$ denote the subspace of ${\mathbb{R}}^{2n}$ orthogonal to the vector ${\bf x}=(y_{1},\ldots,y_{n},-x_{1},\ldots,-x_{n})$ . It is not hard to verify (see [20] for the details) that $h({\bf x})\cap f_{u,v}$ is spanned by the $\{u,v\}$ row of $M_{G,2}$ . Thus, for a generic ${\bf x}$ , we have

[TABLE]

Thus, the question of computing the rank of $M_{G,2}$ becomes the question of computing the dimension of the span of the resulting intersections (which here are simply lines) with a generic hyperplane. To analyze this, Lovász and Yemini use a theory developed by Lovász [19] which studies a similar problem for an arbitrary family of subspaces. The relevant part of Lovász’s theory is introduced in the next subsection.

The idea of [20] can be applied also to rigidity in higher dimensions. For simplicity of the presentation, let us consider only the case $t=3$ . In this case we associate with each edge $\{u,v\}\in E$ a 3-dimensional subspace $g_{u,v}$ of ${\mathbb{R}}^{3n}$ . Namely, the subspace spanned by the vectors $e_{u}-e_{v}$ , $e_{n+u}-e_{n+v}$ , $e_{2n+u}-e_{2n+v}$ , where here $e_{1},\ldots,e_{3n}$ stand for the standard basis of ${\mathbb{R}}^{3n}$ . Let ${\bf x}=(x_{1},\ldots,x_{n},y_{1},\ldots,y_{n},z_{1},\ldots,z_{n})$ and define $\tilde{h}({\bf x})$ to be the (codim 2) subspace of ${\mathbb{R}}^{3n}$ orthogonal to the pair of vectors

[TABLE]

It is not hard to verify that $\tilde{h}({\bf x})\cap f_{u,v}$ is one dimensional and spanned by the $\{u,v\}$ row of $M_{G,3}$ . Thus, for a generic choice of ${\bf x}$ , we have

[TABLE]

A crucial difference from the case $t=2$ is that here a generic choice of ${\bf x}$ does not yield a generic codim 2 subspace $\tilde{h}({\bf x})$ of ${\mathbb{R}}^{3n}$ . From the perspective of this method and of our paper, this is “the reason” why rigidity in higher dimensions is more challenging.

1.3 Subspaces and generic hyperplanes

Let $F$ be a collection of subspaces in ${\mathbb{K}}^{d}$ . Let $h$ be a generic hyperplane in ${\mathbb{K}}^{d}$ , which without loss of generality can be taken to be all vectors perpendicular to ${\bf x}=(x_{1},\dots x_{d})$ . For each subspace $f\in F$ , let $f^{\prime}=f\cap h$ . Now consider the space spanned by the subspaces in $F^{\prime}:=\{f^{\prime}\mid f\in F\}$ (note that the flats in $F^{\prime}$ are functions of ${\bf x}$ ). The question is, what is the dimension of ${\rm span}(F^{\prime})$ ?

One of the major results of Lovász’ paper [19] is a formula, called $\rho(F)$ (which we redefine in Section 2), that determines this dimension for every family of subspaces, and for ${\bf x}$ satisfying a certain “general position” condition (see Definition 5.1). To show that a generic ${\bf x}$ satisfies Lovász’s general position condition over any field (with large enough characteristic) is one main result of our paper (see Section 7). Note that this fact is mentioned (over the field ${\mathbb{R}}$ ) in [19] with no proof. This fact is again mentioned666In Tanigawa [26] an alternative general position condition is suggested, to supposedly correct a mistake in Lovász’s paper. However we find the counter example in [26, footnote on p. 1416] false. We provide a full and detailed proof of Lovász’s formula in Section 5. and applied, again with no proof, in Tanigawa [26]. We see our paper as contributing to the completeness of these results.

When the subspaces $F$ are derived from a graph in the manner described above to generate the rigidity matrix, Lovász and Yemini [20] write the explicit special case of the formula $\rho(F)$ , which yields an elegant characterization. For the general case of an arbitrary family of subspaces $F$ , the formula is given as the minimum, over all possible partitions of the family, of a certain easily computable function. As the number of partitions is exponential, there is no obvious efficient way of computing $\rho$ . We have recently learned that the problem of computing $\rho$ is a special case of minimizing, over all partitions of a set $S$ , the Dilworth truncation of a given submodular function $f$ defined over $S$ ; a strongly polynomial algorithm for this problem is given in Frank and Tardos [7, Chapters II.1 and IV.3]. In our paper we introduce an alternative777Our algorithm seems different than the one in [7], as it does not use duality. strongly polynomial algorithm for computing $\rho$ , by reducing the original problem to a minimization problem of a certain submodular function. In fact, we prove our result to a more general quantity $\rho_{c}(F)$ , introduced in Section 2. (Note that $\rho(F)=\rho_{1}(F)$ is the quantity from [19].)

Theorem 1.6.

There is a deterministic, strongly polynomial time algorithm to compute $\rho_{c}$ for every real number $c$ .

Closing this circle, we will also prove that the problem of computing $\rho_{1}$ is equivalent to PIT( $R_{2}$ ). This will yield Theorem 1.4 as a corollary to Theorem 1.6.

1.4 Related works and applications

We see our result as a step towards better understanding of the algorithmic aspects of the notions and formulas introduced in Lovázs [19] and their applications.

Let us mention one related concept studied in Lovász [19] and discuss follow-up work by Tanigawa [26], which is related to Theorem 5.2 proved in this paper. It would be interesting to find efficient algorithms for the natural computational problem at hand. The reader may skip this subsection at first reading.

Let $F$ be a finite family of subspace in ${\mathbb{K}}^{d}$ (where ${\mathbb{K}}$ is a field of characteristic [math]). Let $X=\{x_{f}\mid f\in F\}$ be a collection of points in ${\mathbb{K}}^{d}$ such that $x_{f}\in f$ for each $f\in F$ . The set $X$ is said to be in general position with respect to $F$ if, for every $f\in F$ fixed, the following holds: Any subspace spanned by members of $F$ and points of $X\setminus\{x_{f}\}$ containing $x_{f}$ must contain the whole flat $f$ . Lovász shows that there exists a choice of a set $X$ in general position with respect to any given family $F$ . He then proves the following formula:

Theorem 1.7 (Lovász [19]).

Let $F$ be a finite family of subspace in ${\mathbb{K}}^{d}$ , and let $X=\{x_{f}\mid f\in F\}$ be in general position with respect to $F$ . Then

[TABLE]

An interesting application of Theorem 1.7 to the body-rod-bar rigidity problem is obtained by Tanigawa [26]. A body-rod-bar framework in ${\mathbb{R}}^{d}$ is defined as a structure consisting of $d$ -dimensional subspaces (bodies) and $(d-2)$ -dimensional flats (rods) mutually linked by one-dimensional lines (bars). (The term “rod” is appropriate for $d=3$ .) More formally, a $d$ -dimensional body-rod-bar-framework is a triple $(G,q,r)$ , where $G=(V=B\cup R,E)$ is a graph, $r:R\to{\rm Gr}(d-1,{\mathbb{R}}^{d+1})\subset\mathbb{P}(\bigwedge^{d-1}({\mathbb{R}}^{d+1}))$ is the rod-configuration mapping a vertex $v\in R$ to a $(d-1)$ -dimensional subspace $r_{v}$ of ${\mathbb{R}}^{d+1}$ , and $q:E\to{\rm Gr}(2,{\mathbb{R}}^{d+1})\subset\mathbb{P}(\bigwedge^{2}({\mathbb{R}}^{d+1}))$ is the bar-configuration mapping an edge $e\in E$ to a 2-dimensional subspace $q_{e}$ in ${\mathbb{R}}^{d+1}$ , such that

[TABLE]

equivalently,

[TABLE]

where here the dot product should be interpreted appropriately (see [26] for the details). Assume also that $r(u)\neq r(v)$ for every $u\neq v\in R$ .

An infinitesimal motion of $(G,q,r)$ is a mapping $m:B\cup R\to\bigwedge^{d-1}({\mathbb{R}}^{d+1})$ such that

[TABLE]

An infinitesimal motion $m$ is called trivial if either $m(u)=m(v)$ for all $u,v\in V$ , or if, for some fixed $v_{0}\in V$ we have $m(v_{0})=r_{v_{0}}$ and $m(v)=0$ for every $v\in V\setminus\{v_{0}\}$ . Finally, a framework $(G,q,r)$ is called infinitesimally rigid if every infinitesimal motion is trivial.

The body-rod-bar problem gives rise to a matroid ${\rm BR}(G,q,r)$ defined on the edge set $E$ whose rank is the maximum size of independent linear equations in (1) (for unknown m). From the definition, $(G,q,r)$ is infinitesimally rigid if and only if the rank of ${\rm BR}(G,q,r)$ is $\tbinom{d+1}{2}|V|-(\tbinom{d+1}{2}+|R|)$ .

Theorem 1.8 (Tanigawa [26, Corollary 4.13]).

Let $G=(B\cup R,E)$ and suppose $d\geq 3$ . Then, for almost all bar-configurations $q$ and almost all rod-configurations $r$ we have

[TABLE]

where the minimum is taken over all partitions $\Pi$ of $E$ .

Tanigawa’s proof is a nice combination of Theorem 1.7 with the other result of Lovász mentioned in the introduction, cited below as Theorem 5.2. Briefly, the first (simpler) step in the proof is to reduce the problem to the form of Theorem 1.7. That is, a family of flats $F$ is introduced, and the question becomes to find the rank of a generic set of points $X=\{x_{f}\mid f\in F\}$ . The family $F$ resulted from the reduction can be described as follow: Each edge $e=\{u,v\}$ of $G$ is associated with some fixed subspace $f_{e}$ in $\left(\mathbb{P}(\bigwedge^{2}({\mathbb{R}}^{d+1}))\right)^{|V|}$ . Then $F=\{f_{e}\cap h(u)\cap h(v)\mid e=\{u,v\}\in E\}$ , where $h_{r}(u),h_{r}(v)$ are subspaces depending on the choice of rod configuration $r$ . Since $r$ is taken generically, this imposes some genericity on the subspaces $h_{r}(v)$ , but they are not exactly generic. The proof is then complete by proving a relaxed version of Theorem 5.2, and adding the subspaces $h_{r}(v)$ one after the other.

For more recent applications of [19, 20] see Tanigawa [26, 27].

1.5 Organization of this paper

In Section 2 we introduce the function $\rho_{c}(F)$ , which is the main object of this study. The rest of the paper has two separate parts. The first, in Sections 3 and 4, describes the algorithm to compute $\rho_{c}$ . In Section 3, we present and prove properties of the function $\rho_{c}$ . Using these properties we describe, in Section 4, a deterministic strongly polynomial time algorithm that computes $\rho_{c}$ over every field via submodular optimization. Note that, as there is an alternative algorithm [7] in the literature to efficiently compute functions like $\rho_{c}$ , this part can be skipped.

The second part, in Sections 5, 6, and 7, describes the genericity proof of $\rho$ . In Section 5, we state (and reprove) the result of Lovász [19] above, relating $\rho_{1}$ to the intersection of $F$ with a hyperplane in “general position”. A similar relation is obtained for $\rho_{c}$ , for an integer $c>0$ (see Theorem 5.5). In Section 6, we develop an explicit representation of a basis of the family $F^{\prime}$ resulting from this intersection, which give rise to the symbolic matrices PIT( $R_{2}$ ) (and PIT( $R_{k}$ )). Using this, we prove in Section 7 that most hyperplanes (and more generally, subspaces) satisfy the “general position” definition of Lovász, thus expressing the rank of a these symbolic matrices as appropriate $\rho(F)$ . Using the algorithm above we can now compute these ranks deterministically and efficiently. This last section is the only one in which the size of the field ${\mathbb{K}}$ is important.

2 Subspaces, partitions, and the function $\rho_{c}$

We introduce the main objects of this study: Families of subspaces, their partitions, and the optimization problem we solve in this paper. We consider linear subspaces $f$ of ${\mathbb{K}}^{d}$ . Let $d(f)$ denote the dimension of a subspace $f$ . For a family $F$ of subspaces, we write ${\rm span}F:={\rm span}\bigcup_{f\in F}f$ and

[TABLE]

A partition of $F$ is a set $\Pi=\{P_{1},\ldots,P_{t}\}$ of nonempty, pairwise disjoint subfamilies of $F$ , such that $F=\bigcup_{i=1}^{t}P_{i}$ . For a partition $\Pi$ of $F$ and a family of subspaces $G$ , we define the restriction of $\Pi$ to $G$ by

[TABLE]

If $G\subset F$ , then $\Pi\cap G$ forms a partition of $G$ .

Lovász [19] defined the following key function $\rho$ of a family of subspaces, whose meaning will be revealed in Section 5. We actually generalize his definition to a family of functions $\rho_{c}$ , for every $c>0$ (his $\rho$ is our $\rho_{1}$ for $c=1$ ). Computing $\rho_{c}(F)$ in deterministic polynomial time given $F$ , in Section 4, will be the key to our derandomization results.

Fix a constant $c>0$ . Let $F$ be a finite family of subspaces in ${\mathbb{K}}^{d}$ . For a partition $\Pi$ of $F$ , we define

[TABLE]

where the minimum is taken over all partitions $\Pi$ of $F$ .

Definition 2.1.

We say that $\Pi$ is a minimal partition of $F$ , with respect to the constant $c>0$ , if $\Pi$ attains $\rho_{c}(F)$ and has the smallest possible number of parts.

Remark. In Corollary 3.2 we prove that, fixing $c>0$ , a minimal partition $\Pi$ of a family $F$ with respect to $c$ is unique.

Notation.

We will use small letters $f,g,h$ to denote subspaces in ${\mathbb{K}}^{d}$ , capital letters $F,G,P,Q$ to denote families of subspaces, and $\Pi$ to denote partitions of a certain family $F$ of subspaces. Note that the elements of a partition $\Pi$ are themselves families of subspaces.

3 Properties of minimal partitions

In this and the next section we develop our algorithm in a fully self-contained manner. As mentioned in the introduction, the reader may skip these sections and apply the algorithm of [7] as a black box. In this section, we introduce some properties of minimal partitions, to be used in our algorithm. We find these properties interesting in their own right, but some may be known, indeed in more generality, for submosular functions.

3.1 Main technical lemma

We start with the following main technical lemma of this section.

Lemma 3.1.

Let $F,G$ be families of subspaces in ${\mathbb{K}}^{d}$ with minimal partitions $\Pi_{F},\Pi_{G}$ , respectively. Assume that $Q\in\Pi_{G}$ and $Q\subset F$ . Then $Q$ is contained in one of the parts of $\Pi_{F}$ .

For the proof, the idea is to show that if, when considering a minimal partition for $F$ , it “pays off” to put the elements of $Q$ together, then it still “pays off” (or at least, harmless) to put these elements together, when this time considering a minimal partition for $G$ .

Proof.

Consider the restriction $\Pi^{\prime}:=\Pi_{F}\cap Q$ of $\Pi_{F}$ to $Q$ (as defined in (2)). By assumption, $Q\subset F$ , and thus $\Pi^{\prime}$ forms a partition of $Q$ .

Our assumption that $Q\in\Pi_{G}$ , and recalling that $\Pi_{G}$ forms a minimal partition of $G$ , implies that

[TABLE]

Fixing some arbitrary order on the elements of $\Pi^{\prime}$ , we write

[TABLE]

where $P_{i}^{\prime}:=P_{i}\cap Q$ is non-empty and $P_{1},\ldots,P_{t}\in\Pi_{F}$ are distinct. Set $V_{0}^{\prime}:=\{0\}$ . For each $1\leq i\leq t$ , define

[TABLE]

and put $r_{i}^{\prime}:=d(V_{i}^{\prime})-d(V^{\prime}_{i-1})$ and $s_{i}^{\prime}:=d(P_{i}^{\prime})-r_{i}^{\prime}$ . Note that

[TABLE]

and that

[TABLE]

With this notation, (4) can be rewritten as

[TABLE]

which implies

[TABLE]

Next, we define

[TABLE]

and put $r_{i}:=d(V_{i})-d(V_{i-1})$ and $s_{i}:=d(P_{i})-r_{i}$ . Similar to above, we have

[TABLE]

and

[TABLE]

We claim that

[TABLE]

Indeed, the inequality (8) holds if and only if

[TABLE]

which holds if and only if

[TABLE]

To prove the last inequality, notice that $V_{i}^{\prime}\subset V_{i}$ and ${\rm span}P_{i}^{\prime}\subset{\rm span}P_{i}$ , for every $i$ . Thus

[TABLE]

Hence, by (5) and (7), we get $s_{i}^{\prime}\leq s_{i}$ . This fact combined with the inequality (6) implies (9) and hence also (8). Since $\Pi_{F}$ is assumed to be minimal for $F$ , we conclude that $t=1$ and $Q\subset P_{1}$ . This completes the proof. ∎

3.2 Uniqueness of minimal partitions

We prove uniqueness of minimal partitions.

Corollary 3.2 (Uniqueness).

Let $F$ be a family of subspaces in ${\mathbb{K}}^{d}$ and let $\Pi_{1},\Pi_{2}$ be minimal partitions of $F$ . Then $\Pi_{1}=\Pi_{2}$ .

Proof.

Let $\sim_{1},\sim_{2}$ denote the equivalence relations on $F$ induced by the partitions $\Pi_{1},\Pi_{2}$ , respectively. Let $f,g\in F$ and assume that $f\sim_{1}g$ . That is $f,g\in Q$ , for some $Q\in\Pi_{1}$ . Applying Lemma 3.1 (with $F$ , $G:=F$ , and $Q$ ), we get that $Q$ is contained in one of the parts in $\Pi_{2}$ . Thus $f\sim_{2}g$ . By symmetry, we conclude that $f\sim_{1}g$ if and only if $f\sim_{2}g$ . Thus $\Pi_{1}=\Pi_{2}$ , as claimed. ∎

Definition 3.3.

Fix $c>0$ . Define $\Pi^{*}(F)$ to be the minimal partition of a family of subspaces $F$ (with respect to $c$ ).

3.3 Monotonicity properties

We prove the following “monotonicity” property of minimal partitions.

Corollary 3.4 (Monotonicity).

Let $F,G$ be families of subspaces in ${\mathbb{K}}^{d}$ and assume that $G\subset F$ . Then $\Pi^{*}(G)$ is a refinement of $\Pi^{*}(F)\cap G$ .

Proof.

Apply Lemma 3.1 to the families $F$ and $G$ . ∎

The following is another type of monotonicity property.

Lemma 3.5.

Let $F=\{f_{1},\ldots,f_{n}\}$ be a family of $n$ subspaces in ${\mathbb{K}}^{d}$ . Let $f_{i}\subset f_{i}^{\prime}$ , for every $i=1,\ldots,n$ , and consider $F^{\prime}:=\{f_{1}^{\prime},\ldots,f_{n}^{\prime}\}.$ For a partition $\Pi$ of $F$ , let $\Pi^{\prime}$ denote the partition of $F^{\prime}$ induced by $\Pi$ , replacing each $f_{i}$ by the corresponding $f_{i}^{\prime}$ . Then $(\Pi^{*}(F))^{\prime}$ is a refinement of $\Pi^{*}(F^{\prime})$ .

Proof.

Let $P\in\Pi^{*}(F)$ and assume without loss of generality that $P=\{f_{1},\ldots,f_{m}\}$ , for some $m\leq n$ . It is easy to see, applying Lemma 3.1, that $\Pi^{*}(P)=\{P\}$ .

Put $P^{\prime}:=\{f_{1}^{\prime},\ldots,f_{m}^{\prime}\}$ . We claim that $\Pi^{*}(P^{\prime})=\{P^{\prime}\}$ . First note that it suffices to prove the claim for the special case where $f_{1}\subset f_{1}^{\prime}$ and $f_{i}=f_{i}^{\prime}$ , for $i=2,\ldots,m$ , and then apply the same argument repeatedly to each $i$ . To prove the calim for the special case, consider the family $Q=\{f_{1},f_{1}^{\prime}\}$ . It is easy to see, by definition, that $\Pi^{*}(Q)=\{Q\}$ . By Lemma 3.1, $Q$ is contained in a part of $\Pi^{*}(G)$ , for every family of subspaces $G$ that contains $Q$ . Moreover, since $f_{1}\cup f_{1}^{\prime}\subset f_{1}^{\prime}$ , we have

[TABLE]

for every such $G$ (this follows directly from the definition of $\rho_{c}$ and of $\Pi^{*}$ ).

Define $G:=\{f_{1},f_{1}^{\prime},f_{2},\ldots,f_{m}\}$ . By what has just been argued, we have

[TABLE]

Since $P,Q\subset G$ , and applying Lemma 3.1, we get that each of $P$ and $Q$ is contained in a part of $\Pi^{*}(G)$ . But $P\cap Q\neq\emptyset$ , thus the set $P\cup Q$ must be contained in a part of $\Pi^{*}(G)$ . Noting that $P\cup Q=G$ , this implies that $\Pi^{*}(G)=\{G\}$ . Combined with (10), this proves $\Pi^{*}(P^{\prime})=P^{\prime}$ , as claimed.

Applying Lemma 3.1 to the families $F^{\prime}$ , $P^{\prime}$ , and with $P^{\prime}\in\Pi^{*}(P^{\prime})$ , we conclude that $P^{\prime}$ is contained in one of the parts of $\Pi^{*}(F^{\prime})$ . Since this is true for every $P\in\Pi^{*}(F)$ , the lemma follows. ∎

3.4 The family $\hat{F}$

Let $F$ be a family of subspaces in ${\mathbb{K}}^{d}$ . We show that, in some sense, $F$ can be replaced by a simpler family $\hat{F}$ defined next. With each $P\in\Pi^{*}(F)$ associate the subspace $f_{P}:={\rm span}P$ . Then define the family

[TABLE]

Note that for $P\neq P^{\prime}$ we have $f_{P}\neq f_{P^{\prime}}$ ; otherwise, taking $P\cup P^{\prime}$ yields a partition of $F$ with strictly less parts and with smaller or equal value of $\rho_{c}$ , contradicting the minimality of $\Pi^{*}(F)$ .

The family $F$ can be replaced by $\hat{F}$ in the sense of Lemma 3.6, and $\hat{F}$ is simpler in the sense of Lemma 3.7.

Lemma 3.6.

Let $F,G$ be families of subspaces in ${\mathbb{K}}^{d}$ . Then

[TABLE]

By the sign $\simeq$ we mean that the identity holds after identifying the partiton $\Pi^{*}(\hat{F}\cup G)$ of $\hat{F}\cup G$ with the partition of $F\cup G$ naturally induced by it. Concretely, the lemma asserts that

[TABLE]

Proof.

In the proof we often abuse notation and regard a partition of $\hat{F}\cup G$ as a one of $F\cup G$ , as explained after the statement of the lemma. Let $\Pi^{*}$ be the partition of $F\cup G$ induced by $\Pi^{*}(\hat{F}\cup G)$ , given by

[TABLE]

We have $|\Pi^{*}|=|\Pi^{*}(\hat{F}\cup G)|$ and

[TABLE]

Thus

[TABLE]

To prove the inverse inequality, apply Lemma 3.1 to the families $F$ and $F\cup G$ . It follows that, for every $P\in\Pi^{*}(F)$ , there exists $Q\in\Pi^{*}(F\cup G)$ such that $P\subset Q$ . This means that $\Pi^{*}(F\cup G)$ induces a well-defined partition $\hat{\Pi}^{*}$ of $\hat{F}\cup G$ with $|\Pi^{*}(F\cup G)|=|\hat{\Pi}^{*}|$ and

[TABLE]

Concretely, $\hat{\Pi}^{*}$ is given by

[TABLE]

where

[TABLE]

We have

[TABLE]

This proves that $\rho_{c}(F\cup G)=\rho_{c}(\hat{F}\cup G)$ .

Next, we claim that $|\Pi^{*}(F\cup G)|=|\Pi^{*}(\hat{F}\cup G)|$ . Indeed, by our argument above, the partition $\hat{\Pi}^{*}$ of $\hat{F}\cup G$ satisfies

[TABLE]

Since $\Pi^{*}(\hat{F}\cup G)$ is taken to be the smallest that attains $\rho_{c}(\hat{F}\cup G)$ , we get

[TABLE]

Similarly, by our argument above, the partition $\Pi^{*}$ of $F\cup G$ satisfies

[TABLE]

Thus,

[TABLE]

This proves the claim.

By the uniqueness of minimal partition (see Corollary 3.2), we conclude that

[TABLE]

This completes the proof of the lemma. ∎

Lemma 3.7.

Let $F$ be a family of subspaces in ${\mathbb{K}}^{d}$ . Then

[TABLE]

Proof.

Apply Lemma 3.6 with $G=\emptyset$ . ∎

We introduce one more simple property that we need.

Lemma 3.8.

$\widehat{F\cup G}=\widehat{\widehat{F}\cup G}.$

Proof.

By Lemma 3.6, $\Pi^{*}(F\cup G)=\Pi^{*}(\hat{F}\cup G)$ . The assertion then easily follows. ∎

4 An algorithm for computing $\rho_{c}(F)$

In this section we prove Theorem 1.6. That is, we introduce an algorithm to compute $\rho_{c}(F)$ , for any number $c$ and a given family $F$ of $n$ subspaces in ${\mathbb{K}}^{d}$ , with polynomial running time in $n$ (and in $d$ ). While we designed our algorithm for the class of functions $\rho_{c}$ , it clearly works for a wider class of submodular functions. As it is different than the one in [7], we feel it would be interesting to explore its generality. Note that the problem is trivial for $c\leq 0$ , which is why we consider only $c>0$ .

As mentioned in the introduction, the problem of computing $\rho_{c}$ turns out to be an instance of a more general problem to which a strongly polynomial time algorithm is already known [7]. In more detail, the Dilworth truncation of a set function $b^{\prime}:2^{S}\to{\mathbb{R}}\cup\{\infty\}$ is defined as the function

[TABLE]

where the minimum is taken over all partitions $\Pi$ of $X$ .

Theorem 4.1 (Frank and Tardos [7, IV.3]).

Let $b^{\prime}:2^{S}\to{\mathbb{R}}\cup\{\infty\}$ be a submodular set function. Suppose that a minimizing oracle for $b^{\prime}$ is available. Then $b(S)$ can be computed in a strongly polynomial time. The algorithm also constructs a partition $\Pi$ of $S$ for which $b(S)=\sum_{P\in\Pi}b^{\prime}(P)$ .

Remark. In [7], a more general result is proved.

4.1 High-level description of the algorithm for $\rho_{c}$

The input to the algorithm is a number $c$ and a family of subspaces $F=\{f_{1},\ldots,f_{n}\}$ in ${\mathbb{K}}^{d}$ Write $F_{i}:=\{f_{1},\ldots,f_{i}\}$ . The high-level scheme of the algorithm is the following:

$\hat{F}_{1}\leftarrow\{f_{1}\}$ . 2. 2.

For $i\leftarrow$ $2$ to $n$

2.1.

$\Pi\leftarrow$ Compute $\Pi^{*}(\hat{F}_{i-1}\cup\{f_{i}\})$ 2. 2.2.

$\hat{F}_{i}\leftarrow\{{\rm span}(P)\mid P\in\Pi\}$ 3. 3.

Return $\sum_{\hat{f}\in\hat{F}_{n}}(d(\hat{f})-c)$

The heart of the algorithm is of course the missing description of Step 2.1, which computes, in the $i$ th iteration, the minimal partition of the family $\widehat{F}_{i-1}\cup\{f_{i}\}$ with respect to $\rho$ .

Lemma 4.2.

The computation in Step 2.1 can be done in strongly-polynomial time.

Recall that the minimal partition of $\hat{F}_{i-1}$ is the partition into singletons, by Lemma 3.7. So in this step we compute the effect on this partition of inserting one new subspace. We explain how to do so efficiently and prove Lemma 4.2 in Section 4.3 below. To describe and analyze step 2.1, we first need to recall submodular functions and optimization, which we do in Section 4.2. The proof of the lemma is then given in Section 4.3.

We are now ready to prove Theorem 1.6, assuming that Lemma 4.2 is true.

Proof of Theorem 1.6.

Correctness of the algorithm. By Lemma 3.8, we have

[TABLE]

Thus the computation of $\hat{F}_{i}$ in Step 2.2 is correct. In view of Lemmas 3.6 and 3.7, the algorithm’s output is $\rho_{c}(F)$ , as needed.

Running time of the algorithm. We represent a $k$ -dimensional subspace $f$ in ${\mathbb{K}}^{d}$ by a $k\times d$ matrix whose rows form a basis for $f$ . The dimension $d(f)$ of a subspace $f$ is just the number of rows in the matrix representing the subspace, and hence can be computed in a constant time. Let $P$ be a family of subspaces in ${\mathbb{K}}^{d}$ . To compute ${\rm span}(P)$ , we take the union of the rows of the matrices in $P$ (representing subspaces) and apply Gauss elimination (using row operations only). If $P$ has $n$ subspaces, we will need to apply Gauss elimination to a matrix of dimensions at most $(nd)\times d$ . The nonzero rows in the matrix received by this process will form a basis for ${\rm span}(P)$ .

Now let $F$ be a family of $n$ subspaces in ${\mathbb{K}}^{d}$ . Cleary, each line in the above description of the algorithm, when applied to $F$ , is called at most $n$ times. In each step, excluding Step 2.1, we are required to compute at most $n$ times one of the operations just described (finding dimension or span) or simple operations such as addition. In view of Lemma 4.2, the proof is complete. ∎

4.2 A submodular set function

Recall that a function $s$ defined on the collection of subsets of a finite set $A$ is called submodular if

[TABLE]

for all $X,Y\subset A$ .

The following is proved by Schrijver in [23].

Theorem 4.3 (Schrijver [23]).

There exists a strongly polynomial-time algorithm minimizing a submodular function $s$ , where $s$ is given by an oracle. The number of oracle calls is bounded by a polynomial in the size of the underlying set. The algorithm also finds a minimizer $X^{*}$ of $s$ .

In this section we consider a set function defined as follows. Let $F$ be a family of subspaces in ${\mathbb{K}}^{d}$ and let $g\subset{\mathbb{K}}^{d}$ be a subspace not in $F$ . Fix $c>0$ . Define $r_{F,g,c}:2^{F}\to{\mathbb{K}}$ by

[TABLE]

where $\overline{X}:=F\setminus X$ . We then put

[TABLE]

and we let $X_{F,g,c}^{*}$ denote a subset $X\subset F$ that attains $r_{F,g,c}^{*}$ .

We show that $r_{F,g,c}$ is submodular.

Lemma 4.4.

Let $F$ and $g$ and $c$ be as above. Then $r_{F,g,c}$ is submodular.

Proof.

To simplify the notation, and as $F,g,c$ are fixed, we write for short $r=r_{F,g,c}$ . Let $X,Y\subset F$ . We need to show

[TABLE]

Put $f_{X}:={\rm span}(X\cup\{g\})$ . By definition, we have

[TABLE]

By basic linear algebra, we have the identity

[TABLE]

Thus the last equality, after some rearranging, is

[TABLE]

Noting that ${\rm span}(f_{X}\cup f_{Y})={\rm span}(f_{X\cup Y})$ and that ${\rm span}(f_{X}\cap f_{Y})\supset{\rm span}(f_{X\cap Y})$ , we get

[TABLE]

This proves the lemma. ∎

4.3 Inserting one subspace

We are now ready to describe in detail Step 2.1 which computes $\widehat{F}_{i}$ given $\widehat{F}_{i-1}$ and $f_{i}$ . More precisely, we describe a subroutine that receives as an input a family $F$ with $F=\widehat{F}$ and a subspace $g$ , and outputs $\Pi^{*}(F\cup\{g\})$ .

We will need the following observation.

Lemma 4.5.

Let $G=F\cup\{g\}$ be a family of subspaces in ${\mathbb{K}}^{d}$ . Let $Q_{g}\in\Pi^{*}(G)$ be the part that contains the subspace $g$ . Then

[TABLE]

Proof.

For every $Q\in\Pi^{*}(G)\setminus\{Q_{g}\}$ , we have $Q\subset F$ . By Lemma 3.1, there exists $P\in\Pi^{*}(F)$ such that $Q\subset P$ . Clearly, we also have $P\subset G$ . Applying Lemma 3.1 once again, we get that also $P\subset Q$ . Thus, $P=Q$ which means that $Q\in\Pi^{*}(F)$ . ∎

Corollary 4.6.

Let $F$ be a family of $n$ subspaces in ${\mathbb{K}}^{d}$ with $\hat{F}=F$ and let $g$ be another subspace in ${\mathbb{K}}^{d}$ . Then $\rho_{c}(F\cup\{g\})=r_{F,g,c}^{*}$ and

[TABLE]

where $X_{F,g,c}^{*}$ and $r_{F,g,c}^{*}$ are as defined in Section 4.2.

Proof.

This follows from the definitions of $\rho_{c}$ and $r_{F,g,c}^{*}$ , combined with Lemma 4.5. ∎

Proof of Lemma 4.2..

Combinig Corollary 4.6 with Theorem 4.3, we get that the computation in Step 2.1 can be done in strongly-polynomial time. ∎

5 Intersecting subspaces with a hyperplane

In this section we state (and reprove) a result of Lovász [19], which explains the source of the function $\rho$ (more precisely, taking $\rho_{c}$ with $c=1$ ) as the dimension of the intersections of a family of subspaces with a hyperplane in “general position”. This connection has been used by Lovász to study certain questions about matroids in [19], and by Lovász and Yemini in [20] to study rigid structures in ${\mathbb{R}}^{2}$ . We extend Lovász’ treatment to arbitrary fields ${\mathbb{K}}$ .

In Theorem 5.5 below, we further extend Lovász’s result, in a straightforward manner, to apply to the intersection of a family of subspaces with an arbitrary subspace (of any co-dimension) in “general position”, instead of only a (co-dimension 1) hyperplane.

Lovász [19] uses a very specific notion of genericity, which he calls general position defined below, and shows that $\rho$ correctly computes the dimension of the intersection when the hyperplane is in general position with respect to the given family of subspaces. In Theorem 7.1 we will prove that indeed “general position” is a generic property, namely holds for almost all hyperplanes. This will complete the connection with the PIT problem solved in this paper.

A hyperplane in ${\mathbb{K}}^{d}$ is a subspace (subspace of ${\mathbb{K}}^{d}$ ) of codimension 1. Let $F$ be a family of (nonzero) subspaces in ${\mathbb{K}}^{d}$ and let $h\subset{\mathbb{K}}^{d}$ be a hyperplane in ${\mathbb{K}}^{d}$ . We denote by $F\cap h$ the family $\{f\cap h\mid f\in F\}$ . Following Lovász, we have the following definition:

Definition 5.1 (General Position).

We say that $h$ is in general position with respect to $F$ if, for every $A,B,C\subset F$ , with $A$ nonempty, we have:

(i) If ${\rm span}(A)\subset h$ , then ${\rm span}(A)=\{0\}$ .

(ii) If888Note that here one can take any of $A,B,C$ to be the empty set, and we interpret ${\rm span}(\emptyset)=\{0\}$ .

[TABLE]

then

[TABLE]

Remark. In Section 6, we prove (in Theorem 7.1) that being in general position with respect to a given family $F$ is a generic property; this fact is mentioned in [19] without a proof.

Theorem 5.2 (Lovász [19, Theorem 2.3]).

Let $F$ be a family of subspaces in ${\mathbb{K}}^{d}$ . Let $h$ be a hyperplane in ${\mathbb{K}}^{d}$ in general position with respect to $F$ . Then

[TABLE]

For completeness, we introduce a slightly more detailed proof, based on the line of argument from [19].

Proof of Theorem 5.2.

Fix $F$ and $h$ as in the statement. Let $F^{\prime}:=F\cap h$ . We need to show that $\rho_{1}(F)=d(F^{\prime})$ .

We first prove that $d(F^{\prime})\leq\rho_{1}(F)$ . That is, equivalently, we show that $d(F^{\prime})\leq\rho_{1}(F,\Pi),$ for every partition $\Pi$ of the family $F$ . Let $\Pi$ be a partition of $F$ . For $P\in\Pi$ , let $P^{\prime}:=P\cap h$ . Then

[TABLE]

and hence

[TABLE]

Note also that, for every $P\in\Pi$ , we have ${\rm span}(P^{\prime})\subset{\rm span}(P)\cap h$ and hence

[TABLE]

where here we used property (i) of the general position assumption on $h$ , namely, we used the fact that ${\rm span}(P)$ is not contained in $h$ . We conclude that

[TABLE]

for every partition $\Pi$ of $F$ . This implies $d(F^{\prime})\leq\rho_{1}(F)$ .

To prove the reverse inequality, we show that, for a certain partition $\Pi^{*}$ of $F$ , the inequality (12) is in fact tight. We will construct $\Pi^{*}$ explicitly subsequently refining a given partition. We describe the first step, which is indeed the general step (the proof will allow us to proceed recursively).

Define an equivalence relation on $F$ as follows: For $f_{1},f_{2}\in F$ , $f_{1}\sim f_{2}$ if and only if

[TABLE]

Let $\{P_{1},\ldots,P_{m}\}$ be the partition (equivalence classes) of $F$ induced by the relation $\sim$ .

The main idea is to prove that after intersection with $h$ , the spans of the parts $P^{\prime}_{i}$ become a direct sum decomposition of ${\rm span}(F^{\prime})$ . As we will see below, $\Pi^{*}$ will be achieved by refining the partition $\{P_{1},\ldots,P_{m}\}$ inductively.

Lemma 5.3.

We have

[TABLE]

Before we prove Lemma 5.3, we establish some preliminary claims. Let $g_{1},\ldots,g_{m}$ be the (distinct) subspaces $g_{i}:={\rm span}(F^{\prime}\cup\{f\})$ for some $f\in P_{i}$ (note that by construction $g_{i}$ is independent of the specific element $f\in P_{i}$ that we take).

We observe that, for every $1\leq i\leq m$ ,

[TABLE]

Indeed, by property (i) of general position, $f$ is not contained in $h$ and $\dim(f\cap h)=\dim(f)-1$ , for every $f\in F$ . Hence, for every $f\in F$ , one can choose a basis for $f$ with all elements of the basis in $h$ except for exactly one element $b_{f}$ which is not in $h$ . Thus, fixing any $f\in P_{i}$ , we have

[TABLE]

Thus, $d(g_{i})=d(F^{\prime})+1$ , as needed.

Next, we observe that, for $i\neq j$ , we have

[TABLE]

Indeed, by construction $g_{i}\neq g_{j}$ , and in particular $g_{i}\cap g_{j}\subsetneq g_{i}$ . Combining this with (14), we get $d(g_{i}\cap g_{j})\leq d(g_{i})-1=d(F^{\prime})$ . By the definition of $g_{i},g_{j}$ , we also have ${\rm span}(F^{\prime})\subset g_{i}\cap g_{j}$ . Hence $g_{i}\cap g_{j}={\rm span}(F^{\prime})$ and (15) follows.

Proof of Lemma 5.3.

Here property (ii) of the general position definition will be crucial for the induction step. If $m=1$ then (13) clearly holds. For $m\geq 2$ , it suffices to show that, for every $2\leq k\leq m$ and every distinct indices $1\leq i_{1},\dots,i_{k}\leq m$ , one has

[TABLE]

We prove (16) by induction on $k$ . For $k=2$ , we need to show that ${\rm span}(P_{i_{1}}^{\prime})\cap{\rm span}(P_{i_{2}}^{\prime})=\{0\}$ , for every distinct $1\leq i_{1},i_{2}\leq m$ . By the definition of the subspaces $g_{i_{1}},g_{i_{2}}$ and applying (15), we have

[TABLE]

Since $h$ is in general position, using property (ii), this implies that ${\rm span}(P_{i_{1}})\cap{\rm span}(P_{i_{2}})=\{0\}$ . This proves the induction base case $k=2$ .

Assume next that (16) holds for some $2\leq k\leq m-1$ fixed and for every distinct indices $1\leq i_{1},\ldots,i_{k}\leq m$ . Let $1\leq i_{1},\ldots,i_{k+1}\leq m$ be some distinct indices. To establish the induction step we need to prove

[TABLE]

Observe that in order to prove (17) it suffices to show that

[TABLE]

Indeed, assume that (18) holds. Then

[TABLE]

where the first line uses the trivial fact that ${\rm span}(P_{i_{k+1}}^{\prime})\subset{\rm span}(P_{i_{2}}^{\prime}\cup\cdots\cup P_{i_{k+1}}^{\prime})$ and the second line is due to (18). By the induction hypothesis, we have

[TABLE]

Thus, assuming that (18) is true, (17) follows.

Finally, we now prove (18). Note that, by the definition of the subspaces $g_{i}$ and using (15), we have

[TABLE]

Hence, our assumption that $h$ is in general position with respect to $F$ implies that in fact

[TABLE]

This clearly implies (18). Thus we have established the inductive step and this completes the proof of Lemma 5.3. ∎

Recall that our goal is to show that (12) is tight for some partition $\Pi^{*}$ of $F$ . In view of Lemma 5.3, for the partition $\{P_{1},\ldots,P_{m}\}$ defined above, one has

[TABLE]

That is, we expressed the quantity $d(F^{\prime})$ as the sum of the quantities $d(P_{i}^{\prime})$ for certain subfamilies $P_{1},\ldots,P_{m}$ of $F$ . This allows to prove the existence of $\Pi^{*}$ using induction on the size of $F$ .

If $|F|=1$ , the unique partition on $F$ clearly attains (12). For $|F|\geq 1$ , let $\{P_{1},\ldots,P_{m}\}$ be the partition of $F$ given by Lemma 5.3, satisfying (19). If $m=1$ , the identity (19), combined with (14), gives

[TABLE]

This means that (12) is tight, and thus $\Pi^{*}=\{P_{1}\}$ . If $m>1$ , then each subfamily $P_{i}$ has fewer elements than $F$ . Applying the induction hypothesis, there exist subpartitions $\Pi_{i}^{*}=\{P_{i1},\ldots,P_{im_{i}}\}$ of $P_{i}$ , for each $1\leq i\leq m$ , satisfying

[TABLE]

Combined with (19), we get

[TABLE]

So $\Pi^{*}:=\bigcup_{i=1}^{m}\Pi_{i}^{*}$ forms a partition of $F$ that attains (12). This completes the proof of the theorem. ∎

Remark 5.4.

Note that in the inductive proof of Lemma 5.3, it was sufficient to consider not all $k$ -subsets of the $P_{i}$ in the given partition, but rather simply on intervals $P_{2},P_{3},\dots,P_{k}$ . The same induction on $k$ works without change. Thus even after refinement, in the proof of this theorem we never need to apply the “general position” condition more than $|F|$ times. This will help us later bound the show that $\rho_{1}(F)$ correctly computes $\dim(F\cap h)$ for most (or generic) hyperplanes $h$ even when ${\mathbb{K}}$ is finite and not too large.

We now generalize the theorem above to intersecting a family of subspaces with an arbitrary subspace. For this we need to extend the definition of “general position”.

Let $F$ be a family of subspaces in ${\mathbb{K}}^{d}$ . Let $\{{\bf x}_{1},\ldots,{\bf x}_{k}\}$ be a set of vectors, and define that the subspaces $h_{i}=\{{\bf x}_{1},\ldots,{\bf x}_{i}\}^{\perp}$ . Note that $h_{i}$ is of codimension $i$ in ${\mathbb{K}}^{d}$ , and that $h^{\prime}_{i}:=h_{i}\cap h_{i-1}$ is a hyperplane in $h_{i-1}$ , for $i=1,\ldots,k$ . We say that the subspace $h=h_{k}$ is in general position with respect to $F$ if for all $i\in[k]$ we have that the hyperplane $h^{\prime}_{i}$ is in general position with respect to the family $F_{i}=F\cap h_{i-1}$ .

Theorem 5.5.

Let $F$ be a family of subspaces in ${\mathbb{K}}^{d}$ . Let $h$ be a subspace in ${\mathbb{K}}^{d}$ of codimension $k$ in general position with respect to $F$ . Then

[TABLE]

Proof.

We prove by induction on the codimension $k$ . The case $k=1$ is Theorem 5.2.

Let ${\bf x}_{1},\ldots,{\bf x}_{k}\in{\mathbb{K}}^{d}$ be vectors such that $h=\{{\bf x}_{1},\ldots,{\bf x}_{k}\}^{\perp}$ is in general position with respect to $h$ . We know that $h^{\prime}_{k}$ is in general position with respect to the family $F_{k}:=F\cap h_{k-1}$ . By Theorem 5.2 again, we have

[TABLE]

where the minimum ranges over all partitions $\Pi_{k}$ of $F_{k}$ . Note that $\Pi_{k}$ induces a partition $\Pi$ on $F$ , in the obvious way. Moreover, for every $P^{\prime}\in\Pi_{k}$ there exists $P\subset F$ such that $P^{\prime}=P\cap h_{k-1}$ . By induction, we get

[TABLE]

Thus,

[TABLE]

where the first minimum (the outer one) in this exprssion is taken over all partitions $\Pi$ of $F$ , and, fixing $\Pi$ and given $P\in\Pi$ , the inner minimum is taken over all partitions $\Pi_{P}$ of the family $P$ .

Note that, for any partition $\Pi$ of $F$ , the partitions $\{\Pi_{P}\mid P\in\Pi\}$ induce a new partition $\Pi^{\prime}$ which is a refinement of $\Pi$ . Namely, $\Pi^{\prime}:=\bigcup_{P\in\Pi}\Pi_{P}$ . Note that taking $\Pi_{P}=\{P\}$ for each $P\in\Pi$ , we get

[TABLE]

We now prove the inverse inequality. Fix a partition $\Pi$ of $F$ , and, for $P\in\Pi$ , let $\Pi_{P}^{*}$ be a partition of $P$ that attains the minimum in

[TABLE]

That is, the partitions $\Pi_{P}^{*}$ satisfy

[TABLE]

Let $(\Pi^{\prime})^{*}$ be the partition of $F$ induced by $\bigcup\{\Pi_{P}^{*}\mid P\in\Pi\}$ . Observe that

[TABLE]

Combining the inequalities (20) and (21), we get $d(F\cap h)=\rho_{k}(F)$ . This completes the induction step, and therefore proves the theorem. ∎

6 Rank of symbolic matrices

In this section we show that the quantity $\rho_{c}(F)$ can be interpreted as the generic rank, defined as the rank over ${\mathbb{K}}({\bf x})$ , of a certain symbolic matrix associated with $F$ . More concretely, for ${\bf x}\in{\mathbb{K}}^{d}$ let

[TABLE]

We prove that $\rho_{c}(F)$ equals to the generic rank of a symbolic matrix whose entries are linear combinations of the coordinates of ${\bf x}$ .

Our main result for the section is the following (note that this is Theorem 1.4 in the introduction).

Theorem 6.1.

Let $u_{1},\ldots,u_{n},v_{1},\ldots,v_{n}\in{\mathbb{K}}^{d}$ be row vectors. Consider the symbolic matrix $A({\bf x})$ , with unknowns ${\bf x}=(x_{1},\ldots,x_{d})$ , whose $i$ th row is

[TABLE]

Then the (generic) rank of $A({\bf x})$ can be computed in polynomial time.

To prove the theorem we use the property established in Theorem 5.2, interpreting the quantity $\rho_{1}(F)$ as the dimension of the space spanned by

[TABLE]

for any hyperplane $h$ in general position with respect to $F$ (see Definition 5.1). Taking $h=h({\bf x})$ we prove, in Lemma 6.2, that the intersection $f\cap h({\bf x})$ is the span of vectors with entries that are linear combinations of the coordinates of ${\bf x}$ . We then prove, in Theorem 7.1, that, given a family $F$ , $h({\bf x})$ is in general position with respect to $F$ , for every generic ${\bf x}$ (namely, for almost every ${\bf x}\in{\mathbb{K}}^{d}$ ). Finally, we use the algorithm for computing $\rho_{1}$ from Section 4.

Lemma 6.2.

Let $f$ be an $m$ -dimensional subspace in ${\mathbb{K}}^{d}$ and let $v_{1},\ldots,v_{m}$ be a basis of $f$ . Let ${\bf x}\in{\mathbb{K}}^{d}$ and assume that $f\not\subseteq h({\bf x})$ . Then $h({\bf x})\cap f$ is spanned by vectors of the form

[TABLE]

*with $i\neq j$ .

Moreover, if (wlog) ${\bf x}\cdot v_{1}\neq 0$ , then the set $\{w_{12},\ldots,w_{1m}\}$ forms a basis of $f\cap h_{{\bf x}}$ .*

Proof.

We first observe that $w_{ij}\in f\cap h({\bf x})$ . Indeed, by definition, each $w_{ij}$ is a linear combination of basis vectors for $f$ , and thus $w_{ij}\in f$ . We also have

[TABLE]

Thus $w_{ij}\in f\cap h({\bf x})$ .

We now show that $w_{ij}$ also span $f\cap h({\bf x})$ . Indeed, we prove the stronger “moreover” statement.

Let $w\in f\cap h({\bf x})$ . Since $w\in f$ we may write $w=\sum_{i=1}^{m}a_{i}v_{i}$ . Since $w\in h({\bf x})$ , we have $w\cdot{\bf x}=0$ or

[TABLE]

If $v_{i}\cdot{\bf x}=0$ for every $i$ , then $f\subseteq h({\bf x})$ , contradicting our assumption. We may therefore assume, without loss of generality, that $v_{1}\cdot{\bf x}\neq 0$ . In this case (22) can be rewritten as

[TABLE]

We conclude that

[TABLE]

This completes the proof of the lemma. ∎

We observe an interesting consequence of Lemma 6.2, asserting that computing $\rho_{1}(F)$ for a family $F$ can be reduced to computing $\rho_{1}(G)$ , for a certain family $G$ consisting only of planes (two-dimensional subspaces).

Corollary 6.3.

Let $F=\{f_{1},\ldots,f_{n}\}$ be a family of subspaces in ${\mathbb{K}}^{d}$ and let $\{v_{i1},\ldots,v_{im_{i}}\}$ be a basis of $f_{i}$ , for $i=1,\ldots,n$ . Consider the family of two-dimensional subspaces

[TABLE]

where $g_{ijk}={\rm span}\{v_{ij},v_{ik}\}.$ Then $\rho_{1}(F)=\rho_{1}(G)$ .

Proof.

It follows easily from Theorem 7.1 that $h({\bf x})$ is in general position with respect to both families $F$ and $G$ , for every generic ${\bf x}\in{\mathbb{K}}^{d}$ . Fixing such ${\bf x}\in{\mathbb{K}}^{d}$ and applying Lemma 6.2, we see that ${\rm span}(F\cap h({\bf x}))={\rm span}(G\cap h({\bf x}))$ . By Theorem 5.2 this means that $\rho_{1}(F)=\rho_{1}(G)$ , as needed. ∎

The following lemma is a natural extension of Lemma 6.2 to a similar description of the intersection of a given subspace with a generic one, where the latter is not necessarily of co-dimension 1. If the co-dimension is $k$ , the basis elements of the intersection will be homogeneous polynomials of degree $k$ in the entries of the generic vectors. This connection, together with our algorithm for computing $\rho_{k}$ , will prove Theorem 1.5 from the introduction.

Lemma 6.4.

Let $k<m\leq d$ be integers. Let $f$ be an $m$ -dimensional subspace in ${\mathbb{K}}^{d}$ and let $v_{1},\ldots,v_{m}$ be a basis of $f$ . Let ${\bf x}_{1},\ldots,{\bf x}_{k}$ be vectors in ${\mathbb{K}}^{d}$ and define the subspace

[TABLE]

Assume that $\dim(f\cap h)=m-k$ (this extends the assumption $f\not\subseteq h({\bf x})$ of the lemma above). Let $X$ be the $k\times d$ matrix with ${\bf x}_{i}$ as its $i$ th row. Let $V$ denote the $d\times m$ matrix with $v_{j}$ as its $j$ th column. Put $M:=XV$ . So $M$ is a $k\times m$ matrix with $(i,j)$ entry being ${\bf x}_{i}\cdot v_{j}$ . For every $I\subset[m]$ of cardinality $k$ , let $M_{I}$ denote the $k\times k$ matrix received by restricting to the columns of $M$ with indices in $I$ . Then $f\cap h$ is the span of vectors of the form

[TABLE]

*where $S=\{s_{1}<\ldots<s_{k+1}\}\subset[m]$ is of cardinality $k+1$ and $I_{j}:=S\setminus\{s_{j}\}$ .

Moreover, if (wlog, given our assumption above), assuming that the last $k$ columns of M are linearly independent, $f\cap h$ is spanned by the $m-k$ vectors $w_{S}$ with $S$ containing the last $k$ columns.

Proof.

We first show that $w_{S}\in f\cap h$ , for every $S\subset[m]$ of cardinality $k+1$ . For $S$ fixed, we need to verify that $w_{S}$ is orthogonal to each of ${\bf x}_{1},\ldots,{\bf x}_{k}$ . For every $1\leq i\leq k$ we have

[TABLE]

Observe that the right-hand side is exactly the determinant of the matrix received by duplicating the $i$ th row of $M$ . Since the latter matrix is evidently singular, we conclude that $w_{S}\cdot{\bf x}_{i}=0$ , for every $i=1,\ldots,k$ . Thus $w_{S}\in h$ . Clearly, we also have $w_{S}\in f$ . Thus $w_{S}\in f\cap h$ , as needed.

We now turn to prove that the vectors $w_{S}$ generate $f\cap h$ . Indeed we prove the stronger “moreover” statement that already the $m-k$ vectors $w_{S}$ with $S$ of size $k+1$ that contain the last $k$ columns span $f\cap h$ . Recall that the last $k$ columns of $M$ are independent.

It will be convenient to add one more piece of (slightly informal) notation. Let $M^{\prime}$ be the matrix extending $M$ with one more (say, 0’th) row, that contains in the $j$ th coordinate the vector $v_{j}$ . Note that, up to a sign, the determinant of any $k+1$ minor of $M^{\prime}$ on columns $S$ is precisely $w_{S}$ .

Note also that column operations on $M^{\prime}$ , and replacing $w_{S}$ by the $k+1$ minors of the resulting matrix, do not change the span of the vectors $w_{S}$ . Moreover, note that column operations on the last $k$ columns of $M^{\prime}$ do not change the vectors $w_{S}$ , restricting to sets $S\subset I$ of size $k+1$ that contain the indices of the last $k$ columns. We may therefore assume, by performing such column operations, that the last $k$ columns of $M$ form the $k\times k$ identity matrix.

We will prove the lemma by induction on $k$ . We already know that this statement holds for $k=1$ (and any $m$ ) by Lemma 6.2. Assume it holds for $k-1$ (and $m-1$ , this is all we need), and we will infer the statement for $k$ . Consider the subspace $h^{\prime}$ orthogonal to the vectors ${\bf x}_{1},\dots,{\bf x}_{k-1}$ , and the subspace $f^{\prime}$ spanned by the vectors $v_{1},\dots,v_{m-1}$ , and form the associated $(k-1)\times(m-1)$ matrix, say $N$ . Add to the matrix $N$ the $0^{\prime}th$ row to create $N^{\prime}$ . By induction, we know that the $k$ -minors containing the last $k-1$ columns of $N^{\prime}$ are vectors which span the $f^{\prime}\cap h^{\prime}$ . For $i\in[m-k]$ , let $w_{i}^{\prime}$ denote the basis vector that corresponds to the columns $\{i,m-k+1,\ldots,m-1\}$ . Note that

[TABLE]

Now add to $N^{\prime}$ a last column for $v_{m}$ and a last row for $x_{k}$ to form $M^{\prime}$ . Fix $i\in[m-k]$ , and write $w_{i}:=w_{S_{i}}$ , where $S_{i}=\{i,m-k+1,\ldots,m\}$ . Due to the last $k$ columns of $M$ being the identity matrix, we have

[TABLE]

Moreover, one can check that in fact

[TABLE]

That is, $w_{i}=({\bf x}_{k}\cdot w_{i}^{\prime})v_{m}-({\bf x}_{k}\cdot v_{m})w_{i}^{\prime}$ . Applying Lemma 6.2, we get that the vectors $w_{i}$ , for $i\in[m-k]$ , form a basis for $f\cap h$ , as needed. ∎

7 Generic vs. General Position

This section completes the cycle of connections, proving that most (namely, generic) hyperplanes, and indeed most subspaces, are in general position (in the Lovász sense of Section 5) with respect to any given family of subspaces. The proof will make use the explicit description we established in the previous section for a basis to the intersection of a family of subspaces and a hyperplane. Thus, computing the ranks of the symbolic matrices in Theorems 1.4 and 1.5 are equivalent to computing the functions $\rho_{1}$ and $\rho_{k}$ respectively, which we can do efficiently by the algorithm of Section 4.

Theorem 7.1.

Let $F$ be a family of subspaces in ${\mathbb{K}}^{d}$ , and assume that either ${\rm char}({\mathbb{K}})>|F|$ or ${\rm char}({\mathbb{K}})=0$ . Then the hyperplane $h({\bf x})$ is in general position (see Definition 5.1) with respect to $F$ for almost every ${\bf x}\in{\mathbb{K}}^{d}$ . More precisely, over finite fields all but $|F|/|{\mathbb{K}}|$ - fraction of hyperplanes are not in general position, and for infinite fields they have measure zero.

The proof of this theorem turns out to be more intricate than we imagined. We will give below a linear-algebraic proof that is valid for all fields ${\mathbb{K}}$ . In the appendix we give an alternative, geometric proof which is valid for the field ${\mathbb{R}}$ of Real numbers.

Proof.

Fix subsets $A,B,C\subset F$ . Our goal is to show that for

[TABLE]

either $S\not\subseteq h({\bf x})$ generically, or $S\subset A\cap h({\bf x})$ generically. Indeed, we will prove that one of these alternative holds for every ${\bf x}$ , except for those ${\bf x}$ that vanish on a certain nontrivial linear equation. Thus, if ${\mathbb{K}}$ is finite, the fraction of such exceptional values of ${\bf x}$ is $1/|{\mathbb{K}}|$ . Since the number of choices of $A,B,C$ is finite, we see that if ${\mathbb{K}}$ is large enough this probability remains negligible. Being a bit more careful, (see Remark 5.4 at the end of the proof of Theorem 5.2), there are at most $|F|$ applications of the “general position” definition, and so the fraction of “bad” ${\bf x}$ is at most $|F|/|{\mathbb{K}}|$ as stated.

It is easy to see that replacing $B$ by ${\rm span}B$ and $C$ by ${\rm span}C$ does not affect the subspace $S$ . We may therefore assume that each of the families $B,C$ contains a single subspace of ${\mathbb{K}}^{d}$ .

Suppose that $B\cap C\neq\{0\}$ , that is, that there exists $v\in B\cap C$ , with $v\neq 0$ . Clearly, we have $v\in S$ and the linear form $v\cdot{\bf x}$ not identically zero. Thus, for almost every ${\bf x}$ , $S$ is not contained in $h({\bf x})$ and there is nothing to prove in this case. We may therefore assume that $B\cap C=\{0\}$ . In this case, after a change of basis of ${\mathbb{K}}^{d}$ , we may assume that $B={\rm span}\{e_{1},\ldots,e_{k}\}$ and $C=\{e_{k+1},\ldots,e_{k+m}\}$ , where $1\leq k<k+m\leq d$ and $e_{1},\ldots,e_{d}$ stand for the standard basis vectors in ${\mathbb{K}}^{d}$ .

From now on we will regard ${\bf x}$ as a vector of variables, and work in the field of fractions ${\mathbb{K}}({\bf x})$ . In particular this makes all subspaces under consideration, $A,B,C$ , $A\cap h({\bf x})$ and of course $S=S({\bf x})$ now subspaces of ${\mathbb{K}}({\bf x})^{d}$ (by taking the span of their bases in ${\mathbb{K}}({\bf x})^{d}$ ).

With this, our task becomes proving the following about these subspaces:

Claim 7.2.

Either $S\not\subseteq h({\bf x})$ , or $S\subset A\cap h({\bf x})$ .

We will break this task to two. Clearly, it will suffice to prove the claim for any spanning set $S^{\prime}$ replacing $S$ . So first we will prove that we can take $S^{\prime}$ to be the affine functions (of ${\bf x}$ ) in $S$ , and then we will prove the claim for $S^{\prime}$ .

Lemma 7.3.

$S$ * is spanned by its elements which are affine functions of ${\bf x}$ .*

Proof of Lemma 7.3.

Recall that we showed, in Lemma 6.2, that ${\rm span}_{\mathbb{K}}(A\cap h)$ has a basis consisting of elements of the form $(u^{t}v-v^{t}u){\bf x}$ , for some $u,v\in{\mathbb{K}}^{d}$ . Write $\{{\bf a}_{1}({\bf x}),\ldots,{\bf a}_{n}({\bf x})\}$ for a basis of ${\rm span}_{\mathbb{K}}(A\cap h)$ of this form.

Having bases for $B,C$ and $A\cap h({\bf x})$ we can express all elements of $S$ as linear combinations of these bases. Thus, elements in $S$ are described by solutions $\alpha,\alpha^{\prime}\in{\mathbb{K}}^{n}$ , $\beta\in{\mathbb{K}}^{k}$ , $\gamma\in{\mathbb{K}}^{m}$ to the following system of linear equations.

[TABLE]

where $\alpha_{i}\in{\mathbb{K}}$ (resp., $\alpha_{i}^{\prime},\beta_{i},\gamma_{i}\in{\mathbb{K}}$ ) is the $i$ th entry of $\alpha$ (resp., $\alpha^{\prime},\beta,\gamma$ ).

By basic theory of linear algebra, there exists a set of solutions, each of the form

[TABLE]

where $\alpha_{i}({\bf x}),\alpha^{\prime}_{i}({\bf x}),\beta_{i}({\bf x}),\gamma_{i}({\bf x})$ are rational functions in the entries of ${\bf x}$ , that together span the subspace $S$ . Moreover, these rational functions are of degree at most $|F|$ .

We will now strive to find a simpler spanning set $S^{\prime}$ for $S$ , and then use it to prove Claim 7.2.

The first simplification is realizing (via common denominators) that without loss of generality we can assume that all $\alpha_{i}({\bf x}),\alpha^{\prime}_{i}({\bf x}),\beta_{i}({\bf x}),\gamma_{i}({\bf x})$ are in fact polynomials in the entries of ${\bf x}$ . These elements of $S$ span the rest, after dividing by some fixed polynomial.

The next simplification (separating out homogeneous terms) shows that without loss of generality we can take all the polynomials in each of $\alpha,\alpha^{\prime},\beta,\gamma$ to be homogeneous of the same degree, which we may respectively call ${\rm deg}(\alpha),{\rm deg}(\alpha^{\prime}),{\rm deg}(\beta),{\rm deg}(\gamma)$ . These homogeneous solutions certainly span $S$ , and now we refine their structure further.

Indeed, inspecting the system of equations we know more: since each entry of ${\bf a}_{i}({\bf x})$ , for every $i$ is of degree one, we know that for some fixed integer $r\geq 0$ , they must satisfy ${\rm deg}(\alpha)={\rm deg}(\alpha^{\prime})=r$ and ${\rm deg}(\beta)={\rm deg}(\gamma)=r+1$ . We use this to stratify solutions $w$ by degree, and say that the associated $w$ has degree $r$ . Let $S_{r}$ be all solutions of degree $r$ (note that each $S_{r}$ is a subspace over ${\mathbb{K}}$ , though we will not use this fact). We call solutions $w$ of degree 0 linear. Our main simplification will come from showing that linear elements $S_{0}$ span $S$ , which in this notation is a restatement of the lemma we are proving.

Claim 7.4.

${\rm span}S_{0}=S$ **

We will prove this claim by induction on $r$ , using our stratifications $S_{r}$ of members of $S$ . It is clearly true for $r=0$ . So assume $S_{0}$ spans $S_{r}$ , and we need to prove that $S_{0}$ spans $S_{r+1}$ . By induction, it suffices to prove that $S_{r}$ spans $S_{r+1}$ . The plan for this will be as follows. We will assume we have some $w\in S_{r+1}$ . We will take all partial derivatives of its constituent polynomials with respect to each variable $x_{t}$ , $t\in[d]$ . From each of these we will generate an element $w_{t}\in S_{r}$ , as the degree decreased by 1. Finally, we will show that $w$ is a linear combination, indeed a very simple one, of the form : $(r+1)w=\sum_{t=1}^{d}x_{t}w_{t}$ . We now elaborate.

Fix $t\in[d]$ . Let us take a derivative with respect to the variable $x_{t}$ of ${\bf x}$ , of both sides of the identity (24). We get

[TABLE]

To define $w_{t}$ we first define $\alpha(t),\alpha^{\prime}(t),\beta(t),\gamma(t)$ by appropriately collecting homogeneous terms, and making sure that $\alpha(t),\alpha^{\prime}(t)\in A\cap h$ are of degree $r$ , and that $\beta(t)\in B$ and $\gamma(t)\in C$ are of degree $r+1$ :

•

$\alpha(t)_{i}=\frac{\partial\alpha_{i}({\bf x})}{\partial x_{t}}$

•

$\alpha^{\prime}(t)_{i}=\frac{\partial\alpha^{\prime}_{i}({\bf x})}{\partial x_{t}}$ ,

•

For $i\in[k]$ , $\beta(t)_{i}({\bf x})$ is

[TABLE]

•

For $i\in[m]$ , $\gamma(t)_{i}({\bf x})$ is

[TABLE]

here we used $[v]_{j}$ to denote the $j$ th entry of a vector $v$ . Now we can formally define $w_{t}\in S_{r}$ as follows. We first observe that

[TABLE]

Indeed, note that (24), restricted to the $j$ th component of the equation, implies that for every, $k+m<j\leq n$ , we have

[TABLE]

From this it is straightforward to verify that the identity (25) indeed holds. Thus, letting

[TABLE]

for each $t$ , the identity (25) implies that $w_{t}$ is in $S$ . Moreover, by our definition, $w_{t}$ is of degree $r-1$ .

It remains to prove that $w$ is spanned by the vectors $w_{t}$ . For this, one basic fact we will need is that if $p({\bf x})$ is any homogeneous polynomial of degree $m$ , it satisfies

[TABLE]

The second fact we will need follows from identity (24), when restricted to the $j$ th component of the equation. For every $j\in[k]$ ,

[TABLE]

Combining these two properties, we get

•

$\sum_{t}x_{t}\alpha(t)=r\alpha$

•

$\sum_{t}x_{t}\beta(t)=r\beta$

and this implies that

[TABLE]

Note that $r\neq 0$ ; indeed, for ${\mathbb{K}}$ with non-zero characteristic, we have $r<{\rm char}({\mathbb{K}})$ . Thus the vectors $w_{t}$ span $w$ . This completes the induction step, and hence the proof of Lemma 7.3. ∎

To complete the proof of the theorem we now prove

Lemma 7.5.

Either $S_{0}$ is not contained in $h({\bf x})$ , or it is contained in $A\cap h({\bf x})$ .

As the elements in $S_{0}$ are affine functions of ${\bf x}$ , a violation of the first possibility will imply that ${\bf x}$ satisfy a linear equation, so the fraction of such vectors is at most $1/|{\mathbb{K}}|$ as requested.

Proof of Lemma 7.5.

We first introduce some notation. Let $v({\bf x})$ be a vector in ${\mathbb{K}}({\bf x})^{d}$ , such that each entry of $v({\bf x})$ is some linear combination of $x_{1},\ldots,x_{d}$ , the coordinates of ${\bf x}$ . Then $v({\bf x})$ can be represented by a matrix $M\in{\rm Mat}_{d\times d}({\mathbb{K}})$ , with constant entries, such that $M{\bf x}=v({\bf x})$ . Note that if $M$ is skew-symmetric, this means that $(M{\bf x})\cdot{\bf x}=(M^{t}{\bf x})\cdot{\bf x}=-(M{\bf x})\cdot{\bf x}$ or $2(M{\bf x})\cdot{\bf x}=0$ , which means that $(M{\bf x})\cdot{\bf x}=0$ , unless the characteristic of the field is $2$ . Conversely, if $M{\bf x}\cdot{\bf x}=0$ for every ${\bf x}\in{\mathbb{K}}^{d}$ and so $M{\bf x}\cdot{\bf x}$ is the zero polynomial (in $d$ variables), which implies that $M$ is skew-symmetric.

Consider $k$ such matrices $M_{1},\ldots,M_{k}$ , representing vectors $v_{1}({\bf x}),\ldots,v_{k}({\bf x})$ , respectively. Then a linear combination $\sum_{i=1}^{k}\alpha_{i}M_{i}$ is a matrix that corresponds to a vector which is a linear combination of $v_{1}({\bf x}),\ldots,v_{k}({\bf x})$ , namely, $v(x)=\sum_{i}\alpha_{i}v_{i}({\bf x})$ . Thus $v({\bf x})$ lies in the span of the vectors $v_{i}({\bf x})$ .

Assume first that $k+m=d$ . We regard a $(k+m)\times(k+m)$ matrix $M$ as a block matrix with ${\rm TL}(M)$ (resp., ${\rm TR}(M)$ , ${\rm BL}(M)$ , ${\rm BR}(M)$ ) denoting the top-left (resp., top-right, bottom-left, bottom-right) blocks. More precisely, ${\rm TL}(M)$ (resp., ${\rm TR}(M)$ , ${\rm BL}(M)$ , ${\rm BR}(M)$ ) stands for the submatrix induced by taking the first $k$ (resp., first $k$ , last $m$ , last $m$ ) rows and first $k$ (resp., last $m$ , first $k$ , last $m$ ) columns of $M$ .

With some abuse of notation, we write $M\in Y$ , for a subspace $Y$ of ${\mathbb{K}}({\bf x})^{d}$ , if $M{\bf x}\in Y$ . Recall that $M$ is in $h$ if and only if $M$ is skew-symmetric. In particular, $TR(M)=-BL(M)^{t}$ , for every $M\in{\rm span}(A\cap h)$ . Assume that for some $M\in{\rm span}(A\cap h)$ , we have ${\rm TR}(M)\neq 0$ (and thus also $BL(M)\neq 0$ ). We claim that in this case there exists a matrix $\widetilde{M}\in S\setminus h$ . To see this it is sufficient to show that there exist matrices $b\in B$ and $c\in C$ such that $M+b=c$ which is not skew-symmetric (and therefore not in $h$ ). Indeed, let $b$ be defined by $TL(b)=-TL(M)$ , $TR(b)=-TR(M)$ , and $BL(b)=BR(b)=0$ . We define the matrix $c$ by $TL(c)=TR(c)=0$ , $BL(c)=BL(M)$ , $BR(c)=BR(M)$ . Clearly, $b\in B$ , $c\in C$ and $M+b=c$ . If $c$ is skew-symmetric, then we must have $BL(c)=BL(M)=0$ , contradicting our assumption on $M$ . Thus $c=M+b$ is in $A\cap h$ but not in $S$ . We conclude that in this case the general position requirement holds generically.

Assume next that for every $M\in{\rm span}(A\cap h)$ , we have ${\rm TR}(M)={\rm BL}(M)=0$ . Recall that ${\rm span}(A\cap h)$ is spanned by matrices of the form $v^{t}u-u^{t}v$ for some $u,v\in{\mathbb{K}}^{d}$ . Assume that $TR(v^{t}u-u^{t}v)=BL(v^{t}u-u^{t}v)=0$ for such a matrix. We claim that in this case at least one of $TL(v^{t}u-u^{t}v)$ or $BR(v^{t}u-u^{t}v)$ is the zero matrix. Indeed, put $M=v^{t}u-u^{t}v$ , and assume that $TL(M)\neq 0$ . The for some $1\leq i_{0}\neq j_{0}\leq k$ we have $u_{i_{0}}v_{j_{0}}\neq u_{j_{0}}v_{i_{0}}$ . In particular, not both $u_{i_{0}}v_{j_{0}}$ and $u_{j_{0}}v_{i_{0}}$ are zero. Assume, without loss of generality, that $u_{i_{0}}v_{j_{0}}\neq 0$ . That is, $u_{i_{0}},v_{j_{0}}\neq 0$ . Suppose that $u_{\ell}=0$ for every $\ell>k$ . In this case it is clear that $BR(M)=0$ and the claim is proved. Therefore, we may assume that for some $\ell>k$ we have $u_{\ell}\neq 0$ . Since we $BL(M)=0$ , we have in particular $u_{\ell}v_{j}=u_{j}v_{\ell}$ , for every $j=1,\ldots,k$ . In particular, $u_{\ell}v_{j_{0}}=u_{j_{0}}v_{\ell}$ . Note that since $v_{j_{0}}\neq 0$ and $u_{\ell}\neq 0$ , we must have that also $v_{\ell},u_{j_{0}}\neq 0$ . Thus, we get $\frac{v_{i_{0}}}{u_{i_{0}}}=\frac{v_{\ell}}{u_{\ell}}$ and $\frac{v_{j_{0}}}{u_{j_{0}}}=\frac{v_{\ell}}{u_{\ell}}$ . Combining these equalities, we get that $u_{i_{0}}v_{j_{0}}=u_{j_{0}}v_{i_{0}}$ , contradicting our assumption. This proves the claim.

This implies that ${\rm span}(A\cap h)$ is a direct sum $U\oplus V$ of matrices with entries supported only on $TL(M)$ for $M\in U$ and matrices supported by $BR(M)$ for $M\in V$ .

Now let $w\in S$ . By the definition of $S$ , $w$ can be written as $w=a+b=a^{\prime}+c$ for some $a,a^{\prime}\in{\rm span}(A\cap h)$ , $b\in B$ , $c\in C$ . Write $a=a_{U}+a_{V}$ , where $a_{U}\in U$ and $a_{V}\in V$ . Similarly, write $a^{\prime}=a^{\prime}_{U}+a^{\prime}_{V}$ . Then $a_{U}+a_{V}+b=a^{\prime}_{U}+a^{\prime}_{V}+c$ , or $a_{U}-a^{\prime}_{U}+b=a^{\prime}_{V}-a_{V}+c$ . But then, we must have $b=a^{\prime}_{U}-a_{U}$ and $c=a_{V}-a^{\prime}_{V}$ , which in particular implies that $b,c\in{\rm span}(A\cap h)$ .

Since $a_{U}-a^{\prime}_{U}\in U$ and $a^{\prime}_{V}-a_{V}\in V$ , this implies that, without loss of generality, we may assume $a\in U$ and $a^{\prime}\in V$ . Thus also $w=a+b=a^{\prime}+c\in{\rm span}(A\cap h)$ . We conclude that $w\in{\rm span}(A\cap h)$ for every $w\in S$ . Thus the general position requirement holds in this case.

We now prove the remaining case where $k+m<d$ , by reducing it to the case $k+m=d$ just discussed. Write $k+m=d-z$ , for some $z>0$ . Repeat the above argument ignoring the last $z$ rows and last $z$ columns of every matrix used along the proof. Note that for $a\in A\cap h$ , $a$ is skew-symmetric, and adding a matrix $b\in B$ or $c\in C$ will result with a matrix which is either in $h$ or not in $h$ , independent of the last $z$ rows and columns of $a$ . Indeed, for $b\in B$ and $c\in C$ these rows and columns are zero, and therefore they cannot affect the skew-symmetry of $a+b$ or $a^{\prime}+c$ . ∎

This completes the proof of Theorem 7.1. ∎

Having established the connection between genericity and general position, we can now complete the proof of Theorem 6.1.

Proof of Theorem 6.1..

Consider the family of subspaces $F=\{f_{1},\ldots,f_{n}\}$ , where $f_{i}:={\rm span}\{u_{i},v_{i}\}$ , for each $i=1,\ldots,n$ . Let ${\bf x}=(x_{1},\ldots,x_{d})$ and consider $h:=({\rm span}\{{\bf x}\})^{\perp}$ . In view of Lemma 6.2, we have

[TABLE]

On the other hand, by Theorem 5.2, we have $d(\{f\cap h\mid f\in F\})=\rho_{1}(F)$ . Thus there exists a deterministic strongly-polynomial time algorithm to compute ${\rm rank}A({\bf x})$ . ∎

We note that in the exact same way, our ability to efficiently compute $\rho_{k}$ for every integer $k$ by Theorem 1.6, and the characterization above, completes the proof of Theorem 1.5 from the introduction.

Acknowledgements We would like to thank Ze’ev Dvir for many illuminating discussions. We thank Amir Shpilka and Roy Meshulam for useful comments on an earlier version of the paper. We also thank Jan Vondrak for telling us about Dilworth truncation.

Appendix: Proof of Theorem 7.1 over ${\mathbb{R}}$

Here we provide an alternative proof of Theorem 7.1 which works over the field of Real numbers. One advantage of working over ${\mathbb{R}}$ is that we have the notions of a manifold and of the dimension of a manifold available. In the proof below, we use the fact that the set of linear subspaces of ${\mathbb{R}}^{d}$ can be viewed as a manifold. Then, to show that a certain set has measure zero, it is sufficient to show that this set has lower dimension. This allows us to obtain a more straightforward proof for the case ${\mathbb{K}}={\mathbb{R}}$ .

Proof over ${\mathbb{R}}$ :.

We first prove that property (i) in Definition 5.1 is a generic propery. Fix $A\subset F$ and put $g={\rm span}(A)$ . For ${\bf x}\in{\mathbb{S}}^{d-1}$ with $g\subset h({\bf x})$ , we have ${\bf x}\in\mathbb{S}^{d-1}\cap g^{\perp}$ . If $d(g)\geq 1$ , this means that ${\bf x}$ lies in a lower-dimensional sphere, which is a measure-zero subset of $\mathbb{S}^{d-1}$ . Since $F$ is finite (and so the number of different sub-families $A$ is finite), we conclude that for every ${\bf x}\in{\mathbb{S}}^{d-1}$ , excluding a finite union of certain lower-dimensional sub-spheres of ${\mathbb{S}}^{d-1}$ , $h({\bf x})$ satisfies property (i) in Definition 5.1.

We now prove that property (ii) in Definition 5.1 is a generic property. Fix some subfamilies $A,B,C\subset F$ . We first handle certain degenerate cases. Note that if

[TABLE]

for some ${\bf x}\in{\mathbb{S}}^{d-1}$ , then $h({\bf x})$ clearly satisfies property (ii). Using Lemma 6.2, condition (26) defines an algebraic subvariety of ${\mathbb{S}}^{d-1}$ . In particular, (26) either holds for every ${\bf x}\in{\mathbb{S}}^{d-1}$ or holds only for ${\bf x}$ taken from a subset of ${\mathbb{S}}^{d-1}$ of measure zero. In the former case this means that, with respect to the subfamilies $A,B,C$ , property (ii) in Definition 5.1 holds for $h({\bf x})$ for every ${\bf x}\in{\mathbb{S}}^{d-1}$ and there is nothing to prove. Therefore we can assume that we are in the complementary case. Namely, we assume that for almost every ${\bf x}\in{\mathbb{S}}^{d-1}$ we have

[TABLE]

Our next step is to identify the set of subspaces $g$ of the form $g={\rm span}(A\cap h({\bf x}))$ , for some ${\bf x}\in{\mathbb{S}}^{d-1}$ , and determine its dimension as a subset of the Grassmannian.

We need the following observation. Let

[TABLE]

We claim that $d(A\cap h({\bf x}))=r$ , for almost every ${\bf x}\in{\mathbb{S}}^{d-1}$ . Indeed, by Lemma 6.2, one can write a basis for ${\rm span}(A\cap h({\bf x}))$ with entries that are linear combinations in the coordinates of ${\bf x}$ . In particular, $d(A\cap h({\bf x}))$ can be expressed as the rank of a certain symbolic matrix, with entries depending linearly in the coordinates of ${\bf x}$ . This implies that $d(A\cap h({\bf x}))=r$ for every ${\bf x}\in{\mathbb{S}}^{d-1}$ , excluding some subset of ${\mathbb{S}}^{d-1}$ of measure zero, which proves our claim. (Here we used the fact that the maximal rank of a given symbolic matrix is the same as the generic rank of the matrix.)

Let $S_{0}$ denote the subset of ${\bf x}\in{\mathbb{S}}^{d-1}$ such that either $d(A\cap h({\bf x}))<r$ or (26) holds for $h({\bf x})$ . As argued above $S_{0}\subset{\mathbb{S}}^{d-1}$ has measure zero. Let ${\rm Gr}(r,d)$ denote the Grassmannian of $r$ -dimensional subspaces of ${\mathbb{R}}^{d}$ , regarded as an affine variety. We define a map $\phi:{\mathbb{S}}^{d-1}\setminus S_{0}\to{\rm Gr}(r,d)$ by

[TABLE]

We claim that the image of $\phi$ is $r$ -dimensional. Indeed, let $g\in{\rm Im}(\phi)$ and let ${\bf x}\in\phi^{-1}(g)$ . By definition of the domain of $\phi$ , we have ${\bf x}\not\in S_{0}$ and thus $d(g)=r$ . This means $g$ has maximal dimension. Observe that this guarantees that, for every ${\bf x}\in g^{\perp}$ , we have ${\rm span}(A\cap h({\bf x}))=g$ . (Indeed, ${\bf x}\in g^{\perp}$ certainly implies that $g\subset{\rm span}(A\cap h({\bf x}))$ and since $d(A\cap h({\bf x}))\leq r=d(g)$ , we have equality.) That is, $\phi^{-1}(g)=({\mathbb{S}}^{d-1}\setminus S_{0})\cap g^{\perp}$ and, in paticular,

[TABLE]

(dimension here is as a manifold). We conclude that

[TABLE]

as claimed.

Next, define

[TABLE]

Our goal is to show that $S_{1}^{\prime}$ has measure zero, as a subset of the sphere. For this, it suffices to show that $S_{1}:=S_{1}^{\prime}\setminus S_{0}$ has measure zero (since $S_{0}$ is of measure zero). Consider the restriction of $\phi$ to $S_{1}$ . Let $g\in{\rm Im}(\phi|_{S_{1}})$ and let ${\bf x}\in\phi|_{S_{1}}^{-1}(g)$ . Set

[TABLE]

Since ${\bf x}\not\in S_{0}$ , we have (27) which means

[TABLE]

Since we assume also that ${\bf x}\in S_{1}^{\prime}$ , we have ${\bf x}\in(g^{\prime})^{\perp}$ . So

[TABLE]

Clearly we also have ${\rm Im}(\phi|_{S_{1}})\subset{\rm Im}(\phi)$ , and thus, using (28),

[TABLE]

Combining (29) and (30), we get that

[TABLE]

This completes the proof of the lemma.∎

Bibliography28

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] M. Agrawal, C Saha, R. Saptharishi, and N. Saxena, Jacobian hits circuits: Hitting sets, lower bounds for depth-d occur-k formulas and depth-3 transcendence degree-k circuits, SIAM J. Comput. 45.4 (2016), 1533–1562.
2[2] L. Asimow and B. Roth, The rigidity of graphs, Trans. Amer. Math. Soc. 245 (1978), 279–289.
3[3] P. M. Brooksbank, and E. M. Luks, Testing isomorphism of modules, J. Algebra 320.11 (2008), 4020–4029.
4[4] A. Chistov, G. Ivanyos, and M. Karpinski, Polynomial time algorithms for modules over finite dimensional algebras, Proceedings of the 1997 ACM International Symposium on Symbolic and Algebraic Computation (ISSAC) (1997), 68–74.
5[5] J. Edmonds, Systems of distinct representatives and linear algebra, J. Res. Natl. Bur. Stand. 71 (1967), 241–245.
6[6] S. Fenner, R. Gurjar, and T. Thierauf, Bipartite perfect matching is in quasi-NC. In Proceedings of the 48th Annual ACM SIGACT Symposium on Theory of Computing (STOC) (2016), 754–763.
7[7] A. Frank and É. Tardos, Generalized polymatroids and submodular flows, Mathematicl Programming 42 (1988), 489–563.
8[8] A. Garg, L. Gurvits, R. Oliveira, and A. Wigderson, Operator scaling: theory and applications, in ar Xiv:1511.03730 v 3 .

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

Subspace arrangements, graph rigidity and derandomization through submodular optimization††thanks:

Abstract

1 Introduction

1.1 Polynomial Identity Testing (PIT)

Open Problem 1.1**.**

Theorem 1.2** ([28]).**

Theorem 1.3**.**

Theorem 1.4**.**

Theorem 1.5**.**

1.2 Graph Rigidity

1.3 Subspaces and generic hyperplanes

Theorem 1.6**.**

1.4 Related works and applications

Theorem 1.7** **(Lovász [19]).

Theorem 1.8** **(Tanigawa [26, Corollary 4.13]).

1.5 Organization of this paper

2 Subspaces, partitions, and the function ρc\rho_{c}ρc​

Definition 2.1**.**

Notation.

3 Properties of minimal partitions

3.1 Main technical lemma

Lemma 3.1**.**

Proof.

3.2 Uniqueness of minimal partitions

Corollary 3.2** **(Uniqueness).

Proof.

Definition 3.3**.**

3.3 Monotonicity properties

Corollary 3.4** **(Monotonicity).

Proof.

Lemma 3.5**.**

Proof.

3.4 The family F^\hat{F}F^

Lemma 3.6**.**

Proof.

Lemma 3.7**.**

Proof.

Lemma 3.8**.**

Proof.

4 An algorithm for computing ρc(F)\rho_{c}(F)ρc​(F)

Theorem 4.1** **(Frank and Tardos [7, IV.3]).

4.1 High-level description of the algorithm for ρc\rho_{c}ρc​

Lemma 4.2**.**

Proof of Theorem 1.6.

4.2 A submodular set function

Theorem 4.3** **(Schrijver [23]).

Lemma 4.4**.**

Proof.

4.3 Inserting one subspace

Lemma 4.5**.**

Proof.

Corollary 4.6**.**

Proof.

Proof of Lemma 4.2..

5 Intersecting subspaces with a hyperplane

Definition 5.1** (General Position).**

Theorem 5.2** **(Lovász [19, Theorem 2.3]).

Proof of Theorem 5.2.

Lemma 5.3**.**

Proof of Lemma 5.3.

Remark 5.4**.**

Theorem 5.5**.**

Proof.

6 Rank of symbolic matrices

Theorem 6.1**.**

Lemma 6.2**.**

Proof.

Corollary 6.3**.**

Proof.

Lemma 6.4**.**

Proof.

7 Generic vs. General Position

Theorem 7.1**.**

Open Problem 1.1.

Theorem 1.2 ([28]).

Theorem 1.3.

Theorem 1.4.

Theorem 1.5.

Theorem 1.6.

Theorem 1.7 (Lovász [19]).

Theorem 1.8 (Tanigawa [26, Corollary 4.13]).

2 Subspaces, partitions, and the function $\rho_{c}$

Definition 2.1.

Lemma 3.1.

Corollary 3.2 (Uniqueness).

Definition 3.3.

Corollary 3.4 (Monotonicity).

Lemma 3.5.

3.4 The family $\hat{F}$

Lemma 3.6.

Lemma 3.7.

Lemma 3.8.

4 An algorithm for computing $\rho_{c}(F)$

Theorem 4.1 (Frank and Tardos [7, IV.3]).

4.1 High-level description of the algorithm for $\rho_{c}$

Lemma 4.2.

Theorem 4.3 (Schrijver [23]).

Lemma 4.4.

Lemma 4.5.

Corollary 4.6.

Definition 5.1 (General Position).

Theorem 5.2 (Lovász [19, Theorem 2.3]).

Lemma 5.3.

Remark 5.4.

Theorem 5.5.

Theorem 6.1.

Lemma 6.2.

Corollary 6.3.

Lemma 6.4.

Theorem 7.1.

Claim 7.2.

Lemma 7.3.

Claim 7.4.

Lemma 7.5.

Appendix: Proof of Theorem 7.1 over ${\mathbb{R}}$

Proof over ${\mathbb{R}}$ :.