Subspace arrangements, graph rigidity and derandomization through submodular optimization
Orit E. Raz, Avi Wigderson

TL;DR
This paper introduces a deterministic polynomial-time algorithm for symbolic matrix rank computation, linking matroid theory, graph rigidity, and derandomization, with potential applications in polynomial identity testing and higher-dimensional rigidity problems.
Contribution
It provides a novel deterministic algorithm for a class of symbolic matrix rank problems, bridging matroid flats, graph rigidity, and submodular optimization, advancing derandomization techniques.
Findings
Deterministic polynomial-time algorithm for symbolic matrix rank.
Connection between graph rigidity and symbolic rank problems.
Potential for improved understanding of higher-dimensional graph rigidity.
Abstract
This paper presents a deterministic, strongly polynomial time algorithm for computing the matrix rank for a class of symbolic matrices (whose entries are polynomials over a field). This class was introduced, in a different language, by Lov\'asz [Lov] in his study of flats in matroids, and proved a duality theorem putting this problem in . As such, our result is another demonstration where ``good characterization'' in the sense of Edmonds leads to an efficient algorithm. In a different paper Lov\'asz [Lov79] proved that all such symbolic rank problems have efficient probabilistic algorithms, namely are in . As such, our algorithm may be interpreted as a derandomization result, in the long sequence special cases of the PIT (Polynomial Identity Testing) problem. Finally, Lov\'asz and Yemini [LoYe] showed how the same problem generalizes the graph rigidity problem in two…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Theory Research · Complexity and Algorithms in Graphs · Interconnection Networks and Systems
Subspace arrangements, graph rigidity and derandomization through submodular optimization††thanks:
The first author was partially supported from NSF grant DMS-1128155. The second author was partially supported from NSF grant CCF-1412958
Orit E. Raz Department of Mathematics, University of British Columbia, Vancouver, Canada. [email protected]
Avi Wigderson School of Mathematics, Institute for Advanced Study, Princeton NJ 08540, U.S.A. [email protected]
Abstract
This paper presents a deterministic, strongly polynomial time algorithm for computing the matrix rank for a class of symbolic matrices (whose entries are polynomials over a field). This class was introduced, in a different language, by Lovász [19] in his study of flats in matroids, and proved a duality theorem putting this problem in . As such, our result is another demonstration where “good characterization” in the sense of Edmonds leads to an efficient algorithm. In a different paper Lovász [16] proved that all such symbolic rank problems have efficient probabilistic algorithms, namely are in . As such, our algorithm may be interpreted as a derandomization result, in the long sequence special cases of the PIT (Polynomial Identity Testing) problem. Finally, Lovász and Yemini [20] showed how the same problem generalizes the graph rigidity problem in two dimensions. As such, our algorithm may be seen as a generalization of the well-known deterministic algorithm for the latter problem.
There are two somewhat unusual technical features in this paper. The first is the translation of Lovász’ flats problem into a symbolic rank one. The second is the use of submodular optimization for derandomization. We hope that the tools developed for both will be useful for related problems, in particular for better understanding of graph rigidity in higher dimensions.
*Dedicated with admiration to László Lovász,
on the occasion of his 70th birthday.*
1 Introduction
In this paper we provide a new deterministic, strongly polynomial time algorithm which can be viewed in two ways. The first is as solving a derandomization problem, providing a deterministic algorithm to a new special case of the PIT (Polynomial Identity Testing) problem. The second is as computing the dimension of the span a collection of subspaces in high dimensional space. Motivating and connecting the two is the problem of testing graph rigidity, to which an efficient deterministic algorithm is known only in the plane, and is open for higher dimensions. Accordingly, we will divide the introduction to explain these three problems.
1.1 Polynomial Identity Testing (PIT)
Let be a field. Let be a -tuple of independent variables. The PIT problem is to determine, given a multivariate polynomial , if (as a polynomial). Of course, the description of as an input to this problem is central to its complexity, and many variants of this problem were considered. The most common formulation is when is given by an arithmetic formula or circuit111When the input is a circuit, the degree of is always assumed to be polynomial in the circuit’s size, and in all cases considered in this paper this will be evident..
The original version of this question was posed by Edmonds [5]. In his formulation, is the determinant of a matrix whose entries are linear forms in (we will refer such a matrix as a symbolic matrix). Lovász [16] proved that this problem is in namely has a fast probabilistic algorithm (for fields larger than the degree of ): indeed, the algorithm simply picks random elements from and evaluates (note that evaluating is efficient in all three formulations above, and indeed in all formulations considered). This left open the problem of finding an efficient deterministic algorithm, namely derandomizing Lovász’s algorithm for PIT.
Open Problem 1.1**.**
Is PIT ?
The importance of this seemingly specific open problem was revealed in an important result of Kabanets and Impagliazzo [13]. They showed that if the answer is positive (as everyone expects), this will imply non-trivial lower bounds on either arithmetic or Boolean circuits, well beyond current techniques.
The progress towards resolving this open problem has been by providing deterministic polynomial time algorithms for a large variety of special cases of it, with the idea of building up techniques. By far, in most of these results the special cases are defined by restricting the input polynomial to lie in some complexity class. In these cases, progress in derandomization followed closely progress on lower bounds for the appropriate class (as is the case in the Boolean setting as well). There are literally dozens of such papers: many are mentioned and explained in the surveys [22, 24] and e.g. the recent paper [1].
In parallel, with motivation from algebra, geometry and other areas, a different collection of special cases of PIT was studied, of a structural nature. Here one works with Edmond’s formulation, and develops an understanding (and often a polynomial time algorithm) for cases where the symbolic matrix has restricted structure. This includes for example the works [3, 4, 6, 9, 11, 21].
This paper contributes to the second line of research, providing new families of symbolic matrices for which PIT can be solved in deterministic polynomial time. To explain this structure we introduce some notation. We will work in a slightly more general setting, in two ways, as the results generalize to both. First, we will allow our symbolic matrices to have polynomial entries. In such cases, these polynomials will have simple formulas describing them. Second, we will be interested in computing the rank of the input symbolic matrix, not just whether its determinant vanishes. While seemingly a more general problem, this turns out to be equivalent to PIT (see e.g. [8, Appendix A]222The proof in [8] is given for non-commutative rank, but the exact same proof works verbatim for our usual notion of rank over .).
Let be a family of polynomial maps . In all cases we assume the degree of all polynomials in all maps is at most , and the number of variables is at most polynomial in , so we will think of as the input size to the problem.
A family of maps prescribes a family of symbolic matrices, so that each row is an image of the -vector of variables under some map in . More formally, define PIT() to be the set of all symbolic matrices (with columns, and rows) in which every row of the matrix is of the form , for some map . We will be interested in families for which the ranks of matrices in PIT() can be computed in polynomial time333We identify the set of matrices and the computational problem of determining their ranks..
We first demonstrate the convenience of this notation. Call complete, if a deterministic polynomial-time algorithm for PIT() implies a deterministic polynomial-time algorithm for PIT. Very simple maps are complete! It follows from Valiant’s [28] hardness of the determinant for the class444The arithmetic analog of the Boolean class . VP that
Theorem 1.2** ([28]).**
The class of affine linear maps is complete.
Indeed, Valiant’s original proof (see more detail here [15]) implies a stronger theorem. Even restricting the support of each row to have at most a single variable in some coordinate, is general enough to be complete.
Theorem 1.3**.**
The class of affine linear maps, such that each map is non-constant in at most a single variable from , is complete.
We now turn to define the polynomial maps we will be interested in, and for which we will be able to provide efficient deterministic algorithms. Some motivation for interest in these maps will be given in the next two subsections.
Consider the following class . Here . Every is of the form , where is a rank-1 matrix. While this family may look very special, we note that the problem of graph rigidity in (for which a polynomial time algorithm is known but far from trivial) is a very special case of PIT().555Moreover, the same family of rank-2, skew symmetric matrices is featured in a very different PIT problem: determining the maximum rank of a subspace generated by given such matrices. A deterministic polynomial time solution for this problem is given by Lovasz’ celebrated matroid parity algorithm [17] (see also [18], Theorem 11.1.2).
Theorem 1.4**.**
PIT() can be solved in deterministic polynomial time, over a field with sufficiently large characteristic (more precisely, when is larger than the number of rows of the input matrix or ).
This construction can be generalized as follows. Here we will generate PIT instances whose entries are polynomials, rather than linear functions of the variables. For a -dimensional tensor of size , denote by its “anti-symmetric” version, namely where for every entry we have . Note that for we have .
We now extend , in which a matrix (namely a 2-dimensional tensor) acts on one vector of variables, to , in which a -dimensional tensor acts on vectors of variables. Let denote the following class of (degree ) maps. Let be -vectors of independent variables, so altogether is a vector of variables. A -tensor of size in each dimension acts on simply with the ’th dimension acting on for . The output of this action is a vector (along dimension ) of length of polynomials of degree , each linear in for all . Define to be all maps defined by for any rank-1 tensor . Note that with this notation is precisely the class defined above.
Generalizing the above theorem we prove:
Theorem 1.5**.**
For every , PIT() can be solved in deterministic polynomial time, over a field with sufficiently large characteristic (more precisely, when is larger than the number of rows of the input matrix or ).
1.2 Graph Rigidity
The problem of graph rigidity arises from several motivations, originally, mechanical engineering (see [14]). Rigidity theory is a fast-growing area, and we refer the interested reader to [25] for more background and recent approaches. Graph rigidiy has several versions, we describe perhaps the most common one, generic rigidity. It is supposed to capture the structural rigidity of a “bars and joints” framework described by a graph. We will not be formal here as precise definitions can be found e.g. in [2]. Here the relevant field for the geometric/physical interpretation is the Real numbers , and we use it in this subsection as in other papers on this problem (although the algebraic formulation is meaningful for every field ).
Let be an undirected graph on vertices and edges. An embedding of in is a map . An embedding of is called rigid if there is no perturbation of the vertex positions which preserves all edge lengths, other than the rigid motions of . The graph is called rigid if every generic embedding of is rigid (equivalently, if there exists an embedding of which is rigid, see [2]). The main question is to determine if a given graph is rigid (and more generally, compute the dimension of the non-rigid motions of a generic embedding, in case is not rigid).
An extremely convenient formulation of the problem (as a PIT) is the following. Let be a set of variables indexed by and . The intuition is that are the coordinates of a generic embedding of the vertex in . Given , construct a symbolic matrix of dimensions , which may be viewed as a concatenation of matrices, one for each dimension . Every row corresponds to an edge , and for each , the column contains the entry , whereas the column contains the the negation .
It is not hard to prove that the rank (as usual, over ) of determines if is rigid, and indeed the dimension of non-rigid motions (see [2] for the details). It is easy to see that for every graph , the matrix is in the class above. Indeed, let denote the standard basis vectors in . For some , put and . Consider the matrix . Then , where is the row of . Thus Theorem 1.4 yields as a corollary a polynomial time algorithm to determine whether a given graph is rigid in . Such algorithms for rigidity in are known (see [10, Section 2.2] and references therein). Note that the matrices make sense over any field , instead of , and Theorem 1.4 in fact provides a deterministic polynomial time algorithm to compute the rank of these matrices over any field with large enough characteristic.
The symbolic matrix representation above shows that for every , the problem of testing graph rigidity in is in , and it is a decades-old problem to whether it is also in , even for the case .
Lovász and Yemini [20] have developed an alternative approach for studying graph rigidity in the plane, which obtains a somewhat finer characterization of rigidity than Laman’s. What is even more interesting is their method. They show that the matrices can actually be obtained in the following way. First, with every edge associate a certain -dimensional subspace . The intersection of this subspace with a generic hyperplane through the origin (of which the normal can be viewed essentially as the -vector of variables ) yields the row of . In more detail, identify the vertices of with the set , and let denote the standard basis in . Define to be the subspace of spanned by the pair of vectors and (note that the definition of is symmetric in ). Let denote the subspace of orthogonal to the vector . It is not hard to verify (see [20] for the details) that is spanned by the row of . Thus, for a generic , we have
[TABLE]
Thus, the question of computing the rank of becomes the question of computing the dimension of the span of the resulting intersections (which here are simply lines) with a generic hyperplane. To analyze this, Lovász and Yemini use a theory developed by Lovász [19] which studies a similar problem for an arbitrary family of subspaces. The relevant part of Lovász’s theory is introduced in the next subsection.
The idea of [20] can be applied also to rigidity in higher dimensions. For simplicity of the presentation, let us consider only the case . In this case we associate with each edge a 3-dimensional subspace of . Namely, the subspace spanned by the vectors , , , where here stand for the standard basis of . Let and define to be the (codim 2) subspace of orthogonal to the pair of vectors
[TABLE]
[TABLE]
It is not hard to verify that is one dimensional and spanned by the row of . Thus, for a generic choice of , we have
[TABLE]
A crucial difference from the case is that here a generic choice of does not yield a generic codim 2 subspace of . From the perspective of this method and of our paper, this is “the reason” why rigidity in higher dimensions is more challenging.
1.3 Subspaces and generic hyperplanes
Let be a collection of subspaces in . Let be a generic hyperplane in , which without loss of generality can be taken to be all vectors perpendicular to . For each subspace , let . Now consider the space spanned by the subspaces in (note that the flats in are functions of ). The question is, what is the dimension of ?
One of the major results of Lovász’ paper [19] is a formula, called (which we redefine in Section 2), that determines this dimension for every family of subspaces, and for satisfying a certain “general position” condition (see Definition 5.1). To show that a generic satisfies Lovász’s general position condition over any field (with large enough characteristic) is one main result of our paper (see Section 7). Note that this fact is mentioned (over the field ) in [19] with no proof. This fact is again mentioned666In Tanigawa [26] an alternative general position condition is suggested, to supposedly correct a mistake in Lovász’s paper. However we find the counter example in [26, footnote on p. 1416] false. We provide a full and detailed proof of Lovász’s formula in Section 5. and applied, again with no proof, in Tanigawa [26]. We see our paper as contributing to the completeness of these results.
When the subspaces are derived from a graph in the manner described above to generate the rigidity matrix, Lovász and Yemini [20] write the explicit special case of the formula , which yields an elegant characterization. For the general case of an arbitrary family of subspaces , the formula is given as the minimum, over all possible partitions of the family, of a certain easily computable function. As the number of partitions is exponential, there is no obvious efficient way of computing . We have recently learned that the problem of computing is a special case of minimizing, over all partitions of a set , the Dilworth truncation of a given submodular function defined over ; a strongly polynomial algorithm for this problem is given in Frank and Tardos [7, Chapters II.1 and IV.3]. In our paper we introduce an alternative777Our algorithm seems different than the one in [7], as it does not use duality. strongly polynomial algorithm for computing , by reducing the original problem to a minimization problem of a certain submodular function. In fact, we prove our result to a more general quantity , introduced in Section 2. (Note that is the quantity from [19].)
Theorem 1.6**.**
There is a deterministic, strongly polynomial time algorithm to compute for every real number .
Closing this circle, we will also prove that the problem of computing is equivalent to PIT(). This will yield Theorem 1.4 as a corollary to Theorem 1.6.
1.4 Related works and applications
We see our result as a step towards better understanding of the algorithmic aspects of the notions and formulas introduced in Lovázs [19] and their applications.
Let us mention one related concept studied in Lovász [19] and discuss follow-up work by Tanigawa [26], which is related to Theorem 5.2 proved in this paper. It would be interesting to find efficient algorithms for the natural computational problem at hand. The reader may skip this subsection at first reading.
Let be a finite family of subspace in (where is a field of characteristic [math]). Let be a collection of points in such that for each . The set is said to be in general position with respect to if, for every fixed, the following holds: Any subspace spanned by members of and points of containing must contain the whole flat . Lovász shows that there exists a choice of a set in general position with respect to any given family . He then proves the following formula:
Theorem 1.7** **(Lovász [19]).
Let be a finite family of subspace in , and let be in general position with respect to . Then
[TABLE]
An interesting application of Theorem 1.7 to the body-rod-bar rigidity problem is obtained by Tanigawa [26]. A body-rod-bar framework in is defined as a structure consisting of -dimensional subspaces (bodies) and -dimensional flats (rods) mutually linked by one-dimensional lines (bars). (The term “rod” is appropriate for .) More formally, a -dimensional body-rod-bar-framework is a triple , where is a graph, is the rod-configuration mapping a vertex to a -dimensional subspace of , and is the bar-configuration mapping an edge to a 2-dimensional subspace in , such that
[TABLE]
equivalently,
[TABLE]
where here the dot product should be interpreted appropriately (see [26] for the details). Assume also that for every .
An infinitesimal motion of is a mapping such that
[TABLE]
An infinitesimal motion is called trivial if either for all , or if, for some fixed we have and for every . Finally, a framework is called infinitesimally rigid if every infinitesimal motion is trivial.
The body-rod-bar problem gives rise to a matroid defined on the edge set whose rank is the maximum size of independent linear equations in (1) (for unknown m). From the definition, is infinitesimally rigid if and only if the rank of is .
Theorem 1.8** **(Tanigawa [26, Corollary 4.13]).
Let and suppose . Then, for almost all bar-configurations and almost all rod-configurations we have
[TABLE]
where the minimum is taken over all partitions of .
Tanigawa’s proof is a nice combination of Theorem 1.7 with the other result of Lovász mentioned in the introduction, cited below as Theorem 5.2. Briefly, the first (simpler) step in the proof is to reduce the problem to the form of Theorem 1.7. That is, a family of flats is introduced, and the question becomes to find the rank of a generic set of points . The family resulted from the reduction can be described as follow: Each edge of is associated with some fixed subspace in . Then , where are subspaces depending on the choice of rod configuration . Since is taken generically, this imposes some genericity on the subspaces , but they are not exactly generic. The proof is then complete by proving a relaxed version of Theorem 5.2, and adding the subspaces one after the other.
For more recent applications of [19, 20] see Tanigawa [26, 27].
1.5 Organization of this paper
In Section 2 we introduce the function , which is the main object of this study. The rest of the paper has two separate parts. The first, in Sections 3 and 4, describes the algorithm to compute . In Section 3, we present and prove properties of the function . Using these properties we describe, in Section 4, a deterministic strongly polynomial time algorithm that computes over every field via submodular optimization. Note that, as there is an alternative algorithm [7] in the literature to efficiently compute functions like , this part can be skipped.
The second part, in Sections 5, 6, and 7, describes the genericity proof of . In Section 5, we state (and reprove) the result of Lovász [19] above, relating to the intersection of with a hyperplane in “general position”. A similar relation is obtained for , for an integer (see Theorem 5.5). In Section 6, we develop an explicit representation of a basis of the family resulting from this intersection, which give rise to the symbolic matrices PIT() (and PIT()). Using this, we prove in Section 7 that most hyperplanes (and more generally, subspaces) satisfy the “general position” definition of Lovász, thus expressing the rank of a these symbolic matrices as appropriate . Using the algorithm above we can now compute these ranks deterministically and efficiently. This last section is the only one in which the size of the field is important.
2 Subspaces, partitions, and the function
We introduce the main objects of this study: Families of subspaces, their partitions, and the optimization problem we solve in this paper. We consider linear subspaces of . Let denote the dimension of a subspace . For a family of subspaces, we write and
[TABLE]
A partition of is a set of nonempty, pairwise disjoint subfamilies of , such that . For a partition of and a family of subspaces , we define the restriction of to by
[TABLE]
If , then forms a partition of .
Lovász [19] defined the following key function of a family of subspaces, whose meaning will be revealed in Section 5. We actually generalize his definition to a family of functions , for every (his is our for ). Computing in deterministic polynomial time given , in Section 4, will be the key to our derandomization results.
Fix a constant . Let be a finite family of subspaces in . For a partition of , we define
[TABLE]
[TABLE]
where the minimum is taken over all partitions of .
Definition 2.1**.**
We say that is a minimal partition of , with respect to the constant , if attains and has the smallest possible number of parts.
Remark. In Corollary 3.2 we prove that, fixing , a minimal partition of a family with respect to is unique.
Notation.
We will use small letters to denote subspaces in , capital letters to denote families of subspaces, and to denote partitions of a certain family of subspaces. Note that the elements of a partition are themselves families of subspaces.
3 Properties of minimal partitions
In this and the next section we develop our algorithm in a fully self-contained manner. As mentioned in the introduction, the reader may skip these sections and apply the algorithm of [7] as a black box. In this section, we introduce some properties of minimal partitions, to be used in our algorithm. We find these properties interesting in their own right, but some may be known, indeed in more generality, for submosular functions.
3.1 Main technical lemma
We start with the following main technical lemma of this section.
Lemma 3.1**.**
Let be families of subspaces in with minimal partitions , respectively. Assume that and . Then is contained in one of the parts of .
For the proof, the idea is to show that if, when considering a minimal partition for , it “pays off” to put the elements of together, then it still “pays off” (or at least, harmless) to put these elements together, when this time considering a minimal partition for .
Proof.
Consider the restriction of to (as defined in (2)). By assumption, , and thus forms a partition of .
Our assumption that , and recalling that forms a minimal partition of , implies that
[TABLE]
Fixing some arbitrary order on the elements of , we write
[TABLE]
where is non-empty and are distinct. Set . For each , define
[TABLE]
and put and . Note that
[TABLE]
and that
[TABLE]
With this notation, (4) can be rewritten as
[TABLE]
which implies
[TABLE]
Next, we define
[TABLE]
and put and . Similar to above, we have
[TABLE]
and
[TABLE]
We claim that
[TABLE]
Indeed, the inequality (8) holds if and only if
[TABLE]
which holds if and only if
[TABLE]
To prove the last inequality, notice that and , for every . Thus
[TABLE]
Hence, by (5) and (7), we get . This fact combined with the inequality (6) implies (9) and hence also (8). Since is assumed to be minimal for , we conclude that and . This completes the proof. ∎
3.2 Uniqueness of minimal partitions
We prove uniqueness of minimal partitions.
Corollary 3.2** **(Uniqueness).
Let be a family of subspaces in and let be minimal partitions of . Then .
Proof.
Let denote the equivalence relations on induced by the partitions , respectively. Let and assume that . That is , for some . Applying Lemma 3.1 (with , , and ), we get that is contained in one of the parts in . Thus . By symmetry, we conclude that if and only if . Thus , as claimed. ∎
Definition 3.3**.**
Fix . Define to be the minimal partition of a family of subspaces (with respect to ).
3.3 Monotonicity properties
We prove the following “monotonicity” property of minimal partitions.
Corollary 3.4** **(Monotonicity).
Let be families of subspaces in and assume that . Then is a refinement of .
Proof.
Apply Lemma 3.1 to the families and . ∎
The following is another type of monotonicity property.
Lemma 3.5**.**
Let be a family of subspaces in . Let , for every , and consider For a partition of , let denote the partition of induced by , replacing each by the corresponding . Then is a refinement of .
Proof.
Let and assume without loss of generality that , for some . It is easy to see, applying Lemma 3.1, that .
Put . We claim that . First note that it suffices to prove the claim for the special case where and , for , and then apply the same argument repeatedly to each . To prove the calim for the special case, consider the family . It is easy to see, by definition, that . By Lemma 3.1, is contained in a part of , for every family of subspaces that contains . Moreover, since , we have
[TABLE]
for every such (this follows directly from the definition of and of ).
Define . By what has just been argued, we have
[TABLE]
Since , and applying Lemma 3.1, we get that each of and is contained in a part of . But , thus the set must be contained in a part of . Noting that , this implies that . Combined with (10), this proves , as claimed.
Applying Lemma 3.1 to the families , , and with , we conclude that is contained in one of the parts of . Since this is true for every , the lemma follows. ∎
3.4 The family
Let be a family of subspaces in . We show that, in some sense, can be replaced by a simpler family defined next. With each associate the subspace . Then define the family
[TABLE]
Note that for we have ; otherwise, taking yields a partition of with strictly less parts and with smaller or equal value of , contradicting the minimality of .
The family can be replaced by in the sense of Lemma 3.6, and is simpler in the sense of Lemma 3.7.
Lemma 3.6**.**
Let be families of subspaces in . Then
[TABLE]
By the sign we mean that the identity holds after identifying the partiton of with the partition of naturally induced by it. Concretely, the lemma asserts that
[TABLE]
Proof.
In the proof we often abuse notation and regard a partition of as a one of , as explained after the statement of the lemma. Let be the partition of induced by , given by
[TABLE]
We have and
[TABLE]
Thus
[TABLE]
To prove the inverse inequality, apply Lemma 3.1 to the families and . It follows that, for every , there exists such that . This means that induces a well-defined partition of with and
[TABLE]
Concretely, is given by
[TABLE]
where
[TABLE]
We have
[TABLE]
This proves that .
Next, we claim that . Indeed, by our argument above, the partition of satisfies
[TABLE]
Since is taken to be the smallest that attains , we get
[TABLE]
Similarly, by our argument above, the partition of satisfies
[TABLE]
Thus,
[TABLE]
This proves the claim.
By the uniqueness of minimal partition (see Corollary 3.2), we conclude that
[TABLE]
This completes the proof of the lemma. ∎
Lemma 3.7**.**
Let be a family of subspaces in . Then
[TABLE]
Proof.
Apply Lemma 3.6 with . ∎
We introduce one more simple property that we need.
Lemma 3.8**.**
Proof.
By Lemma 3.6, . The assertion then easily follows. ∎
4 An algorithm for computing
In this section we prove Theorem 1.6. That is, we introduce an algorithm to compute , for any number and a given family of subspaces in , with polynomial running time in (and in ). While we designed our algorithm for the class of functions , it clearly works for a wider class of submodular functions. As it is different than the one in [7], we feel it would be interesting to explore its generality. Note that the problem is trivial for , which is why we consider only .
As mentioned in the introduction, the problem of computing turns out to be an instance of a more general problem to which a strongly polynomial time algorithm is already known [7]. In more detail, the Dilworth truncation of a set function is defined as the function
[TABLE]
where the minimum is taken over all partitions of .
Theorem 4.1** **(Frank and Tardos [7, IV.3]).
Let be a submodular set function. Suppose that a minimizing oracle for is available. Then can be computed in a strongly polynomial time. The algorithm also constructs a partition of for which .
Remark. In [7], a more general result is proved.
4.1 High-level description of the algorithm for
The input to the algorithm is a number and a family of subspaces in Write . The high-level scheme of the algorithm is the following:
. 2. 2.
For to
- 2.1.
Compute 2. 2.2.
3. 3.
Return
The heart of the algorithm is of course the missing description of Step 2.1, which computes, in the th iteration, the minimal partition of the family with respect to .
Lemma 4.2**.**
The computation in Step 2.1 can be done in strongly-polynomial time.
Recall that the minimal partition of is the partition into singletons, by Lemma 3.7. So in this step we compute the effect on this partition of inserting one new subspace. We explain how to do so efficiently and prove Lemma 4.2 in Section 4.3 below. To describe and analyze step 2.1, we first need to recall submodular functions and optimization, which we do in Section 4.2. The proof of the lemma is then given in Section 4.3.
We are now ready to prove Theorem 1.6, assuming that Lemma 4.2 is true.
Proof of Theorem 1.6.
Correctness of the algorithm. By Lemma 3.8, we have
[TABLE]
Thus the computation of in Step 2.2 is correct. In view of Lemmas 3.6 and 3.7, the algorithm’s output is , as needed.
Running time of the algorithm. We represent a -dimensional subspace in by a matrix whose rows form a basis for . The dimension of a subspace is just the number of rows in the matrix representing the subspace, and hence can be computed in a constant time. Let be a family of subspaces in . To compute , we take the union of the rows of the matrices in (representing subspaces) and apply Gauss elimination (using row operations only). If has subspaces, we will need to apply Gauss elimination to a matrix of dimensions at most . The nonzero rows in the matrix received by this process will form a basis for .
Now let be a family of subspaces in . Cleary, each line in the above description of the algorithm, when applied to , is called at most times. In each step, excluding Step 2.1, we are required to compute at most times one of the operations just described (finding dimension or span) or simple operations such as addition. In view of Lemma 4.2, the proof is complete. ∎
4.2 A submodular set function
Recall that a function defined on the collection of subsets of a finite set is called submodular if
[TABLE]
for all .
The following is proved by Schrijver in [23].
Theorem 4.3** **(Schrijver [23]).
There exists a strongly polynomial-time algorithm minimizing a submodular function , where is given by an oracle. The number of oracle calls is bounded by a polynomial in the size of the underlying set. The algorithm also finds a minimizer of .
In this section we consider a set function defined as follows. Let be a family of subspaces in and let be a subspace not in . Fix . Define by
[TABLE]
where . We then put
[TABLE]
and we let denote a subset that attains .
We show that is submodular.
Lemma 4.4**.**
Let and and be as above. Then is submodular.
Proof.
To simplify the notation, and as are fixed, we write for short . Let . We need to show
[TABLE]
Put . By definition, we have
[TABLE]
By basic linear algebra, we have the identity
[TABLE]
Thus the last equality, after some rearranging, is
[TABLE]
Noting that and that , we get
[TABLE]
This proves the lemma. ∎
4.3 Inserting one subspace
We are now ready to describe in detail Step 2.1 which computes given and . More precisely, we describe a subroutine that receives as an input a family with and a subspace , and outputs .
We will need the following observation.
Lemma 4.5**.**
Let be a family of subspaces in . Let be the part that contains the subspace . Then
[TABLE]
Proof.
For every , we have . By Lemma 3.1, there exists such that . Clearly, we also have . Applying Lemma 3.1 once again, we get that also . Thus, which means that . ∎
Corollary 4.6**.**
Let be a family of subspaces in with and let be another subspace in . Then and
[TABLE]
where and are as defined in Section 4.2.
Proof.
This follows from the definitions of and , combined with Lemma 4.5. ∎
Proof of Lemma 4.2..
Combinig Corollary 4.6 with Theorem 4.3, we get that the computation in Step 2.1 can be done in strongly-polynomial time. ∎
5 Intersecting subspaces with a hyperplane
In this section we state (and reprove) a result of Lovász [19], which explains the source of the function (more precisely, taking with ) as the dimension of the intersections of a family of subspaces with a hyperplane in “general position”. This connection has been used by Lovász to study certain questions about matroids in [19], and by Lovász and Yemini in [20] to study rigid structures in . We extend Lovász’ treatment to arbitrary fields .
In Theorem 5.5 below, we further extend Lovász’s result, in a straightforward manner, to apply to the intersection of a family of subspaces with an arbitrary subspace (of any co-dimension) in “general position”, instead of only a (co-dimension 1) hyperplane.
Lovász [19] uses a very specific notion of genericity, which he calls general position defined below, and shows that correctly computes the dimension of the intersection when the hyperplane is in general position with respect to the given family of subspaces. In Theorem 7.1 we will prove that indeed “general position” is a generic property, namely holds for almost all hyperplanes. This will complete the connection with the PIT problem solved in this paper.
A hyperplane in is a subspace (subspace of ) of codimension 1. Let be a family of (nonzero) subspaces in and let be a hyperplane in . We denote by the family . Following Lovász, we have the following definition:
Definition 5.1** (General Position).**
We say that is in general position with respect to if, for every , with nonempty, we have:
(i) If , then .
(ii) If888Note that here one can take any of to be the empty set, and we interpret .
[TABLE]
then
[TABLE]
Remark. In Section 6, we prove (in Theorem 7.1) that being in general position with respect to a given family is a generic property; this fact is mentioned in [19] without a proof.
Theorem 5.2** **(Lovász [19, Theorem 2.3]).
Let be a family of subspaces in . Let be a hyperplane in in general position with respect to . Then
[TABLE]
For completeness, we introduce a slightly more detailed proof, based on the line of argument from [19].
Proof of Theorem 5.2.
Fix and as in the statement. Let . We need to show that .
We first prove that . That is, equivalently, we show that for every partition of the family . Let be a partition of . For , let . Then
[TABLE]
and hence
[TABLE]
Note also that, for every , we have and hence
[TABLE]
where here we used property (i) of the general position assumption on , namely, we used the fact that is not contained in . We conclude that
[TABLE]
for every partition of . This implies .
To prove the reverse inequality, we show that, for a certain partition of , the inequality (12) is in fact tight. We will construct explicitly subsequently refining a given partition. We describe the first step, which is indeed the general step (the proof will allow us to proceed recursively).
Define an equivalence relation on as follows: For , if and only if
[TABLE]
Let be the partition (equivalence classes) of induced by the relation .
The main idea is to prove that after intersection with , the spans of the parts become a direct sum decomposition of . As we will see below, will be achieved by refining the partition inductively.
Lemma 5.3**.**
We have
[TABLE]
Before we prove Lemma 5.3, we establish some preliminary claims. Let be the (distinct) subspaces for some (note that by construction is independent of the specific element that we take).
We observe that, for every ,
[TABLE]
Indeed, by property (i) of general position, is not contained in and , for every . Hence, for every , one can choose a basis for with all elements of the basis in except for exactly one element which is not in . Thus, fixing any , we have
[TABLE]
Thus, , as needed.
Next, we observe that, for , we have
[TABLE]
Indeed, by construction , and in particular . Combining this with (14), we get . By the definition of , we also have . Hence and (15) follows.
Proof of Lemma 5.3.
Here property (ii) of the general position definition will be crucial for the induction step. If then (13) clearly holds. For , it suffices to show that, for every and every distinct indices , one has
[TABLE]
We prove (16) by induction on . For , we need to show that , for every distinct . By the definition of the subspaces and applying (15), we have
[TABLE]
Since is in general position, using property (ii), this implies that . This proves the induction base case .
Assume next that (16) holds for some fixed and for every distinct indices . Let be some distinct indices. To establish the induction step we need to prove
[TABLE]
Observe that in order to prove (17) it suffices to show that
[TABLE]
Indeed, assume that (18) holds. Then
[TABLE]
where the first line uses the trivial fact that and the second line is due to (18). By the induction hypothesis, we have
[TABLE]
Thus, assuming that (18) is true, (17) follows.
Finally, we now prove (18). Note that, by the definition of the subspaces and using (15), we have
[TABLE]
Hence, our assumption that is in general position with respect to implies that in fact
[TABLE]
This clearly implies (18). Thus we have established the inductive step and this completes the proof of Lemma 5.3. ∎
Recall that our goal is to show that (12) is tight for some partition of . In view of Lemma 5.3, for the partition defined above, one has
[TABLE]
That is, we expressed the quantity as the sum of the quantities for certain subfamilies of . This allows to prove the existence of using induction on the size of .
If , the unique partition on clearly attains (12). For , let be the partition of given by Lemma 5.3, satisfying (19). If , the identity (19), combined with (14), gives
[TABLE]
This means that (12) is tight, and thus . If , then each subfamily has fewer elements than . Applying the induction hypothesis, there exist subpartitions of , for each , satisfying
[TABLE]
Combined with (19), we get
[TABLE]
So forms a partition of that attains (12). This completes the proof of the theorem. ∎
Remark 5.4**.**
Note that in the inductive proof of Lemma 5.3, it was sufficient to consider not all -subsets of the in the given partition, but rather simply on intervals . The same induction on works without change. Thus even after refinement, in the proof of this theorem we never need to apply the “general position” condition more than times. This will help us later bound the show that correctly computes for most (or generic) hyperplanes even when is finite and not too large.
We now generalize the theorem above to intersecting a family of subspaces with an arbitrary subspace. For this we need to extend the definition of “general position”.
Let be a family of subspaces in . Let be a set of vectors, and define that the subspaces . Note that is of codimension in , and that is a hyperplane in , for . We say that the subspace is in general position with respect to if for all we have that the hyperplane is in general position with respect to the family .
Theorem 5.5**.**
Let be a family of subspaces in . Let be a subspace in of codimension in general position with respect to . Then
[TABLE]
Proof.
We prove by induction on the codimension . The case is Theorem 5.2.
Let be vectors such that is in general position with respect to . We know that is in general position with respect to the family . By Theorem 5.2 again, we have
[TABLE]
where the minimum ranges over all partitions of . Note that induces a partition on , in the obvious way. Moreover, for every there exists such that . By induction, we get
[TABLE]
Thus,
[TABLE]
where the first minimum (the outer one) in this exprssion is taken over all partitions of , and, fixing and given , the inner minimum is taken over all partitions of the family .
Note that, for any partition of , the partitions induce a new partition which is a refinement of . Namely, . Note that taking for each , we get
[TABLE]
We now prove the inverse inequality. Fix a partition of , and, for , let be a partition of that attains the minimum in
[TABLE]
That is, the partitions satisfy
[TABLE]
Let be the partition of induced by . Observe that
[TABLE]
Combining the inequalities (20) and (21), we get . This completes the induction step, and therefore proves the theorem. ∎
6 Rank of symbolic matrices
In this section we show that the quantity can be interpreted as the generic rank, defined as the rank over , of a certain symbolic matrix associated with . More concretely, for let
[TABLE]
We prove that equals to the generic rank of a symbolic matrix whose entries are linear combinations of the coordinates of .
Our main result for the section is the following (note that this is Theorem 1.4 in the introduction).
Theorem 6.1**.**
Let be row vectors. Consider the symbolic matrix , with unknowns , whose th row is
[TABLE]
Then the (generic) rank of can be computed in polynomial time.
To prove the theorem we use the property established in Theorem 5.2, interpreting the quantity as the dimension of the space spanned by
[TABLE]
for any hyperplane in general position with respect to (see Definition 5.1). Taking we prove, in Lemma 6.2, that the intersection is the span of vectors with entries that are linear combinations of the coordinates of . We then prove, in Theorem 7.1, that, given a family , is in general position with respect to , for every generic (namely, for almost every ). Finally, we use the algorithm for computing from Section 4.
Lemma 6.2**.**
Let be an -dimensional subspace in and let be a basis of . Let and assume that . Then is spanned by vectors of the form
[TABLE]
*with .
Moreover, if (wlog) , then the set forms a basis of .*
Proof.
We first observe that . Indeed, by definition, each is a linear combination of basis vectors for , and thus . We also have
[TABLE]
Thus .
We now show that also span . Indeed, we prove the stronger “moreover” statement.
Let . Since we may write . Since , we have or
[TABLE]
If for every , then , contradicting our assumption. We may therefore assume, without loss of generality, that . In this case (22) can be rewritten as
[TABLE]
We conclude that
[TABLE]
This completes the proof of the lemma. ∎
We observe an interesting consequence of Lemma 6.2, asserting that computing for a family can be reduced to computing , for a certain family consisting only of planes (two-dimensional subspaces).
Corollary 6.3**.**
Let be a family of subspaces in and let be a basis of , for . Consider the family of two-dimensional subspaces
[TABLE]
where Then .
Proof.
It follows easily from Theorem 7.1 that is in general position with respect to both families and , for every generic . Fixing such and applying Lemma 6.2, we see that . By Theorem 5.2 this means that , as needed. ∎
The following lemma is a natural extension of Lemma 6.2 to a similar description of the intersection of a given subspace with a generic one, where the latter is not necessarily of co-dimension 1. If the co-dimension is , the basis elements of the intersection will be homogeneous polynomials of degree in the entries of the generic vectors. This connection, together with our algorithm for computing , will prove Theorem 1.5 from the introduction.
Lemma 6.4**.**
Let be integers. Let be an -dimensional subspace in and let be a basis of . Let be vectors in and define the subspace
[TABLE]
Assume that (this extends the assumption of the lemma above). Let be the matrix with as its th row. Let denote the matrix with as its th column. Put . So is a matrix with entry being . For every of cardinality , let denote the matrix received by restricting to the columns of with indices in . Then is the span of vectors of the form
[TABLE]
*where is of cardinality and .
Moreover, if (wlog, given our assumption above), assuming that the last columns of M are linearly independent, is spanned by the vectors with containing the last columns.
Proof.
We first show that , for every of cardinality . For fixed, we need to verify that is orthogonal to each of . For every we have
[TABLE]
Observe that the right-hand side is exactly the determinant of the matrix received by duplicating the th row of . Since the latter matrix is evidently singular, we conclude that , for every . Thus . Clearly, we also have . Thus , as needed.
We now turn to prove that the vectors generate . Indeed we prove the stronger “moreover” statement that already the vectors with of size that contain the last columns span . Recall that the last columns of are independent.
It will be convenient to add one more piece of (slightly informal) notation. Let be the matrix extending with one more (say, 0’th) row, that contains in the th coordinate the vector . Note that, up to a sign, the determinant of any minor of on columns is precisely .
Note also that column operations on , and replacing by the minors of the resulting matrix, do not change the span of the vectors . Moreover, note that column operations on the last columns of do not change the vectors , restricting to sets of size that contain the indices of the last columns. We may therefore assume, by performing such column operations, that the last columns of form the identity matrix.
We will prove the lemma by induction on . We already know that this statement holds for (and any ) by Lemma 6.2. Assume it holds for (and , this is all we need), and we will infer the statement for . Consider the subspace orthogonal to the vectors , and the subspace spanned by the vectors , and form the associated matrix, say . Add to the matrix the row to create . By induction, we know that the -minors containing the last columns of are vectors which span the . For , let denote the basis vector that corresponds to the columns . Note that
[TABLE]
Now add to a last column for and a last row for to form . Fix , and write , where . Due to the last columns of being the identity matrix, we have
[TABLE]
Moreover, one can check that in fact
[TABLE]
That is, . Applying Lemma 6.2, we get that the vectors , for , form a basis for , as needed. ∎
7 Generic vs. General Position
This section completes the cycle of connections, proving that most (namely, generic) hyperplanes, and indeed most subspaces, are in general position (in the Lovász sense of Section 5) with respect to any given family of subspaces. The proof will make use the explicit description we established in the previous section for a basis to the intersection of a family of subspaces and a hyperplane. Thus, computing the ranks of the symbolic matrices in Theorems 1.4 and 1.5 are equivalent to computing the functions and respectively, which we can do efficiently by the algorithm of Section 4.
Theorem 7.1**.**
Let be a family of subspaces in , and assume that either or . Then the hyperplane is in general position (see Definition 5.1) with respect to for almost every . More precisely, over finite fields all but - fraction of hyperplanes are not in general position, and for infinite fields they have measure zero.
The proof of this theorem turns out to be more intricate than we imagined. We will give below a linear-algebraic proof that is valid for all fields . In the appendix we give an alternative, geometric proof which is valid for the field of Real numbers.
Proof.
Fix subsets . Our goal is to show that for
[TABLE]
either generically, or generically. Indeed, we will prove that one of these alternative holds for every , except for those that vanish on a certain nontrivial linear equation. Thus, if is finite, the fraction of such exceptional values of is . Since the number of choices of is finite, we see that if is large enough this probability remains negligible. Being a bit more careful, (see Remark 5.4 at the end of the proof of Theorem 5.2), there are at most applications of the “general position” definition, and so the fraction of “bad” is at most as stated.
It is easy to see that replacing by and by does not affect the subspace . We may therefore assume that each of the families contains a single subspace of .
Suppose that , that is, that there exists , with . Clearly, we have and the linear form not identically zero. Thus, for almost every , is not contained in and there is nothing to prove in this case. We may therefore assume that . In this case, after a change of basis of , we may assume that and , where and stand for the standard basis vectors in .
From now on we will regard as a vector of variables, and work in the field of fractions . In particular this makes all subspaces under consideration, , and of course now subspaces of (by taking the span of their bases in ).
With this, our task becomes proving the following about these subspaces:
Claim 7.2**.**
Either , or .
We will break this task to two. Clearly, it will suffice to prove the claim for any spanning set replacing . So first we will prove that we can take to be the affine functions (of ) in , and then we will prove the claim for .
Lemma 7.3**.**
* is spanned by its elements which are affine functions of .*
Proof of Lemma 7.3.
Recall that we showed, in Lemma 6.2, that has a basis consisting of elements of the form , for some . Write for a basis of of this form.
Having bases for and we can express all elements of as linear combinations of these bases. Thus, elements in are described by solutions , , to the following system of linear equations.
[TABLE]
where (resp., ) is the th entry of (resp., ).
By basic theory of linear algebra, there exists a set of solutions, each of the form
[TABLE]
where are rational functions in the entries of , that together span the subspace . Moreover, these rational functions are of degree at most .
We will now strive to find a simpler spanning set for , and then use it to prove Claim 7.2.
The first simplification is realizing (via common denominators) that without loss of generality we can assume that all are in fact polynomials in the entries of . These elements of span the rest, after dividing by some fixed polynomial.
The next simplification (separating out homogeneous terms) shows that without loss of generality we can take all the polynomials in each of to be homogeneous of the same degree, which we may respectively call . These homogeneous solutions certainly span , and now we refine their structure further.
Indeed, inspecting the system of equations we know more: since each entry of , for every is of degree one, we know that for some fixed integer , they must satisfy and . We use this to stratify solutions by degree, and say that the associated has degree . Let be all solutions of degree (note that each is a subspace over , though we will not use this fact). We call solutions of degree 0 linear. Our main simplification will come from showing that linear elements span , which in this notation is a restatement of the lemma we are proving.
Claim 7.4**.**
**
We will prove this claim by induction on , using our stratifications of members of . It is clearly true for . So assume spans , and we need to prove that spans . By induction, it suffices to prove that spans . The plan for this will be as follows. We will assume we have some . We will take all partial derivatives of its constituent polynomials with respect to each variable , . From each of these we will generate an element , as the degree decreased by 1. Finally, we will show that is a linear combination, indeed a very simple one, of the form : . We now elaborate.
Fix . Let us take a derivative with respect to the variable of , of both sides of the identity (24). We get
[TABLE]
[TABLE]
To define we first define by appropriately collecting homogeneous terms, and making sure that are of degree , and that and are of degree :
- •
- •
,
- •
For , is
[TABLE]
- •
For , is
[TABLE]
here we used to denote the th entry of a vector . Now we can formally define as follows. We first observe that
[TABLE]
Indeed, note that (24), restricted to the th component of the equation, implies that for every, , we have
[TABLE]
From this it is straightforward to verify that the identity (25) indeed holds. Thus, letting
[TABLE]
for each , the identity (25) implies that is in . Moreover, by our definition, is of degree .
It remains to prove that is spanned by the vectors . For this, one basic fact we will need is that if is any homogeneous polynomial of degree , it satisfies
[TABLE]
The second fact we will need follows from identity (24), when restricted to the th component of the equation. For every ,
[TABLE]
Combining these two properties, we get
- •
- •
and this implies that
[TABLE]
Note that ; indeed, for with non-zero characteristic, we have . Thus the vectors span . This completes the induction step, and hence the proof of Lemma 7.3. ∎
To complete the proof of the theorem we now prove
Lemma 7.5**.**
Either is not contained in , or it is contained in .
As the elements in are affine functions of , a violation of the first possibility will imply that satisfy a linear equation, so the fraction of such vectors is at most as requested.
Proof of Lemma 7.5.
We first introduce some notation. Let be a vector in , such that each entry of is some linear combination of , the coordinates of . Then can be represented by a matrix , with constant entries, such that . Note that if is skew-symmetric, this means that or , which means that , unless the characteristic of the field is . Conversely, if for every and so is the zero polynomial (in variables), which implies that is skew-symmetric.
Consider such matrices , representing vectors , respectively. Then a linear combination is a matrix that corresponds to a vector which is a linear combination of , namely, . Thus lies in the span of the vectors .
Assume first that . We regard a matrix as a block matrix with (resp., , , ) denoting the top-left (resp., top-right, bottom-left, bottom-right) blocks. More precisely, (resp., , , ) stands for the submatrix induced by taking the first (resp., first , last , last ) rows and first (resp., last , first , last ) columns of .
With some abuse of notation, we write , for a subspace of , if . Recall that is in if and only if is skew-symmetric. In particular, , for every . Assume that for some , we have (and thus also ). We claim that in this case there exists a matrix . To see this it is sufficient to show that there exist matrices and such that which is not skew-symmetric (and therefore not in ). Indeed, let be defined by , , and . We define the matrix by , , . Clearly, , and . If is skew-symmetric, then we must have , contradicting our assumption on . Thus is in but not in . We conclude that in this case the general position requirement holds generically.
Assume next that for every , we have . Recall that is spanned by matrices of the form for some . Assume that for such a matrix. We claim that in this case at least one of or is the zero matrix. Indeed, put , and assume that . The for some we have . In particular, not both and are zero. Assume, without loss of generality, that . That is, . Suppose that for every . In this case it is clear that and the claim is proved. Therefore, we may assume that for some we have . Since we , we have in particular , for every . In particular, . Note that since and , we must have that also . Thus, we get and . Combining these equalities, we get that , contradicting our assumption. This proves the claim.
This implies that is a direct sum of matrices with entries supported only on for and matrices supported by for .
Now let . By the definition of , can be written as for some , , . Write , where and . Similarly, write . Then , or . But then, we must have and , which in particular implies that .
Since and , this implies that, without loss of generality, we may assume and . Thus also . We conclude that for every . Thus the general position requirement holds in this case.
We now prove the remaining case where , by reducing it to the case just discussed. Write , for some . Repeat the above argument ignoring the last rows and last columns of every matrix used along the proof. Note that for , is skew-symmetric, and adding a matrix or will result with a matrix which is either in or not in , independent of the last rows and columns of . Indeed, for and these rows and columns are zero, and therefore they cannot affect the skew-symmetry of or . ∎
This completes the proof of Theorem 7.1. ∎
Having established the connection between genericity and general position, we can now complete the proof of Theorem 6.1.
Proof of Theorem 6.1..
Consider the family of subspaces , where , for each . Let and consider . In view of Lemma 6.2, we have
[TABLE]
On the other hand, by Theorem 5.2, we have . Thus there exists a deterministic strongly-polynomial time algorithm to compute . ∎
We note that in the exact same way, our ability to efficiently compute for every integer by Theorem 1.6, and the characterization above, completes the proof of Theorem 1.5 from the introduction.
Acknowledgements We would like to thank Ze’ev Dvir for many illuminating discussions. We thank Amir Shpilka and Roy Meshulam for useful comments on an earlier version of the paper. We also thank Jan Vondrak for telling us about Dilworth truncation.
Appendix: Proof of Theorem 7.1 over
Here we provide an alternative proof of Theorem 7.1 which works over the field of Real numbers. One advantage of working over is that we have the notions of a manifold and of the dimension of a manifold available. In the proof below, we use the fact that the set of linear subspaces of can be viewed as a manifold. Then, to show that a certain set has measure zero, it is sufficient to show that this set has lower dimension. This allows us to obtain a more straightforward proof for the case .
Proof over :.
We first prove that property (i) in Definition 5.1 is a generic propery. Fix and put . For with , we have . If , this means that lies in a lower-dimensional sphere, which is a measure-zero subset of . Since is finite (and so the number of different sub-families is finite), we conclude that for every , excluding a finite union of certain lower-dimensional sub-spheres of , satisfies property (i) in Definition 5.1.
We now prove that property (ii) in Definition 5.1 is a generic property. Fix some subfamilies . We first handle certain degenerate cases. Note that if
[TABLE]
for some , then clearly satisfies property (ii). Using Lemma 6.2, condition (26) defines an algebraic subvariety of . In particular, (26) either holds for every or holds only for taken from a subset of of measure zero. In the former case this means that, with respect to the subfamilies , property (ii) in Definition 5.1 holds for for every and there is nothing to prove. Therefore we can assume that we are in the complementary case. Namely, we assume that for almost every we have
[TABLE]
Our next step is to identify the set of subspaces of the form , for some , and determine its dimension as a subset of the Grassmannian.
We need the following observation. Let
[TABLE]
We claim that , for almost every . Indeed, by Lemma 6.2, one can write a basis for with entries that are linear combinations in the coordinates of . In particular, can be expressed as the rank of a certain symbolic matrix, with entries depending linearly in the coordinates of . This implies that for every , excluding some subset of of measure zero, which proves our claim. (Here we used the fact that the maximal rank of a given symbolic matrix is the same as the generic rank of the matrix.)
Let denote the subset of such that either or (26) holds for . As argued above has measure zero. Let denote the Grassmannian of -dimensional subspaces of , regarded as an affine variety. We define a map by
[TABLE]
We claim that the image of is -dimensional. Indeed, let and let . By definition of the domain of , we have and thus . This means has maximal dimension. Observe that this guarantees that, for every , we have . (Indeed, certainly implies that and since , we have equality.) That is, and, in paticular,
[TABLE]
(dimension here is as a manifold). We conclude that
[TABLE]
as claimed.
Next, define
[TABLE]
Our goal is to show that has measure zero, as a subset of the sphere. For this, it suffices to show that has measure zero (since is of measure zero). Consider the restriction of to . Let and let . Set
[TABLE]
Since , we have (27) which means
[TABLE]
Since we assume also that , we have . So
[TABLE]
Clearly we also have , and thus, using (28),
[TABLE]
Combining (29) and (30), we get that
[TABLE]
This completes the proof of the lemma.∎
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] M. Agrawal, C Saha, R. Saptharishi, and N. Saxena, Jacobian hits circuits: Hitting sets, lower bounds for depth-d occur-k formulas and depth-3 transcendence degree-k circuits, SIAM J. Comput. 45.4 (2016), 1533–1562.
- 2[2] L. Asimow and B. Roth, The rigidity of graphs, Trans. Amer. Math. Soc. 245 (1978), 279–289.
- 3[3] P. M. Brooksbank, and E. M. Luks, Testing isomorphism of modules, J. Algebra 320.11 (2008), 4020–4029.
- 4[4] A. Chistov, G. Ivanyos, and M. Karpinski, Polynomial time algorithms for modules over finite dimensional algebras, Proceedings of the 1997 ACM International Symposium on Symbolic and Algebraic Computation (ISSAC) (1997), 68–74.
- 5[5] J. Edmonds, Systems of distinct representatives and linear algebra, J. Res. Natl. Bur. Stand. 71 (1967), 241–245.
- 6[6] S. Fenner, R. Gurjar, and T. Thierauf, Bipartite perfect matching is in quasi-NC. In Proceedings of the 48th Annual ACM SIGACT Symposium on Theory of Computing (STOC) (2016), 754–763.
- 7[7] A. Frank and É. Tardos, Generalized polymatroids and submodular flows, Mathematicl Programming 42 (1988), 489–563.
- 8[8] A. Garg, L. Gurvits, R. Oliveira, and A. Wigderson, Operator scaling: theory and applications, in ar Xiv:1511.03730 v 3 .
