Large homogeneous submatrices
D\'aniel Kor\'andi, J\'anos Pach, Istv\'an Tomon

TL;DR
This paper proves that large zero-one matrices avoiding a specific small pattern necessarily contain large homogeneous submatrices, and characterizes which patterns guarantee submatrices of size nearly linear in the original matrix.
Contribution
It provides a new structural result linking pattern avoidance in matrices to the existence of large homogeneous submatrices, with a near-complete classification of such patterns.
Findings
Matrices avoiding certain patterns contain large homogeneous submatrices.
Characterization of patterns guaranteeing near-linear size homogeneous submatrices.
Applications to chordal bipartite graphs and other combinatorial structures.
Abstract
A matrix is homogeneous if all of its entries are equal. Let be a zero-one matrix that is not homogeneous. We prove that if an zero-one matrix does not contain as a submatrix, then has an homogeneous submatrix for a suitable constant . We further provide an almost complete characterization of the matrices (missing only finitely many cases) such that forbidding in guarantees an homogeneous submatrix. We apply our results to chordal bipartite graphs, totally balanced matrices, halfplane-arrangements and string graphs.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Large homogeneous submatrices
Dániel Korándi University of Oxford. Email: [email protected]. Research supported by SNSF Postdoc.Mobility fellowship P400P2-186686.
János Pach Rényi Institute, Budapest. Email: [email protected]. Research partially supported by the National Research, Development and Innovation Office, NKFIH, project KKP-133864, and the Austrian Science Fund (FWF), grant Z 342-N31.MIPT Moscow. Research partially supported by the Ministry of Educational and Science of the Russian Federation in the framework of MegaGrant No. 075-15-2019-1926.
István Tomon 33footnotemark: 3 ETH Zurich. Email: [email protected]. Research supported by SNSF grant 200021-175573.
Abstract
A matrix is homogeneous if all of its entries are equal. Let be a zero-one matrix that is not homogeneous. We prove that if an zero-one matrix does not contain as a submatrix, then has an homogeneous submatrix for a suitable constant . We further provide an almost complete characterization of the matrices (missing only finitely many cases) such that forbidding in guarantees an homogeneous submatrix. We apply our results to chordal bipartite graphs, totally balanced matrices, halfplane-arrangements and string graphs.
1 Introduction
Zero-one matrices play an important role in discrete mathematics, as they can be used to represent (bipartite) graphs, hypergraphs, systems of incidences, and many other binary relations. In such settings, the circumstances often force structural restrictions. In this paper, we analyze the structure of matrices that do not contain a given submatrix , and show that forbidding often forces a large all-0 or all-1 submatrix. With a slight abuse of notation, the letter appearing in different statements stands for unrelated positive constants.
A matrix is homogeneous if all of its entries are equal, and inhomogeneous otherwise. We will also say that a matrix contains another matrix if is a submatrix of , and that is -free if is not a submatrix of . Our first result shows that if is a matrix whose entries are not all 0 or all 1, then every -free zero-one matrix contains a linear-size homogeneous submatrix.
Theorem 1.1**.**
Let be an inhomogeneous zero-one matrix. Then every -free zero-one matrix contains a homogeneous submatrix, for a suitable constant .
As we will see below, this result does not hold when is the all-0 or the all-1 matrix. We can, however, extend Theorem 1.1 to matrices by making a small sacrifice on the size of the homogeneous submatrix.
Theorem 1.2**.**
Let be a zero-one matrix that does not contain a homogeneous submatrix. Then every -free zero-one matrix contains a homogeneous submatrix, for a suitable constant .
Of course, one can obtain an analogous result for matrices by working with the transposes. In particular, we find a homogeneous submatrix for any or matrix with no homogeneous submatrix. Moreover, every matrix can be extended to such a matrix, so this also holds for and matrices.
We should also point out that, as permuting rows or columns does not affect homogeneous submatrices, the same results hold if we only assume that can be made -free by reordering its rows and columns.
Our problem arises naturally in numerous combinatorial and geometrical settings. When represents a bipartite graph, corresponds to a forbidden induced (ordered) subgraph, and a homogeneous submatrix is a complete or empty bipartite subgraph. When represents an incidence relation in geometry, is often a geometrically impossible pattern, and a homogeneous submatrix corresponds to two completely intersecting or disjoint families. When is the incidence matrix of a hypergraph, a homogeneous submatrix gives a set of hyperedges and a completely disjoint or completely contained set of vertices. We list a few specific applications to chordal bipartite graphs, totally balanced matrices, halfplane-arrangements and string graphs in Section 9.
Several closely related problems have been studied in the literature, including certain Erdős-Hajnal type questions and the Turán problem for ordered graphs and forbidden patterns. We discuss some connections and differences in Section 10.
As mentioned above, it is not true that forbidding any submatrix forces an almost linear-size homogeneous submatrix.
Definition 1.3**.**
A zero-one matrix is called acyclic if every submatrix of has a row or column containing at most one 1-entry. The complement of is the matrix obtained from by replacing the 1-entries with 0s and the 0-entries with 1s. We say that is simple if both and are acyclic.
It is easy to see that is acyclic if and only if the bipartite graph with biadjacency matrix is acyclic. If is not simple, then there are -free zero-one matrices with only small homogeneous submatrices. The (fairly standard) probabilistic construction will be given in Section 3.
Proposition 1.4**.**
Let be a zero-one matrix. If is not simple, then there is a -free zero-one matrix with no homogeneous submatrix for every large enough , where is a positive constant.
Proposition 1.4 shows that Theorems 1.1 and 1.2 are optimal in terms of the matrices covered: the remaining or matrices are not simple, so these statements cannot hold for them. In fact, Theorems 1.2 and 1.4 almost completely characterize which forbidden matrices force an almost-linear homogeneous submatrix, because they only miss a finite number of simple matrices. Indeed, a simple matrix can contain at most 0-entries and at most 1-entries, so it must satisfy , or, equivalently, . So, apart from the matrices treated in Theorem 1.2, only , and matrices can be simple.
We believe that a similar statement should hold for the remaining simple matrices, as well. In fact, we make the following stronger conjecture.
Conjecture 1.5**.**
Let be a simple zero-one matrix. Then every -free zero-one matrix contains an homogeneous submatrix, for a suitable constant .
Much of the difficulty in our results comes from the ordered structure of matrices. We can obtain better results if we relax the notion of matrices to “unordered” matrices, where the order of the rows and the columns does not matter. We can then say that a zero-one matrix is unordered -free if it does not contain any submatrix whose rows and columns can be permuted to obtain . We show that Theorem 1.2 holds for unordered matrices.
Theorem 1.6**.**
Let be a zero-one matrix that does not contain a homogeneous submatrix. Then every unordered -free zero-one matrix contains a homogeneous submatrix, for a suitable constant .
Results about unordered matrices can be thought of as results about bipartite graphs. In the language of graphs, Theorem 1.6 implies the following statement: let be a star of size and a star of size glued together at one of their leaves, and let be the union of and an isolated vertex. Let be an induced subgraph of , and let be an induced -free bipartite graph with . Then there are linear-size subsets and such that induces either a complete or an empty bipartite graph in . This latter statement has been proved independently by Axenovich, Tompkins, and Weber [2].
Our paper is organized as follows. In Section 2, we state a number of further results, which imply Theorems 1.1 and 1.2, but might be of interest on their own. The proof of these (positive) results are given in Sections 4, 5, 6 and 7. Our negative result, Proposition 1.4, is proved in Section 3. Finally, we prove Theorem 1.6 in Section 8.
We finish the paper with a few applications in Section 9 and some further connections and remarks in Section 10.
2 Forbidden submatrices
Our first result is about matrices that contain a 0-entry and a 1-entry in every column, establishing Theorems 1.1 and 1.2 in this special case.
Theorem 2.1**.**
Let be a zero-one matrix without any homogeneous column. Then every -free zero-one matrix contains a homogeneous submatrix, for a suitable constant .
As rotation and taking complements does not affect the problem, there is essentially one simple matrix not covered by this theorem: . When cannot be rotated into a matrix without homogeneous columns, the problem becomes more difficult. is the only such matrix where we can prove a linear lower bound.
Theorem 2.2**.**
Let . Then every -free zero-one matrix contains an homogeneous submatrix, for a suitable constant .
In the general case, we can show the following, somewhat weaker result.
Theorem 2.3**.**
Let be a simple zero-one matrix. Then every -free zero-one matrix contains an homogeneous submatrix, for a suitable constant .
Theorem 1.1 then follows from Theorems 2.1 and 2.2, and Theorem 1.2 is equivalent to Theorem 2.3. Theorems 2.1, 2.2 and 2.3 are proved in Sections 4, 5, and 6, respectively.
Note that Conjecture 1.5 is invariant under taking complements: the complement of a -free zero-one matrix is -free, and if is simple, then so is . We may therefore assume that 0 is the majority entry in , and then try to find a large all-0 submatrix in it. Indeed, this is the approach we take to prove Theorems 2.1, 2.2 and 2.3. More generally, we believe that the following strengthening of Conjecture 1.5 might also be true.
Conjecture 2.4**.**
Let be an acyclic zero-one matrix. Then for every , there is a , such that every -free zero-one matrix with at least 0-entries contains a all-0 submatrix.
Another immediate corollary of this conjecture would be the following:
Conjecture 2.5**.**
Let be an acyclic zero-one matrix. Then every zero-one matrix that is both -free and -free contains an homogeneous submatrix, for a suitable constant .
We can prove these conjectures in the special case when has no column with more than one 1-entries.
Theorem 2.6**.**
Let be a zero-one matrix such that every column of has at most one 1-entry. Then every zero-one matrix that is both -free and -free contains a homogeneous submatrix, for some .
Note that Theorem 2.1 can also be obtained, with slightly weaker constants, as a corollary of this result (by applying Theorem 2.6 to the concatenation of and ). The proof can be found in Section 7.
3 Notation, preliminaries–Proof of Proposition 1.4
Throughout this paper, we use the following notation. When is a matrix, denotes the entry in the ’th row and ’th column. Sometimes we make no distinction between rows and their indices, and refer to the ’th row as “row ” (and, in a similar manner, for columns). We denote the submatrix in the intersection of rows and columns (the submatrix induced by and ) by .
We use two natural correspondences between zero-one matrices and graphs. The biadjacency matrix of a bipartite graph is the zero-one matrix whose rows are indexed by , columns are indexed by , and the entry is 1 for and if and only if . The incidence matrix of a graph is the zero-one matrix whose rows are indexed by , columns are indexed by , and the entry is 1 if and only if is incident to .
For two subsets , we write to denote that for every . When , we may simply write . We systematically omit floor and ceiling signs whenever they are not essential.
We start by proving that only acyclic forbidden matrices can force large all-0 submatrices. This also shows that Conjecture 2.4 can only hold for acyclic .
Proposition 3.1**.**
Let be a zero-one matrix. If is not acyclic, then there is a -free zero-one matrix with at least 0-entries, but no homogeneous submatrices for every large enough , where is a positive constant.
Proof.
We may assume that every row and column of contains at least two 1-entries, as otherwise we can replace with a submatrix. In particular, we have , and contains at least 1-entries.
Let be a random matrix, where each entry is independently set to 1 with probability , and set to 0 otherwise. First of all, note that the expected number of 1-entries is if is large enough, so the probability that has more than 1-entries is at most . Also, the expected number of submatrices identical to in is at most . So with probability at least , there are at most such submatrices. Finally, the probability that contains a homogeneous matrix is at most
[TABLE]
if , which holds for whenever and is large enough. So there is an matrix that contains at most submatrices identical to and no homogeneous submatrix. Then we can delete columns to obtain the -free matrix we were looking for. ∎
Proof of Proposition 1.4.
Let us apply Proposition 3.1 to or (whichever is not acyclic) to get with no homogeneous submatrix. Then or (whichever is -free) will work. ∎
Definition 3.2**.**
We say that a zero-one matrix is -good if for all , every -free matrix with at least 0-entries contains a all-0 submatrix.
By convention, every matrix contains the all-0 submatrix, so every is -good for every . We prove our main results by showing that certain matrices are -good for some . Let us start with a simple case.
Proposition 3.3**.**
The all-1 matrix is -good.
Proof.
Without assuming anything about the density, we can find an all-0 matrix in any rows of an -free matrix. Indeed, as every row contains at most 1-entries, any rows induce at least columns with only 0-entries. ∎
Of course, if a matrix is -good, then it is also -good for any . The next lemma shows that adding an all-0 row or column at a border of a matrix does not change goodness.
Lemma 3.4**.**
Let be a zero-one matrix, and let be the matrix obtained from by appending a new last column of 0-entries. If is -good for some , then is -good.
Proof.
Let be a -free matrix with at least 0-entries. We will find a dense submatrix with an all-0 last column, and then apply the goodness property of to get the large homogeneous submatrix.
Define as the matrix obtained from by replacing the first 0-entries of each row by 1-entries (if a row has fewer than 0-entries, then it becomes a row with all 1’s). Then has at least 0-entries, so it must contain a column with at least 0-entries. If column is such a column, let be a set of rows with a 0-entry in the ’th column, and let be the first columns. By the definition of , every row of has at least 0-entries, so in total, contains at least 0-entries. Now let be the columns with the most 0-entries in them. Then is an matrix with at least 0-entries.
Note that is -free, since we could otherwise add 0’s in the ’th column to get a copy of in . As is -good, must contain a all-0 submatrix. ∎
4 Matrices with no homogeneous columns–Proof of Theorem 2.1
In this section, we prove Theorem 2.1 by showing that every matrix with no homogeneous columns satisfies Conjecture 2.4. We first prove this for a special class of “checkerboard” matrices. Let denote the matrix defined by if is even, and otherwise. The main concern of this section is to establish that for every , there is a such that is -good. The general case will follow easily by observing that every matrix with no homogeneous columns is a submatrix of .
Note that is the concatenation of copies of . We first consider -free families.
Lemma 4.1**.**
Let , and suppose that is an zero-one matrix with at least 0-entries. Then at least one of the following statements holds.
* contains an all-0 submatrix.* 2. 2.
At least different pairs of rows of contain as a submatrix.
Proof.
Let . First, we find a submatrix of such that its first row and column contain only 0’s, moreover, each of these 0-entries is preceded by other 0-entries in their rows and columns in .
Let be the matrix obtained from by replacing the first 0-entries of each row and column with 1-entries. As at most 0-entries are lost, still has at least 0-entries. Now let be the matrix obtained from by replacing the last 0-entries of each row and column with 1-entries. By the same argument, has at least 0-entries.
Take a 0-entry in , say in the ’th row and ’th column. By the definition of , we must have a set of columns such that the ’th row of contains a 0 in these columns, and similarly, there we must have a set of rows such that the ’th column of contains a 0 in these rows. So, the submatrix is all-0 in its first row and column. Also, by the definition of , each row contains 0-entries in in some columns preceding the columns of , and similarly, each column contains 0-entries in some rows .
If has rows without a 1-entry, then it contains a all-0 submatrix, establishing 1. Hence, we may assume that at least rows in contain a 1-entry.
Let be such that , and look at the submatrix . Again, if this has rows without a 1-entry, then contains a all-0 submatrix, and we are done. Otherwise, there are 1-entries in different rows of . However, if for some , the entry is 1, then . For any choice of , there is such an in different rows, so we find in at least different row pairs, establishing 2. ∎
Lemma 4.2**.**
For every , is -good.
Proof.
Suppose is a -free zero-one matrix. Let , and divide into blocks , where is the interval , for every . We say that is heavy if contains at least 0-entries. If denotes the number of heavy pairs, then we can bound the number of 0-entries in as follows:
[TABLE]
Consequently, .
This means that for some , there is a set of at least indices such that is heavy for every . Let be the set of pairs such that rows and in together contain . If is heavy, then by Lemma 4.1 (applied with parameters and ), either , or contains an all-0 submatrix. In the latter case, we are done, so we may assume the former holds for every . Now
[TABLE]
implies that some pair is contained in at least of the sets , say in . Then is a submatrix of the union of the matrices in the rows indexed by and , which is a contradiction. ∎
Proof of Theorem 2.1.
Every matrix with no homogeneous columns is contained in , so if a matrix is -free, then it is also -free.111Note that this observation combined with Lemma 4.2 also implies that every such is -good. Similarly, every -free matrix is -free, because also has no homogeneous columns.
If is -free, then is -free, so both and are -free. One of and will contain at least 0-entries, so we can apply Lemma 4.2 with to find an homogeneous submatrix in . ∎
Let , that is, is the largest such that for every , every -free matrix with 0-entries contains a all-[math] matrix. One might wonder what the order of is. Lemma 4.1 shows that (the upper bound is trivial), while Lemma 4.2 implies for . It might seem reasonable to conjecture that also holds for . However, this is not true, already for : Füredi and Hajnal [18] proved that for every positive integer , there is an matrix such that does not contain either of and as a submatrix (where can be either [math] or ), but contains 0-entries, where is the slowly growing inverse Ackermann function. For and every , we can construct the matrix by replacing each -entry of with an all-1 matrix, and each [math]-entry of with an all-0 matrix. Then is -free, it has at least 0-entries, but it does not contain any all-0 submatrix with more than rows and columns. As , we have .
It would be interesting to determine the true order of magnitude of . We believe the answer should be closer to the upper bound .
5 The matrix with one 1 in the corner–Proof of Theorem 2.2
In this section, we establish Theorem 2.2. As before, we achieve this by showing a density result: we prove that both and its complement satisfy Conjecture 2.4.
More generally, let be the the matrix such that for , and all other entries are 0. For example, , and . Proposition 3.3 and Lemma 3.4 easily imply that is -good for every . In this case, we can actually gain a factor of :
Lemma 5.1**.**
* is -good for every .*
Proof.
Let be a -free matrix with at least 0-entries, and let be the matrix obtained from by replacing the first 0-entries in each row and column with 1’s. It is easy to see that fewer than entries were changed, so for some . By the definition of , we then have sets of size such that and , and for every and , .
Now is an matrix, and as is -free, it does not contain a all-1 submatrix. Then, by Proposition 3.3, it has an all-0 submatrix. ∎
The difficult part is to show that for every , is also -good for some . We prove this in the next lemma.
Lemma 5.2**.**
Let be an zero-one matrix with at least 1-entries. If does not contain , then it has an all-1 submatrix.
Proof.
For an index , let denote the submatrix formed by the first columns of and let denote the submatrix of the last columns. Then for some , both and contain at least 1-entries. Note that this implies, in particular, that both and have at least columns. Also, has at least rows containing at least 1-entries. Indeed, otherwise would contain fewer than 1-entries in total, which is not the case. Let be the submatrix of consisting of such rows, and let be an submatrix of the same rows in .
Now let us define the graph on the 0-entries of as vertices, where we connect two 0-entries by an edge if they are in the same row or the same column of . For a vertex in , we define and as the row and column of , respectively. We say that a path in is row-monotone if . This notion is motivated by the following claim.
Claim 5.3**.**
Let be a vertex of , and let be the set of vertices that can be reached from via a row-monotone path. Then contains a all-1 submatrix, where is the number of different rows of .
Proof.
Let be a vertex that can be reached from via a row-monotone path , where and . We are going to show that if for some column of , then , as well. In fact, we will show for every , by induction.
Assume this holds for some (the case is trivial). If , then there is nothing to prove. Otherwise, and by the definition of the path. Let us look at the submatrix . The entries in the second column are 0 by the definition of , and the top left entry is 1 by assumption. But this submatrix cannot be , so the bottom left entry must also be 1, as needed.
This shows that in , the rows of have 1-entries wherever does. The row , like every row of , contains at least 1-entries, so the rows of together produce a all-1 submatrix. ∎
Claim 5.3 shows that it would be enough to find a vertex in that sends monotone paths to at least different rows. The next claim shows that each connected component of has a vertex that reaches the whole component via monotone paths.
Claim 5.4**.**
Let be a connected component of and let be a vertex such that is smallest. Then for every vertex , there is a row-monotone path from to .
Proof.
Let be a - walk in that minimizes . We will show that is a row-monotone path. First, we establish the following simple properties for every such minimal path:
has no three collinear vertices, i.e., there is no such that or . 2. 2.
There is no “bottom right corner” in , i.e., there is no such that and , and there is no with and .
The first property is clear: we would get a better - walk by simply deleting from . For the second property, suppose there is an satisfying and , and look at the submatrix . Using and , we see that contains all entries of this submatrix, except for the top left entry. All vertices in are 0-entries, so , as well, for otherwise . Then we could replace with the vertex corresponding to this top left entry, and get a new with smaller . The other case of property 2 can be proved analogously.
Now let be the smallest index such that . By the definition of , we have . We can show by induction that from on, alternately moves downwards and to the right. Indeed, property 1 shows that the path changes direction after each edge. Now suppose that at some point it moves downwards, i.e., (as is the case for ). Then according to property 2, we cannot move towards the left, so we must have . On the other hand, if the path moves to the right, i.e., , then the second case of property 2 forbids a move upwards in the next step, so we must have .
This means that the row coordinates never decrease along , so it is indeed a row-monotone - walk. In fact, it is a path because of its minimality. ∎
Now if a component of has vertices in at least rows, then Claims 5.4 and 5.3 together imply that contains an all-1 submatrix, as needed. The next claim shows that if there is no such component, then we can find a large all-1 submatrix in , without even forbidding .
Claim 5.5**.**
Suppose no component of has vertices in different rows. Then contains an all-1 submatrix.
Proof.
Let be the components of , and let and be the row and column sets of . Note that all the 0-entries of in rows or columns are inside .
Swapping rows and columns does not affect our statement, so let us reorder the rows and columns of so that are the first rows, followed by the rows , etc., and similarly for columns. This way we get a block-diagonal matrix with blocks , where each block has height less than and all the 0-entries are inside the blocks.
Consider the block that touches the ’th (essentially the middle) column of . If no such block exists, then the right half of is an all-1 submatrix, so we are done. We know that has fewer than rows, so this block cannot contain entries from both the ’th and the ’th rows of . If it is disjoint from the ’th row, then there is an all-1 submatrix in the top right corner of . Otherwise, we find such a submatrix in the bottom left corner of . ∎
This completes the proof of Lemma 5.2. ∎
Proof of Theorem 2.2.
Let be an -free zero-one matrix. If contains at least 0-entries, then by Lemma 5.1, it has an all-0 submatrix. Otherwise, contains at least 1-entries, so we can apply Lemma 5.2 to find an all-1 submatrix in . ∎
The above proof breaks completely if instead of we forbid an arbitrary simple matrix, although most of it (including a weakening of Claim 5.3) is salvageable in the special case when we forbid . Unfortunately, Claim 5.4 is false even in this case, and we do not see any meaningful way to circumvent it. The best we can do for -free matrices is to find a homogeneous submatrix of size using the methods of Section 6.
6 General matrices–Proof of Theorem 2.3
In this section, we prove Theorem 2.3 with the help of partial orders. A comparability graph is a graph whose edges correspond to comparable pairs in some partial order on . The key idea in our proof is to introduce partial orders on the rows of using the forbidden submatrix. To find the homogeneous submatrices, we need to analyze complete bipartite subgraphs in the comparability graphs and their complement. Our bound on the size of the homogeneous submatrix comes from the following result of Fox and Pach [14].
Theorem 6.1** (Fox, Pach).**
Let be the union of comparability graphs on the same vertices. Then either one of the graphs or the complement of contains a complete bipartite graph with parts of size .
For simplicity, we write . We show that if is acyclic, then we can find an all-0 matrix of almost linear size in any -free zero-one matrix, where the density of [math]-entries is positive.
Lemma 6.2**.**
Let be an acyclic zero-one matrix. For every , there is a such that every -free zero-one matrix with at least 0-entries contains an all-0 submatrix.
Proof.
Let . We will start with some preprocessing on to find a large submatrix with “nice” all-0 columns, such that every row contains many 0-entries between any two nice columns.
Let us call a -tuple nice for a row if , for every , and there are at least 0-entries in A\big{[}\{r\}\times[c_{i}+1,c_{i+1}]\big{]} for .
If the ’th row of contains at least 0-entries, then there are at least nice -tuples for . Indeed, if the columns of the 0-entries in the ’th row are , then every -tuple is a nice -tuple for , whenever
The number of rows with at least 0-entries is at least . Hence, there are at least -tuples in total (with multiplicities) that are nice for some row. As the number of different -tuples in is less than , some -tuple is nice for at least rows . Let be a set of such rows, and let be the interval for . Then each row of every matrix contains at least 0-entries, and the last column of every is all-0.
For every , define the graph on vertex set as follows. We join and in by an edge if the submatrix of induced by rows does not contain the ’th column of . As is -free, must be the complete graph on .
Let us make some observations about these graphs. First of all, if the ’th column of is all-0, then is empty because has an all-0 column. Note also that can have at most one all-1 column, otherwise it would not be acyclic. Finally (and crucially), if the ’th column of is not homogeneous, then is a comparability graph. Indeed, suppose that the ’th column of is . For a row , let be the set of columns such that . Then for , where , we have that and are joined by an edge in if and only if . As the relation is easily seen to be a poset, is indeed a comparability graph. A similar argument works if the ’th column of is .
Let be the set of inhomogeneous columns in , and let . By Theorem 6.1, either some or the complement of contains a complete bipartite graph with parts of size . First suppose that contains for some . We may assume by symmetry that the ’th column of is . Let be the first row in that appears in this . Then is adjacent to a set of rows below it, and for every . Recall that by the construction of , so is an all-0 submatrix of , as needed.
Now suppose that the complement of contains . As is empty for all-0 columns of and is the complete graph on , must have an all-1 column , and the must be a subgraph of . Let be the two vertex classes of this . By the definition of , each column of contains a 1-entry in at most one of and . As has at least columns, one of or contains at least all-0 columns, so has an all-0 submatrix, finishing the proof. ∎
Proof of Theorem 2.3.
Let be an -free matrix. As is simple, both and are acyclic. So, if has at least 0-entries, we can apply Lemma 6.2 to with and . Otherwise, we can apply the lemma to with and . Either way, we find an homogeneous submatrix in . ∎
Note that any improvement in Theorem 6.1 would also improve our theorem. However, this alone will not be sufficient to find a linear-size homogeneous submatrix. Indeed, as was shown recently by Korándi and Tomon [24], the size of the bipartite graph in Theorem 6.1 cannot be replaced by anything larger than .
On the other hand, one can find slightly larger all-0 submatrices in Lemma 6.2 by reducing the number of partial orders we use. For example, we may assume that in the proof has size at most , as otherwise there are no homogeneous columns in and we can apply Theorem 2.1. This immediately guarantees an homogeneous submatrix.
It is also enough to use just one matrix (and comparability graph ) for consecutive columns of if they are the same. For example, if consecutive columns equal , then one can take to be the comparability graph where two rows are joined by an edge if , and use it to embed all columns in . With this argument, one can find a homogeneous submatrix in any -free zero-one matrix.
7 Matrices without two ones in a column–Proof of Theorem 2.6
In this section, we prove Theorem 2.6. The main part of our proof is to prove a weaker variant of Conjecture 2.4 for zero-one matrices with no more than one 1-entry per column. Namely, we show that for some , every -free matrix with at least 0-entries contains an all-0 submatrix. This will be enough to obtain Theorem 2.6 when is very dense or very sparse in terms of 0-entries. For the range in between, we will use the following result of Alon, Fischer, and Newman [1].
Lemma 7.1** (Alon, Fischer, Newman).**
Let be a zero-one matrix. For every there is a such that every -free zero-one matrix has a submatrix that has either at most or at least 0-entries.
Lemma 7.1 is stated in [1, Lemma 1.6] in a much stronger form in a “removal lemma”-type setting, with strong quantitative bounds on . However, this weak corollary already serves our purposes. Also, let us remark that in the graph world, this lemma corresponds to the well known result of Rödl [33] that for any graph , induced -free graphs cannot have a uniform edge distribution.
Lemma 7.2**.**
Let be a zero-one matrix such that no column of contains more than one 1-entry. Then there is an such that every -free zero-one matrix with at least 0-entries has an all-0 submatrix.
Proof.
Suppose has rows and columns. Let be the identity matrix, and let be the matrix that is the concatenation of copies of , i.e., for every and , and all other entries of are 0. It is easy to see that contains every matrix with at most one 1-entry per column as a submatrix.222In fact, they are already contained in the first rows of . We use for the sake of a simpler presentation. In particular, every -free matrix is also -free, so it is enough to prove our theorem for instead of .
Let , and . We will show that if contains at least 0-entries but does not have an all-0 submatrix, then contains as a submatrix.
Let us split the first rows of into submatrices A_{i}=A\big{[}[m]\times[(i-1)m+1,im]\big{]} for . Let be the family of -element sets such that contains a copy of in the rows indexed by .
Claim 7.3**.**
* for every .*
Proof.
Let us consider the matrices
[TABLE]
for every . Then are submatrices along the diagonal of .
As , we know that does not have any all-0 submatrix. This easily implies that in each , there are at least 1-entries such that no two share a row or a column. Let be the set of coordinates of these 1-entries.
Let us pick an element for every (so one 1-entry from each ), and consider the submatrix . There are such submatrices . Also, has 1-entries in the diagonal, so , unless there is another 1-entry in . However, each such 1-entry of can appear in at most matrices , because it fixes the choice of for two ’s: if , then the 1-entry at can only appear in matrices for which for and for . As there are at most 1-entries in , we are left with at least
[TABLE]
choices where . ∎
Suppose that does not contain as a submatrix. Then every -element set can appear in at most of the sets . Indeed, if , then contains as a submatrix in the rows induced by .
Together with Claim 7.3, this gives
[TABLE]
This contradicts our choice of . ∎
Proof of Theorem 2.6.
By Lemma 7.2, there is an such that any -free zero-one matrix with at least 0-entries contains an all-0 submatrix. We can apply Lemma 7.1 with this to get some such that any -free zero-one matrix has a submatrix with at least entries that are all 0 or all 1.
Let be an zero-one matrix that is both -free and -free, and let be the submatrix with at least equal entries. If these entries are all 0, then contains an all-0 submatrix because it is -free. Otherwise, is a -free matrix with at least 0-entries, so contains an all-1 submatrix. ∎
8 Unordered matrices–Proof of Theorem 1.6
In this section, we prove Theorem 1.6. Again, we show that if an unordered -free matrix has a positive density of 0-entries, then it contains a linear-size all-0 submatrix.
Lemma 8.1**.**
Let be a simple zero-one matrix. Then every unordered -free zero-one matrix with at least 0-entries contains an all-0 submatrix.
Proof.
Let be the matrix whose first columns are , the next columns are , the ’st column is , and the last column is . Then contains an ordering of the columns of , so it is enough to prove our result for instead of .
Let be the matrix obtained from by deleting the rows with fewer than 0-entries. At most 0’s are deleted, so contains at least 0-entries. Hence, one can find a column in with 0 entries. Let be the submatrix of induced by the rows of these 0-entries.
By permuting rows and columns if necessary, we may assume that these 0-entries form an all-0 last column in , and that the rows of are in increasing order according to the number of 0-entries in them.
For , let denote the set of indices such that . Define the directed graph on vertex set by adding as an edge if and . Then is an acyclic directed graph.
Note that if is not an edge of for some , then we must have . Indeed, if , then . We also have , which further implies because . Therefore, if is a -element subset of , and is a -element subset of , then is a reordering of , contradicting our assumption.
For a set , we denote its complement by . Let be the set of minimal vertices in , that is, the set of vertices such that no edge points towards . Then the sets are pairwise disjoint for . Every element can be reached from a minimal vertex via a directed path. Let us assign each to the one such vertex in with the smallest label in .
Now we will show that there is a subset such that and at least of the elements in are assigned to vertices in . Note that by the construction of , we have and hence for every . If contains a vertex that is assigned to more than elements of , then we are done, as we can take . So we may assume that there is no such vertex. Starting with , add vertices of one by one to until the number of elements assigned to the vertices in is at least . At this point, the number of elements assigned to is between and . If , then set , otherwise, set . As , we must have . The number of elements assigned to an element of is at least in both cases.
Now let be the elements of assigned to , so . Also, for , we have . We show by induction on that . If , then is a minimal element, so , and we are done. Now suppose that . If , then , so
[TABLE]
and we are done. If , then must contain an edge for some . Indeed, if is assigned to , then all other vertices on a - directed path are assigned to , as well. Now we can use , and hence , to get
[TABLE]
Fix (for , take ). Then , so the submatrix of induced by the rows and columns is an all-0 matrix with at least rows, and at least columns. This finishes the proof. ∎
Proof of Theorem 1.6.
If has at least 0-entries, we can find a all-0 submatrix in by the previous lemma. Otherwise, we can apply Lemma 8.1 to to show that contains a all-1 submatrix. ∎
Lemma 8.1 shows that there is a genuine difference between the ordered and unordered case of our problem. Indeed, this result shows that 0-entries in an unordered -free matrix guarantee an all-0 submatrix. However, as we discussed at the end of Section 4, this is not true for every matrix in the ordered setting: there are -free matrices with 0-entries that do not have any all-0 submatrix of size .
By a result of Füredi [17], there is an matrix with 0-entries that does not contain (where can be either 1 or 0). With the same methods as before, we can use this to construct matrices with 0-entries that do not contain and have no all-0 submatrices of size .
9 Applications
Several matrix classes can be described by a finite set of forbidden submatrices (see, e.g., [22]), and our results show that in many cases they contain large homogeneous submatrices. We give three specific applications.
9.1 Chordal bipartite graphs and totally balanced matrices
A zero-one matrix is totally balanced if it does not contain any submatrix, whose columns are different and which has exactly two 1-entries in each of its rows and columns. In other words, none of its submatrices is the incidence matrix of a cycle of length at least 3.
Totally balanced matrices (first studied by Lovász [26] in connection with a hypergraph coloring problem) are well-examined objects in combinatorial optimization. Their importance comes from the fact that integer programs with totally balanced coefficient matrices can be easily solved. Indeed, the optimization problem can be solved greedily if the coefficient matrix does not contain as a submatrix, and as was shown in [5, 20, 28], a matrix is totally balanced if and only if its rows and columns can be rearranged to get a -free matrix. (For more on optimization properties of balanced matrices, see the book of Berge [7].) As rearranging rows and columns does not affect homogeneous submatrices, Theorem 2.2 shows that totally balanced matrices have large homogeneous submatrices.
Corollary 9.1**.**
Every totally balanced matrix contains an homogeneous submatrix with some .
A chordal bipartite graph is a bipartite graph with no induced cycle of length greater than 4. This class of graphs was introduced by Golumbic and Goss [19] as a bipartite analog to chordal graphs, with similar perfect elimination properties. Clearly, a bipartite graph is chordal if and only if its adjacency matrix is totally balanced. This immediately implies the following.
Corollary 9.2**.**
Every chordal bipartite graph with parts of size contains sets and of size , for some constant , such that is either empty or complete.
9.2 The Erdős-Hajnal conjecture and intersection graphs
A family of graphs is said to have the Erdős-Hajnal property, if there is a constant such that each member contains a clique or an independent set on at least vertices. The family has the strong Erdős-Hajnal property, if there is a constant such that every satisfies that either or its complement contains a complete bipartite graph with parts of size . By a result of Alon, Pach, Pinchasi, Radoičić and Sharir [3], the strong Erdős-Hajnal property implies the Erdős-Hajnal property in hereditary families. The famous Erdős-Hajnal conjecture [12, 13] asserts the following.
Conjecture 9.3** (Erdős, Hajnal).**
For every graph , the family of graphs not containing an induced copy of has the Erdős-Hajnal property.
This conjecture has attracted significant attention in the past decades, but is still wide open. For history and relevant results, we refer the reader to the survey of Chudnovsky [10].
The intersection graph of a family of sets is the graph with vertex set , where two vertices are joined by an edge if their intersection is nonempty. A curve in the plane is the image of an injective continuous function . In this paper, we assume that curves in our collections only meet at proper crossings, that is, if two curves and share a point in common, then passes to the other side of at this point. A string graph is a graph that is isomorphic to the intersection graph of a family of curves.
In a very recent paper, Tomon [35] showed that the family of string graphs has the Erdős-Hajnal property. However, this family does not satisfy the strong Erdős-Hajnal property [32], although Fox and Pach [15] proved that one can always find a complete bipartite graph of almost linear size in every string graph or its complement.
Theorem 9.4** (Fox, Pach).**
Let be a string graph on vertices. Then either contains with , or the complement of contains with .
A collection of curves is -intersecting, if any two curves in the collection intersect in at most points. Fox, Pach and Tóth [16] showed that the family of intersection graphs of -intersecting curves does have the strong Erdős-Hajnal property.
Theorem 9.5** (Fox, Pach, Tóth).**
For every positive integer , there is a constant such that the following holds. Let be the intersection graph of a -intersecting family of curves. Then either or its complement contains a complete bipartite graph of size .
Here, we are interested in a bipartite version of this problem. That is, given two families of curves, and , we would like to find large subfamilies, and , such that , and either every curve in intersects every curve in , or every curve in is disjoint from every curve in .
In general, we cannot hope for any bound on beating the Ramsey bound . Indeed, the complement of every comparability graph is a string graph [27, 32], therefore the complement of any bipartite graph is a string graph. Nevertheless, the question remains meaningful if we restrict ourselves to -intersecting collections of curves.
In fact, we believe that the condition that is -intersecting can be weakened to only requiring that and themselves are -intersecting.
Conjecture 9.6**.**
For every there is a constant such that the following holds. Let and be two families of curves each such that and are -intersecting. Then there are subfamilies and such that , and either every intersects every , or every is disjoint from every .
In some sense, this is the weakest condition one can impose on and to force any meaningful properties. Indeed, the complement of any bipartite graph can be realized as the intersection graph of a collection of curves , where is -intersecting, and any two curves and intersect in at most 2 points (but is not -intersecting for any bounded ), see [31].
A natural special case of the conjecture is when the curves are 0-1 curves. Here, a 0-1 curve is the drawing of a continuous function in . As a first step towards Conjecture 9.6, we prove the following statement.
Theorem 9.7**.**
Let and be two families of 0-1 curves each. If is -intersecting, and is 1-intersecting, then there are subfamilies and such that , and either every intersects every , or every is disjoint from every .
Proof.
By slightly perturbing our curves, we can assume that no 3 curves in go through the same point, and no two of them intersect the lines and in the same point. For two curves , let if intersects the vertical line below .
First, we claim that there are subfamilies and such that , and either for every , or for every . Indeed, in the total ordering defined by , pick the smallest element such that either elements of are , or elements of are . In the first case, set and let be an element subset of . In the second case, let be an element subset of and .
Without loss of generality, suppose that for every . Define the matrix by setting if the ’th smallest element of intersects the ’th smallest element of with respect to the ordering , and otherwise.
Claim 9.8**.**
Let be the matrix defined by , if is even, and if is odd. Then is -free.
Proof.
Let us start with introducing some notation. Each 0-1 curve cuts the strip into two parts, an upper and lower part. We say that a point set is above the curve if it is a subset of the upper part, and it is below, if it is a subset of the lower part. Also, if is a 0-1 curve and , let denote the subcurve of starting on the vertical line , and ending at . For , we define .
Suppose that in , and in induce . Let be the intersection points of the curves and , ordered by their -coordinates. As is -intersecting, we have . These intersection points cut both and into subcurves, let us denote them by and from left to right. Note that implies that if is even, then is below , and is above , while if is odd, then is above and is below . For , let denote the region in bounded by and , and call these regions lenses. If is even, say that is the top boundary of and is the bottom boundary, and if is odd, then is the top boundary of , and is the bottom boundary. Note that if intersects the lens , then intersects only the top boundary of , as and each of the curves intersect exactly one of and . Therefore, if and intersect, and must have the same parity.
For , let denote the smallest index for which intersects the lens . We show that , which contradicts . Suppose that for some . As and have different parities, we have and cannot intersect . Let denote the union of the top boundaries of all the lenses, and let denote the union of the bottom boundaries of the lenses, then and are 0-1 curves. Let be the first intersection point of and , and let
[TABLE]
In other words, we obtain the curve by following the bottom boundaries of the lenses until we reach the lens , where we follow the top boundary until we reach . Let be the region bounded by and , see Figure 1. The curve starts outside , but contains the lens , so must enter . However, does not intersect intersect , nor does it touch . Thus, cannot intersect . Hence, must enter through . Since also leaves , it must also exit through . Therefore, and intersect twice, contradiction. ∎
The matrix does not contain a homogeneous column, so we can apply Theorem 2.1 to conclude that contains a homogeneous submatrix of size at least . This corresponds to two collections and with the desired properties.
∎
9.3 Pseudohalfplanes
A bi-infinite -monotone curve is the graph of a continuous function . A collection of bi-infinite -monotone curves is a pseudoline-arrangement if any two elements of intersect in exactly one point. If is a pseudoline-arrangement, then is a pseudohalfplane-arrangement if every element is either the set of points below an element of , or the set of points above an element of .
Let be a set of points in the plane and let be a pseudohalfplane-arrangement. Consider the matrix whose rows are labeled with elements of , columns are labeled with the elements of , and
[TABLE]
It is proved in [21, Theorem 2.19, Proposition A.1] (see also [9]) that can be partitioned into two submatrices and such that the following holds: the rows and columns of and can be ordered such that and does not contain as a submatrix. But then Theorem 1.1 immediately implies that some linear set of pseudohalfplanes contains or avoids a positive proportion of the points.
Corollary 9.9**.**
Let be a set of points in the plane and let be a pseudohalfplane-arrangement with elements. Then there are subsets and of size for a suitable constant , such that either for every and we have , or for every and we have .
10 Concluding remarks
Our work establishes various bounds on the size of the largest homogeneous submatrix that can be found in a matrix, when a fixed submatrix is forbidden. A summary of our results for fixed small can be found in Figure 2. A number of questions remain unsolved, and it would be very interesting to obtain good bounds for simple or acyclic matrices. Perhaps the first open question is to decide if satisfies Conjecture 2.5, i.e., if forbidding the submatrix in an zero-one matrix guarantees the existence of a homogeneous submatrix.
These questions are also closely related to recent results on the Erdős-Hajnal theory of trees: Extending previous work in [8, 25], Chudnovsky, Scott, Seymour and Spirkl [11] proved the following variant of the Erdős-Hajnal conjecture.
Theorem 10.1** (Chudnovsky et al.).**
Let be a tree. Then the family of all graphs not containing an induced copy of and has the strong Erdős-Hajnal property.
Our problems can be thought of as an ordered bipartite version of the strong Erdős-Hajnal problem. For example, it is not hard to see that Theorem 10.1 would follow from Conjecture 2.5.
Indeed, we can think of our zero-one matrix as the biadjacency matrix of a bipartite graph with parts of size . Submatrices then correspond to induced subgraphs, and a homogeneous submatrix means a subgraph that is complete or empty between and . An important difference, though, is that a forbidden submatrix only forbids one ordering of the corresponding bipartite graph (where the vertices in the two parts are ordered according to the rows and columns of the matrix). This is a much weaker condition and adds considerable difficulty to our problem.
Approximate versions of our Conjectures 2.5 and 2.4, finding homogeneous submatrices, were very recently proved by Scott, Seymour and Spirkl [34].
Extremal questions about zero-one matrices have been extensively studied over the past decades, and it is worth mentioning a few that are loosely related to our problem.
A zero-one matrix contains a pattern , where is another zero-one matrix, if can be obtained from a submatrix of by changing some 1-entries to 0-entries. When is a biadjacency matrix, this corresponds to the subgraph relation (as opposed to submatrices corresponding to induced subgraphs). The Turán number is defined as the maximum number of 1-entries in an zero-one matrix that does not contain the pattern . A central problem in this area is a conjecture of Pach and Tardos [30] that whenever is acyclic. Although this is known for many such matrices [18, 29, 30, 23], the general conjecture remains open.
Another related question asks for , the maximum number of distinct columns in an unordered -free zero-one matrix with rows. When we think of as the incidence matrix of a hypergraph, finding is connected to certain hypergraph coloring problems (see, e.g., [26]), as well as other structural results. For example, in the special case when is the zero-one matrix with all different columns, the Sauer-Shelah lemma gives . An open conjecture of Anstee and Sali [6] asserts that for an implicitly defined integer function . For further partial results on this topic, we refer the reader to the survey [4].
Acknowledgments
We thank Balázs Keszegh and Dömötör Pálvölgyi for drawing our attention to their recent paper [21] and for pointing out that our Theorem 1.1 implies Corollary 9.9. We are also grateful to Maria Axenovich for sharing with us her manuscript [2].
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] N. Alon, E. Fischer, I. Newman, Efficient testing of bipartite graphs for forbidden induced subgraphs, SIAM Journal on Computing 37 (2007): 959–976.
- 2[2] M. Axenovich, C. Tompkins, L. Weber, Large homogeneous subgraphs in bipartite graphs with forbidden induced subgraphs, ar Xiv:1903.09725 preprint.
- 3[3] N. Alon, J. Pach, R. Pinchasi, R. Radoičić, M. Sharir, Crossing patterns of semi-algebraic sets, Journal of Combinatorial Theory Series A 111 (2) (2005): 310–326.
- 4[4] R.P. Anstee, A survey of forbidden configuration results, Electronic Journal of Combinatorics DS 20 (2013): pp 53.
- 5[5] R.P. Anstee, M. Farber, Characterizations of totally balanced matrices, Journal of Algorithms 5 (1984): 215–230.
- 6[6] R.P. Anstee, A. Sali, Small forbidden configurations IV, Combinatorica 25 (2005): 503–518.
- 7[7] C. Berge, Hypergraphs: Combinatorics of Finite Sets, North-Holland, Amsterdam (1989).
- 8[8] N. Bousquet, A. Lagoutte, S. Thomassé, The Erdős-Hajnal conjecture for paths and antipaths, Journal of Combinatorial Theory, Series B, 113 (2015): 261–264.
