Most binary matrices have no small defining set
Carly Bodkin, Anita Liebenau, and Ian M. Wanless

TL;DR
This paper demonstrates that most large binary matrices with fixed row and column sums lack small defining sets, implying that such matrices are typically not uniquely determined by a small subset of entries.
Contribution
The authors extend previous results by showing that, under certain conditions, large binary matrices almost surely have no small defining sets, generalizing earlier specific cases.
Findings
Most binary matrices lack small defining sets with high probability.
Critical sets in such matrices are typically large, close to the total number of entries.
Results generalize previous theorems for specific matrix sizes and proportions.
Abstract
Consider a matrix chosen uniformly at random from a class of matrices of zeros and ones with prescribed row and column sums. A partially filled matrix is a for if is the unique member of its class that contains the entries in . The of a defining set is the number of filled entries. A is a defining set for which the removal of any entry stops it being a defining set. For some small fixed , we assume that , and that , where is the proportion of entries of that equal . We also assume that the row sums of do not vary by more than , and that the column sums do not vary by more than . Under these assumptions we show that almost surely has…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Most binary matrices have no small defining set
Carly Bodkin [email protected] by an Australian Government Research Training Program (RTP) Scholarship. School of Mathematics
Monash University
Clayton Vic 3800 Australia
Anita Liebenau [email protected] by the Australian Research Council grant DE170100789 and DP180103684. School of Mathematics and Statistics, UNSW Sydney NSW 2052 Australia
Ian M. Wanless [email protected] by the Australian Research Council grant DP150100506. School of Mathematics
Monash University
Clayton Vic 3800 Australia
Abstract
Consider a matrix chosen uniformly at random from a class of matrices of zeros and ones with prescribed row and column sums. A partially filled matrix is a defining set for if is the unique member of its class that contains the entries in . The size of a defining set is the number of filled entries. A critical set is a defining set for which the removal of any entry stops it being a defining set.
For some small fixed , we assume that , and that , where is the proportion of entries of that equal . We also assume that the row sums of do not vary by more than , and that the column sums do not vary by more than . Under these assumptions we show that almost surely has no defining set of size less than . It follows that almost surely has no critical set of size more than . Our results generalise a theorem of Cavenagh and Ramadurai, who examined the case when and for an integer .
1 Introduction
Let and be integers, and let and be vectors of non-negative integers. Then is defined to be the set of all binary matrices with ones in row and ones in column , where and . We say almost all matrices in have a property if the probability that a matrix chosen uniformly at random from has the property tends to as .
A partial binary matrix is a matrix with entries [math], or , where we call a cell empty if its entry is . Let denote the set of all partial binary matrices with at most ones and zeros in row , and at most ones and zeros in column . Given and we write if , for all and .
Suppose and . Then we say is a defining set for if is the unique member of such that . Furthermore, for we define the partial matrix by
[TABLE]
The size of a partial binary matrix , denoted , is the number of nonempty cells. We define
[TABLE]
Also, define to be the maximum of amongst all binary matrices.
For integers and , let be the set of all binary matrices with constant row and column sum . In [3], Cavenagh and Ramadurai construct a matrix in with no defining set of size less than whenever is a power of . In §3, we prove a similar result for almost all matrices , for every pair of integers and , provided is not too far from square, and the number of ones in each row and column does not stray too far from the average values and . Our result is this:
Theorem 1**.**
Fix a sufficiently small . For integers with , let and be vectors of positive integers such that . Define and and suppose that uniformly for , and uniformly for . Suppose and that is bounded away from zero. Also suppose that
[TABLE]
Then almost all matrices in have no defining set of size less than .
This result significantly generalises the theorem of Cavenagh and Ramadurai mentioned above, albeit with a slightly worse error term. Taking and for all , Theorem 1 implies the following corollary.
Corollary 2**.**
For any integer , almost all matrices have no defining set of size smaller than .
We refer to the parameter in Theorem 1 as the density of . It is the proportion of entries in which equal one. Throughout this paper we require to be bounded away from zero. Our approach relies on an asymptotic formula from [1] for the number of bipartite graphs with a given degree sequence. Similar enumeration results do exist for the very sparse range [5], but the intermediate range is not yet covered. This is why we decided to not consider the case when . Furthermore, we will assume that . Without that assumption, our problem has symmetry between zeros and ones in the sense that we may switch zeros and ones without changing the size of the smallest defining set. We can easily form a defining set for any matrix by taking either all the ones or all the zeros in . The smaller of these two options turns out to provide a good upper bound on the size of the smallest defining set. The justification for legislating that is that it simplifies the exposition if we know that the number of ones does not exceed the number of zeros. We then get good estimates by observing that the minimum size of a defining set for cannot exceed , and maxsds. We note that the case where would be easily handled by replacing with in the appropriate places, but our statements are simpler if we do not need to say this each time. For similar reasons, we assume throughout that .
Since every matrix in has a defining set of size , another way to state the conclusion of Theorem 1 is that sds for almost all . It also follows that . These results are limited to the case when satisfy the hypotheses of our theorem. Since every binary matrix has a defining set of size at most , we can also say:
Corollary 3**.**
For , we have .
Cavenagh [2] and Cavenagh and Wright [4] studied critical sets, that is, defining sets which are minimal in the sense that the removal of any element destroys the property of being a defining set. They showed that the complement of a critical set is itself a defining set. Therefore Theorem 1 implies that almost all binary matrices contain no large critical set. More specifically:
Corollary 4**.**
Under the hypotheses of Theorem 1, almost all elements of have no critical set of size more than .
2 Preliminary results
In this section we provide some preliminary results used in the proof of Theorem 1. We utilise the following elegant characterisation of defining sets from [3]. It uses the idea of a South-East walk tracing through a matrix using steps to the right or downward. Such a walk separates the entries of the matrix into two classes: those above (and to the right of) the walk and those below (and to the left of) the walk. In particular, no entry lies on the walk itself. We say a partial matrix is in good form if whenever and then and whenever and then . In other words, a partial matrix is in good form if a South-East walk in exists with only ones (or empty cells) below the walk and only zeros (or empty cells) above it.
Theorem 5**.**
Let and let . Then is a defining set for if and only if and the rows and columns of the partial matrix can be permuted to be in good form.
The family of matrices constructed in [3] have the special property that within any rectangular subarray the difference between the number of ones and zeros is small. This property, combined with Theorem 5, guarantees no small defining set. In our more general setting, we are interested in the property that the difference between the number of ones and the expected number of ones in any subarray is small. Here, and henceforward, when we refer to the expected number of ones occupying a particular set of cells, the underlying distribution involves a matrix being chosen uniformly at random from all binary matrices with given dimensions and density.
Let and be any subsets of the rows and columns, respectively, of . Let be the density of . Then the subarray is the array of induced by and and denotes the number of ones in minus , which is the expected number of ones in .
Lemma 6**.**
Fix and let be a function of integers and . Let have a density satisfying . Let be a defining set for . If for every and , subsets of the rows and columns of , respectively, we have
[TABLE]
then
Proof.
Let be such that for any subsets and of the rows and columns, respectively. Let be a minimal defining set for . We show that the size of cannot be less than . By Theorem 5 we can assume that the rows and columns of have been permuted so that is in good form. That is, we can draw a South-East walk in the matrix so that all non-empty cells above are zeros and all non-empty cells below are ones. Since is minimal, must contain every one that occurs in below and every zero that occurs in above .
Let and denote the number of zeros and ones (respectively) in above , and let and denote the number of zeros and ones (respectively) in below . Hence, we have
[TABLE]
We now find an upper bound on , which is the number of ones minus the expected number of ones in below . For , define to be the number of cells in row to the left of and let . By definition, the sequence is weakly increasing. Let h=\big{\lceil}m^{3/4}\big{\rceil}. For , define a block where and C_{i}=\big{\{}f\big{(}(i-1)h\big{)}+1,\dots,f(ih)\big{\}}. Note that each block lies entirely below and is disjoint from for . Moreover, in any column there are at most cells that are below but are not in any of the . For these cells, the difference between the number of ones and the expected number of ones cannot exceed , the total number of cells involved. For each block , we then employ the bound (1) to give
[TABLE]
Now and with , so
[TABLE]
It follows that
[TABLE]
as claimed. ∎
Let be the number of labelled bipartite graphs with vertices on one side of the bipartition with degrees given by , and vertices on the other side with degrees given by . We utilise the following asymptotic estimate from [1].
Theorem 7**.**
Let , and be defined as in Theorem 1. Then we have
[TABLE]
Lastly, we need the following well-known results called the Chernoff bounds [6].
Theorem 8**.**
Let be independent Bernoulli random variables where with probability and with probability . Let and . Then
- (i)
\mathbb{P}\big{(}X\geqslant(1+\gamma)\mu\big{)}\leqslant\exp(-\frac{\gamma^{2}}{2+\gamma}\mu)* for all ,*
- (ii)
\mathbb{P}\big{(}|X-\mu|\geqslant\gamma\mu\big{)}\leqslant 2\exp(-\mu\gamma^{2}/3)* for all .*
3 Proof of the main result
An element of is the bi-adjacency matrix of a bipartite graph with vertices on one side of the bipartition with degrees given by , and vertices on the other side with degrees given by . We define the density of a bipartite graph to be the density of its bi-adjacency matrix.
Let and be subsets of the vertices of , each from a different side and denote the number of edges between and by . The property (1) is equivalent to the difference between the number of edges and the expected number of edges between and being at most . Therefore, by Lemma 6, the following theorem implies our main result, Theorem 1.
Theorem 9**.**
Let be chosen uniformly at random from the bipartite graphs with one side of the bipartition of size with degrees from and the other side of size with degrees from . Let be the density of and suppose that , , , and satisfy the hypotheses of Theorem 1. Then there is some constant such that, with probability ,
[TABLE]
for any two subsets and of the vertices, each from a different side.
Let be a random bipartite graph with sides of size and , in which each of the possible edges occurs independently with probability . Note that with respect to this graph, the expectation of is , for any two subsets and of the vertices of , each from a different side.
Proof of Theorem 9.
Fix a positive constant . We say a bipartite graph has property , if there exist subsets and , each from a different side, such that
[TABLE]
Let denote the probability that has property and let be the probability that has property . We define to be the event that has degree sequence . Then
[TABLE]
We claim that goes to zero as . Firstly we find a lower bound on . To simplify our calculations we let . Applying Stirling’s formula to the binomials given in Theorem 7, we have the following approximations, provided , , and satisfy the hypotheses of Theorem 1:
[TABLE]
[TABLE]
By assumption, for we have where and . Hence we have
[TABLE]
Similarly,
[TABLE]
Hence, we have
[TABLE]
A similar argument yields
[TABLE]
Combining all of the above approximations with Theorem 7, we find that
[TABLE]
There are labelled bipartite graphs with sides of size and and density , so
[TABLE]
We now need to find an upper bound on . Let N=\big{\lfloor}(c/\lambda)(mn^{1/2+\varepsilon}+nm^{1/2+\varepsilon})\,\big{\rfloor}. Then we have
[TABLE]
where the first inequality follows from the union bound and this sum is over all pairs of subsets and of the vertices of , each from a different side. By Theorem 8 (ii),
[TABLE]
where the last inequality is due to the fact that the number of pairs is bounded above by . We now move on to those subsets satisfying . Note that in this case, if then \big{|}e(A,B)-\lambda|A||B|\big{|}\leqslant\lambda|A||B|\leqslant\lambda N. Thus, by Theorem 8(i), we have
[TABLE]
Combining (3), (4) and (5), we have
[TABLE]
We can choose large enough that exceeds any fixed multiple of . In comparison, (2) is independent of . So for an appropriately large ,
[TABLE]
Hence tends to zero as and we are done. ∎
A topic for future research might be to try to identify which matrices have the largest sds and what structure those defining sets have. Our proofs do not give much insight into these questions. However, we do at least know that must be very close to in order to achieve .
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] E. R. Canfield, C. Greenhill and B. D. Mc Kay, Asymptotic enumeration of dense 0-1 matrices with specified line sums, J. Combin. Theory Ser. A , 115 (2008), 32–66.
- 2[2] N. J. Cavenagh, Defining sets and critical sets in ( 0 , 1 ) 0 1 (0,1) -matrices, J. Combin. Designs , 21 (2013), 253–266.
- 3[3] N. J. Cavenagh and R. Ramadurai, Constructing ( 0 , 1 ) 0 1 (0,1) -matrices with large minimal defining sets, Linear Alg. Appl. , 537 (2018), 38–47.
- 4[4] N. J. Cavenagh and L. K. Wright, The maximum, spectrum and supremum for critical set sizes in ( 0 , 1 ) 0 1 (0,1) -matrices, J. Combin. Designs , 27 (2019), 522–536.
- 5[5] C. Greenhill, B. D. Mc Kay and X. Wang, Asymptotic enumeration of sparse 0–1 matrices with irregular row and column sums, J. Combin. Theory Ser. A , 113 (2006), 291–324.
- 6[6] M. Mitzenmacher and E. Upfal, Probability and computing – randomized algorithms and probabilistic analysis , Cambridge University Press, Cambridge UK, 2005, pp. 64–67.
