The Normalized Matching Property in Random and Pseudorandom Bipartite Graphs
Niranjan Balachandran, Deepanshu Kush

TL;DR
This paper investigates the conditions under which bipartite graphs exhibit the Normalized Matching Property (NMP), establishing thresholds in random graphs and showing that pseudorandom graphs can be made to have NMP after minor vertex removals.
Contribution
It introduces sharp thresholds for NMP in random bipartite graphs and demonstrates that pseudorandom graphs can attain NMP after removing a small fraction of vertices, extending understanding of graph matchings.
Findings
Identifies p=log(n)/k as a sharp threshold for NMP in G(k,n,p).
Shows pseudorandom bipartite graphs can have NMP after minor vertex deletions.
Proves an almost vertex decomposition into Euclidean trees with NMP.
Abstract
A simple generalization of the Hall's condition in bipartite graphs, the Normalized Matching Property (NMP) in a graph with vertex partition states that for any subset , we have . In this paper, we show the following results about having the Normalized Matching Property in random and pseudorandom graphs. 1. We establish as a sharp threshold for having NMP in , which is the graph with (assuming ), and in which each pair is an edge independently with probability . This generalizes a classic result of Erd\H{o}s-R\'enyi on the threshold for having a perfect matching in . 2. We also show that a pseudorandom bipartite graph - upon deletion of a vanishingly small fraction of vertices…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
The Normalized Matching Property in Random and Pseudorandom Bipartite Graphs
Niranjan Balachandran111Department of Mathematics, Indian Institute of Technology Bombay. Email: [email protected]. and Deepanshu Kush222Department of Computer Science, University of Toronto. Email: [email protected]. Work was performed while the author was at IIT Bombay.
Abstract
A simple generalization of the Hall’s condition in bipartite graphs, the Normalized Matching Property (NMP) in a graph with vertex partition states that for any subset , we have . In this paper, we show the following results about the Normalized Matching Property in random and pseudorandom graphs.
We establish as a sharp threshold for having NMP in , which is the graph with (assuming ), and in which each pair is an edge independently with probability . This generalizes a classic result of Erdős-Rényi on the threshold for having a perfect matching in . 2. 2.
We also show that a pseudorandom bipartite graph - upon deletion of a vanishingly small fraction of vertices - admits NMP, provided it is not too sparse. More precisely, a bipartite graph , with , is said to be Thomason pseudorandom (following A. Thomason (Discrete Math., 1989)) with parameters if each has degree at least and each pair of distinct has at most common neighbors. We show that for any large enough -Thomason pseudorandom graph , there are “tiny” subsets such that the subgraph has NMP, provided . En route, we prove an “almost” vertex decomposition theorem: Every such Thomason pseudorandom graph admits - excluding a negligible portion of its vertex set - a partition of its vertex set into graphs that we call Euclidean trees. These are trees that have NMP, and which arise organically through the Euclidean GCD algorithm.
1 Introduction
Consider the following problems:
Suppose are positive integers. By a * star-array* (or simply star-array), we mean a array whose entries are symbols from the set . Given a star-array, when is it possible to replace some of the entries of the array by non-negative integers such that in the resulting array all the row sums equal , and all the column sums equal for some integers ? 2. 2.
Let be a sufficiently large prime power and suppose with , . Is it possible to label each element of with some element of such that each element of appears as a label exactly times, and further, for each labeled , the sum is a quadratic residue? More generally, one can ask the same question with a subgroup instead of the set of quadratic residues.
In both the problems posed above, there is a natural bipartite graph that captures the problem in its essence: Given a star-array , let and denote the set of rows and columns of respectively, and a vertex is adjacent to in if and only if the entry of corresponding to a . For the second problem consider the bipartite graph where are the given sets, and the pair is an edge in if and only if .
In the rest of the paper, shall denote a bipartite graph with vertex partition ; we shall drop the in our notation for convenience. We say that has the Normalized Matching Property (NMP for short) if: For any , if we denote by , its set of neighbors in , then . In particular, if , then this is the familiar Hall’s condition for the existence of a perfect matching in .
The following theorem of Kleitman [16] gives us an equivalent formulation of NMP in bipartite graphs:
Theorem 1.1**.**
The following statements are equivalent:
- •
* with has NMP.*
- •
For any independent set in , .
- •
There exists a multiplicity function such that (resp. ) is equal for all (resp. for all ).
It is easy to see that the problems posed above simply ask if the associated bipartite graphs have NMP by virtue of the third part of Theorem 1.1.
The Normalized Matching Property in bipartite graphs was introduced by Graham and Harper [11] and subsequently has been a focus of study in bipartite graphs in several papers (for instance [16, 24]) and some monographs as well (for instance [4, 7]). The notion also extends very naturally to finite ranked posets; for a ranked poset , let denote the set of all elements of with rank . Then we say that has NMP if for each , the bipartite graph of poset covering relations between and has NMP. NMP posets are objects of great interest specifically in related decomposition problems (see [12, 13, 14, 22, 23] for some decompositions results). As a concrete instance, the Griggs conjecture which states that any unimodal NMP poset admits a nested chain decomposition (see [14] or [25] for more details on what the definitions are) is still open - even for posets of rank - despite several attacks on the problem.
As it turns out, many interesting finite ranked posets arising from finite geometric structures have NMP. Indeed, the Boolean poset, the poset of affine flats in a finite projective -dimensional space and the poset of the subgroup lattice of abelian -groups all have NMP (see [21, 22, 23] respectively), i.e., in each of these posets, the associated bipartite graphs on the sets of elements of successive ranks within these posets have NMP. As is the case with Hall’s theorem for bipartite graphs, it is clear that graphs with “high density” are more likely to possess NMP. But in each of the instances listed above, the associated bipartite graphs are very sparse. This raises the following natural question: At what density does a typical bipartite graph have NMP?
To formulate the above question more precisely, we set up some asymptotic terminology and notation. Given functions , we write (resp. ) if (resp. ). We also write to denote that . We write (resp. ) if there exists an absolute constant and such that for all (resp. if ). If the constant involves a related parameter , then we write (resp. ) to indicate the dependence of the implicit constant on the parameter .
To formalize the question posed above, we recall some standard terminology from the theory of random graphs. For a probability space we say that an event that depends on a parameter occurs with high probability (abbreviated as whp) if as . A graph property is simply a collection of graphs, and a graph property is called monotone if whenever and then as well. The Erdős-Rényi random graph model introduced in [9] is the random graph where the vertex set is the set and each pair is an edge with probability independently. A monotone graph property is said to have a threshold if whenever then has property whp, and if then whp does not have property . A property is said to have a sharp threshold if for and , has property whp and for , does not have property whp.
The seminal paper of Erdős and Rényi [9] established sharp thresholds for several very natural monotone graph properties. A theorem of Bollobas and Thomason [6] showed that every monotone graph property admits a threshold. However, not all graph properties admit sharp thresholds; for instance, the property “ contains a cycle” admits a threshold which is sharp on one side but not the other (see [15] for more on sharp thresholds). In fact, the problem of determining sharp thresholds (if the graph property admits one) is a very popular motif in the theory of random graphs.
For bipartite graphs, Erdős and Rényi also introduced the random bipartite model where the vertex set is partitioned into two sets of size each, and each pair with is in independently with probability . One of the first results in this model is the result that is a sharp threshold for the existence of a perfect matching in [10]. As observed earlier, if , NMP is the same as Hall’s condition for bipartite graphs, so it is natural to seek the threshold for NMP in a slightly more general model for bipartite random graphs, which is what the question previously posed seeks to do.
Suppose are positive integers, and let . Let denote the random bipartite graph with the vertex partition given by with , and each pair is an edge in independently with probability . Here both and should be thought of as parameters growing to infinity with being a function of that always satisfies . Our first main result in this paper establishes a sharp threshold for NMP in the sense stated above:
Theorem 1.2**.**
Suppose , and let . There exists such that for
If then . 2. 2.
If then .
In other words, has a sharp threshold for NMP at .
Note that if or equivalently, if , then the expression for our threshold exceeds one. Also, for each fixed , if and , then a simple computation shows that the probability that has at least one isolated vertex is bounded away from zero (this will be clear from the proof of Theorem 1.2; see Lemma 3.1). Hence, the range for in the statement of the theorem is essentially the widest possible one if one seeks a sharp threshold.
Let us now return to the problems at the beginning of this section. To check if a given bipartite graph has NMP is computationally simple: form a bigger new bipartite graph with with consisting of by copies of , consisting of copies of , and being an edge in if and only if was an edge in . Then it is straightforward to see that has NMP if and only if admits a perfect matching. Hence either problem admits a computationally simple solution. But let us relax our requirement and seek an answer only in an approximate sense: For the first problem, is it possible to replace each entry with a non-negative integer such that with the exception of a negligible proportion of the rows/columns, the remaining rows and columns satisfy the aforementioned property? Or in the second problem, can we ignore a negligible proportion of elements from both sets , so that the desired property holds for the remaining elements? Since either of the originally posed problems is equivalent to asking if a given bipartite graph has NMP, this approximate version asks if a given bipartite graph “almost” has NMP in a certain sense that we shall formalize below.
The bipartite graph considered in the second problem (with the subsets of ) possesses certain regularity properties that are best described as “random-like” - as we shall soon see. Taking a cue from this, we impose the following reasonable hypotheses on bipartite graphs that we shall consider: If all the vertices of have “almost” the same degree, and suppose that no two vertices of have “too many” common neighbors in (so that there isn’t a clustering of edges between some subsets of and ), is there an affirmative answer to the approximate version for these problems?
To formulate this in more precise terms, we need the notion of a pseudorandom bipartite graph. The notion of pseudorandomness was first introduced by Thomason in the 80s [20] and pseudorandomness in graphs is a well-studied notion (see [18] for a definitive survey). One of the more popular and well-understood models for pseudorandomness in graphs is the notion of an graph (see [2]). An graph is a graph on vertices which is -regular and which satisfies the following property: If are the eigenvalues of then for all .
Pseudorandom graphs, as the name suggests, have some properties very reminiscent of random graphs, and the most well-known is the Expander-Mixing Lemma (see [2]): Suppose is an graph. If then , where denotes the number of edges of the form with and .
As mentioned earlier, Thomason introduced the notion of pseudorandomness which is a little more general, and in particular, we shall - in this paper - confine our attention to the notion of pseudorandomness in bipartite graphs as proposed by Thomason in [21].
Definition 1.1**.**
Suppose , and . A bipartite graph with vertex classes and of sizes and respectively with is called Thomason pseudorandom with parameters if every vertex in has degree at least , and every pair of distinct vertices in have at most neighbors in common.
At this juncture, a few remarks are in order. Thomason’s original definition in [21] actually only considers bipartite graphs with . Secondly, Thomason’s definition in [21] is more in line with the original notion of pseudorandomness in [20]: A graph is pseudorandom with parameters for some where the second condition states that every pair of vertices in have at most common neighbors. The definition that we shall be using is a relaxation of the restriction that , but also a restriction to the more natural and intuitive case where .
Notions of pseudorandomness are usually “symmetric” or “global” in their definitions as in the definition in [20] or in the definition of an graph. This latter notion is at first glance somewhat asymmetric in the sense that the conditions imposed on the degrees and codegrees are only for the vertices of . However, it is a simple exercise (which we shall not get into here) to show that these conditions also imply certain restrictions on the degrees and codegrees of the vertices of as a consequence of the following analogue of the expander-mixing lemma (restricted to our setup):
Theorem 1.3** (Theorem 2 in [21]).**
Let be a bipartite graph with , which is Thomason pseudorandom with parameters . Then for every subset of size at least and every subset , with and ,
[TABLE]
Again, we remark that Thomason’s theorem in [21] is stated for pseudorandom bipartite graphs with and parameters . But a glance at the proof there immediately tells us that the same proof works in our general setup as well. The interesting point is that this asymmetric definition of pseudorandomness also yields the aforementioned theorem. A heuristic and somewhat simplistic explanation for this is that we are restricting ourselves to bipartite graphs, and it is precisely due to the bipartite structure of the graph that the arguments go through.
Another reason why we prefer to work with this notion of pseudorandomness is that it is combinatorial in its definition; it only considers the degrees of the vertices and codegrees of pairs of vertices of , which is computationally easy to verify. In addition, it is a reasonably robust notion which also allows us to generate several non-trivial examples of Thomason pseudorandom graphs. While it is true that many notions of pseudorandomness do pass onto subgraphs, we did not find any concrete statement in the literature that established the same here for this notion. So we took it on ourselves to prove its robustness; see the lemma in the Appendix for a precise statement.
Pseudorandom graphs enjoy several very interesting properties. It is not hard to show that graphs with are -edge connected and as a simple consequence, it follows that for even , graphs have a perfect matching [18]. In the more general context, it is conceivable that Thomason pseudorandom graphs admit “almost-perfect” matchings, i.e., admit a perfect matching on at least vertices under not-too-restrictive conditions. The second result of our paper proves a more general version of this statement for NMP for Thomason pseudorandom graphs.
Before we formally state our result, we need the following definition.
Definition 1.2** (NMP-Approximability).**
Suppose . For functions such that as , a bipartite graph is said to be -NMP approximable if there are subsets and such that:
- •
,
- •
The bipartite subgraph induced on the sets and has NMP.
We now state our second main result of the paper.
Theorem 1.4**.**
Suppose , and let be a non-negative valued function that satisfies as . There exists an integer such that the following holds. Suppose , with , and suppose is a Thomason pseudorandom bipartite graph with parameters . Then is -NMP-approximable with
- (a)
, if and 2. (b)
* if .*
Note that in the statement of Theorem 1.4 the bounds work for all . The first part of the theorem is a stronger conclusion when . At the level of generality of the statement of Theorem 1.4, it may in fact be necessary to delete some vertices from the graph in order to achieve NMP. Indeed, the definition of a Thomason pseudorandom graph does not preclude the existence of isolated vertices; in fact, one could add a few isolated vertices to to get another pseudorandom graph with only slightly worse parameters! Also, on a less frivolous note, suppose and , and consider ; a consequence of the proof of the second item of Theorem 1.2 (which appears later in the paper as Lemma 3.1) shows that there are isolated vertices in whp. Since is also Thomason pseudorandom whp it follows that over the sparser regime for (where Theorem 1.4 is applicable), the deletion of some vertices is indeed necessary to arrive at the conclusion of Theorem 1.4.
Theorem 1.4 essentially says that if we have a not-too-sparse pseudorandom bipartite graph, i.e., a Thomason pseudorandom graph with not too small, then we can remove a small fraction of vertices from both parts such that the graph induced by the remaining vertices has the normalized matching property. The sense of how small these sets are is described using the notion of NMP-Approximability defined above. As we shall see, the proof actually establishes an “approximate decomposition” theorem: the vertex set of any Thomason pseudorandom bipartite graph almost admits a decomposition into copies of what we call a Euclidean Tree - a small tree that arises canonically via the execution of the Euclidean algorithm. Furthermore, the entire process of obtaining and is algorithmic (and efficient) in nature and we consider this to be a major feature of our argument. After the publishing of this article, we have learned that this notion of Euclidean Trees has been defined prior to our work in the context of graphic matroids333We thank Attila Sali for bringing this to our attention. (see [26]). So we find it quite interesting to see it reappear in the context of a seemingly unrelated problem.
The rest of the paper is organised as follows. The next section gives some preliminaries and sets up terminology and tools that will be of use in the latter sections. In Section 3 we prove Theorem 1.2, and in Section 4, we prove Theorem 1.4. The paper concludes with some remarks and open questions in Section 5, and an Appendix. As mentioned earlier, the lemma in the Appendix can serve as a generator of several examples of Thomason-pseudorandom graphs for which Theorem 1.4 is applicable. The main reason for including the Lemma is that most of the standard and well-studied examples of pseudorandom graphs that arise from algebraic structures/posets tend to have , or even in the cases where , the corresponding bipartite graphs are much sparser than the ones we need in our hypothesis.
2 Preliminaries
Suppose is a bipartite graph. For , set , . For sets , by we shall mean the subgraph of induced by the vertex set . For a vertex , shall denote its degree, and for sets , shall denote the number of edges between and .
We shall repeatedly make use of the Chernoff bound:
Theorem 2.1**.**
[Chernoff Bound] (As in [15]) Suppose is a binomial random variable and . Then for
[TABLE]
A natural question that arises in the context of NMP is: If has NMP, then does also have NMP, i.e., is it true that for all ? This is not immediately obvious from the definition of NMP, but it is indeed the case, as can be immediately seen from the second characterization of Theorem 1.1 which is symmetric in and .
We begin with a simple proposition that will be instrumental in our proof of Theorem 1.2 in Section 3. For a graph that does not have NMP we say that a set of vertices witnesses the violation of NMP for if .
Lemma 2.1**.**
Suppose with , does not have NMP. Then, if witnesses the violation of NMP for , then witnesses the violation of NMP for . Moreover, either there exists that witnesses the violation of NMP for with , or there exists that witnesses the violation of NMP for with .
Proof.
If witnesses the violation of NMP for , then
[TABLE]
where we subtracted both sides from and used the simple fact that in the final inequality. Now, to see the “moreover” part, as does not have NMP, first let be a minimal set that witnesses the violation of NMP for . By the minimality of , we have . If , then we are through, so suppose that . Let . Then note that . But then by the argument above (which is symmetric in and ), witnesses the violation of NMP for .
∎
We also take note of a couple of facts from literature on random graphs that will be useful in the proof of Theorem 1.2. By (respectively ) we mean the degree of vertex into (respectively the degree of vertex into ) in .
Fact 2.2**.**
Let . For any fixed , in , for all and for all whp.
This follows from the following well known result (see [5] for instance, chapter 3) that in if for any function that goes to infinity with , then whp has minimum degree since the number of vertices of degree is approximately Poisson. The same argument extends to as well.
Fact 2.3**.**
Let and suppose . Then in , whp every has degree at least .
This is an easy consequence of the Chernoff bound (Theorem 2.1). Indeed, since , it follows that
[TABLE]
We now introduce an important ingredient that is vital to the proof of Theorem 1.4. Suppose are positive integers with . A tree will be called a left-right tree if the two color classes of its vertex set are labelled as “left” and “right” respectively. Since a connected bipartite graph admits a unique -coloring of its vertices, a left-right tree can be thought of a tree with a label on each vertex denoting its color class.
The Euclidean -tree which we shall denote by , is a left-right tree on vertices with left vertices, and right vertices that is defined recursively as follows. If , is simply a star on vertices with one left vertex and right vertices. If , then is the star on vertices with one right vertex, and left vertices. In general, suppose and are the left and right vertex sets respectively, and suppose . Let denote the matching consisting of the edges for . We define where denotes an edge disjoint union, and is the corresponding Euclidean tree with left vertex set and right vertex set . If then we define to be the matching for all and define where is the Euclidean tree with left vertex set and right vertex set . A picture is worth a thousand words; see Figure 1 that illustrates the Euclidean tree , and Figure 2 that illustrates .
The following lemma conveys why Euclidean trees are relevant to us.
Lemma 2.2**.**
Suppose is a Euclidean tree. Then if denote the sets of left and right vertices respectively, then as the bipartite graph has NMP. Moreover, so does the graph obtained by making several vertex-disjoint copies of i.e., the graph where , .
Proof.
First assume that . If , then is simply a star with leaves, and clearly, has NMP. Suppose by induction that Euclidean trees with fewer than vertices have NMP. Let . Then since , it follows that where is the set of neighbors of among . But since has NMP, we have , so that and that completes the proof. If , then the above argument works with swapped with throughout and the fact that has NMP if and only if does. Finally, the observation that has NMP follows immediately from the third (multiplicity function) characterization of NMP in Theorem 1.1. ∎
We now describe what we call the “Euclidean (-tree process” which details a realization of the graphs through a series of steps, which along with the corresponding terminology we build here will be relevant in Section 4 in the proof of Theorem 1.4. This description also justifies why we call them Euclidean trees.
Suppose . Consider the Euclidean algorithm on the pair as follows.
[TABLE]
If we set , then we may write the equalities above as for . is referred to as the complexity of the Euclidean algorithm for the parameters . The following fact is well-known (see for instance, [17], page 360).
Fact 2.4**.**
The complexity of the Euclidean algorithm with input parameters is at most .
We now describe as the evolution of an inductive sequence of trees through stages ( as above), and in order to do that, we need some additional terminology. By an * -fan*, we mean the tree and by a -fan, we mean . By an -thrill444The collective noun for fans is a thrill, so the nomenclature seemed appropriate. of size we mean a union of vertex disjoint -fans, and a -thrill is defined analogously. For a fixed graph , an -factor in a graph is a spanning subgraph of consisting of vertex disjoint copies of . As an example, an -thrill admits a factoring by -fans.
By definition, is inductively obtained through a sequence of edge disjoint unions of matchings, until we finally terminate in a tree or , for some . We now invert this process.
Suppose as described above in the Euclidean algorithm is even (the odd case is analogous). Let . Having inductively defined with left set , right set and edge set , we define as follows. If is even, then the vertex set of has left set , right set , and the edges of consist of the edges of along with an additional -thrill of size between the vertices of and the vertices of . If is odd, then has left vertex set , right vertex set and the edges of consist of the edges of along with an additional -thrill of size between the vertices of and the vertices of . In simpler terms, it is the same construction but with the roles of the left and right sets reversed as per the parity of . The main point is that the graphs are precisely the Euclidean trees (or depending on the parity of ) along with isolated vertices. While the inductive definition of the Euclidean tree appends one additional matching at each step, the Euclidean tree process accelerates this by adding a -thrill for an appropriate . In particular, is precisely and as we shall see in Section 4, it is particularly handy to think of as the end result of this evolving process. Figure 2 gives an illustration of this evolution for the Euclidean tree .
3 Threshold for NMP for
In this section we prove Theorem 1.2, restated below for convenience. Throughout this section, we shall write to denote . Unless stated otherwise, we shall assume .
Theorem .
Suppose , and let . There exists such that for
If then . 2. 2.
If then .
We establish item 2 first i.e., that if is below the threshold then whp, does not have NMP. The proof is straightforward as it simply shows the existence of an isolated vertex in whp.
Lemma 3.1**.**
Suppose be such that for all . Let . There exists such that for , if then does not have NMP whp.
Proof.
Let and let denote the number of isolated vertices in . Then .
Claim 3.1**.**
Given , there exists a unique such that for all , and equality holds only when . Moreover, as , .
The claim is a standard exercise in basic calculus, so we omit its proof.
Fix such that . Since , by the above claim, there exists sufficiently large such that . Consequently, which grows to infinity as does, where is defined to be . Now using the Chernoff bound (taking in the second inequality in Theorem 2.1), we have
[TABLE]
for large . This concludes the proof. ∎
Lemma 3.1 establishes that the right threshold for having NMP in must be at least as large as . The following is a heuristic argument that suggests that it is exactly . As mentioned in the Introduction, a classical result of Erdős-Rényi states that a sharp threshold for the existence of a perfect matching in a bipartite graph is . In our present situation, suppose divides . Replicate each vertex of by a factor of to obtain the set . Define the graph as follows. If arises from the replication of the vertex , then if and only if . It is a straightforward exercise to see that the original graph has NMP if and only if satisfies Halls’ condition, or equivalently, has NMP if and only if has a perfect matching. If this new bipartite graph behaves likes (which it isn’t), then we need for the existence of a perfect matching. But since each vertex of has been blown up to copies, it is intuitive to expect that each vertex of behaves like the union of all these vertices bundled together, which suggests a threshold of . While this argument is just a heuristic, it suggests what the correct threshold ought to be, as we next show is indeed the case by establishing the remaining (and main) item of Theorem 1.2.
Here is an overview of the proof. Lemma 3.2 proves the theorem when is large (i.e., grows to infinity with ), and this part of the proof only takes recourse to Theorem 1.1. The general case however is a little more delicate. The basic idea in the general case of the proof considers estimating the probability that there is a minimal set that violates the NMP condition. In that sense, our strategy follows a line of argument á la Erdős-Rényi but we need some additional ideas and more careful analysis to carry it through to fruition.
Lemma 3.2**.**
Suppose where the function for all and satisfies as . Let . Then there exists such that for , if , then .
Proof.
Let , and let . Since NMP is a monotone property, it suffices to establish the lemma for .
Suppose fails to have NMP. By Theorem 1.1, there exists an independent set in such that Thus, from the union bound, the probability that does not have NMP is at most where for , where
[TABLE]
Here, is an upper bound on the probability that there is a set of size and a set of size such that is an independent set. is an upper bound on the probability that contains an isolated vertex.
We define and split into three cases according to whether is “small”, “intermediate”, or “large” and repeatedly make use of the well-known bounds for all and the binomial coefficients for all .
Small Case: .
Here, using followed by standard binomial coefficient bounds, (1) yields
[TABLE]
where to derive (4), we use the bounds , and for large enough . This is where we crucially use our assumption that as . (5) follows by using the trivial fact that and taking out the common factor . (6) is obtained by using , plugging in and working out that the expression in the square brackets in (5) is at most for small . Finally, since for large enough , it follows that in this case.
Intermediate Case: .
Using the same expression for the upper bound on as in the previous case, we have
[TABLE]
Using the observation that in this case, and the trivial bound , we obtain
[TABLE]
where the last inequality follows - setting - from the fact that for all . Hence,
Large Case: .
This case is completely analogous to the small case. First, observe
[TABLE]
for large enough (again using as ) and we have that is at most
[TABLE]
where in the last step we use the bound for large enough . Consequently,
[TABLE]
To explain the last step, the expression within the square brackets evaluates to which is at most when . But for sufficiently large and since . Thus, we have and that completes the proof of the lemma. ∎
Note that the argument in the intermediate case does not require and in fact shows the following (in light of Theorem 1.1, switching from the independent set viewpoint to the violation of NMP viewpoint):
Corollary 3.2**.**
Given , for any large enough, and vertex sets and of sizes and respectively, the probability that there exists with for such that witnesses a violation of NMP for is at most .
Interestingly, the proof of Lemma 3.2 actually works out for all if one assumes in the hypothesis instead of the sharper assumption on . This, combined with Lemma 3.1, already establishes that is a threshold for NMP. The additional ideas employed in the remainder of this section are essentially only required to show that is a sharp threshold.
Proof of Theorem 1.2..
In light of Lemma 3.2, it suffices to prove the theorem assuming . here may be replaced by any slow-growing (but unbounded) function of or without much change to the rest of the argument, but we stick to for convenience.
By Lemma 2.1 either there exists with that witnesses a violation of NMP for , or there exists with that witnesses the violation of NMP for (of course, these cases need not be mutually exclusive; we merely use that combined, they exhaust the event that NMP is violated). The proof naturally splits into cases (labelled and respectively) according to whether the set winessing the violation is a subset of or . We shall show that either case occurs with low probability by exploiting certain properties of the minimal witness.
Case :
Define to be the constant if and if . In light of Facts 2.2 (for if ) and 2.3, it follows that any minimal that witnesses the violation of NMP for must have size at least whp where denotes the minimum degree of the vertices in . The choice of the peculiar constant will become clear later.
Suppose such that where . We first claim that every of size witnesses at least neighbors (as a set) in . Indeed, suppose there is a subset of vertices in which are the neighbors of only one vertex in . Then by the minimality of , it follows that the set satisfies which is a contradiction, and that proves the claim.
We divide case further into two subcases. First, we bound the probability that there exists of size for which (notice that this clearly implies ) which witnesses a violation of NMP for . So fix a choice for of size , and (which will represent ) of size equal to some integer in the interval . Fix a partition of into sets of size . By size considerations, there are at least such parts, and by the observation above, each such part admits at least two neighbors in . We conclude that the probability that there exists with which witnesses a violation of NMP for is at most
[TABLE]
To see why, observe that there are choices for , at most values for (since minimally witnesses a violation of NMP), each of which is at most . The probability that is at most , and finally, the last expression is a bound on the probability that each of the blocks of vertices has at least neighbors in . The condition on that we have imposed in this subcase simply translates to the observation that the quantity in the right-most parenthesis that is raised to is less than . So, we have
[TABLE]
for sufficiently large and where in the final step, we used the fact that an infinite geometric series is at most twice the first term, when the common ratio is small enough. This expression is clearly when (and so ). Further, it is at most when .
For the subcase , we simply bound (which we shall call ) the probability of a minimal whose size is in this range by the probability that is independent and sum over the entire range of again. First, observe that in this subcase,
[TABLE]
and thus,
[TABLE]
as before and we are through.
Finally, observe that the case follows immediately from Corollary 3.2.
Case :
There is a minimal witness with that witnesses the violation of NMP for . This time though, since it follows that , and that for every there are at least neighbors in . Now, define . As earlier, by Fact 2.2, the minimal that witnesses the violation of NMP for must have size at least whp. Again, we split this into two subcases: and where again .
Suppose . Analogous to how we divided Case into two subcases, let us first assume that which in particular, lets us assume that . Then the probability that such a witness exists of size in this range is at most
[TABLE]
where to derive (9), we use in the exponent and the more crude bound elsewhere, which is applicable since by assumption, . We also subsequently drop the range in the sum for convenience. Next, if , then we simply bound the probability of there being a witness of size in this range by the probability that is an independent set (i.e. the final parenthesis in the expression for above is dropped) and sum over this range of again. The calculations (for the accordingly defined expression ) are very similar to that of in case and are omitted here.
Finally, if , then note that has size , and by Lemma 2.1, witnesses the violation of NMP for and is covered by Corollary 3.2. ∎
4 Normalized Matching Property in Pseudorandom Graphs
In this section, we prove Theorem 1.4 which is restated below for convenience. Suppose and . Recall that a bipartite graph with is called Thomason pseudorandom with parameters if every vertex in has degree at least , and if every pair of vertices in have at most neighbors in common.
Theorem 1.4.
Suppose , and let be a non-negative valued function that satisfies as . There exists an integer such that the following holds. Suppose , with , and suppose is a Thomason pseudorandom bipartite graph with parameters . Then is -NMP-approximable with
- (a)
, if and 2. (b)
* if .*
In what follows, is a Thomason pseudorandom graph with parameters where and where denotes a function that satisfies as . As always, , and are sufficiently large (depending on the choice of and ). As in the proof of Theorem 1.2, we split the task of proving NMP-approximability into two cases: the first, in which is significantly larger than and the second, in which the two are comparable.
Here is a brief overview of the proof. Suppose that
[TABLE]
where the latter is the representation in reduced form i.e., and . Our strategy of proof is to show that we can find small sets such that admits a vertex decomposition into copies of the Euclidean tree . Since has NMP by Lemma 2.2, this establishes the NMP-approximability of . An essential ingredient in the proof of both cases is Lemma 4.1 (which appears below) which basically states: If satisfies that for every subset of size at least and every subset , we have , then all large enough subsets of admit an almost partition into -thrills or -thrills (as the case may be).
The application of this lemma in the first case ( large) is straightforward, but in the second case, it does not apply directly. The principal issue in the second case emanates from the possibility that in the reduced form are still large; for instance if are coprime, then and Lemma 4.1 does not apply. To circumvent this difficulty, we pre-process the graph, by deleting a small portion from both to get so that the reduced form for satisfies . Lemma 4.1 then applies in a multi-step process that we describe in Lemma 4.2.
Lemma 4.1**.**
Let and be such that or . Suppose satisfies the conclusion of Theorem 1.3. Let and and define . Then there exist subsets such that if and , then
- •
if , then is spanned by an -thrill where and ;
- •
if , then is spanned by a -thrill where and .
Proof.
First, assume that . Let be a maximal -thrill in and let , i.e., let denote the set of all those vertices in which belong to a -fan in . Similarly, let and set . Since is an -thrill, which gives . Note that we may assume that as otherwise, the bounds on and hold trivially since for either assumption on .
By the maximality of , no vertex in has more than neighbors in , implying . Since , the aforementioned observation coupled with Theorem 1.3 implies
[TABLE]
so that
[TABLE]
Plugging yields
[TABLE]
which upon further simplification, yields the following quadratic inequality in :
[TABLE]
Since for either assumption on for sufficiently large,
[TABLE]
It now suffices to show that (for either assumption on ) . Note that . If , then we have for large enough that and therefore, If , then for large enough , we have that and therefore,
Now, assume that . This case proceeds analogously to the previous one, with only minor changes at appropriate places. Let now be a maximal -thrill and let and . Define and as in the previous case. Then by the maximality of , no vertex in has more than neighbors in , implying . Further, we have . By Theorem 1.3, assuming as earlier, we have
[TABLE]
Upon plugging in and working out as before, we obtain the quadratic inequality
[TABLE]
which is identical to (15) except with in place of and in place of . Thus, it follows that , therefore . This implies the claimed bounds in terms of as before.∎
A few remarks are in order.
Though we have slightly stronger bounds on and in the second case (when ), we simply use the stated bounds for the sake of ease of calculations later. 2. 2.
When (for instance in the pseudorandom graphs that arise from the point-hyperplane incidences of projective geometries), the calculations above in fact yield when and something analogous when . In particular, the sizes of the deleted parts are considerably smaller in this case. 3. 3.
If then the conclusions of Lemma 4.1 hold even for the graph with the same parameters since the lemma directly applies to the pair as a subset of . This is vitally of use in the way we apply the Lemma in the proof of Theorem 1.4 part (b).
Proof of Theorem 1.4 part (a).
Suppose , where and is an integer such that . Choose an arbitrary subset of size and define . Apply Lemma 4.1 to the sets and to obtain and such that is spanned by an -thrill and therefore has NMP (by Lemma 2.2). Define and so that
[TABLE]
and
[TABLE]
∎
Lemma 4.2**.**
Suppose is representation in reduced form of , suppose and let . There exist subsets with and , such that admits a -factor. Here, is the complexity of the Euclidean algorithm for the parameters as defined in Section 2.
Proof of Lemma 4.2.
Partition both and arbitrarily into “blocks”, each of size . Let the blocks be denoted by and respectively. We shall refer to the blocks as left blocks and the blocks as right blocks. Let be the remainders and quotients as defined in Section 2. We shall now replicate the Euclidean- process with the vertices being replaced by these blocks, which we shall carry out in stages, beginning with stage .
In the rest of the proof of Lemma 4.2 we assume that is even; the odd case is completely analogous. We also define the sets and analogous to the sets and in the definition of the Euclidean tree (see Section 2) as follows. If is even,
[TABLE]
and if is odd, then
[TABLE]
We also assume that .
We induct on . At stage , we apply Lemma 4.1 to appropriately defined sets and to obtain sets and such that is spanned by an -thrill or a -thrill (depending on whether is even or odd respectively). In fact, it will turn out that and are large subsets of and respectively, when is even (and something analogous when is odd). We denote the set of deleted vertices from and at the end of stage by and respectively, and these are obtained by modifying and suitably, with the help of and . We then show that admits a -factor, where as was defined in Section 2. By controlling the sizes of and (which we denote by and respectively) the Lemma follows by plugging in because and .
Let us get to the details now. For starters, we apply Lemma 4.1 to the “first” right blocks (recall that ) and the “first” left blocks. More precisely, we apply Lemma 4.1 to and so that . We obtain sets and such that is spanned by a -thrill. This terminates stage with and ; consequently, by Lemma 4.1 and . This establishes the following:
admits a -factor, with and .
Suppose now that for some , admits a -factor, and
- (1)
if is even, then and . 2. (2)
if is odd, then and .
We shall show that there exist subsets and such that admits a -factor, and furthermore,
- (a)
if is even, then and , 2. (b)
if is odd, then and ,
which would establish the induction step.
Suppose is even. Let be an arbitrary subset of of size . Define
[TABLE]
Since we have , so by Lemma 4.1, we obtain sets and with and such that is spanned by an -thrill.
By assumption, admits a -factor i.e., is spanned by vertex-disjoint copies of . Define to be the set of all those vertices in which belong to one of the above copies of that also contains at least one vertex from . Obviously, . Similarly, we define as the set of vertices in which belong to a copy of that contains at least one vertex from . We refer to such copies of in (that contain at least one vertex from ) as corrupt copies. Define
[TABLE]
as the set of those vertices of that get “corrupted” due to the introduction of further deletions during stage (i.e. the set ). In other words, is the set of vertices touched by the corrupt copies. See Figure 3 for an illustration of the induction step.
Define
[TABLE]
and set . Note that every corrupt copy of in has vertices in and vertices in . Therefore, we have the bounds
[TABLE]
Putting things together, we obtain the recurrences
[TABLE]
By the induction hypothesis we have and . Therefore,
[TABLE]
and
[TABLE]
where in the final step, we use and the fact that .
We now prove that admits a -factor. Recall from the preliminaries that if is the Euclidean tree with left vertices and right vertices , then is constructed on left vertices and right vertices , by adding to an -thrill of size between and . By Lemma 4.1, is spanned by an -thrill. This, along with the copies of that span , gives us the desired -factoring of .
The proof of the inductive step when is odd i.e., is completely analogous ( swapped with everywhere). The only small difference that arises is in the recurrences for and because of the slightly different bounds for and given by Lemma 4.1 in this case. In particular, by following the same line of argument as in the proof of , we obtain, in this case
[TABLE]
But then, by using the trivial bound , we obtain the desired estimates and .
Thus, we have shown that there exist subsets such that admits a -factor and consequently has NMP. Furthermore we have
[TABLE]
∎
We are now in a position to prove Theorem 1.4 part (b).
Proof of Theorem 1.4 part (b).
Suppose is a Thomason pseudorandom bipartite graph with parameters and with vertex classes and of sizes and respectively with .
Set and and consider the interval . Since its length is , there is an integer such that is a multiple of . Also, since , there is an integer in the interval such that is a multiple of . With and as defined above (note that ), simply pick a subset of size and of arbitrarily and define a new graph . Observe that if is the representation in reduced from of , then . Applying Lemma 4.2 to (see Remark after Lemma 4.1), we obtain subsets and such that has NMP, where and . By Fact 2.4 and the trivial bounds and , we have
[TABLE]
and similarly,
[TABLE]
and that completes the proof. ∎
5 Concluding Remarks
- •
The main engine in the proof of Theorem 1.4 comes from Lemma 4.1 which is the place the pseudorandomness is used in an explicit form. The rest of the proof of the theorem including the inductive argument uses this in a black-box manner. Hence, if we had an equivalent statement to Lemma 4.1 for other models of pseudorandomness - call it Lemma 4.1* (say), then the rest of the proof of Theorem 1.4 can run through with the error estimates being dictated by Lemma 4.1* instead. The content of Lemma 4.1 uses the notion of Thomason pseudorandomess explicitly only when we evoke Theorem 1.3 which is basically a statement that estimates how much the difference between and the expected number of edges, if the graph were random, viz., can be. For graphs, the analogue of this theorem is the expander-mixing lemma which provides precisely such an estimate.
We illustrate this by returning to problem 2 that was stated in the introduction. For , and a sufficiently large prime power, let be a multiplicative subgroup of of order at least . Consider the Sum-Cayley graph whose vertex set is and vertices are adjacent if and only if . A result of Alon and Bourgain (see [1]) states that that is a graph, i.e., it is a regular graph on vertices, with degree , and every non-trivial eigenvalue of is at most . If is the bipartite graph described in the introduction following the description of problem 2, then it is not difficult to show that for any we have by using the expander-mixing lemma. Then, via the argument in the proof of Lemma 4.1 we have: If with , , and let is a subgroup of of size at least , then there exists with , and such that has NMP. Consequently, every element of can be labeled by some element of such that each label appears times, and further, for each labeled , the sum . This answers in the affirmative, the approximate version of problem 2. One could pose more general questions of the same kind, but without the additional constraint that is a multiple of . For instance, suppose and (say), with , and let be a subgroup of of size at least . Then one can similarly show that there exist subsets with such that if are the remaining sets, then one may form a star-array of dimension whose rows and columns are labeled by the elements of respectively with the property that if the element of is a star, then . Furthermore, each row of has precisely stars, and each column has precisely stars.
- •
For a bipartite graph with that admits a perfect matching, the Max-Min Greedy Matching problem that was introduced in [8] goes as follows. Given permutations of the vertices of and respectively, the vertices of are processed according to , and each is matched to its earliest available neighbor in according to . If denote the size of the resulting greedy matching, determine . This problem admits a natural generalization. Suppose is a bipartite graph, with , with , and suppose . As before, let be permutations of the vertices of and respectively. We process the vertices of according to and for each , we choose its first neighbors in that have not been already chosen by some previous vertex of according to . Let denote the number of vertices of for which one can choose such neighbors. Then determine . Our proof of Lemma 4.1 can easily be adapted to establish the following: Suppose , and let be a function such that as . Then there exists such that whenever and is a -Thomason pseudorandom bipartite graph with , and , then .
- •
Our proof of Theorem 1.2 on closer examination reveals that does not have NMP whp for for any arbitrary function that goes to infinity. However, to prove the existence of NMP with high probability, our proof cannot extend beyond . While it is possible to improve (using our methods) our result to prove that has NMP whp for for some , the question of whether there is a sharp threshold for NMP of the form remains open.
- •
As remarked in the Introduction, our proof of Theorem 1.4 shows that works uniformly for all pairs . Is it possible to improve this to uniformly over all ?
- •
We make a final remark pertaining to a remark following the statement of Theorem 1.4 in the Introduction. As we noted, the definition of Thomason pseudorandomness does not preclude the existence of isolated vertices unless a more symmetric definition of pseudorandomness is adopted. In that case, it would be interesting to see if one can arrive at a stronger conclusion than the statement of Theorem 1.4.
Appendix: Robustness of Thomason pseudorandomness
Lemma.
Let , and be positive integers. Suppose is a Thomason pseudorandom bipartite graph with parameters with , and suppose . Then, for a given integer satisfying for , there exist subsets and such that
- •
* and , where for some fixed constant ,*
- •
the subgraph induced by the sets and is Thomason pseudorandom with parameters where and .
Proof.
Let where shall be specified later. Let be a uniformly random subset of of size . Then by the tail bound of the hypergeometric distribution (see [19]) we have, for every ,
[TABLE]
for every vertex . Now, fix . Call a vertex bad with respect to if
[TABLE]
Then by equation 16, the expected number of bad vertices is at most . Fix a set of size for which the set of bad vertices (which we shall call ) has size at most .
Now, for a vertex , let . Then for , as is not a bad vertex,
[TABLE]
where the inequality follows from the hypothesis (see Definition 1.1) that is Thomason pseudorandom. Also note that for any distinct vertices ,
[TABLE]
which follows since
[TABLE]
where the last inequality follows from the fact that . The required codegree bound then follows from the given condition on .
It remains only to check is that . To see this, observe that and also note that . Thus,
[TABLE]
where we may take the constant in the definition of . ∎
One interesting consequence of the proof of the lemma is that if we seek then one has a randomized algorithm to choose a set and a related with such that deleting these sets from respectively results in another Thomason pseudorandom graph with only slightly worse parameters.
It is known (see [21]) that bipartite graphs arising from the point-hyperplane incidence structure of a projective geometry of dimension over a finite field is Thomason pseudorandom with parameters and . More generally, one can take the point-block incidence structure arising from a symmetric block design as the “seed” Thomason pseudorandom graph which upon the application of the lemma above gives us several other examples of Thomason pseudorandom graphs with parameters that are relevant in Theorem 1.4.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] N Alon, J. Bourgain, Additive Patterns in Multiplicative Subgroups, Geom. Funct. Anal. , 24 (2014), No. 3, 721-739.
- 2[2] N. Alon, J. Spencer, The Probabilistic Method , 4th ed., Wiley, 2016.
- 3[3] I. Anderson, Some problems in combinatorial number theory, Ph D thesis, University of Nottingham, United Kingdom, (1967).
- 4[4] I. Anderson, Combinatorics of Finite Sets , Dover Publications, Mineola, NY (2002).
- 5[5] B. Bollobás, Random Graphs , 2nd ed., Cambridge University Press (2001).
- 6[6] B. Bollobás, A. Thomason, Threshold functions, Combinatorica , 7 (1986), 35-38.
- 7[7] K. Engel, Sperner theory in Encyclopedia of Mathematics , Cambridge University Press (1997).
- 8[8] A. Eden, U. Feige, M. Feldman, Max-min greedy matching, Net Econ’ 19 , Proc. 14th Workshop on the Economics of Networks, Systems and Computation, Article 10, (2019)
