The independence number of the Birkhoff polytope graph, and applications to maximally recoverable codes
Daniel Kane, Shachar Lovett, Sankeerth Rao

TL;DR
This paper establishes that the minimal dimension for labeling edges in a bipartite graph to ensure cycle sum conditions is linear in n, improving bounds and applying representation theory to analyze the Birkhoff polytope graph.
Contribution
It proves that the minimal label dimension is linear in n, refining previous bounds, and introduces a recursive construction alongside representation theory analysis.
Findings
The minimal label dimension d is linear in n.
A recursive construction outperforms random methods.
Tight bounds for the independence number of the Birkhoff polytope graph are provided.
Abstract
Maximally recoverable codes are codes designed for distributed storage which combine quick recovery from single node failure and optimal recovery from catastrophic failure. Gopalan et al [SODA 2017] studied the alphabet size needed for such codes in grid topologies and gave a combinatorial characterization for it. Consider a labeling of the edges of the complete bipartite graph with labels coming from , that satisfies the following condition: for any simple cycle, the sum of the labels over its edges is nonzero. The minimal d where this is possible controls the alphabet size needed for maximally recoverable codes in n x n grid topologies. Prior to the current work, it was known that d is between and . We improve both bounds and show that d is linear in n. The upper bound is a recursive construction which beats the random construction. The…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Data Storage Technologies · Caching and Content Delivery · Cryptography and Data Security
The independence number of the Birkhoff polytope graph, and applications to maximally recoverable codes
Daniel Kane Shachar Lovett Sankeerth Rao
Department of Computer Science
University of California, San Diego email: [email protected]: [email protected]. Research supported by NSF CCF award 1614023 and a Sloan fellowship.email: [email protected]. Research supported by NSF CCF award 1614023.
Abstract
Maximally recoverable codes are codes designed for distributed storage which combine quick recovery from single node failure and optimal recovery from catastrophic failure. Gopalan et al [SODA 2017] studied the alphabet size needed for such codes in grid topologies and gave a combinatorial characterization for it.
Consider a labeling of the edges of the complete bipartite graph with labels coming from , that satisfies the following condition: for any simple cycle, the sum of the labels over its edges is nonzero. The minimal where this is possible controls the alphabet size needed for maximally recoverable codes in grid topologies.
Prior to the current work, it was known that is between and . We improve both bounds and show that is linear in . The upper bound is a recursive construction which beats the random construction. The lower bound follows by first relating the problem to the independence number of the Birkhoff polytope graph, and then providing tight bounds for it using the representation theory of the symmetric group.
1 Introduction
The Birkhoff polytope is the convex hull of doubly stochastic matrices. The Birkhoff polytope graph is the graph associated with its -skeleton. This graph is well studied as it plays an important role in combinatorics and optimization, see for example the book of Barvinok [2]. For us, this graph arose naturally in the study of certain maximally recoverable codes. Our main technical results are tight bounds on the independence number of the Birkhoff polytope graph, which translate to tight bounds on the alphabet size needed for maximally recoverable codes in grid topologies.
We start by describing the coding theory question that motivated the current work.
1.1 Maximally recoverable codes
Maximally recoverable codes, first introduced by Gopalan, Huang, Jenkins and Yekhanin [6], are codes designed for distributed storage which combine quick recovery from single node failure and optimal recovery from catastrophic failure. More precisely, they are systematic linear codes which combine two types of redundancy symbols: local parity symbols, which allow for fast recovery from single symbol erasure; and global parity symbols, which allow for recovery from the maximal information theoretic number of erasures. This was further studied in [12, 1, 11, 7, 9].
The present paper is motivated by a recent work of Gopalan, Hu, Kopparty, Saraf, Wang and Yekhanin [5], which studied the effect of the topology of the network on the code design. Concretely, they studied grid like topologies. In the simplest setting, a codeword is viewed as an array, with entries in a finite field , where there is a single parity constraint for each row and each column, and an additional global parity constraint. More generally, a maximally recoverable code has codewords viewed as an matrix over , with parity constraints per row, parity constraints per column, and additional global parity constraints. An important problem in this context is, how small can we choose the alphabet size and still achieve information theoretical optimal resiliency against erausers.
Gopalan et al. [5] gave a combinatorial characterization for this problem, in the simplest setting of and . Their characterization is in terms of labeling the edges of the complete bipartite graph by elements of , which satisfy the property that in every simple cycle, the sum is nonzero.
Let . Let be a labeling of the edges of the complete bipartite graph by bit vectors of length .
Definition 1.1**.**
A labeling is simple cycle free if for any simple cycle in it holds that
[TABLE]
Gopalan et al. [5] showed that the question on the minimal alphabet size needed for maximally recoverable codes, reduces to the question of how small can we take so that a simple cycle free labeling exists. Concretely:
- •
The alphabet size needed for codes is .
- •
The alphabet size needed for codes is at least .
Before the current work, there were large gaps between upper and lower bounds on . For upper bounds, as the number of simple cycles in is , a random construction with succeeds with high probability. There are also simple explicit constructions matching the same bounds, see e.g. [6]. In terms of lower bounds, it is simple to see that is necessary. The main technical lemma of Gopalan et al. [5] in this context is that in fact is necessary. This implies a super-polynomial lower bound on the alphabet size in terms of , which is one of their main results.
We improve on both upper and lower bounds and show that is linear in . We note that our construction improves upon the random construction, which for us was somewhat surprising. For convenience we describe it when is a power of two, but note that it holds for any with minimal modifications.
Theorem 1.2** (Explicit construction).**
Let be a power of two. There exists for which is simple cycle free.
Our main technical result is a nearly matching lower bound.
Theorem 1.3** (Lower bound).**
Let be simple cycle free. Then .
1.2 Labeling by general Abelian groups
The definition of simple cycles free labeling can be extended to labeling by general Abelian groups, not just . Let be an Abelian group, and let . We say that is simple cycle free if for any simple cycle ,
[TABLE]
where is an alternating sign assignment to the edges of (these are sometimes called circulations). We note that the analysis of Gopalan et al. [5] can be extended to non-binary alphabets , in which case their combinatorial characterization extends to the one above with .
Theorem 1.4**.**
Let be an Abelian group. Let be simple cycle free. Then .
As a side remark, we note that the study of graphs with nonzero circulations was instrumental in the recent construction of a deterministic quasi-polynomial algorithm for perfect matching in NC [4]. However, beyond some superficial similarities, the setup seems inherently different than ours. For starters, they study general bipartite graphs, while we study the complete graphs. Moreover, they need to handle certain families of cycles, not necessarily simple, while in this work we focus on simple cycles.
The proofs of Theorem 1.3 and Theorem 1.4 rely on the study of a certain Cayley graph of the permutation group, which encodes the property of simple cycle free labeling. Surprisingly, the corresponding graph is the Birkhoff polytope graph.
1.3 The Birkhoff polytope graph
Let denote the symmetric group of permutations on . A permutation is said to be a cycle if, except for its fixed points, it contains a single non-trivial cycle (in particular, the identity is not a cycle). We denote by the set of cycles. The Cayley graph is a graph with vertex set and edge set . Note that this graph is undirected, as if then also .
The graph turns out to be widely studied: it is the graph of the Birkhoff polytope, which is the convex hull of all permutation matrices. See for example [3] for a proof. Our analysis does not use this connection; we use the description of the graph as a Cayley graph.
The following claim shows that Theorem 1.4 reduces to bounding the size of the largest independent set in the Birkhoff polytope graph.
Claim 1.5**.**
Let be an Abelian group. Assume that is simple cycle free. Then contains an independent set of size .
Proof.
Define
[TABLE]
where is chosen to maximize the size of . Thus . We claim that is an independent set in .
Assume not. Then there are two permutations such that . Let denote the matching in associated with , and define analogously. Let denote their symmetric difference. The fact that has exactly one cycle, is equivalent to being a simple cycle. Let be an alternating sign assignment to the edges of . Then
[TABLE]
This violates the assumption that is simple cycle free. ∎
The construction of a simple cycle free labeling in Theorem 1.2, combined with Claim 1.5, implies that the Birkhoff polytope graph contains a large independent set.
Corollary 1.6**.**
Let be a power of two. Then contains an independent set of size .
We also give in the appendix a construction of a larger independent set in the Birkhoff polytope graph, not based on a simple cycle free labeling.
Theorem 1.7**.**
Let be a power of two. Then contains an independent set of size .
The best previous bounds we are aware of are by Onn [8] who proved that contains an independent set of size .
Our main technical result is an upper bound on the largest size of an independent set in the Birkhoff polytope graph.
Theorem 1.8**.**
The largest independent set in has size .
As a side remark, we note that general bounds on the independence number of graphs, such as the Hoffman bound, give much weaker bounds. A standard application of the Hoffman bound gives a much weaker bound for the independence number of of ; and if we restrict all permutations to have the same sign, the bound improves to . The reason is that the Hoffman bounds (at least in its simplest form) directly relates to the minimal eigenvalues of the graph. However, in our case the eigenvalues are controlled by the irreducible representations of , and the extreme eigenvalues are given by low dimensional representations. This prohibits obtaining strong bounds on the independence number directly.
In order to overcome this barrier, our analysis circumvents the effect of the low dimensional representations by appealing to a structure vs. randomness dichotomy specialized for our setting. It allows us to either reduce the dimension of the ambient group, or restrict to pseudo-random assumptions about the actions of the low dimensional representations.
Organization.
We prove Theorem 1.2 in Section 2 and Theorem 1.8 in Section 3. Theorem 1.7 is proved in Appendix A.
Acknowledgements.
We thank Ran Gelles and Sergey Yekhanin for useful discussions on the problem and comments on a preliminary version of this paper. We thank Igor Pak for bringing to our attention that the Cayley graph which we study is the Birkhoff polytope graph.
2 A construction of a simple cycle free labeling
We prove Theorem 1.2 in this section. We first introduce some notation. For denote by the unit vector with in coordinate and [math] in all other coordinates. We let denote the all zero vector.
Let be a power of two. We define recursively a labeling . For set (for example)
[TABLE]
Assume . Let and , where . Define recursively as
- (i)
The first bits of are if , and otherwise they are . 2. (ii)
The next bits of are if , and otherwise they are . 3. (iii)
The last bits of are defined recursively to be .
We claim that is indeed simple cycle free. For it is simple to verify this directly, so assume .
Let be a simple cycle in , and assume towards a contradiction that . Assume has nodes, for some , and let these be . We denote and . Define furthermore and .
Claim 2.1**.**
Either or .
Proof.
Assume that both and are nonempty. Then there must exist with and , where if then we take the subscript modulo . Recall that is the neighbour of in . Its contribution to the first bits of the sum is , since and . Note that no other edge in has a nonzero value in coordinate . Thus the coordinate in the sum over is , which contradicts the assumption that the sum over is zero. ∎
Thus we can assume from now on that either or .
Claim 2.2**.**
Either or .
Proof.
Assume that , and the case of is identical. Assume that both and are both nonempty. Then there must exist with and . Recall that is the neighbour of in . Its contribution to the 2nd batch (of bits) of the sum is , since and . Note that no other edge in has a nonzero value in coordinate , where we here we need the assumption that or . Thus the coordinate in the sum over is , which contradicts the assumption that the sum over is zero. ∎
Thus we have that or , and similarly or . Thus, is a simple cycle in embedded in in one of four disjoint ways: , , or . Observe that in each of these copies, the last coordinates of the sum are precisely , so by induction cannot have zero sum.
3 The independence number of the Birkhoff polytope graph
We prove Theorem 1.8 in this section. Let be an independent set in . We prove an upper bound on the size of . Concretely, we will show that for some absolute constants . As we will see at the end, the choice of works.
The proof relies on representation theory, in particular representation theory of the symmetric group. We refer readers to the excellent book of Sagan [10], which provides a thorough introduction to the topic. We will try to adhere to the notations in that book whenever possible.
Overall Strategy.
Our basic plan will be to break our analysis into two cases based on whether or not the action of on -tuples is nearly uniform for all . This will be in analogy with standard structure vs. randomness arguments. If the action on -tuples is highly non-uniform, this will allow us to take advantage of this non-uniformity to reduce to a lower-dimensional case. On the other hand, if acts nearly uniformly on -tuples, this suggests that it behaves somewhat randomly. This intuition can be cashed out usefully by considering the Fourier-analytic considerations of this condition, which will allow us to prove that some pair of elements of differ by a simple cycle using Fourier analysis on .
Non-Uniform Action on Tuples.
Let denote the family of ordered -tuples of distinct elements of . Its size is . A permutation acts on by sending to . Below when we write we always mean the probability of an event under a uniform choice of .
Notice that if for some pair , this will allow us to reduce to a lower dimensional version of the problem. In particular, if we let , we note that . On the other hand, after multiplying on the left and right by appropriate permutations (an operation which doesn’t impact our final problem), we can assume that . Then, if were an independent set for , would correspond to an independent set for . Then, if we could prove the bound that , we could inductively prove that .
Uniform Action on Tuples.
When the action of on -tuples is near uniform for all , we will attempt to show that two elements of differ by a simple cycle using techniques from the Fourier analysis of . In fact, we will show the stronger statement that some pair of elements of differ by a single cycle of length .
Some slight complications arise here when parity of the permutations here is considered. In particular, all -cycles have the same parity. This is actually a problem for even, as all such cycles will be odd, and thus our statement will fail if consists only of permutations with the same parity. Thus, we will have to consider our statement only in the case of odd. Even in this case though, parity will still be relevant. In particular, note that the difference between two permutations in can be a cycle of length only if the initial permutations had the same parity. Thus, we lose very little by restricting our attention to only elements of with the more common parity. This will lose us a factor of in the size of , but will make our analysis somewhat easier. We are now prepared to state our main technical proposition:
Proposition 3.1**.**
Let be an odd integer and let be a sufficiently small constant. Let be a set of permutations satisfying:
- (i)
All elements of are of the same sign. 2. (ii)
For any even and any , .
Then there exist two elements of that differ by a cycle of length . In particular, we can take .
Remark.
In the second condition above, we consider only even . This is because if this condition fails, we are going to use our other analysis to recursively consider permutations of , and would like to also be odd.
We prove Proposition 3.1 below, and then show that it implies Theorem 1.8.
Proof.
First, note that by replacing all by for some odd permutation if necessary, it suffices to assume that all are even. We will assume this henceforth.
Rephrasing the problem using class functions.
Let denote the set of -cycles in . Define two class functions as
[TABLE]
It is easy to see that our conclusion is equivalent to showing that .
Let denote a partition of , namely where and . The irreducible representations of are the Specht modules, which are indexed by partitions . Let denote their corresponding characters. Their action extends linearly to . Namely, if is given by where then .
As are class functions, their inner product equals
[TABLE]
Let be a fixed cycle of length . As all elements in are conjugate to , we have and we can simplify Equation (1) to
[TABLE]
Thus, we are lead to explore the action of the irreducible characters on the full cycle .
Characters action on the full cycle.
The Murnaghan-Nakayama rule is a combinatorial method to compute the value of a character on a conjugacy class, which in our case is . In this special case it is very simple. It equals zero unless is a hook, e.g. its corresponding tableaux has only one row and one column, and otherwise its either or . Concretely, let for denote the partition corresponding to a hook. Then
[TABLE]
Thus we can simplify Equation (2) to
[TABLE]
Bounding the characters on .
The character corresponds to the trivial representation, and by our definition of it equals . Observe that we can simplify as
[TABLE]
First, we argue that the evaluation of characters on is always nonnegative.
Claim 3.2**.**
* for all .*
Proof.
Let be given by . Then
[TABLE]
where for a matrix its Frobenius norm is given by . In particular it is always nonnegative. ∎
The following lemma bounds . Observe that in particular for it gives . However, we would use it to obtain effective bounds when .
Lemma 3.3**.**
Let . For any even it holds that .
Proof.
Let denote the (not irreducible) Young module associated with a partition . In the case of it corresponds to the action of on . That is, for any we have that is a matrix whose rows and columns are indexed by respectively, where . Observe that . We extend this action to linearly.
Recall that . By assumption (ii) in Proposition 3.1 we have
[TABLE]
Thus, we can bound the Frobenius norm of by
[TABLE]
This is useful as
[TABLE]
The Kostka numbers denote the multiplicity of the Specht module in the Young module . We can thus decompose
[TABLE]
We saw that for all . By Young’s rule, equals the number of semistandard tableaux of shape and content . In particular, it is always a nonnegative integer. In the special case of and for , Young’s rule is simple to compute and gives
[TABLE]
Recall that is the trivial representation, for which and . Thus
[TABLE]
∎
We next apply Lemma 3.3 to bound for all . If then we can apply Lemma 3.3 for and obtain the bound
[TABLE]
For we need the following claim, relating to .
Claim 3.4**.**
For any it holds that .
Proof.
For any partition let denote the transpose (also known as conjugate) partition. It satisfies for all , where is the sign representation. As all elements in are even permutations, it holds by the definition of that
[TABLE]
In particular if then . ∎
Next, we lower bound as follows. The dominant terms are . For any , the corresponding term in Equation (4) appears twice, once as and once as . The term for appears once.
Furthermore, as for all by Claim 3.2, the only negative terms correspond to odd . Thus we can lower bound
[TABLE]
It is not hard to show that this is positive if is small enough. If we take , the right hand side of Equation (6) is slightly negative for large enough (the limit as is ). However, when , the second term can be replaced by rather than , making our lower bound on at least . This completes our proof.
∎
We are now prepared to prove Theorem 1.8.
Proof.
We first prove that if is odd and if all permutations in have the same sign, then
We proceed by induction on . Firstly, we note that if , the bound follows trivially.
For odd , we note that unless there is some even and some with , then our result follows immediately from Proposition 3.1. Otherwise, we may assume without loss of generality that . It then follows that letting , we can think of as a set of permutations on . Also, note that being an independent set for , implies that is an independent set for . Therefore, by the inductive hypothesis:
[TABLE]
We now need to reduce to the case of odd and consisting only of permutations of the same sign. First, restricting to only permutations of the most common sign, we can assume that all permutations in have the same sign, losing only a factor of in . Now, if is odd, we are done. otherwise, let be the most likely value of for taken from . We have that . Without loss of generality, and we can let . Since is an independent set in , and since is odd, we have
[TABLE]
∎
Appendix A A construction of a larger independent set
We prove Theorem 1.7 in this section. Assume that . We construct of size , such that is an independent set in .
Let for . Note that is a partition of for every , that and that is a partition of .
We define a sequence of subsets of . For let . For any set of size let be a bijection between subsets of of size and . Define and
[TABLE]
Since each value mod occurs equally often as a for each , and since these values are independent of one another, . Finally set . The following claim (applied for ) shows that is an independent set in .
Claim A.1**.**
Let . Let be such that . Then there exists such that
. 2. 2.
* for all .*
Proof.
We prove the claim by induction on . The case of follows from the definition of . By assumption fix both and . However, as is a cycle, it must be contained in either or . This implies that for all or all .
Consider next the case of . By induction for all . Moreover, there exists such that for all . This implies that for all .
Next, the assumption that guarantees that
[TABLE]
For any we know that and , so . Thus we obtain that also . Moreover, as we also know that and that is a bijection to , it must be the case that and hence also . Thus we conclude that for all .
To conclude, as is a cycle, it must be contained in either or . Thus, must fix all points in or all points in . We set accordingly. ∎
Finally, we compute the size of . As and we obtain that
[TABLE]
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] S. Balaji and P. V. Kumar. On partial maximally-recoverable and maximally-recoverable codes. In Information Theory (ISIT), 2015 IEEE International Symposium on , pages 1881–1885. IEEE, 2015.
- 2[2] A. Barvinok. A course in convexity , volume 54. American Mathematical Society Providence, 2002.
- 3[3] L. J. Billera and A. Sarangarajan. The combinatorics of permutation polytopes. In Formal power series and algebraic combinatorics , volume 24, pages 1–23, 1994.
- 4[4] S. Fenner, R. Gurjar, and T. Thierauf. Bipartite perfect matching is in quasi-nc. In Proceedings of the 48th Annual ACM SIGACT Symposium on Theory of Computing , pages 754–763. ACM, 2016.
- 5[5] P. Gopalan, G. Hu, S. Kopparty, S. Saraf, C. Wang, and S. Yekhanin. Maximally recoverable codes for grid-like topologies. In Proceedings of the Twenty-Eighth Annual ACM-SIAM Symposium on Discrete Algorithms , pages 2092–2108. SIAM, 2017.
- 6[6] P. Gopalan, C. Huang, B. Jenkins, and S. Yekhanin. Explicit maximally recoverable codes with locality. IEEE Transactions on Information Theory , 60(9):5245–5256, 2014.
- 7[7] V. Lalitha and S. V. Lokam. Weight enumerators and higher support weights of maximally recoverable codes. In Communication, Control, and Computing (Allerton), 2015 53rd Annual Allerton Conference on , pages 835–842. IEEE, 2015.
- 8[8] S. Onn. Geometry, complexity, and combinatorics of permutation polytopes. Journal of Combinatorial Theory, Series A , 64(1):31–49, 1993.
