Proof of an entropy conjecture of Leighton and Moitra
H\"useyin Acan, Pat Devlin, Jeff Kahn

TL;DR
This paper proves a conjecture by Leighton and Moitra, establishing an entropy bound for probability distributions on permutations that favor certain tournament arcs, with improvements for transitive tournaments.
Contribution
The paper confirms the conjecture, providing a general proof and a sharper entropy bound for transitive tournaments, advancing understanding of permutation distributions in combinatorics.
Findings
Entropy of distributions exceeds a threshold if they favor tournament arcs.
For transitive tournaments, a shorter proof with a better entropy bound is provided.
The entropy bound depends on a fixed positive parameter related to arc bias.
Abstract
We prove the following conjecture of Leighton and Moitra. Let be a tournament on and the set of permutations of . For an arc of , let . For a fixed , if is a probability distribution on such that for every arc of , then the binary entropy of is at most for some (fixed) positive . When is transitive the theorem is due to Leighton and Moitra; for this case we give a short proof with a better .
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
footnotetext: AMS 2010 subject classification: 05C20, 05D40, 94A17, 06A07footnotetext: Key words and phrases: entropy, permutations, tournaments, regularity
Proof of an entropy conjecture of
Leighton and Moitra
Hüseyin Acan 111Department of Mathematics, Rutgers University
[email protected] Supported by National Science Foundation Fellowship (Award No. 1502650).
Pat Devlin 11footnotemark: 1
[email protected] Supported by NSF grant DMS1501962.
Jeff Kahn 11footnotemark: 1 33footnotemark: 3
Abstract
We prove the following conjecture of Leighton and Moitra. Let be a tournament on and the set of permutations of . For an arc of , let .
Theorem. For a fixed , if is a probability distribution on such that for every arc of , then the binary entropy of is at most for some (fixed) positive .
When is transitive the theorem is due to Leighton and Moitra; for this case we give a short proof with a better .
1 Introduction
In what follows we use for and for binary entropy. The purpose of this note is to prove the following natural statement, which was conjectured by Tom Leighton and Ankur Moitra [6] (and told to the third author by Moitra in 2008).
Theorem 1**.**
Let be a tournament on and a random (not necessarily uniform) permutation of satisfying:
[TABLE]
Then
[TABLE]
where depends only on .
(We will usually think of permutations as bijections ). The original motivation for Leighton and Moitra came mostly from questions about sorting partially ordered sets; see [6] for more on this.
For the special case of transitive , Theorem 1 was proved in [6] with . Note that for a typical (a.k.a. random) , the conjecture’s hypothesis is unachievable, since, as shown long ago by Erdős and Moon [2], no agrees with on more than a -fraction of its arcs. In fact, it seems natural to expect that transitive tournaments are the worst instances, being the ones for which the hypothesized agreement is easiest to achieve. From this standpoint, what we do here may be considered somewhat unsatisfactory, as our ’s are quite a bit worse than those in [6]. For transitive it’s easy to see [6, Claim 4.14] that one can’t take greater than , which seems likely to be close to the truth. We make some progress on this, giving a surprisingly simple proof of the following improvement of [6].
Theorem 2**.**
For , , as Theorem 1 with transitive,
[TABLE]
The proof of Theorem 1 is given in Section 3 following brief preliminaries in Section 2. The underlying idea is similar to that of [6], which in turn was based on the beautiful tournament ranking bound of W. Fernandez de la Vega [1]; see Section 3 (end of “Sketch”) for an indication of the relation to [6]. Theorem 2 is proved in Section 4.
2 Preliminaries
Usage
In what follows we assume is large enough to support our arguments and pretend all large numbers are integers.
As usual is the subgraph of induced by ; we use for the bipartite subgraph induced (in the obvious sense) by disjoint and . For a digraph , and are used analogously. For both graphs and digraphs, we use for number of edges (or arcs).
Also as usual, the density of a pair of disjoint subsets of is , and we extend this to bipartite digraphs in which
[TABLE]
For a digraph , is the digraph gotten from by reversing its arcs.
Write for the set of permutations of . For , we use for the corresponding (transitive) tournament on (that is, iff ) and for a digraph (on ) define
[TABLE]
(e.g. when is a tournament, this is a measure of the quality of as a ranking of ).
Regularity
Here we need just Szemerédi’s basic notion [7] of a regular pair and a very weak version (Lemma 3) of his Regularity Lemma. As usual a bipartite graph on disjoint is -regular if
[TABLE]
whenever , , and , and we extend this in the obvious way to the situation in (3). It is easy to see that if a bigraph is -regular then its bipartite complement is as well; this implies that for a tournament on and , disjoint subsets of ,
[TABLE]
The following statement should perhaps be considered folklore, though similar results were proved by János Komlós, circa 1991 (see [5, Sec. 7.3]).
Lemma 3**.**
For each there is a such that for any bigraph on with , there is a -regular pair with and each of at least .
Corollary 4**.**
For each , as in Lemma 3 and digraph , there is a partition of such that is -regular and
Proof.
Let be an (arbitrary) equipartition of and apply Lemma 3 to the undirected graph underlying the digraph .∎
3 Proof of Theorem 1
We now assume that drawn from the probability distribution on satisfies (1) and try to show (2) (with TBA). We use for expectation w.r.t. and for uniform distribution on .
Sketch and connection with [6]
We will produce with for some disjoint , satisfying:
- (i)
with , (where the implied constant depends on );
- (ii)
each is -regular (with TBA);
- (iii)
for all , either or is contained in one of (note this implies the ’s are disjoint).
Let and . The main points are then:
- (a)
is bounded below by a positive function of . (This is just (i) together with a couple applications of Markov’s Inequality.)
- (b)
Regularity of implies .
- (c)
Under (iii), for any ,
[TABLE]
(a weak version of independence of the ’s under ).
And these points easily combine to give (2) (see (3) and (8)).
For the transitive case in [6] most of this argument is unnecessary; in particular, regularity disappears and there is a natural decomposition of into ’s: Supposing and (for simplicity) , we may take the ’s to be the sets with running over pairs
[TABLE]
with and . (As mentioned earlier, this decomposition of the (identity) permutation also provides the framework for [1].) After some translation, our argument (really, a fairly small subset thereof) then specializes to essentially what’s done in [6].∎
Set and let be half the of Lemma 3 and Corollary 4. We use the corollary to find a rooted tree each of whose internal nodes has degree (number of children) 2 or 3, together with disjoint subsets of (the arc set of) , corresponding to the internal nodes of . The nodes of will be subsets of (so the size, , of a node is its size as a set).
To construct , start with root and repeat the following for until each unprocessed node has size less than (say) . Let be an unprocessed node of size at least and apply Corollary 4 to to produce a partition , with and -regular of density at least 1/2. (Note (4) says we can reverse the roles of and if the density of is less than 1/2.) Add to as the children of and mark “processed.” (Note the ’s are the internal nodes of ; nodes of size less then are not processed and are automatically leaves. Note also that there is no restriction on and that, for , is equal to one of , , for some .)
Let be the number of internal nodes of (the final tree). Note that the leaves of have size at most and that the ’s satisfy (ii) and (iii) of the proof sketch; that they also satisfy (i) is shown by the next lemma.
Set
[TABLE]
this quantity will play a central role in what follows.
Lemma 5**.**
* *
Proof.
This will follow easily from the next general (presumably known) observation, for which we assume is a tree satisfying:
- •
the nodes of are subsets of , an -set which is also the root of ;
- •
the children of each internal node of form a partition of with at most blocks;
- •
the leaves of are , with (any ) and depth .
Lemma 6**.**
With the setup above, .
(Of course this is exact if is the complete -ary tree of depth and all leaves have size ).
Proof.
Recall that the relative entropy between probability distributions and on is
[TABLE]
(the inequality given by the concavity of the logarithm). We apply this with and the probability that the ordinary random walk down the tree ends at . In particular , which, with nonpositivity of and the assumption , gives
[TABLE]
The lemma follows.∎
This gives Lemma 5 since , with ranging over leaves of (and again denoting depth).∎
Lemma 7**.**
The number m of internal nodes of is less than .
Proof.
A straightforward induction shows that the number of leaves of a rooted tree is , where ranges over internal nodes and denotes number of children. The lemma follows since here the number of leaves is at most (actually at most ) and each is at least 2. ∎
Recalling that and that refers to , we have which with
[TABLE]
gives (essentially Markov’s Inequality applied to ).
Set and , and let be the event . Then implying and (since ) ; so using Markov’s Inequality as above gives .
Thus, with chosen from according to , we have
[TABLE]
(recall is the uniform measure on ).
Let
[TABLE]
and, for , let . Set
[TABLE]
(see (12) for the reason for the choice of ). We will show, for each ,
[TABLE]
which implies
[TABLE]
the second inequality following from together with Lemma 7. With c=\varepsilon^{\color[rgb]{0,0,0}{3}}\delta\beta^{3}/150<(b\varepsilon\log_{3}e)/4, this bounds (for large ) the r.h.s. of (3) by
[TABLE]
which proves Theorem 1 with . ∎
The rest of our discussion is devoted to the proof of (8). For a digraph with disjoint subsets of , say a pair of disjoint subsets of with , is safe for if
[TABLE]
for every bijection with (where has the obvious meaning). We also say is safe for if is. Note that since has density at least 1/2 in , the ’s in are unsafe for .
Lemma 8**.**
Assume the above setup with and , and set and . Let be the natural partition of into intervals of size . If D is -regular and
[TABLE]
then is safe for .
(Of course an interval of is one of the sets .)
Proof.
For as in the line after (9), let and (). Then
[TABLE]
Here the last term is an upper bound on the contribution of pairs contained in the ’s: if (so and ), then
[TABLE]
gives
[TABLE]
On the other hand, regularity and (10) (which implies () since , and similarly ) give, for all ,
[TABLE]
where is the density of . Combining this with (10) bounds each of the summands in (11) by
[TABLE]
[TABLE]
and the r.h.s. of (11) by
[TABLE]
(The main term on the l.h.s. is the one with , which, since , is less than half the r.h.s. The second and third terms are much smaller (the second since is much smaller than ).) ∎
Corollary 9**.**
For D and parameters as in Lemma 8, and uniform from ,
[TABLE]
Proof.
Let . Once we’ve chosen (determining ), is the usual Hoeffding bound [3, Eq. (2.3)] on the probability that violates (10) for a given . (The bound may be more familiar when elements of are in independently, but also applies to the hypergeometric r.v. ; see e.g. [4, Thm. 2.10 and (2.12)].) ∎
Proof of (8)..
Let
[TABLE]
and . Then (as noted above) and (therefore) . Moreover—perhaps the central point—the ’s are independent, since depends only on the relative positions of and within .
On the other hand, Corollary 9, applied with (so , , and ) gives
[TABLE]
(Recall was defined in (7); since we assume is large (), the choice leaves a little room to absorb the .) And of course (12) and the independence of the ’s give (8). ∎
4 Back to the transitive case
Theorem 2 is an easy consequence of the next observation.
Lemma 10**.**
Let a random -subset of satisfying
[TABLE]
Then .
To get Theorem 2 from this, let and, for simplicity, , and decompose as in (5). For each , say with () , let consist of the indices of positions within occupied by ; that is, if , then . Then Lemma 10 (its hypothesis provided by (1)) gives
[TABLE]
so, since is determined by the ’s, we have
[TABLE]
Remark. Note that the of Theorem 2 is the best one can do without more fully exploiting (1) (that is, beyond (13) for the ’s, which is all we are using).
Proof of Lemma 10..
For , set . Then
[TABLE]
(where the 2 could actually be ); so it is enough to show
[TABLE]
For a given -subset of , we have
[TABLE]
(the first sum counts pairs with and , and is the number of such pairs with also in ); so we have
[TABLE]
implying . Combining this with , we have and then, using Cauchy-Schwarz,
[TABLE]
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] W. Fernandez de la Vega, On the maximal cardinality of a consistent set of arcs in a random tournament, J. Comb. Th. Series B (1983), 328-332.
- 2[2] P. Erdős and J. Moon, On sets of consistent arcs in a tournament, Canad. Math. Bull. 8 (1965), 269-271.
- 3[3] W. Hoeffding, Probability inequalities for sums of bounded random variables, J. Amer. Statistical Assoc. 58 (1963), 13-30.
- 4[4] S. Janson, T. Łuczak and A. Ruciński, Random Graphs , Wiley, New York, 2000.
- 5[5] J. Komlós and M. Simonovits, Szemerédi’s regularity lemma and its applications in graph theory, Combinatorics, Paul Erdős is eighty, Vol. 2 (Keszthely, 1993) , 295-352, Bolyai Soc. Math. Stud. 2, János Bolyai Math. Soc., Budapest, 1996.
- 6[6] T. Leighton and A. Moitra, On Entropy and Extensions of Posets, manuscript 2011. http://people.csail.mit.edu/moitra/docs/poset.pdf.
- 7[7] E. Szemerédi, Regular Partitions of Graphs, pp. 399-401 in Problémes Combinatoires et Théorie des Graphes (Colloq. Internat. CNRS, Univ. Orsay, Orsay, 1976) , Paris: Éditions du Centre National de la Recherche Scientifique (CNRS), 1978.
