The limiting distribution of the hook length of a randomly chosen cell in a random Young diagram
Ljuben Mutafchiev

TL;DR
This paper investigates the asymptotic distribution of the hook length of a randomly chosen cell in a random Young diagram, revealing a specific limiting distribution as the size of the partition grows large.
Contribution
It establishes the weak convergence of the scaled hook length distribution to a new explicit probability density function for large partitions.
Findings
The scaled hook length converges in distribution as partition size increases.
The limiting distribution has a density function proportional to y/(e^y-1).
The results provide insight into the typical shape and properties of large random Young diagrams.
Abstract
Let be the number of all integer partitions of the positive integer and let be a partition, selected uniformly at random from among all such partitions. It is known that each partition has a unique graphical representation, composed by non-overlapping cells in the plane called Young diagram. As a second step of our sampling experiment, we select a cell uniformly at random from among all cells of the Young diagram of the partition . For large , we study the asymptotic behavior of the hook length of the cell of a random partituion . This two-step sampling procedure suggests a product probability measure, which assigns the probability to each pair . With respect to this probability measure, we show that the random variable converges weakly, as…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models · Advanced Combinatorial Mathematics · Advanced Mathematical Identities
The Limiting Distribution of the Hook Length of a Randomly
Chosen Cell in a Random Young Diagram
**Ljuben Mutafchiev
**American University in Bulgaria, 2700 Blagoevgrad, Bulgaria
and Institute of Mathematics and Informatics of the
Bulgarian Academy of Sciences
Abstract
Let be the number of all integer partitions of the positive integer and let be a partition, selected uniformly at random from among all such partitions. It is known that each partition has a unique graphical representation, composed by non-overlapping cells in the plane called Young diagram. As a second step of our sampling experiment, we select a cell uniformly at random from among the cells of the Young diagram of the partition . For large , we study the asymptotic behavior of the hook length of the cell of a random partition . This two-step sampling procedure suggests a product probability measure, which assigns the probability to each pair . With respect to this probability measure, we show that the random variable converges weakly, as , to a random variable whose probability density function equals if , and zero elsewhere.
Mathematics Subject Classifications: 11P82, 05A17, 60F05, 60C05
Key words: integer partition, Young diagram, hook length, limiting distribution
1 Introduction and Statement of the Main Result
For a natural number , we say that is a partition of if is a sequence of positive integers satisfying and such that
[TABLE]
The summands in (1) are usually called parts of . The Young diagram of a partition is an array of square boxes, or cells, in the first quadrant of the plane, left-justified, with cells in the -th row counting from the bottom. We label these cells , with denoting the row number of the cell and - the column number in the Young diagram. For example, in the Young diagram of of the partition of as , the cell is the square whose vertices in the -plane have coordinates . Reading consecutively the numbers of cells in the columns of the array of the partition , beginning from the most left column, we get the conjugate partition , where . The Young diagram of is called conjugate of the Young diagram of . In our example, we have , which is the conjugate partition of , namely, . The hook length of the cell in the partition is defined by the formula
[TABLE]
that is, is the number of cells in the hook comprised by the -cell itself and by the cells in the -th row right of and -th column above . Recalling our example given above, we get since and .
Let be the set of all partitions of and let (further on, by we denote the cardinality of the set ). The number is determined asymptotically by the famous partition formula of Hardy and Ramanujan [14]:
[TABLE]
A precise asymptotic expansion for was found later by Rademacher [25] (more details may be also found in [2, Chapter 5]). Further on, we assume that, for fixed integer , a partition is selected uniformly at random (uar). In other words, we assign the probability to each . In this way, each numerical characteristic of can be regarded as a random variable defined on (or, a statistic in the sense of the random generation of partitions of ). The study of the asymptotic behavior of various partition statistics for large is a subject of intensive research in combinatorics, number theory and statistical physics. Erdös and Lehner [5] were apparently the first who established a probabilistic limit theorem related to integer partitions. As a matter of fact, they found an appropriate normalization for the number of parts in a random partition of and showed that it converges weakly to the extreme value (Gumbel) distribution as . By conjunction of the corresponding Young diagram, the same limit theorem holds true for the largest part of a random integer partition. For other typical distributional results and limit theorems of various integer partition statistics, we refer the reader, e.g., to [26], [27], [28], [6], [9], [24], [20], [31], [10].
Another subject of study in the asymptotic theory of integer partitions is related to the limit shape of the underlying Young diagrams. Here is a simple setting of the problem. Let
[TABLE]
be the multiplicity of the part equal to in a partition , . The upper boundary of the Young diagram of is a piecewise constant function given by
[TABLE]
If the set is endowed with a probability measure (e.g., is the uniform measure or the Plancherel distribution on the set of Young diagrams with cells), the limit shape, with respect to as , is understood as a function (curve) in the plane such that for every and any ,
[TABLE]
where and are suitable scaling constants satisfying . With respect to the uniform distribution on the limit shape in (5) exists under the scaling and is determined by the function
[TABLE]
or, in a more symmetric form, by the equation
[TABLE]
The limit shape (7) for the uniform distribution on was first identified by Temperley [29] in relation to a model of a growing crystal. Afterward Szalay and Turán [26] obtained essentially analogous estimates of those in (5) and (6), which were used later by Vershik [30] to establish (7) and generalize it to other types of probability measures and models. In the same paper [30], Vershik gave also an important probabilistic frame of the relationship between problems from statistical mechanics (models of ideal gas) and problems related to the asymptotic theory of partitions. An alternative proof of (7) was also given by Pittel in [24]. Recent developments in this area are presented in [3].
In the present paper we study a statistic produced by a random selection of a cell in a random Young diagram. The sampling procedure combines the outcomes of two experiments: first, we select a partition uar, and then we select a cell uar. A similar type of sampling was first proposed and studied in the context of part multiplicities of a random integer partition in [4], and then in the context of part sizes in [22] and [23]. The sampling procedure that we introduced leads to the product probability space
[TABLE]
and the product probability measure , defined by
[TABLE]
(For more details on the theory of product probability spaces, we refer the reader, e.g., to [18, Part I, Section 4.2]). We also denote by the expectation with respect to the probability measure .
Our aim in this paper is to determine asymptotically, as , the distribution of the hook length of a randomly chosen cell of the Young diagram of a partition chosen uar. More precisely, for any pair , we denote the underlying hook length by
[TABLE]
where the function was defined earlier by (2). We organize the paper as follows. Section 2 contains some auxiliary facts that we need further. In Section 3 we prove the following limit theorem for .
Theorem 1
Let . Then, we have
[TABLE]
Remark 1. The hook lengths of cells in the Young diagram of an integer partition play an important role in algebraic combinatorics thanks to the famous hook-length formula due to Frame et al. [8]. It represents the number of standard Young tableaux of shape by the hook lengths of the cells in the following way:
[TABLE]
A standard Young tableau of shape is a labelling of the cells of the Young diagram of with numbers to so that the labels are strictly increasing from bottom to top along columns and from left to right along rows. There exists a bijection between the partitions from the set and the irreducible representations of the symmetric group of letters (see, e.g., [17, Chapters 1 and 4]). The work of Young [32], [33] shows that gives the dimension of the irreducible representation of related to the partition . A probabilistic proof of formula (11) was given by Greene et al. [12]. It is based on a construction of a random walk on the cells of the Young diagram (called also a hook walk). In its first step, a cell of a Young diagram containing cells is selected uar. If this cell has coordinates , then the walk continues on the cells in the hook of the cell . A detailed discussion on the enumerational and algorithmic aspects of formula (11) may be found in [16, Section 5.1.4].
Remark 2. Our method of proof is based on:
- •
some combinatorial identities due to Han [13](see Lemmas 1 and 2 in the next section)
- •
the saddle point method in terms of admissibility in the sense of Hayman (see [7, Chapter VIII.5]);
- •
the method of moments in probability theory (the Fréchet-Shohat limit theorem; see, e.g., [18, Chapter IV, Section 11.4]).
Another possible approach to the problem could be built on Vershik’s ideas for the limit shape of a random Young diagram (see [30]). In this context, one has to deal with a family of probability measures , defined on the set of all partitions . The key feature in a study of this type is that, under certain conditions, the uniform probability measure on is recovered as a conditional distribution . Moreover, can be constructed as a product measure, resulting into mutually independent random part multiplicities (see their definition given by (4)). In the context of random partitions, this phenomenon was first observed and applied by Fristedt [9]. The computation of the expectation of the sum of the part sizes of a partition with respect to the measure suggests the proper choice of the parameter that approaches as becomes large. By the above property of the conditional distribution , we can consider as selected uar from the set . Dealing with the family of measures on the space avoids some technical difficulties because, under this setting, the multiplicities are independent. One can now introduce the usual scaling for random Young diagrams with cells () in order to obtain their limit shape (5)-(7) in the -plane as . (For more details, we refer the reader to [30] and [3]). Next, we can choose a point at random from the region bounded by the lines and by the curve (7). The computation of the horizontal and vertical distances between this point and the curve is easy. Their sum shows the typical hook length of a cell in a random Young diagram for large . Further on, a possible way to identify the required limiting distribution is, e.g., to apply some known asymptotic results from [24] or [22] for the part sizes of a random partition and of its conjugate partition . In our further study we prefer, however, to follow an approach, which, in our opinion, is more direct: we apply the Cauchy formula to a generating function identity and then employ the saddle point method for the asymptotic analysis of the coefficients representing the moments of the underlying statistic.
2 Preliminaries
2.1 Combinatorial Identities and Generating Functions
We start with the definition of another partition statistic, defined on the set of all partitions of . We equip this set with the uniform probability measure and, for any , we define its -th moment by
[TABLE]
In the case the sum in the right-hand side of (12) clearly counts the number of parts in , while, by (1), . Further on, will stand for the expectation with respect to the uniform probability measure on . From (12) it obviously follows that
[TABLE]
Furthermore, the definition of the product probability measure on the space (see (9) and (8), respectively) and definition (10) of the random variable imply that
[TABLE]
Our first preliminary facts are related to identities that are due to Han [13]. We present the first one in terms of the expectations introduced by (13) and (14).
Lemma 1
[13, Corollary 6.5]**. For , we have
[TABLE]
Han [13, Theorem 6.6] also obtained an identity for the generating function of . It will be the starting point in our asymptotic analysis. Let
[TABLE]
be the Euler partition generating function.
Lemma 2
For any and , we have
[TABLE]
where
[TABLE]
Remark 3. In fact, Han [13] has proved the following combinatorial identity:
[TABLE]
Lemma 1 translates it in terms of the expectations (13) and (14).
2.2 Analytic Combinatorics Background: Meinardus Theorem
on Weighted Partitions and Hayman Admissibility of Generating Functions
Lemma 2 implies that the coefficient of can be expressed by a Cauchy integral whose integrand contains the product . Its behavior heavily depends on the properties of the partition generating function whose infinite product representation (15) shows that its main singularity is at the point (see also [2, Chapter 5]). A complete asymptotic expansion of , for fixed and odd , was obtained by Grabner et al. [10]. We need here only the main asymptotic term of , , which can be obtained using the Hayman-Meinardus version of the saddle point method. Here, we will give a brief account on this subject. This approach was also demonstrated in [22, Sections 5 and 6].
We start with some general remarks on the Hardy-Ramanujan formula (3). The Cauchy integral stemming from (15) and representing can be analyzed with the aid of the Hardy, Ramanujan and Rademacher’s circle method developed in [14] and [25]. Subsequent generalizations of these results are related to extensions of the class of generating functions, for which an infinite product representation similar to (15) is valid. An important result in this direction is due to Meinardus [19] (see also [2, Chapter 6]) who obtained the asymptotic of the Taylor coefficients of infinite products of the form:
[TABLE]
under certain general assumptions on the sequence of non-negative numbers . Meinardus’ approach is based on considering the Dirichlet generating series
[TABLE]
We briefly describe here Meinardus’ assumptions avoiding their precise statements as well as some extra notations and concepts.
- •
The first assumption () specifies the domain in the complex plane, in which has an analytic continuation.
- •
The second one () is related to the asymptotic behavior of , whenever . A function of the complex variable which is bounded by , in certain domain of the complex plane is called a function of finite order. Meinardus’ second condition requires that is of finite order in the whole domain .
- •
Finally, the Meinardus third condition () implies a bound on the ordinary generating function of the sequence . It can be stated in a way simpler than the Meinardus’ original expression by the inequality:
[TABLE]
for sufficiently small and some constants () (see [11]).
It is known that the Euler partition generating function (which is obviously of the form (17) satisfies the Meinardus scheme of conditions () - () (see, e.g., [2, Theorem 6.3]).
In the asymptotic analysis of the Cauchy integral stemming from Lemma 2 we shall use a variant of the saddle point method given by Hayman’s theorem for admissible power series [15] (for more details, see, e.g., [7, Chapter VIII.5]). To present Hayman’s idea and show how it can be applied to the proof of Theorem 1, we need to introduce some auxiliary notations.
We consider here a function that is analytic for , . For , we let
[TABLE]
[TABLE]
In the statement of the Hayman’s result, we shall use the terminology given in [7, Chapter VIII.5]. We assume that for and satisfies the following three conditions.
- •
Capture condition. and .
- •
Locality condition. For some function defined over and satisfying , one has
[TABLE]
as , uniformly for .
- •
Decay condition.
[TABLE]
as , uniformly for .
Definition 1
A function satisfying the capture, locality and decay conditions is called a Hayman admissible function.
Hayman Theorem. Let be a Hayman admissible function and be the unique solution in the interval of the equation
[TABLE]
Then the Taylor coefficients of satisfy, as ,
[TABLE]
with and given by (19) and (20), respectively.
3 Proof of Theorem 1
The proof is divided into two parts.
(A) Proof of Hayman admissibility for .
(B) Obtaining an asymptotic estimate for the Cauchy integral stemming from Lemma 2.
3.1 Part (A)
This part of the proof follows the same line of reasoning given in [22, pp. 338-340]. For the sake of completeness, we present it here.
First, we need to show how Hayman theorem can be applied to find the asymptotic behavior of the Taylor coefficients of the partition generating function . Since in (17) we have , the Dirichlet generating series (18) is , where denotes the Riemann zeta function: . We set in (21) where is the unique solution of the equation
[TABLE]
Granovsky et al. [11] showed that the first two Meinardus conditions imply that the unique solution of (22) has the following asymptotic expansion:
[TABLE]
where is fixed constant (here we have also used that ; see [1, Chapter 23.2]. We also notice that (20) and (3.1) imply that
[TABLE]
(This is a particular case of Lemma 2.2 from [21] with .) Hence, by (22) and (24), and as , that is, Hayman’s ”capture” condition is satisfied with . To show next that Hayman’s ”decay” condition is satisfied, we set
[TABLE]
where is a function satisfying as arbitrarily slowly and in the second equality we have used (3.1). We can apply now an estimate for established in a general form in [21, Lemma 2.4] with the aid of all three Meinardus conditions. In our particular case it states that there exist two positive constants and , such that, for sufficiently large ,
[TABLE]
uniformly for . This, in combination with (3.1) and (24) implies that uniformly in the same range of , which is just Hayman’s ”decay” condition. Finally, by Lemma 2.3 of [21], established with the aid of Meinardus’ conditions () and (), Hayman’s ”locality” condition is also satisfied by . In fact, this lemma implies in the particular case that
[TABLE]
uniformly for , where and are determined by (24) and (25), respectively. Hence, all conditions of Hayman’s theorem hold, and we can apply it with and to find that
[TABLE]
Remark 4. To show that formula (28) yields (3), one has to replace (3.1) and (24) in the right-hand side of (28). The asymptotic of is determined by a general lemma due to Meinardus [19] (see also [2, Lemma 6.1]). Since and (see [1, Chapter 23.2]), in the particular case of , this lemma implies that
[TABLE]
where . The rest of the computation leading to (3) is based on simple algebraic manipulations and cancellations.
3.2 Part (B)
We are now ready to apply the Cauchy coefficient formula to the generating function identity of Lemma 2. We use the circle , as a contour of integration and obtain, for any fixed , that
[TABLE]
Then, we break up the range of integration as follows:
[TABLE]
where
[TABLE]
[TABLE]
where , and is defined by (25).
To estimate , we notice that, for fixed , by the definition of Riemann integrals, (16) and (3.1),
[TABLE]
The last two equalities follow from the estimate
[TABLE]
which is a simple consequence of (3.1). Combining (31)
[TABLE]
where .
For the asymptotic estimate of , we need to expand around the point . Thus, for any fixed and uniformly for any , we have
[TABLE]
As previously, we can consider the sum representing as a Riemann sum. So, for large , we can replace it by the value of the corresponding integral (see, e.g., [1, Section 27.1]). Hence, by (16), (3.1) and (33), we have
[TABLE]
In the same way, we can estimate the first derivative of :
[TABLE]
where the last -estimate follows from (33). Hence, by (25) and (3.2), the error term in (3.2) becomes
[TABLE]
Consequently, (3.2) - (3.2) imply that
[TABLE]
Inserting this estimate and (27) into (30) and applying the asymptotic of the partition function from (28), we obtain
[TABLE]
In the first asymptotic equivalence we have used (24) and (25) in order to get
[TABLE]
if as not too fast, so that . Combining (29) - (31), (3.2) and (3.2), we get
[TABLE]
which in turn shows that
[TABLE]
Now, we recall Lemma 1, (13) and (14). Combining these observations with (40), we obtain
[TABLE]
Consequently,
[TABLE]
Hence, the Frechet-Shohat limit theorem [18, Chapter IV, Section 11.4] implies that the sequence converges in distribution to a random variable with probability density function , and zero elsewhere, which completes the proof of the theorem.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] M. Abramovitz and I. A. Stegun, Handbook of Mathemathical Functions with Formulas, Graphs and Mathematical Tables , Dover Publ. Inc. (New York, 1965).
- 2[2] G. E. Andrews, The Theory of Partitions, Encyclopedia Math. Appl. 2 , Addison-Wesley (Reading, MA, 1976).
- 3[3] L. V. Bogachev, Unified derivation of the limit shape formultiplicative ensembles of random integer partitions with equiweighted parts, Random Struct. Alg. 47 (2015), 227-266.
- 4[4] S. Corteel, B. Pittel, C. D. Savage, H. S. Wilf, On the multiplicity of parts in a random partition, Random Stuct. Alg. 14 (1999), 185-197.
- 5[5] P. Erdös and J. Lehner, The distribution of the number of summands in the partitions of a positive integer, Duke Math. J. 8 (1941), 335-345.
- 6[6] P. Erdös and M. Szalay, On the statistical theory of partitions, In: Topics in Classical Number Theory, vol. I (G. Halasz ed.) , North-Holland, Amsterdam, pp. 397-450, 1984.
- 7[7] P. Flajolet and R. Sedgewick, Analytic Combinatorics , Cambridge Univ. Press (Cambridge, 2009).
- 8[8] J. S. Frame, G. de B. Robinson and R.M. Thrall, The hook graphs of the symmetric group, Canad. J. Math. 6 (1954), 316-324.
