\Delta-cumulants in terms of moments
Matthieu Josuat-Verg\`es

TL;DR
This paper explores -cumulants related to -convolution, providing new formulas for -cumulants using Lagrange inversion and Schr46der trees, enhancing understanding of their combinatorial structure.
Contribution
It introduces two novel formulas for -cumulants, expanding the combinatorial and analytical tools available for studying -convolution.
Findings
Derived a simple Lagrange inversion formula for -cumulants.
Developed a combinatorial inversion involving Schr46der trees.
Provided explicit expressions linking moments and -cumulants.
Abstract
The \Delta-convolution of real probability measures, introduced by Bo\.zejko, generalizes both free and boolean convolutions. It is linearized by the \Delta-cumulants, and Yoshida gave a combinatorial formula for moments in terms of \Delta-cumulants, that implicitly defines the latter. It relies on the definition of an appropriate weight on noncrossing partitions. We give here two different expressions for the \Delta-cumulants: the first one is a simple variant of Lagrange inversion formula, and the second one is a combinatorial inversion of Yoshida's formula involving Schr\"oder trees.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models · Advanced Combinatorial Mathematics · Statistical Mechanics and Entropy
-cumulants in terms of moments
Matthieu Josuat-Vergès
CNRS and Institut Gaspard Monge
Université Paris-Est Marne-la-Vallée
France
[email protected] Supported by ANR CARMA (ANR-12-BS01-0017).
Abstract
The -convolution of real probability measures, introduced by Bożejko, generalizes both free and boolean convolutions. It is linearized by the -cumulants, and Yoshida gave a combinatorial formula for moments in terms of -cumulants, that implicitly defines the latter. It relies on the definition of an appropriate weight on noncrossing partitions. We give here two different expressions for the -cumulants: the first one is a simple variant of Lagrange inversion formula, and the second one is a combinatorial inversion of Yoshida’s formula involving Schröder trees.
1 Introduction
The additive convolution of two real probability measures and is usually defined as the law of where and are two independent random variables of law and , respectively. Other operations, that we can see as deformed convolutions, are obtained by replacing the classical notion of independence with other ones coming from noncommutative probability theories. Two important examples are the free convolution (see Voiculescu [13]) and the boolean convolution (see Speicher and Woroudi [12]).
The -convolution was introduced by Bożejko [1] as a special case of the conditionally free convolution from [2], and further studied in [3, 9, 14]. See [10] for the general context. This operation, denoted , depends on another measure and specializes at (respectively, ) when is the Dirac distribution at (respectively, [math]). It can be defined analytically as follows. First, is characterized its Cauchy transform:
[TABLE]
Then, the function is implicitly defined by:
[TABLE]
where is the multiplicative convolution of and (defined like the additive convolution above but with instead of ). This function also characterizes the measure , and it is called its -transform. This is a deformation of Voiculescu’s -transform (itself being the analog of the logarithm of the Fourier transform in classical probability), see [13]. Then is characterized by the fact that it is linearized by the -transform:
[TABLE]
See [14] for details.
Rather than the functional equation in (1), one can consider the relations between the moments and -cumulants , which are the coefficients in the following expansions, when they exist:
[TABLE]
near and , respectively. Alternatively, we have . These relations depend on the moments of , denoted , that we also assume to exist. Yoshida [14] proved that for an appropriate weight function on the set of noncrossing partitions of (defined in the next section), we have:
[TABLE]
This generalizes the free and boolean cases, where we have an unweighted sum over noncrossing partitions, and interval partitions, respectively. But in the weighted case of (2), inverting the relation cannot be done via a Möbius inversion of a poset, since the weight depends on the ’s.
In this work, we provide two different expressions for the -cumulants. The first one (Theorem 3.1) is based on the functional equation in (1), and is a variant of Lagrange inversion formula (see [4]) where a Hadamard product is involved. The second one (Theorem 4.8) is a combinatorial formula that is the inverse of (2), proved by inverting a matrix which is the multiplicative extension of Yoshida’s weight. The solution is in terms of Schröder trees, and relies on related notions taken from [7]. In that work, Schröder trees appeared naturally because the relations between moments and free cumulants are interpreted in the group of an operad of trees, or also in terms of characters of Hopf algebras of trees (building on [5, 6]). However, we don’t have such an algebraic interpretation for the case of -cumulants.
2 Definitions
When is a set partition of a set , we denote the equivalence relation defined by iff for some block .
Let denote the set of noncrossing partitions of , i.e. set partitions of where there exist no such that , , and . For example, . To lighten the notation, the same is written , and it is represented as:
[TABLE]
Endowed with the reverse refinement order, is a lattice, first defined by Kreweras [8]. Its minimal element is , and its maximal element is .
Let denote the set of arcs of , i.e. pairs such that , , and there is no such that and . They are indeed arcs in the graphical representations as in (3), for example those of are , , . Note that there are arcs inside a block of , and it follows that .
Definition 2.1**.**
Yoshida’s weight of is:
[TABLE]
Remark 2.2**.**
If we allow to be a positive measure (i.e. is any positive number instead of for a probability measure), we see in Equation (2) that is homogeneous of degree in and (by the relation ). So there is no loss of generality when we assume .
Let denote the set of interval partitions of , i.e. set partitions where each block is an interval of consecutive integers. Equivalently, is in iff it has no arc with .
In the free case ( for all ), we have for all . So Equation (2) is the known relation for free cumulants [11]. In the boolean case ( and for ), we have if and [math] otherwise. So Equation (2) is the known relation for boolean cumulants [12].
The Hadamard product of two series is defined by:
[TABLE]
This operation makes sense either for formal power series, or functions that are analytic at a specified point. If two measures and have all their moments, their Cauchy transforms are analytic near , and we have:
[TABLE]
Indeed, let and be two independent random variables of law and , respectively. Then we have , so .
From now on, we write for the moments and for the -cumulants, dropping the dependence in , and consider their generating functions:
[TABLE]
And to avoid confusion, we take specific notations for the two specializations of : and are respectively the free cumulants and boolean cumulants associated with . Their generating functions are:
[TABLE]
Moreover, the generating function of the moments of is similar to :
[TABLE]
In fact, since our results are essentially of algebraic or combinatorial nature, we don’t need to assume that or is the moment sequence of some measure, we can treat them as formal variables and their generating functions as formal power series. In this setting, and are related by (2) if and only if their generating functions are related by:
[TABLE]
This is a rewriting of (1), using (5) and changing to .
In particular, in the free case we have , and the relation between and is the definition of Voiculescu’s -transform [13]:
[TABLE]
And in the boolean case, , and we recover the analytic definition of boolean cumulants [12]:
[TABLE]
3 Lagrange inversion for cumulants
In this section, we denote by the coefficient of in a formal Laurent series . A formal power series with has a unique compositional inverse , such that . Lagrange inversion formula is the identity:
[TABLE]
It comes in a wide range of different forms and has a lot of variants and generalizations, see Comtet’s book [4, Chapter III]. Let us review how to use it in the case of free cumulants, following Speicher [11]. From (7), we get:
[TABLE]
and then:
[TABLE]
i.e. . Applying (9) gives
[TABLE]
Another identity is Hermite’s formula [4, p. 150, Theorem D)]:
[TABLE]
from which we get:
[TABLE]
One might prefer a formula only involving and not its derivative, and this is possible at the condition of working with Laurent series. Indeed, the previous formula also gives:
[TABLE]
and since for , we have:
[TABLE]
In the case of our series related by Equation (6), we can adapt a classical proof of Lagrange inversion formula to get the following:
Theorem 3.1**.**
For , the th -cumulant is given by:
[TABLE]
Proof.
From (6), we have:
[TABLE]
Taking the derivative, we have:
[TABLE]
Divide on both sides by \big{(}M(z)\odot\Delta(z)\big{)}^{k} to get:
[TABLE]
Then, take the coefficient of . To deal with the right hand side, note that if , we have:
[TABLE]
Since for any Laurent series , it remains:
[TABLE]
From , we easily obtain [z^{-1}]\big{(}M(z)\odot\Delta(z)\big{)}^{\prime}\big{(}M(z)\odot\Delta(z)\big{)}^{-1}=1. We thus obtain a formula for and Equation (11) follows. ∎
We end this section by a few remarks about the previous theorem. In the boolean case, we have and , so it says:
[TABLE]
After multiplying by and summing for , we get:
[TABLE]
where the term is needed to remove negative powers of from . This agrees with the analytic definition of boolean cumulants in (8).
In the free case, , so we get:
[TABLE]
Since , we have . So the formula gives
[TABLE]
and we recover (10).
It is worth writing the previous theorem in a more analytic way, using Cauchy transforms. We have:
[TABLE]
Since , and , this gives:
[TABLE]
For a function which is analytic near , its residue at is given by and can be calculated by a contour integral. So the analytic formulation of the previous theorem is:
[TABLE]
We do not know if there exists another variant of Lagrange inversion that would give the moments in terms of and .
4 Inverting the relation
We now present how to inverse the relation in Equation (2) to get a formula for in terms of . For small values of , (2) gives:
[TABLE]
From that, we successively get the values:
[TABLE]
It appears that each coefficient between parentheses is a polynomial in with positive coefficients. This property will be a consequence of our general formula for .
To present the multiplicative extension of Yoshida’s weight, we first need some definitions. If is finite, there is a natural notion of noncrossing partitions of , using the same condition as in the definition of (the only property that we need is the total order on ). They form a lattice denoted . The unique order preserving bijection induces a bijection , called standardization. If , its weight is defined as .
Also, if with and , we define the restriction of to as: . More generally, is well defined as soon as is the union of some blocks of .
Definition 4.1**.**
The map on is given by:
[TABLE]
It is a refinement by the parameters of the poset theoretic function of .
Proposition 4.2**.**
If , we have:
[TABLE]
Proof.
Let us first show that for any finite and , we have:
[TABLE]
If , we have and we recover the definition of the weight. The right hand side of (14) is clearly unchanged by the standardization process, so we get (14) in general.
Let in , then we have:
[TABLE]
and we get (13). ∎
Lemma 4.3**.**
We have:
[TABLE]
Proof.
Using (2) with instead of , we can write:
[TABLE]
Then we expand the product. Using the fact that the map is an order preserving bijection from to , and the definition of as a product of weights, we get the announced formula. ∎
We can see as a matrix whose rows and columns are indexed by , and define its inverse . It is a refinement by the parameters of the Möbius function of . It follows from (15) that:
[TABLE]
So it remains to make explicit. To this end, we use some definitions taken from [7]. Schröder trees themselves are classical objects in combinatorics but it was showned there that they are an alternative to noncrossing partitions for dealing with free cumulants.
Definition 4.4** (cf. [7]).**
Let denote the set of Schröder trees with leaves, defined as plane trees where each internal vertex has at least descendants. Among edges issued from an internal vertex, we have a left edge, a right edge, and other ones are called middle edges. Let denote the set of prime Schröder trees, defined as those such that the right edge issued from the root is attached to a leaf. Also let denote that set of internal vertices of a tree .
When drawing a tree, we take the convention that all leaves are at the same level. For example, the Schröder trees with leaves are:
[TABLE]
and the first only are prime. Those with leaves are:
[TABLE]
and the first only are prime.
Definition 4.5** (cf. [7]).**
The map is given by the following rule. Let . First, we place labels such that is placed between the th and st leaves, from left to right. Then, we have iff we can draw a path from label to label that stays above the level of the leaves, and cross only middle edges of .
For example,
[TABLE]
Another property that we will need and is elementary to check is that
[TABLE]
Definition 4.6**.**
The left branch of a tree is the path going from the root down to the leftmost leaf. Let denote the set of internal vertices that are not in the left branch of . The degree of is its number of descendants. And the weight of is:
[TABLE]
We have now all necessary definitions to state:
Theorem 4.7**.**
For any , we have:
[TABLE]
This will be proved in the next section. Together with Equations (16) and (20), it immediately follows:
Theorem 4.8**.**
The th -cumulant is given combinatorially by
[TABLE]
For example, one can check that the trees in (18) (in this order) gives the formula for given at the beginning of this section.
In the free case ( for all , hence for all ), this was obtained in [7]. It was proved there that this formula in terms of prime Schröder trees implies Speicher’s one involving the Möbius function of [11].
In the boolean case (, for ), we have if all internal vertices of are in the left branch, and [math] otherwise. Such trees with are in bijection with interval partitions, via the map (suitably restricted). For example,
[TABLE]
Moreover, the factor is easily seen to be the Möbius function of evaluated at , so we recover the known formula for boolean cumulants [12].
5 Proof of Equation (22)
Let denote the right hand side of (22), and for , let
[TABLE]
Our goal is to show that and if . Indeed, these equations precisely say that is the column vector of indexed by , i.e. .
First note that is straightforward. The sum defining is reduced to the unique term . Moreover because there is a unique such that , that having one internal vertex whose descendants are the leaves. So, from now on we assume and we want to prove .
Let us first rewrite the formula for in terms of other combinatorial objects, also taken from [7].
Definition 5.1**.**
Let denote the set of noncrossing arrangements of binary trees with leaves, defined as follows. Given dots on a horizontal axis, is a set of binary trees such that: each of the dots is a leaf of exactly one of the trees, and edges do not cross when the trees are drawn in the canonical way (formally described by the fact that the edges issued from an internal vertex go in the South East and South West directions). Also, for , we define a noncrossing partition as follows: label the leaves by from left to right, then each block of is the set of labels of the leaves in some tree of .
For example, an element is in the right part of Figure 1, and the associated noncrossing partition is . Note that the map is surjective but not injective.
Proposition 5.2** (cf. [7]).**
There is a bijection such that for , is obtained from by removing the root and its incident edges, and removing every middle edge of the tree.
See Figure 1 for an example, and note that there is an obvious identification of the vertices of with vertices of different from the root and the rightmost leaf. For we will denote where is the element of such that .
Definition 5.3**.**
We extend the weight function to by the rule that for any . For two internal vertices , we say that covers if, in the unique such that , is a descendant of via a middle edge. For , we denote by the number of vertices covered by .
Lemma 5.4**.**
For , we have:
[TABLE]
Proof.
This is a simple reformulation of Definition 4.6 using the bijection . Note that for , the number of vertices it covers is , since these are all its descendants except the left and right ones. This explains why the index in (21) becomes here. ∎
To state the next lemma, we need the classical notion of Kreweras complement [8]. Let . Suppose we have labels , , , , , , on a horizontal line, in this order, and that is drawn as in (3) (only using the labels ). Then the Kreweras complement of is defined by the condition that iff we can connect to by a path that stays above the level of the labels, and do not cross the arches of . For example, Figure 2 shows that . The map is a poset anti-isomorphism from to itself, and its inverse is denoted . We refer to [8] for details.
Lemma 5.5** (cf. [7]).**
If , we have .
Using the bijection and the previous lemma, we can write in terms of noncrossing arrangements of binary trees:
[TABLE]
A property of Kreweras complementation is that . Note also that we have clearly for . So if . Plugging the previous formula for in the definition of , it follows:
[TABLE]
Kreweras complementation being a poset anti-automorphism, we can change the condition in the summation to get:
[TABLE]
Then, let us define a map by . Here we exchange the arguments to keep the fact that if , just as . We get the following equality:
[TABLE]
We will show that this quantity is [math] by pairing terms, but we need another lemma before doing that.
If is finite, we denote the smallest interval containing , i.e. the set of consecutive integers . Note that if and , is the union of some blocks of .
If , there is an interval partition which is minimal among interval partitions above , and its number of blocks is denoted . It is easily seen that this number can be computed as follows: consider with , then with , and so on until we find with , and , this last condition meaning that cannot be defined and the process stops. Then . More precisely the smallest interval partition above is .
We also extend this map to if by the requirement .
Lemma 5.6**.**
If , we have:
[TABLE]
Proof.
We will use the following fact, which is straightforward from the definition of Kreweras complementation: asumming , is an arch of if and only if there is a block such that and .
Our goal is as follows: to each factor in , associate a factor in the right hand side of (25), and reciprocally.
Such a factor in means we can find such that , , and . This follows from Equation (13).
From , we get that contains a block with , and . Similarly, there exist such that and (for ).
This block shows that there is a factor in the right hand side of (25). Indeed, we have since . We have and so . The sets are blocks of , and the relations between their maxima and minima show that . So we get a factor in the right hand side of (25), as needed.
In the other direction, we can check that starting from and the blocks , we find as above. ∎
The next step is to define a fixed point free involution on the set , such that
[TABLE]
It will show that the right hand side of Equation (24) vanishes, since terms indexed by and cancel each other out, hence it will complete the proof of Equation (22).
To begin, we denote the (unique) block of such that , and if is another block of such that . Since , we have , so exists. To define , we distinguish two cases, whether or not.
- •
If , there is a tree in the arrangement , two of whose leaves are labelled by and . Let denote the root of . Then, is defined by removing (as well as the two edges issued from it).
- •
In the other case, , it is the reverse operation. Let and be the trees in that respectively contain and . Then is obtained from by adding a new internal vertex , whose two descendants are the roots of and .
To check that we can add the two new edges without creating a crossing in the latter case, observe that since , and , there exists a noncrossing partitions obtained from obtained by merging the block containing with that containing . This shows that is a well-defined pairing on the set . An example is given in Figure 3, with (for example).
Also, the number of trees in is one more or one less than that of , so . It remains only to show:
[TABLE]
Indeed, has then all the required properties to show that the right hand side of Equation (24) vanishes.
We can assume that we are in the first case above, i.e. , since the two cases are exchanged under the involution .
First, if , we have:
[TABLE]
Indeed, in the product of (13), does not appear since it contains , and the other factors cannot change. We also have:
[TABLE]
Indeed, is in the left branch of , so we can remove it without changing the product in (23). So (26) holds.
Now suppose . We have from (25):
[TABLE]
On the other side, we have:
[TABLE]
Multiplying the previous two equations gives (26), at the condition that
[TABLE]
This is therefore the last equality to check to complete the proof of the required properties of , hence of .
To prove (27), let us first check on the example of Figure 3. The vertices covered by are and , and the smallest interval partition above is , so (27) holds. In general, let and be the two descendants of . Then, for each vertex which is either or or covered by , consider the tree of containing , then denote the set of its leaf labels. Then it is straightforward to see that the intervals form the smallest interval partition above . This proves (27).
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] M. Bożejko : Deformed Free Probability of Voiculescu. RIMS Kokyuroku 1227, 96–113 (2001).
- 2[2] M. Bożejko, M. Leinert, R. Speicher : Convolution and limit theorems for conditionally free random variables. Pacific J. Math. 175 (1996), 357–388.
- 3[3] M. Bożejko, A. D. Krystek, Ł. J. Wojakowski : Remarks on the r 𝑟 r - and Δ Δ \Delta -convolutions. Math. Z. 253 (2006), 177–196.
- 4[4] L. Comtet : Advanced combinatorics (english translation). Reidel Publishing Company, Dordrecht, Holland, 1974.
- 5[5] K. Ebrahimi-Fard, F. Patras : Cumulants, free cumulants and half-shuffles . Proc. Royal Soc. London A 471 (2015).
- 6[6] K. Ebrahimi-Fard, F. Patras : The splitting process in free probability theory . Internat. Math. Res. Notices 2015.
- 7[7] M. Josuat-Vergès, F. Menous, J.-C. Novelli, J.-Y. Thibon : Noncommutative free cumulants. Adv. Appl. Math., to appear.
- 8[8] G. Kreweras : Sur les partitions non croisées d’un cycle. Disc. Math. 1(4) (1972), 333–350.
