Finite free convolutions via Weingarten calculus
Jacob Campbell, Zhi Yin

TL;DR
This paper introduces a new combinatorial approach using Weingarten calculus to analyze finite free convolutions of polynomials, providing an alternative proof of their equivalence with matrix operations.
Contribution
It offers an innovative method based on Weingarten calculus to establish the equivalence of different descriptions of finite free convolutions, highlighting a quadrature property of certain subgroup series.
Findings
Established a combinatorial Weingarten approach for finite free convolutions
Identified a quadrature property for specific subgroup series of unitary groups
Provided explicit convolution formulas via this new method
Abstract
We consider the three finite free convolutions for polynomials studied in a recent paper by Marcus, Spielman, and Srivastava. Each can be described either by direct explicit formulae or in terms of operations on randomly rotated matrices. We present an alternate approach to the equivalence between these descriptions, based on combinatorial Weingarten methods for integration over the unitary and orthogonal groups. A key aspect of our approach is to identify a certain \emph{quadrature property}, which is satisfied by some important series of subgroups of the unitary groups (including the groups of unitary, orthogonal, and signed permutation matrices), and which yields the desired convolution formulae.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Finite free convolutions via Weingarten calculus
Jacob Campbell
Department of Pure Mathematics, University of Waterloo, Waterloo, Ontario N2L 3G1, Canada
and
Zhi Yin
Institute of Advanced Study in Mathematics, Harbin Institute of Technology, Harbin 150006, China
Abstract.
We consider the three finite free convolutions for polynomials studied in a recent paper by Marcus, Spielman, and Srivastava. Each can be described either by direct explicit formulae or in terms of operations on randomly rotated matrices. We present an alternate approach to the equivalence between these descriptions, based on combinatorial Weingarten methods for integration over the unitary and orthogonal groups. A key aspect of our approach is to identify a certain quadrature property, which is satisfied by some important series of subgroups of the unitary groups (including the groups of unitary, orthogonal, and signed permutation matrices), and which yields the desired convolution formulae.
1. Introduction
In [10], A. Marcus, D. A. Spielman, and N. Srivastava studied three convolution-type operations on polynomials of degree :
Definition 1.1** ([10, Definitions 1.1 & 1.4 & 1.7]).**
For polynomials with degree at most , say
[TABLE]
we make the following definitions:
- (1)
the symmetric additive convolution of and is defined by
[TABLE] 2. (2)
the symmetric multiplicative convolution of and is defined by
[TABLE] 3. (3)
the asymmetric additive convolution of and is defined by
[TABLE]
These are also called finite free convolutions, due to their connection with free convolution in free probability theory, which is expounded upon in [8]. The finite free convolutions can be defined explicitly as above, in terms of formulae involving the coefficients, but the motivation is their forthcoming description in terms of expected characteristic polynomials of the corresponding operations with randomly rotated matrices. We use the following notation for characteristic polynomials:
[TABLE]
is the characteristic polynomial of the matrix in the variable , and is the -th elementary symmetric function of the eigenvalues of .
Theorem 1.2** ([10, Theorems 1.2 & 1.5 & 1.8]).**
Let be polynomials with degree .
**symmetric additive: **
If are normal with and , then
[TABLE]
where is a random unitary matrix.
**symmetric multiplicative: **
If are normal with and , then
[TABLE]
where is a random unitary matrix.
**asymmetric additive: **
If are matrices with and , then
[TABLE]
where and are independent random unitary matrices.
In this paper we present an alternate proof of 1.2, and we also prove the analogous results for random orthogonal matrices instead of random unitary matrices. Our approach revolves around the Weingarten calculus for integration on the unitary and orthogonal groups; in particular, we take advantage of the connection between Weingarten functions and the combinatorial representation theory of symmetric groups. Using the orthogonality relations for irreducible characters of in the unitary case and for the zonal spherical functions of the Gelfand pair in the orthogonal case, we obtain combinatorial identities for the relevant Weingarten functions which allow us to greatly reduce the apparent complexity of direct computations of the relevant expected characteristic polynomials.
A very interesting aspect of [10] is what one might call a quadrature phenomenon: the random unitary or orthogonal matrices in the definitions of the finite free convolutions can be replaced with random signed permutation matrices, thus discretizing these operations. They isolate a property that they call minor-orthogonality, which makes this quadrature work and which is satisfied by the group of signed permutation matrices. Our techniques also yield quadrature results, with a quadrature property which plays a similar role as minor-orthogonality:
Definition 1.3**.**
A compact subgroup satisfies the quadrature property if
[TABLE]
for , for all .
In these terms, the main results are that various groups have the quadrature property:
Theorem 1.4**.**
The following groups have the quadrature property:
- (1)
the group of unitary matrices; 2. (2)
the group of orthogonal matrices; 3. (3)
the groups , for . (Here, is just , and is the group of permutation matrices.) In particular, the case recovers the group of signed permutation matrices.
We prove 1.2 in terms of this quadrature property; specifically, we prove the following theorem for the expected elementary symmetric functions of the eigenvalues of the randomly rotated matrices, which is easily seen to recover 1.2 with :
Theorem 1.5** ([T]heorems 2.10 & 2.12 & 2.14).**
*MSS2019]
Let be a compact subgroup with the quadrature property. Then*
- (1)
for , we have
[TABLE]
for normal ; 2. (2)
for , we have
[TABLE]
for normal ; 3. (3)
for , we have
[TABLE]
for .
Since the first version of this paper was posted, there has been a substantial update to [10], and we have incorporated some of the changes. At some points we still refer to the first version [9], namely for some technical details about the asymmetric additive convolution which do not differ in our proofs.
The paper is organized as follows. In Section 2 we review some preliminaries on permutations and partitions, the combinatorial representation theory of and the Gelfand pair , and the Weingarten calculus for integration on the unitary and orthogonal groups. In Section 3 we isolate, from the definitions of the finite free convolutions, the computations we need to handle, and we arrive at our quadrature property. Sections 4, 5, and 6 are devoted to the proofs that , , and , respectively, have the quadrature property. In the final Section 7 we prove 1.5, thus finishing the proof of 1.2 in terms of the quadrature property.
2. Preliminaries
In this section we review some necessary preliminaries from combinatorics and random matrix theory. We mostly follow the notation of [4].
2.1. Elementary combinatorics
For an integer partition , we may either write , or when convenient, where is the multiplicity of in ; write for the length of . We denote by the cycle type of a permutation , and for , we write . Let us collect some properties that we will use:
Lemma 2.1**.**
Let .
- (1)
The number of permutations with is . 2. (2)
We have , where . 3. (3)
For , we have .
Proof.
For (1), see e.g. [11, Proposition 1.3.2]. For (2), we have
[TABLE]
since . For (3), a cycle of length is a product of transpositions:
[TABLE]
So if , then can be written as a product of
[TABLE]
transpositions, which gives . ∎
The number of permutations in with exactly cycles is called the unsigned Stirling number of the first kind with parameters and and is denoted by . These numbers are related to the -th rising factorial, which is
[TABLE]
by the following lemma:
Lemma 2.2**.**
We have
[TABLE]
Proof.
See e.g. [11, Proposition 1.3.7]. ∎
Recall that a partition of a set is a collection of disjoint subsets, called blocks of the partition, whose union is . Let be the set of partitions of the set and for , write for the number of blocks of . Write for the set of partitions of whose blocks are all of size ; note that this is only non-empty when is even.
The ordering of is defined by letting if each block of is a subset of a block of . This ordering makes a lattice. For , write for the element of whose blocks are the equivalence classes of the relation defined by if and only if ; in other words is the partition of whose blocks are the “level sets” of the multi-index . With this notation, if and only if whenever and are in the same block of ; in other words the multi-index labels the blocks of in a consistent way.
There is a natural embedding of into which we will use extensively as in [4]: each can be written uniquely in the form
[TABLE]
with for and , and the embedding is
[TABLE]
2.2. Combinatorial representation theory
The representation theory of symmetric groups is well known to be described by the combinatorics of integer partitions and Young diagrams; a nice reference on this theory is [2]. In particular, the irreducible representations of are canonically labeled, say as , by the integer partitions . Write for the character of , and for the value of on the conjugacy class in labeled by .
2.2.1. The symmetric group and Schur functions
An important general feature of the representation theory of finite groups is that the characters of irreducible representations are orthogonal, and for this takes the following form:
Theorem 2.3** (Orthogonality relations for ).**
For , we have
[TABLE]
Proof.
See e.g. [2, Corollary 1.3.7]. ∎
The graded algebra whose -th component is the space of class functions on is identified with the graded algebra of symmetric functions, see e.g. [7, Section I.7], and the symmetric functions corresponding to the irreducible characters are called the Schur functions. We just need a particular value of :
Proposition 2.4** (Hook-content formula).**
We have
[TABLE]
for and .
Proof.
See e.g. [2, Theorem 4.3.3]. ∎
2.2.2. The Gelfand pair
If is a finite group and is a subgroup of , recall that is called a Gelfand pair if the trivial representation of induces a multiplicity-free representation of . Write for the centralizer of ; this is called the hyperoctahedral group and has order . The hyperoctahedral group may be alternately described as the group of signed permutations of symbols, as the symmetry group of a -dimensional hypercube, or as the wreath product . It is well known (see e.g. [7, VII.2.2]) that is a Gelfand pair.
Associated to a Gelfand pair is its family of zonal spherical functions. These are defined by taking the characters of the irreducible representations contained in and averaging them over , which we will make more precise below. By [7, VII.2.4] the irreducible representations of contained in are precisely the ones labeled by for , so we make the following concrete definition:
Definition 2.5** (Zonal spherical functions).**
For , define by
[TABLE]
for . This is called the zonal spherical function of the Gelfand pair corresponding to .
Let us single out the values of a particular :
Lemma 2.6**.**
For , we have
[TABLE]
Proof.
See e.g. [7, Example VII.2.2.b]. ∎
In the analogy between and , the relation of with the conjugacy classes of corresponds to the relation of with the double cosets of in :
Definition 2.7**.**
For , define a graph as follows:
- •
the vertices are ;
- •
the edges connect with and with for .
The connected components of are cycles of even lengths, and dividing those lengths by , we get an integer partition of which is called the coset type of .
By e.g. [7, VII.2.1.i], the coset type labels the double cosets of in . So for we write for the corresponding double coset and then . By [7, VII.2.3] the cardinality of a double coset is
[TABLE]
Clearly the zonal spherical functions are constant on double cosets, so we write for the value of on . For us, the critical property of the zonal spherical functions is that we still have the orthogonality relations analogous to 2.3:
Theorem 2.8** (Orthogonality relations for ).**
For , we have
[TABLE]
where is the product of the hook lengths in .
Proof.
See e.g. [7, VII.2.15’]. ∎
We need an analogue of the relationship between the irreducible characters and the Schur functions . To this end, one identifies the graded algebra whose -th component is the space of functions on which are constant on the double cosets of , with the graded algebra of symmetric functions. Then, the zonal polynomial is the symmetric function corresponding to in this identification. Again, we only need a particular value of ; write for the -th falling factorial.
Proposition 2.9**.**
We have
[TABLE]
for and . In particular, we have .
Proof.
See e.g. [7, VII.2.25]. ∎
2.3. Random matrices and the Weingarten calculus
We denote by and the compact groups of unitary and orthogonal matrices, respectively; a random unitary or orthogonal matrix is a random element of or , respectively, sampled according to the groups’ respective Haar probability measures. For , let be the -th matrix coordinate function, i.e. the function which picks out the -th entry of a matrix.
The Weingarten calculus is a family of combinatorial techniques for integration over certain classical matrix groups, named after D. Weingarten due to his pioneering work [12] which concerned the group . Ideas which first appeared there were systematized and developed for the unitary group by Collins in [3], and later for the orthogonal and symplectic groups by Collins-Śniady in [5]. One may prefer to take the perspective of e.g. Banica-Speicher in [1], in which the Weingarten calculus follows from the construction of combinatorial models of the representation categories of so-called easy groups.
2.3.1. Integration on the unitary group
The main theorem on integration over is the following, which is due to Collins and Śniady in [3, 5]. The matrix , indexed by , is constructed from the invariant theory of .
Theorem 2.10** (Unitary Weingarten calculus).**
For and and , the integral
[TABLE]
is
[TABLE]
when , and it is [math] otherwise.
We single out an expression for in terms of the characters of which we use in a critical way:
Theorem 2.11** ([5, Proposition 2.3]).**
For , we have
[TABLE]
In particular, only depends on the cycle type of .
2.3.2. Integration on the orthogonal group
Now let us state the main theorem on integration over , which is due to Collins-Śniady in [5, Corollary 3.4]. The matrix , indexed by , is constructed from the invariant theory of , although in a somewhat different way from the unitary case.
Theorem 2.12** (Orthogonal Weingarten calculus).**
For and , we have
[TABLE]
where the integral is with respect to the Haar probability measure of .
Again, we require an expression for in terms of the representation theory of ; here, the analogue of 2.11 is in terms of the zonal spherical functions of the Gelfand pair . Recall that the value of at , which is the dimension of the irreducible representation of labeled by , is
[TABLE]
where the last equality is by the hook-length formula [2, Theorem 4.2.14].
Theorem 2.13** ([4, Theorem 3.1]).**
For we have
[TABLE]
where is embedded into as in (4). In particular, only depends on the coset type of .
3. Quadrature property and motivation
In this section we will explain our quadrature property and where it comes from. Recall
See 1.3
This is best motivated by just diving in to the symmetric additive case and seeing what Haar integrals must be handled. We can assume without loss of generality that and are diagonal: since they are normal, they can be diagonalized, say as and for some unitary and and some diagonal and . Then we have
[TABLE]
and by invariance of Haar measure, we have
[TABLE]
So write and , and .
To approach this expected characteristic polynomial, observe that
[TABLE]
and
[TABLE]
where for we denote by the submatrix of consisting of the rows indexed by and the columns indexed by . So we want to look at for with .
Lemma 3.1**.**
We have
[TABLE]
for with , for .
Proof.
We have
[TABLE]
Switching the product and sum, we have
[TABLE]
and putting this back into the sum above, we get the desired formula. ∎
Thus we want to work with
[TABLE]
for , for , and the quadrature property does exactly this.
4. The unitary case
In this section we show that itself has the quadrature property, i.e. that
[TABLE]
for , for all . To this end we use 2.11 to reduce the computation to the following simple lemma:
Lemma 4.1**.**
We have
[TABLE]
for .
Proof.
Observe that is just the sign character of . So if , we have
[TABLE]
and otherwise, if , then by 2.3 we have
[TABLE]
so we are done. ∎
Proof of (1) in 1.4.
By 2.10 we have
[TABLE]
If is not injective, say there are some with and , we want to identify pairs of summands which cancel each other out, i.e. for each we want a corresponding with and
[TABLE]
To this end let , so that . Moreover, since only depends on the cycle type of , we have
[TABLE]
as the condition is invariant under translation of by . Thus we have shown that when is not injective, the summands in Eq. 5 cancel each other out and the sum is [math].
If is injective, then the only with is so by 2.11 we have
[TABLE]
and we are done. ∎
5. The orthogonal case
In this section we show that has the quadrature property, i.e. that
[TABLE]
for , for all . To this end we use 2.13 to reduce the computation to the following lemma:
Lemma 5.1**.**
We have
[TABLE]
for .
Proof.
If , then by (3) in 2.1, 2.2, and 2.6, we have
[TABLE]
On the other hand, if , then by 2.1, 2.6, and 2.8, for any we have
[TABLE]
thus . ∎
Proof of (2) in 1.4.
By 2.12, with and , we have
[TABLE]
since the condition forces equality. If is not injective, say there are some with , we want to identify pairs of summands which cancel each other out, i.e. for each we want a corresponding with and
[TABLE]
To this end let , which obviously satisfies . Moreover, we have in the embedding (4), so with , and have the same coset type. Since the condition is invariant under translation of by , by 2.13 we have
[TABLE]
Thus we have shown that when is not injective, the summands in Eq. 6 cancel each other out and the sum is [math].
If is injective, then the condition forces equality, so since
[TABLE]
and , by 2.13 and 5.1, we have
[TABLE]
and we are done. ∎
6. The hyperoctahedral case
In this brief section we show the elementary proof that the so-called hyperoctahedral series of groups for , which consist of “signed” permutation matrices where the “signs” are -th roots of unity (or in the case , the entire circle), have the quadrature property. This is similar to e.g. [10, Lemma 2.6] which is the case , and the quadrature results for these groups in general are already known from [6]. What we want to show is that
[TABLE]
for , for .
Proof of (3) in 1.4.
For , we have
[TABLE]
and the non-zero summands are the ones with and for . If is not injective, then there is no with for , so the sum is [math]. On the other hand, if is injective, there are permutations with for , i.e. ; similarly the condition forces for , i.e. . So the sum above becomes
[TABLE]
since the last sum gives summands, which are copies of . The case is similar, except with an integral over the -torus instead of a sum over copies of . ∎
7. Convolution formulae and quadrature
Finally we prove the convolution formulae; recall the generalization
See 1.5
7.1. Symmetric additive convolution
We have already done a large portion of the proof for the symmetric additive convolution in Section 3.
Proof of (1) in 1.5.
Let us pick back up from the computations in Section 3. Notice that since we assume and , we have
[TABLE]
and similarly for . So we have
[TABLE]
and then
[TABLE]
so we are done. ∎
7.2. Symmetric multiplicative convolution
The computations for the symmetric multiplicative convolution are somewhat simpler:
Proof of (2) in 1.5.
As for (1), assume without loss of generality that and are diagonal with and . Write and , so the -th entry of is and for a subset with , we have
[TABLE]
Switching the product and sum, we have
[TABLE]
thus
[TABLE]
and we are done. ∎
7.3. Asymmetric additive convolution
The computations here, for the asymmetric additive convolution, are more involved than for the symmetric convolutions; we refer to [9, Section 2.3.2] at some points for details which do not concern our techniques. Let us first make some simplifying notation:
Notation 7.1**.**
For , write
[TABLE]
and
[TABLE]
Proof of (3) in 1.5.
Assume without loss of generality – passing to singular value decompositions if necessary – that and are diagonal with and . We have
[TABLE]
so with , we have
[TABLE]
If is odd, then for any with and for any , there is some such that . So we may assume is even, say . The coefficient of is
[TABLE]
and we have
[TABLE]
since the cross terms vanish in the second-last line.
Putting this back into the larger sum, we get
[TABLE]
since only the only non-zero summands are the ones with or . Now by the quadrature property, this is equal to
[TABLE]
where the last equality follows as in [9, Section 2.3.2]. ∎
Acknowledgements
The authors wish to thank Alexandru Nica and Daniel Perales for introducing them to finite free convolutions, and for several enlightening discussions around these topics. J.C. would also like to thank Gavin Orok for his feedback on various aspects of this project. Z.Y wish to thank Benoit Collins and Ping Zhong for helpful discussions, and he also grateful to the Pure Mathematics Department of Waterloo University, where provided a fruitful environment to work on the project. This project was partially supported by NSFC No. 11771106.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Teodor Banica and Roland Speicher. Liberation of orthogonal Lie groups. Adv. Math. , 222(4):1461–1501, 2009.
- 2[2] Tullio Ceccherini-Silberstein, Fabio Scarabotti, and Filippo Tolli. Representation theory of the symmetric groups. The Okounkov-Vershik approach, character formulas, and partition algebras , volume 121 of Cambridge Studies in Advanced Mathematics . Cambridge University Press, Cambridge, 2010.
- 3[3] Benoît Collins. Moments and cumulants of polynomial random variables on unitary groups, the Itzykson-Zuber integral, and free probability. Int. Math. Res. Not. , (17):953–982, 2003.
- 4[4] Benoît Collins and Sho Matsumoto. On some properties of orthogonal Weingarten functions. J. Math. Phys. , 50(11):113516, 14, 2009.
- 5[5] Benoît Collins and Piotr Śniady. Integration with respect to the Haar measure on unitary, orthogonal and symplectic group. Comm. Math. Phys. , 264(3):773–795, 2006.
- 6[6] Chris Hall, Doron Puder, and William F. Sawin. Ramanujan coverings of graphs. Adv. Math. , 323:367–410, 2018.
- 7[7] I. G. Macdonald. Symmetric functions and Hall polynomials . Oxford Mathematical Monographs. The Clarendon Press, Oxford University Press, New York, second edition, 1995.
- 8[8] Adam Marcus. Polynomial convolutions and (finite) free probability. https://web.math.princeton.edu/~amarcus/papers/ff_main.pdf , 2018.
