The Birkhoff theorem for unitary matrices of prime-power dimension
Alexis De Vos, Stijn De Baerdemacker

TL;DR
This paper extends the unitary Birkhoff theorem for matrices of prime-power dimension, showing that a smaller set of permutation matrices suffices for the decomposition, linked to the affine group GA(w,p).
Contribution
It demonstrates that for prime-power dimensions, the Birkhoff decomposition can be achieved with a subset of permutation matrices related to the affine group, reducing complexity.
Findings
Decomposition uses epicirculant permutation matrices.
Permutation matrices form a group isomorphic to GA(w,p).
Reduction from n! to a smaller group of permutation matrices.
Abstract
The unitary Birkhoff theorem states that any unitary matrix with all row sums and all column sums equal unity can be decomposed as a weighted sum of permutation matrices, such that both the sum of the weights and the sum of the squared moduli of the weights are equal to unity. If the dimension~ of the unitary matrix equals a power of a prime , i.e.\ if , then the Birkhoff decomposition does not need all possible permutation matrices, as the epicirculant permutation matrices suffice. This group of permutation matrices is isomorphic to the general affine group GA() of order only .
| no | no | |
| yes | yes | |
| no | yes |
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 2 | 6 | 12 | 20 | 360 | 42 | 1,344 | 216 | 1,814,400 | 110 |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topicsgraph theory and CDMA systems · Coding theory and cryptography · Finite Group Theory Research
The Birkhoff theorem
for unitary matrices of prime-power dimension
Alexis De Vos and Stijn De Baerdemacker
Abstract
The unitary Birkhoff theorem states that any unitary matrix with all row sums and all column sums equal unity can be decomposed as a weighted sum of permutation matrices, such that both the sum of the weights and the sum of the squared moduli of the weights are equal to unity. If the dimension of the unitary matrix equals a power of a prime , i.e. if , then the Birkhoff decomposition does not need all possible permutation matrices, as the epicirculant permutation matrices suffice. This group of permutation matrices is isomorphic to the general affine group GA() of order only .
1 Introduction
Let D() be the semigroup of doubly stochastic matrices; let P() be the group of permutation matrices. Birkhoff [1] has demonstrated
Theorem 1
Every D() matrix can be written
[TABLE]
with all P() and the weights real, satisfying both and .
The question arises whether a similar theorem holds for matrices from the unitary group U(). This question is discussed by De Baerdemacker et al. [2] [3]. For this purpose, the subgroup XU() of U() is introduced [4] [5]. It consists of all U() matrices with all line sums (i.e. all row sums and all column sums) equal to 1. Whereas U() is an -dimensional Lie group, the group XU() is only -dimensional. A unitary Birkhoff theorem has been proved for XU() matrices [2] [3]. Remarkable is the fact that the case with an arbitrary prime [3] has been treated in a very different way from the case where is an arbitrary integer [2]. As a result, the decomposition, tailored to prime numbers [3], can be restricted to terms, whereas the general case [2] leads to a summation over all (or at least over ) permutation matrices, albeit with a large number of degrees of freedom. In the present paper, we will treat the two cases in a unified way. Moreover, the unified approach will be applied to the case , i.e. equal to an arbitrary power of an arbitrary prime .
In general, the Birkhoff theorem for unitary matrices is easily proved as follows. Let G() be a finite subgroup of XU().
Lemma 1
If an XU() matrix can be written
[TABLE]
with all G(), then the weights satisfy .
The proof is trivial: all line sums of equal unity; therefore, all line sums of the matrix equal and thus all line sums of the matrix are equal to . As all line sums of are equal to 1, we thus need .
Lemma 2
If every XU() matrix can be written
[TABLE]
with all G(), then there exists a decomposition
[TABLE]
such that not only , but also .
This fact follows from the Klappenecker–Rötteler theorem [6].
2 The group XU()
Remark 1
For sake of convenience, in the present paper, the rows and colums of a matrix are not numbered starting from 1, but instead starting from 0. Thus the upper-left entry of any square matrix is and its lower-right entry is .
We recall that the group XU() is an -dimensional subgroup of the -dimensional unitary group U(). Any member of XU() can be written
[TABLE]
where is a member of U() and where the constant unitary matrix is times a dephased complex Hadamard matrix [7]. Thus (1) constitutes a 1-to-1 mapping between and . Because of
[TABLE]
eqn (1) leads to
[TABLE]
With being unitary, i.e. with , this becomes
[TABLE]
We thus can write the matrix as a sum of matrices:
[TABLE]
where is the van der Waerden matrix, i.e. the doubly stochastic matrix with all entries equal to , and where is an matrix defined by
[TABLE]
The labels and of the matrix run from 1 to , in contrast to the indices and of its entries, which run from 0 to . We thus have such matrices, each having entries. Each entry of the matrix equals the leftmost entry of its row times the uppermost entry of its column. Taking into account (2), one indeed easily checks
[TABLE]
Both the first row and the first column of equal a line of the Hadamard matrix (up to complex conjugation and up to the factor ):
[TABLE]
Because is times a Hadamard matrix, we have and , such that and , and thus, because of (5), we conclude that all entries have unit modulus.
3 Underlying framework
In the present section, we consider an arbitrary doubly transitive group G() of permutation matrices. We denote by the order of the group. We generalize the ideas and computations in Reference [2], where G() is equal to the group P() of all permutation matrices, thus G() being isomorphic to the symmetric group Sn and being equal to .
In the next three sections, we will apply the Lemmas 1 and 2 to three different choices of G():
- •
In case of arbitrary , we choose the group of all permutation matrices (i.e. a group isomorphic to the symmetric group Sn). See Section 4.
- •
In case of equal to some prime , we choose the group of all supercirculant permutation matrices (i.e. a group isomorphic to a semidirect-product group Cn : Cn-1). See Section 5.
- •
In case of equal to some power of some prime (i.e. equal to ), we choose the group of all epicirculant permutation matrices (i.e. a group isomorphic to the general affine group GA()). See Section 6.
The meaning of the words ‘supercirculant’ and ‘epicirculant’ will be made clear below. The mentioned groups are doubly transitive, as it is known that the symmetric group Sn is -transitive, the alternating group An is -transitive, and the affine groups are -transitive [8], in contrast to e.g. the cyclic group Cn, which is only 1-transitive.
In each of the three cases, we will prove below that every XU() matrix can be written as
[TABLE]
with all member of the appropriate group G(). Because of Lemmas 1 and 2, we are then allowed to put the case that both and . For the explicit computation of the weights , we note that the G() matrices form an -dimensional reducible representation of some abstract group G. We assume that G has different irreducible representations. According to Lemma (29.1) of [9], because G is 2-transitive, the -dimensional natural representation is the sum of the 1-dimensional trivial representation and an -dimensional irreducible representation, which we will call the standard representation.
We replace eqn (7) by an eqn concerning one of the irreducible representations of G:
[TABLE]
where is the label of the irrep (), where is the th irreducible representation of , and is an appropriate unitary matrix, with a special mentioning for anf (see further). Here, is the dimension of the th representation. We have such matrix equations (8). Each matrix eqn constitutes scalar equations. We thus have a total of scalar equations with unknowns :
[TABLE]
Solution of this set of equations is:
[TABLE]
We choose for the trivial representation, i.e. the 1-dimensional irreducible representation with all characters equal to 1. We choose for the standard representation, i.e. the -dimensional irreducible representation obtained by applying (1) to the permutation matrix :
[TABLE]
and thus
[TABLE]
In (9), the matrix equals the unit matrix and the matrix equals the lower-right block of
[TABLE]
For the remaining matrices with , we are allowed to choose any unitary matrix of the right dimension . This usually allows a large number of degrees of freedom. Here, we propose two different strategies to take advantage of this freedom.
3.1 First strategy
For each matrix with , we choose the unit matrix. Then (9) becomes
[TABLE]
We take advantage of Shur’s orthogonality relation:
[TABLE]
where is the trivial identity permutation and where while if . Because moreover and , we obtain the explicit expression for the weight:
[TABLE]
The number denotes the character of the element of the group G according to the th representation. It is equal to . In particular, we have .
3.2 Second strategy
The second strategy is only applicable if the group G has an anti-standard irreducible representation, non-equivalent to the standard representation. The anti-standard representation, which we will assign the label (if it exists), has the same characters as the standard representation (with label ), except for a factor if the corresponding permutation is an odd permutation. A necessary condition for the second strategy is
[TABLE]
As in the first strategy, we again choose the unit matrix for and the matrix for . However, in this second strategy, we also choose the matrix for each matrix . For each matrix with , we choose the unit matrix. Then (9) becomes
[TABLE]
Again taking advantage of Shur’s orthogonality relation and , we obtain
[TABLE]
In the second strategy, the group G An thus takes over the role of G and takes over the role of .
4 The case of arbitrary dimension
Lemma 3
Every XU() matrix can be written
[TABLE]
with all P().
The proof is provided by [3], by means of induction on . Combining Lemmas 1, 2, and 3 leads to the unitary Birkhoff theorem:
Theorem 2
Every XU() matrix can be written
[TABLE]
with all P(), such that both and .
4.1 First strategy
We can apply result (11) with . The only possible values of are Tr(Pσ) and thus , with exception of .
4.2 Second strategy
The character tables of the groups S2 and S3 show no anti-standard representation. For , the group Sn has an anti-standard representation. In this case, we can apply result (14) with . The restriction is not surprising, as (12) with is fulfilled neither if nor if .
5 The case of prime dimension
We call an matrix supercirculant iff each row equals row shifted positions to the right. Thus , where addition and subtraction are modulo . We equivalently may write
[TABLE]
We call the pitch of the matrix. If , then the supercirculant matrix is called circulant; if , then the supercirculant matrix is called anticirculant.
If denotes a prime, then the supercirculant permutation matrices are denoted , where is the pitch and (called the shift) is the column with the unit entry in the upper row (i.e. row 0). The unit entries of such permutation matrix thus are located at the positions , , , …, and , where sums are to be taken modulo . Because and are co-prime, the consecutive columns with a 1, i.e. the columns , , , …, and , are all different.
If equals some prime , then we choose for the Hadamard matrix of Section 2 the discrete Fourier transform , with entries
[TABLE]
where is equal to the th root of unity. Thus (4) becomes
[TABLE]
From [3], we know that can be written as a weighted sum of supercirculant permutation matrices:
[TABLE]
where the pitch of the matrix is a function of and . Indeed, the condition
[TABLE]
yields
[TABLE]
and thus . Thus has to satisfy the eqn
[TABLE]
This eqn has one solution:
[TABLE]
where is the inverse of modulo . As is prime, each non-zero integer has exactly one inverse. With , we finally obtain
[TABLE]
The supercirculant permutation matrices form a group S(), subgroup of P() (proof in Appendix A), isomorphic to the semidirect product of the cyclic group of order and the multiplicative group of integers modulo . The group thus is isomorphic to the semidirect product of two cyclic groups:
[TABLE]
a non-Abelian group of order .
Lemma 4
If is prime, then every XU() matrix can be written
[TABLE]
with all S().
The proof is as follows. If is a prime , then all matrices are supercirculant with a pitch modulo . Also the van der Waerden matrix is supercirculant, as it is circulant:
[TABLE]
Hence, according to (3), is a weighted sum of supercirculant permutation matrices.
Combining Lemmas 1, 2, and 4 leads to
Theorem 3
If is prime, then every XU() matrix can be written
[TABLE]
with all S(), such that both and .
5.1 First strategy
We can apply result (11) with . The only possible values of are , [math], and , as demonstrated in Appendix B. Thus we find a unitary Birkhoff decomposition with only terms. For a prime exceeding 3, this number is substantially smaller than the number of Subsection 4.2. The resulting unitary Birkhoff theorem is also slightly stronger than the theorem in [3], where the Birkhoff decomposition consists of terms.
5.2 Second strategy
The group S(2), isomorphic to the cyclic group C2, has only two irreducible representations: the trivial one and the standard one. Also the group S() with equal to an odd prime , has no inequivalent anti-standard representation. Indeed, because all odd supercirculant permutations have non-unit pitch (see Appendix C) and thus have unit trace (see Appendix B) and hence have zero character , all characters of the anti-standard representation equal the corresponding characters of the standard representation. Therefore, the standard and anti-standard representations are equivalent. We conclude that we cannot apply the second strategy of Subsection 3.2. The absence of any inequivalent anti-standard representation is no surprise, as does not satisfy (12).
6 The case of prime-power dimension
For with arbitrary positive , we can choose for of Section 2 the Kronecker product of small (i.e. ) Fourier matrices :
[TABLE]
The matrix has following entries:
[TABLE]
where is the sum of the ditwise product of the -ary numbers and :
[TABLE]
As a consequence, we have
[TABLE]
Among the entries of this matrix, are equal to 1, are equal to , …, and are equal to .
Remark 2
For sake of convenience, below, the rows and the colums of a matrix will sometimes be pointed at, not by a number, but instead by a vector. This will allow matrix computations for the row and column numbers. For this purpose, any number has an associated boldfaced vector , consisting of the dits of the number .
We call a matrix epicirculant if row equals row 0, ‘shifted to the right’ according to
[TABLE]
where a is the vector associated with the column number and where x is a matrix called the pitch matrix, consisting of entries, all . A matrix of the form (16) is automatically epicirculant. It is a weighted sum of epicirculant permutation matrices : we have
[TABLE]
Here, is an appropriate pitch matrix, depending on and . Proof is in Appendix D. We note that vector and matrix constitute a pair, fully specifying an affine transformation [10].
If is a prime power, say , then the epicirculant permutation matrices form a group E(), subgroup of P() (proof in Appendix E), isomorphic to the general affine group GA(), a semidirect product of the direct product of cyclic groups of order and the general linear group GL():
[TABLE]
of order
[TABLE]
We note that GA() is a maximal subgroup of the symmetric group S (O’Nan–Scott theorem) [11].
Each of the subgroups Cp consists of matrices, each a Kronecker product with a total of factors:
[TABLE]
where denotes the unit matrix and a circulant permutation matrix .
Lemma 5
If is a prime power, then every XU() matrix can be written
[TABLE]
with all E().
The proof is as follows. If is a prime power , then all matrices are epicirculant with an invertible pitch matrix x. Also the van der Waerden matrix is epicirculant, as it is circulant:
[TABLE]
where the pitch matrix denotes the unit matrix. Hence, according to (3), is a weighted sum of epicirculant permutation matrices.
Combining Lemmas 1, 2, and 5 leads to
Theorem 4
If is a prime power, then every XU() matrix can be written
[TABLE]
with all E(), such that both and .
6.1 First strategy
We can apply result (11) with given by (18). The only possible values of are , [math], , , , …, and , as demonstrated in Appendix F.
6.2 Second strategy
For and , the general affine groups have, besides the standard representation, also an inequivalent anti-standard representation. For a proof, it suffices to point to a single example of an odd epicirculant permutation matrix with trace different from unity. We choose the matrix
[TABLE]
i.e. the Kronecker product of matrices (i.e. the unit matrix) and the supercirculant matrix . The pitch matrix associated with is the diagonal matrix .
On the one hand, we have the following property of the Kronecker product of two square matrices:
[TABLE]
Therefore, we have . We choose the number such that and thus . This is always possible. Suffice it to choose equal to , where is a generator of the modulo multiplication group [12]. Unfortunately, there is no algorithm known for finding such generator except brute force [13]. Nevertheless, we can prove that Det, without a priori knowing the value of : see Appendix C.
On the other hand, we have . Because , we have and thus . We thus conclude that we can apply result (14) with according to (18).
The above reasoning is not valid for , because, in that case, does not imply . For the case , we will prove that all epicirculant matrices are even permutations. For this purpose, it is sufficient to demonstrate that all group generators are even. From reversible computing [14] [15] [16], it is known that the group GA() is generated by following matrices:
[TABLE]
with a total of (for ) or (for and ) factors . In the context of computing, these matrices represent NOT gates, respectively controlled NOT gates. Applying (19), we have:
[TABLE]
except if . Thus, for , all members of GA() represent even permutations and the second strategy (Subsection 3.2) is not applicable.
This leaves us with the case and . The epicirculant matrices form a group E(4) isomorphic to the symmetric group S4. As stated in Section 4.2, the second strategy is applicable. The results on the applicability of the second strategy are summarized in Table 1.
7 Conclusion
According to [2], every unit-linesum unitary matrix can be decomposed as a weighted sum of the permutation matrices, such that both the sum of the weights and the sum of the squared moduli of the weights equal unity. Such Birkhoff sum contains terms. In the present paper, we demonstrate the following:
- •
If , then terms suffice.
- •
If with an arbitrary prime and an arbitrary integer, then suffice.
- •
If with an arbitrary odd prime and an integer , then suffice.
For numerical examples, see Table 2.
The case of equal to the product of two different primes is left for further investigation.
Appendix A The group of supercirculant permutation matrices
The supercirculant permutation matrices form a group. Indeed, the product of two such matrices (say and ) yields a third such matrix. In order to prove this fact, we compute the matrix entry at position :
[TABLE]
and hence
[TABLE]
If is a prime , each non-zero number has an inverse number . Applying (26), we find
[TABLE]
The right-hand side being the unit matrix, the result proves that each supercirculant matrix has an inverse matrix that also is supercirculant:
[TABLE]
We conclude by considering two applications of eqn(26):
- •
choosing leads to
[TABLE]
illustrating that the matrices are isomorphic to the addition modulo ;
- •
choosing leads to
[TABLE]
illustrating that the matrices are isomorphic to the multiplication modulo .
Each supercirculant matrix can be decomposed as the product of a zero-shift matrix and a unit-pitch matrix:
[TABLE]
Appendix B The trace of a supercirculant permutation matrix
We compute the trace of the supercirculant permutation matrix :
[TABLE]
If the eqn
[TABLE]
is fulfilled, then the corresponding number points to a unit entry in position of the matrix . We notice:
- •
If , then is the one and only solution;
- •
if and , then the eqn has no solution ;
- •
if and , then may have any value from .
Thus we conclude:
- •
Tr, if ,
- •
Tr, if , and
- •
Tr.
Appendix C The determinant of a supercirculant permutation matrix
As mentioned in Appendix A, each supercirculant matrix can be decomposed as follows:
[TABLE]
Hence:
[TABLE]
We have and therefore . If is odd, then , such that . In other words: for odd primes, all of the circulant permutation matrices have unit determinant. The situation is different for the matrices . Half of them have unit determinant and half of them have determinant equal to . In order to prove this fact, the key observation is the fact that the cyclic group is Abelian; so there exists a similarity transformation that diagonalizes all matrices . We now prove that the following matrix serves our purpose:
[TABLE]
where is the th root of unity, and the function gives the ‘position’ of the number in the cyclic group (multiplicative group modulo ), as a power of the (a priori unknown) generator , i.e.
[TABLE]
From this definition, the following interesting properties of can be deduced:
[TABLE]
These properties are key in the following derivation. We compute the similarity transformation given by . Because both and are block diagonal with a single 1 in the upper-left corner, we only need to compute the lower-right part:
[TABLE]
This result leads to two conclusions:
- •
By choosing , we find that and thus that is unitary.
- •
By choosing arbitrary, we find that the matrix has the eigenvalues plus an additional 1 from the upper-left matrix block.
The determinant is just the product of all eigenvalues:
[TABLE]
Now, if is an odd prime, then , such that , which proves that the sign of the determinant of alternates in the chain of successive elements of Cp-1. More in particular, the position of always is , so we have .
We note that the above results for both and are only valid for odd primes . If is even, i.e. if , then there exist only two supercirculant matrices S_{0,1}=\tiny\left(\begin{array}[]{cc}1&0\\ 0&1\end{array}\right), with determinant equal to , and S_{1,1}=\tiny\left(\begin{array}[]{cc}0&1\\ 1&0\end{array}\right), with determinant equal to .
Appendix D The pitch matrix
In (17), the epicirculant matrix needs a unit entry in position if
[TABLE]
implying
[TABLE]
or
[TABLE]
and thus
[TABLE]
or
[TABLE]
and thus
[TABLE]
We fulfil this condition by the set of non-coupled eqns
[TABLE]
For each eqn, we expect solutions (as we can choose out of the dits arbitrarily from ). However, many solutions have to be rejected. Indeed, each column of the matrix in (17) should contain one and only one unit entry. For this purpose, it is necessary and sufficient that the matrix x is invertible. Proof is as follows. We require that for any two different row numbers () the unit entry of the permutation matrix is in another column:
[TABLE]
and thus . This requires that for any non-zero number we have
[TABLE]
This, in turn, requires that the rows of x are linearly independent and thus that the matrix x is invertible.
We now prove that, for any pair , the set (27) has at least one acceptable solution, i.e. a solution such that the matrix x is invertible. Indeed:
- •
Because both and are non-zero, at least one dit is non-zero and at least one dit is non-zero. Let be the least-significant non-zero dit of ; let be the least-significant non-zero dit of .
- •
We choose all dits , except the dits , , and . Thus eqns (27) become
[TABLE]
- •
For and , we choose . Further we choose and . Thus eqns (28) become
[TABLE]
which lead to a single solution set .
The resulting pitch matrix x consists of a non-zero diagonal, one non-zero row, and one extra unit entry. E.g. for , , and , we have:
[TABLE]
We note that here Det(x) equals . In general, we have
[TABLE]
Because Det(x) , we have that x is invertible.
Appendix E The group of epicirculant permutation matrices
The epicirculant permutation matrices form a group. An arbitrary entry (at location ) of such matrix is . The product of two such matrices yields a third such matrix. Indeed:
[TABLE]
and hence
[TABLE]
Straightforward application of this result leads to
[TABLE]
The right-hand side being the unit matrix, the result proves that each epicirculant matrix has an inverse matrix that also is epicirculant:
[TABLE]
Each epicirculant matrix can be decomposed as the product of a matrix with zero shift vector a and a matrix with unit pitch matrix x:
[TABLE]
Appendix F The trace of an epicirculant permutation matrix
We compute the trace of the epicirculant permutation matrix :
[TABLE]
If the eqn
[TABLE]
is fulfilled, then the corresponding number points to a unit entry in position of the matrix . Here, 1 denotes the unit matrix. We notice:
- •
If is invertible, then is the one and only solution;
- •
if and , then the eqn has no solutions ;
- •
if and , then may have any value from ;
- •
if is neither invertible nor zero, then has rank with and can have as many values as there are solutions of the eqn , i.e. as the size of the kernel of , i.e. .
Thus we conclude:
- •
Tr, if ,
- •
Tr, and
- •
Tr, if has rank .
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] G. Birkhoff, “Tres observaciones sobre el algebra lineal”, Universidad Nacional de Tucumán: Revista Matemáticas y Física Teórica , vol. 5 (1946), pp. 147-151.
- 2[2] S. De Baerdemacker, A. De Vos, L. Chen, and L. Yu, “The Birkhoff theorem for unitary matrices of arbitrary dimension”, Linear Algebra and its Applications , vol. 514 (2017), pp. 151-164.
- 3[3] A. De Vos and S. De Baerdemacker, “The Birkhoff theorem for unitary matrices of prime dimension”, Linear Algebra and its Applications , vol. 493 (2016), pp. 455-468.
- 4[4] A. De Vos and S. De Baerdemacker, “The NEGATOR as a basic building block for quantum circuits”, Open Systems & Information Dynamics , vol. 20 (2013), 1350004.
- 5[5] A. De Vos and S. De Baerdemacker, “On two subgroups of U( n 𝑛 n ), useful for quantum computing”, Journal of Physics: Conference Series: Proceedings of the 30 th International Colloquium on Group-theoretical Methods in Physics, Gent (July 2014) , vol. 597 (2015), 012030.
- 6[6] A. Klappenecker and M. Rötteler, “Quantum software reusability”, International Journal of Foundations of Computer Science , vol. 14 (2003), pp. 777-796.
- 7[7] W. Tadej and K. Życzkowski, “A concise guide to complex Hadamard matrices”, Open Systems & Information Dynamics , vol. 13 (2006), pp. 133-177.
- 8[8] mathworld.wolfram.com/Transitive Group.html (2018).
