The Birkhoff theorem for unitary matrices of prime-power dimension

Alexis De Vos; Stijn De Baerdemacker

arXiv:1812.08833·math-ph·December 24, 2018

The Birkhoff theorem for unitary matrices of prime-power dimension

Alexis De Vos, Stijn De Baerdemacker

PDF

Open Access

TL;DR

This paper extends the unitary Birkhoff theorem for matrices of prime-power dimension, showing that a smaller set of permutation matrices suffices for the decomposition, linked to the affine group GA(w,p).

Contribution

It demonstrates that for prime-power dimensions, the Birkhoff decomposition can be achieved with a subset of permutation matrices related to the affine group, reducing complexity.

Findings

01

Decomposition uses epicirculant permutation matrices.

02

Permutation matrices form a group isomorphic to GA(w,p).

03

Reduction from n! to a smaller group of permutation matrices.

Abstract

The unitary Birkhoff theorem states that any unitary matrix with all row sums and all column sums equal unity can be decomposed as a weighted sum of permutation matrices, such that both the sum of the weights and the sum of the squared moduli of the weights are equal to unity. If the dimension~ $n$ of the unitary matrix equals a power of a prime $p$ , i.e.\ if $n = p^{w}$ , then the Birkhoff decomposition does not need all $n!$ possible permutation matrices, as the epicirculant permutation matrices suffice. This group of permutation matrices is isomorphic to the general affine group GA( $w, p$ ) of order only $p^{w} (p^{w} - 1) (p^{w} - p) ... (p^{w} - p^{w - 1}) ≪ (p^{w})!$ .

Tables2

Table 1. Table 1: Applicability of the second strategy for the Birkhoff decomposition of an XU( n 𝑛 n ) matrix with n = p w 𝑛 superscript 𝑝 𝑤 n=p^{w} .

	$p = 2$	$p \geq 3$
$w = 1$	no	no
$w = 2$	yes	yes
$w \geq 3$	no	yes

Table 2. Table 2: Number of Birkhoff terms in the decomposition of an arbitrary n × n 𝑛 𝑛 n\times n unit-linesum unitary matrix.

$n$	1	2	3	4	5	6	7	8	9	10	11
	1	2	6	12	20	360	42	1,344	216	1,814,400	110

Equations195

D = σ \sum c_{σ} P_{σ}

D = σ \sum c_{σ} P_{σ}

X = σ \sum c_{σ} G_{σ}

X = σ \sum c_{σ} G_{σ}

X = σ \sum a_{σ} G_{σ}

X = σ \sum a_{σ} G_{σ}

X = σ \sum b_{σ} G_{σ},

X = σ \sum b_{σ} G_{σ},

X=T\ \left(\begin{array}[]{cc}1&\\ &U\end{array}\right)\ T^{-1}\ ,

X=T\ \left(\begin{array}[]{cc}1&\\ &U\end{array}\right)\ T^{-1}\ ,

T_{j, 0} = T_{0, k} = 1/ n,

T_{j, 0} = T_{0, k} = 1/ n,

X_{k, l} = \frac{1}{n} + r = 1 \sum n - 1 s = 1 \sum n - 1 T_{k, r} U_{r - 1, s - 1} (T^{- 1})_{s, l} .

X_{k, l} = \frac{1}{n} + r = 1 \sum n - 1 s = 1 \sum n - 1 T_{k, r} U_{r - 1, s - 1} (T^{- 1})_{s, l} .

X_{k, l} = \frac{1}{n} + r = 1 \sum n - 1 s = 1 \sum n - 1 U_{r - 1, s - 1} T_{k, r} \overline{T_{l, s}} .

X_{k, l} = \frac{1}{n} + r = 1 \sum n - 1 s = 1 \sum n - 1 U_{r - 1, s - 1} T_{k, r} \overline{T_{l, s}} .

X = W + \frac{1}{n} r = 1 \sum n - 1 s = 1 \sum n - 1 U_{r - 1, s - 1} M_{r, s},

X = W + \frac{1}{n} r = 1 \sum n - 1 s = 1 \sum n - 1 U_{r - 1, s - 1} M_{r, s},

(M_{r, s})_{k, l} = n T_{k, r} \overline{T_{l, s}} .

(M_{r, s})_{k, l} = n T_{k, r} \overline{T_{l, s}} .

(M_{r, s})_{0, l} (M_{r, s})_{k, 0} = (M_{r, s})_{k, l} .

(M_{r, s})_{0, l} (M_{r, s})_{k, 0} = (M_{r, s})_{k, l} .

(M_{r, s})_{0, l}

(M_{r, s})_{0, l}

(M_{r, s})_{k, 0}

X = σ \sum c_{σ} G_{σ}

X = σ \sum c_{σ} G_{σ}

U^{(ν)} = σ \sum c_{σ} D_{σ}^{(ν)},

U^{(ν)} = σ \sum c_{σ} D_{σ}^{(ν)},

σ \sum c_{σ} (D^{(ν)} (σ))_{k, l} = (U^{(ν)})_{k, l} .

σ \sum c_{σ} (D^{(ν)} (σ))_{k, l} = (U^{(ν)})_{k, l} .

c_{σ}

c_{σ}

P_{\sigma}=T\ \left(\begin{array}[]{cc}1&\\ &D^{(1)}(\sigma)\end{array}\right)\ T^{-1}

P_{\sigma}=T\ \left(\begin{array}[]{cc}1&\\ &D^{(1)}(\sigma)\end{array}\right)\ T^{-1}

\left(\begin{array}[]{cc}1&\\ &D^{(1)}(\sigma)\end{array}\right)=T^{-1}P_{\sigma}T\ .

\left(\begin{array}[]{cc}1&\\ &D^{(1)}(\sigma)\end{array}\right)=T^{-1}P_{\sigma}T\ .

\left(\begin{array}[]{cc}1&\\ &U\end{array}\right)=T^{-1}XT\ .

\left(\begin{array}[]{cc}1&\\ &U\end{array}\right)=T^{-1}XT\ .

c_{σ} = \frac{1}{N} [n_{0} \mbox T r (D^{(0) †} (σ)) + n_{1} \mbox T r (D^{(1) †} (σ) U) + ν = 2 \sum μ - 1 n_{ν} \mbox T r (D^{(ν) †} (σ))] .

c_{σ} = \frac{1}{N} [n_{0} \mbox T r (D^{(0) †} (σ)) + n_{1} \mbox T r (D^{(1) †} (σ) U) + ν = 2 \sum μ - 1 n_{ν} \mbox T r (D^{(ν) †} (σ))] .

ν \sum n_{ν} \mbox T r (D^{(ν) †} (σ)) = ν \sum n_{ν} \mbox T r (D^{(ν) †} (σ) D^{(ν)} (ϵ)) = δ_{σ} N,

ν \sum n_{ν} \mbox T r (D^{(ν) †} (σ)) = ν \sum n_{ν} \mbox T r (D^{(ν) †} (σ) D^{(ν)} (ϵ)) = δ_{σ} N,

c_{σ} = δ_{σ} + \frac{n - 1}{N} \mbox T r (D^{(1)} (σ^{- 1}) U) - \frac{n - 1}{N} χ^{(1)} (σ^{- 1}) .

c_{σ} = δ_{σ} + \frac{n - 1}{N} \mbox T r (D^{(1)} (σ^{- 1}) U) - \frac{n - 1}{N} χ^{(1)} (σ^{- 1}) .

N \geq 2 + 2 (n - 1)^{2} .

N \geq 2 + 2 (n - 1)^{2} .

c_{σ} = \frac{1}{N} [n_{0} \mbox T r (D^{(0) †} (σ))

c_{σ} = \frac{1}{N} [n_{0} \mbox T r (D^{(0) †} (σ))

c_{σ} = δ_{σ}

c_{σ} = δ_{σ}

= 0

X = σ \sum c_{σ} P_{σ}

X = σ \sum c_{σ} P_{σ}

X = σ \sum c_{σ} P_{σ}

X = σ \sum c_{σ} P_{σ}

A_{0, a} = A_{k, a + k x} .

A_{0, a} = A_{k, a + k x} .

F_{k, l} = \frac{1}{p} ω^{k l},

F_{k, l} = \frac{1}{p} ω^{k l},

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Topicsgraph theory and CDMA systems · Coding theory and cryptography · Finite Group Theory Research

Full text

The Birkhoff theorem

for unitary matrices of prime-power dimension

Alexis De Vos and Stijn De Baerdemacker

Abstract

The unitary Birkhoff theorem states that any unitary matrix with all row sums and all column sums equal unity can be decomposed as a weighted sum of permutation matrices, such that both the sum of the weights and the sum of the squared moduli of the weights are equal to unity. If the dimension $n$ of the unitary matrix equals a power of a prime $p$ , i.e. if $n=p^{w}$ , then the Birkhoff decomposition does not need all $n!$ possible permutation matrices, as the epicirculant permutation matrices suffice. This group of permutation matrices is isomorphic to the general affine group GA( $w,p$ ) of order only $p^{w}(p^{w}-1)(p^{w}-p)...(p^{w}-p^{w-1})\ll\left(p^{w}\right)!$ .

1 Introduction

Let D( $n$ ) be the semigroup of $n\times n$ doubly stochastic matrices; let P( $n$ ) be the group of $n\times n$ permutation matrices. Birkhoff [1] has demonstrated

Theorem 1

Every D( $n$ ) matrix $D$ can be written

[TABLE]

with all $P_{\sigma}\in$ P( $n$ ) and the weights $c_{\sigma}$ real, satisfying both $0\leq c_{\sigma}\leq 1$ and $\sum_{\sigma}c_{\sigma}=1$ .

The question arises whether a similar theorem holds for matrices from the unitary group U( $n$ ). This question is discussed by De Baerdemacker et al. [2] [3]. For this purpose, the subgroup XU( $n$ ) of U( $n$ ) is introduced [4] [5]. It consists of all U( $n$ ) matrices with all line sums (i.e. all row sums and all column sums) equal to 1. Whereas U( $n$ ) is an $n^{2}$ -dimensional Lie group, the group XU( $n$ ) is only $(n-1)^{2}$ -dimensional. A unitary Birkhoff theorem has been proved for XU( $n$ ) matrices [2] [3]. Remarkable is the fact that the case $n=p$ with $p$ an arbitrary prime [3] has been treated in a very different way from the case where $n$ is an arbitrary integer [2]. As a result, the decomposition, tailored to prime numbers [3], can be restricted to $n^{2}$ terms, whereas the general case [2] leads to a summation over all $n!$ (or at least over $n!/2$ ) permutation matrices, albeit with a large number of degrees of freedom. In the present paper, we will treat the two cases in a unified way. Moreover, the unified approach will be applied to the case $n=p^{w}$ , i.e. $n$ equal to an arbitrary power $w$ of an arbitrary prime $p$ .

In general, the Birkhoff theorem for unitary matrices is easily proved as follows. Let G( $n$ ) be a finite subgroup of XU( $n$ ).

Lemma 1

If an XU( $n$ ) matrix $X$ can be written

[TABLE]

with all $G_{\sigma}\in$ G( $n$ ), then the weights $c_{\sigma}$ satisfy $\sum_{\sigma}c_{\sigma}=1$ .

The proof is trivial: all line sums of $G_{\sigma}$ equal unity; therefore, all line sums of the matrix $c_{\sigma}G_{\sigma}$ equal $c_{\sigma}$ and thus all line sums of the matrix $\sum_{\sigma}c_{\sigma}G_{\sigma}$ are equal to $\sum_{\sigma}c_{\sigma}$ . As all line sums of $X$ are equal to 1, we thus need $\sum_{\sigma}c_{\sigma}=1$ .

Lemma 2

If every XU( $n$ ) matrix $X$ can be written

[TABLE]

with all $G_{\sigma}\in$ G( $n$ ), then there exists a decomposition

[TABLE]

such that not only $\sum_{\sigma}b_{\sigma}=1$ , but also $\sum_{\sigma}|b_{\sigma}|^{2}=1$ .

This fact follows from the Klappenecker–Rötteler theorem [6].

2 The group XU( $n$ )

Remark 1

For sake of convenience, in the present paper, the rows and colums of a matrix are not numbered starting from 1, but instead starting from 0. Thus the upper-left entry of any $m\times m$ square matrix $A$ is $A_{0,0}$ and its lower-right entry is $A_{m-1,m-1}$ .

We recall that the group XU( $n$ ) is an $(n-1)^{2}$ -dimensional subgroup of the $n^{2}$ -dimensional unitary group U( $n$ ). Any member $X$ of XU( $n$ ) can be written

[TABLE]

where $U$ is a member of U( $n-1$ ) and where the constant unitary matrix $T$ is $1/\sqrt{n}$ times a dephased complex Hadamard matrix [7]. Thus (1) constitutes a 1-to-1 mapping between $X$ and $U$ . Because of

[TABLE]

eqn (1) leads to

[TABLE]

With $T$ being unitary, i.e. with $T^{-1}=T^{\dagger}$ , this becomes

[TABLE]

We thus can write the matrix $X$ as a sum of $1+(n-1)^{2}$ matrices:

[TABLE]

where $W$ is the $n\times n$ van der Waerden matrix, i.e. the doubly stochastic matrix with all entries equal to $\frac{1}{n}$ , and where $M_{r,s}$ is an $n\times n$ matrix defined by

[TABLE]

The labels $r$ and $s$ of the matrix $M_{r,s}$ run from 1 to $n-1$ , in contrast to the indices $k$ and $l$ of its entries, which run from 0 to $n-1$ . We thus have $(n-1)^{2}$ such matrices, each having $n^{2}$ entries. Each entry of the matrix $M_{r,s}$ equals the leftmost entry of its row times the uppermost entry of its column. Taking into account (2), one indeed easily checks

[TABLE]

Both the first row and the first column of $M_{r,s}$ equal a line of the Hadamard matrix $T$ (up to complex conjugation and up to the factor $\sqrt{n}\,$ ):

[TABLE]

Because $T$ is $1/\sqrt{n}$ times a Hadamard matrix, we have $|T_{l,s}|=1/\sqrt{n}$ and $|T_{k,r}|=1/\sqrt{n}$ , such that $|(M_{r,s})_{0,l}|=1$ and $|(M_{r,s})_{k,0}|=1$ , and thus, because of (5), we conclude that all entries $(M_{r,s})_{k,l}$ have unit modulus.

3 Underlying framework

In the present section, we consider an arbitrary doubly transitive group G( $n$ ) of $n\times n$ permutation matrices. We denote by $N$ the order of the group. We generalize the ideas and computations in Reference [2], where G( $n$ ) is equal to the group P( $n$ ) of all $n\times n$ permutation matrices, thus G( $n$ ) being isomorphic to the symmetric group Sn and $N$ being equal to $n!$ .

In the next three sections, we will apply the Lemmas 1 and 2 to three different choices of G( $n$ ):

•

In case of arbitrary $n$ , we choose the group of all $n\times n$ permutation matrices (i.e. a group isomorphic to the symmetric group Sn). See Section 4.

•

In case of $n$ equal to some prime $p$ , we choose the group of all $n\times n$ supercirculant permutation matrices (i.e. a group isomorphic to a semidirect-product group Cn : Cn-1). See Section 5.

•

In case of $n$ equal to some power $w$ of some prime $p$ (i.e. equal to $p^{w}$ ), we choose the group of all $n\times n$ epicirculant permutation matrices (i.e. a group isomorphic to the general affine group GA( $w,p$ )). See Section 6.

The meaning of the words ‘supercirculant’ and ‘epicirculant’ will be made clear below. The mentioned groups are doubly transitive, as it is known that the symmetric group Sn is $n$ -transitive, the alternating group An is $(n-2)$ -transitive, and the affine groups are $2$ -transitive [8], in contrast to e.g. the cyclic group Cn, which is only 1-transitive.

In each of the three cases, we will prove below that every XU( $n$ ) matrix $X$ can be written as

[TABLE]

with all $G_{\sigma}$ member of the appropriate group G( $n$ ). Because of Lemmas 1 and 2, we are then allowed to put the case that both $\sum_{\sigma}c_{\sigma}=1$ and $\sum_{\sigma}|c_{\sigma}|^{2}=1$ . For the explicit computation of the weights $c_{\sigma}$ , we note that the G( $n$ ) matrices form an $n$ -dimensional reducible representation of some abstract group G. We assume that G has $\mu$ different irreducible representations. According to Lemma (29.1) of [9], because G is 2-transitive, the $n$ -dimensional natural representation is the sum of the 1-dimensional trivial representation and an $(n-1)$ -dimensional irreducible representation, which we will call the standard representation.

We replace eqn (7) by an eqn concerning one of the $\mu$ irreducible representations of G:

[TABLE]

where $\nu$ is the label of the irrep ( $0\leq\nu\leq\mu-1$ ), where $D^{(\nu)}_{\sigma}$ is the $\nu$ th irreducible representation of $G_{\sigma}$ , and $U^{(\nu)}$ is an appropriate $n_{\nu}\times n_{\nu}$ unitary matrix, with a special mentioning for $\nu=0$ anf $\nu=1$ (see further). Here, $n_{\nu}$ is the dimension of the $\nu$ th representation. We have $\mu$ such matrix equations (8). Each matrix eqn constitutes $n_{\nu}^{2}$ scalar equations. We thus have a total of $\sum_{\nu=0}^{\mu-1}n_{\nu}^{2}=N$ scalar equations with $N$ unknowns $c_{\sigma}$ :

[TABLE]

Solution of this set of equations is:

[TABLE]

We choose for $\nu=0$ the trivial representation, i.e. the 1-dimensional irreducible representation with all characters equal to 1. We choose for $\nu=1$ the standard representation, i.e. the $(n-1)$ -dimensional irreducible representation obtained by applying (1) to the permutation matrix $P_{\sigma}$ :

[TABLE]

and thus

[TABLE]

In (9), the matrix $U^{(0)}(\sigma)$ equals the $1\times 1$ unit matrix and the matrix $U^{(1)}(\sigma)$ equals the $(n-1)\times(n-1)$ lower-right block of

[TABLE]

For the remaining matrices $U^{(\nu)}(\sigma)$ with $2\leq\nu\leq\mu-1$ , we are allowed to choose any unitary matrix of the right dimension $n_{\nu}$ . This usually allows a large number of degrees of freedom. Here, we propose two different strategies to take advantage of this freedom.

3.1 First strategy

For each matrix $U^{(\nu)}(\sigma)$ with $2\leq\nu\leq\mu-1$ , we choose the $n_{\nu}\times n_{\nu}$ unit matrix. Then (9) becomes

[TABLE]

We take advantage of Shur’s orthogonality relation:

[TABLE]

where $\epsilon$ is the trivial identity permutation and where $\delta_{\epsilon}=1$ while $\delta_{\sigma}=0$ if $\sigma\neq\epsilon$ . Because moreover $D^{(1)\,\dagger}(\sigma)=D^{(1)}(\sigma^{-1})$ and $n_{1}=n-1$ , we obtain the explicit expression for the weight:

[TABLE]

The number $\chi^{(\nu)}(G)$ denotes the character of the element $G$ of the group G according to the $\nu$ th representation. It is equal to $\mbox{Tr}(D^{(\nu)}(G))$ . In particular, we have $\mbox{Tr}(D^{(1)}(G))=\mbox{Tr}(G)-1$ .

3.2 Second strategy

The second strategy is only applicable if the group G has an anti-standard irreducible representation, non-equivalent to the standard representation. The anti-standard representation, which we will assign the label $\nu=2$ (if it exists), has the same characters as the standard representation (with label $\nu=1$ ), except for a factor $-1$ if the corresponding permutation is an odd permutation. A necessary condition for the second strategy is

[TABLE]

As in the first strategy, we again choose the $1\times 1$ unit matrix for $U^{(0)}(\sigma)$ and the $(n-1)\times(n-1)$ matrix $U$ for $U^{(1)}(\sigma)$ . However, in this second strategy, we also choose the matrix $U$ for each matrix $U^{(2)}(\sigma)$ . For each matrix $U^{(\nu)}(\sigma)$ with $3\leq\nu\leq\mu-1$ , we choose the $n_{\nu}\times n_{\nu}$ unit matrix. Then (9) becomes

[TABLE]

Again taking advantage of Shur’s orthogonality relation and $n_{1}=n_{2}=n-1$ , we obtain

[TABLE]

In the second strategy, the group G $\cap$ An thus takes over the role of G and $N/2$ takes over the role of $N$ .

4 The case of arbitrary dimension $n$

Lemma 3

Every XU( $n$ ) matrix $X$ can be written

[TABLE]

with all $P_{\sigma}\in$ P( $n$ ).

The proof is provided by [3], by means of induction on $n$ . Combining Lemmas 1, 2, and 3 leads to the unitary Birkhoff theorem:

Theorem 2

Every XU( $n$ ) matrix $X$ can be written

[TABLE]

with all $P_{\sigma}\in$ P( $n$ ), such that both $\sum_{\sigma}c_{\sigma}=1$ and $\sum_{\sigma}|c_{\sigma}|^{2}=1$ .

4.1 First strategy

We can apply result (11) with $N=n!$ . The only possible values of $\chi^{(1)}$ are Tr(Pσ) $-1$ and thus $-1,0,1,2,...,n-1$ , with exception of $n-2$ .

4.2 Second strategy

The character tables of the groups S2 and S3 show no anti-standard representation. For $n>3$ , the group Sn has an anti-standard representation. In this case, we can apply result (14) with $N=n!$ . The restriction $n>3$ is not surprising, as (12) with $N=n!$ is fulfilled neither if $n=2$ nor if $n=3$ .

5 The case of prime dimension $n=p$

We call an $n\times n$ matrix $A$ supercirculant iff each row $k$ equals row $k-1$ shifted $x$ positions to the right. Thus $A_{k,l}=A_{k-1,l-x}$ , where addition and subtraction are modulo $n$ . We equivalently may write

[TABLE]

We call $x$ the pitch of the matrix. If $x=1$ , then the supercirculant matrix is called circulant; if $x=n-1$ , then the supercirculant matrix is called anticirculant.

If $p$ denotes a prime, then the $p\times p$ supercirculant permutation matrices are denoted $S_{a,x}$ , where $x$ is the pitch and $a$ (called the shift) is the column with the unit entry in the upper row (i.e. row 0). The unit entries of such $p\times p$ permutation matrix thus are located at the $p$ positions $(0,a)$ , $(1,a+x)$ , $(2,a+2x)$ , …, and $(p-1,a+(p-1)x)$ , where sums are to be taken modulo $p$ . Because $x$ and $p$ are co-prime, the consecutive columns with a 1, i.e. the columns $a$ , $a+x$ , $a+2x$ , …, and $a+(p-1)x$ , are all different.

If $n$ equals some prime $p$ , then we choose for the $p\times p$ Hadamard matrix $T$ of Section 2 the $p\times p$ discrete Fourier transform $F$ , with entries

[TABLE]

where $\omega$ is equal to the $p\,$ th root of unity. Thus (4) becomes

[TABLE]

From [3], we know that $M$ can be written as a weighted sum of $p$ supercirculant permutation matrices:

[TABLE]

where the pitch $x$ of the matrix $S_{a,x}$ is a function of $r$ and $s$ . Indeed, the condition

[TABLE]

yields

[TABLE]

and thus $r-xs=0$ . Thus $x$ has to satisfy the eqn

[TABLE]

This eqn has one solution:

[TABLE]

where $s^{-1}$ is the inverse of $s$ modulo $p$ . As $p$ is prime, each non-zero integer has exactly one inverse. With $(M_{r,s})_{0,a}=\omega^{-as}$ , we finally obtain

[TABLE]

The supercirculant $p\times p$ permutation matrices form a group S( $p$ ), subgroup of P( $p$ ) (proof in Appendix A), isomorphic to the semidirect product of the cyclic group of order $p$ and the multiplicative group of integers modulo $p$ . The group thus is isomorphic to the semidirect product of two cyclic groups:

[TABLE]

a non-Abelian group of order $p(p-1)$ .

Lemma 4

If $n$ is prime, then every XU( $n$ ) matrix $X$ can be written

[TABLE]

with all $S_{\sigma}\in$ S( $n$ ).

The proof is as follows. If $n$ is a prime $p$ , then all matrices $M_{r,s}$ are supercirculant with a pitch $x=rs^{-1}$ modulo $p$ . Also the van der Waerden matrix $W$ is supercirculant, as it is circulant:

[TABLE]

Hence, according to (3), $X$ is a weighted sum of supercirculant permutation matrices.

Combining Lemmas 1, 2, and 4 leads to

Theorem 3

If $n$ is prime, then every XU( $n$ ) matrix $X$ can be written

[TABLE]

with all $S_{\sigma}\in$ S( $n$ ), such that both $\sum_{\sigma}c_{\sigma}=1$ and $\sum_{\sigma}|c_{\sigma}|^{2}=1$ .

5.1 First strategy

We can apply result (11) with $N=p(p-1)$ . The only possible values of $\chi^{(1)}$ are $-1$ , [math], and $p-1$ , as demonstrated in Appendix B. Thus we find a unitary Birkhoff decomposition with only $p(p-1)$ terms. For a prime exceeding 3, this number is substantially smaller than the number $p!/2$ of Subsection 4.2. The resulting unitary Birkhoff theorem is also slightly stronger than the theorem in [3], where the Birkhoff decomposition consists of $p^{2}$ terms.

5.2 Second strategy

The group S(2), isomorphic to the cyclic group C2, has only two irreducible representations: the trivial one and the standard one. Also the group S( $n$ ) with $n$ equal to an odd prime $p$ , has no inequivalent anti-standard representation. Indeed, because all odd supercirculant permutations have non-unit pitch (see Appendix C) and thus have unit trace (see Appendix B) and hence have zero character $\chi^{(1)}$ , all characters of the anti-standard representation equal the corresponding characters of the standard representation. Therefore, the standard and anti-standard representations are equivalent. We conclude that we cannot apply the second strategy of Subsection 3.2. The absence of any inequivalent anti-standard representation is no surprise, as $N=n(n-1)$ does not satisfy (12).

6 The case of prime-power dimension $n=p^{w}$

For $n=p^{w}$ with arbitrary positive $w$ , we can choose for $T$ of Section 2 the Kronecker product of $w$ small (i.e. $p\times p$ ) Fourier matrices $F$ :

[TABLE]

The $n\times n$ matrix $T$ has following entries:

[TABLE]

where $f(x,y)$ is the sum of the ditwise product of the $p$ -ary numbers $x$ and $y$ :

[TABLE]

As a consequence, we have

[TABLE]

Among the $n^{2}$ entries of this matrix, $n^{2}/p$ are equal to 1, $n^{2}/p$ are equal to $\omega$ , …, and $n^{2}/p$ are equal to $\omega^{p-1}$ .

Remark 2

For sake of convenience, below, the rows and the colums of a matrix will sometimes be pointed at, not by a number, but instead by a vector. This will allow matrix computations for the row and column numbers. For this purpose, any number $z=z_{0}+z_{1}p+z_{2}p^{2}...+z_{w-1}p^{w-1}$ has an associated boldfaced $w\times 1$ vector ${\bf z}=(z_{0},z_{1},z_{2},...,z_{w-1})^{T}$ , consisting of the $w$ dits of the number $z$ .

We call a matrix $A$ epicirculant if row $k$ equals row 0, ‘shifted to the right’ according to

[TABLE]

where a is the $w\times 1$ vector associated with the column number $a$ and where x is a $w\times w$ matrix called the pitch matrix, consisting of $w^{2}$ entries, all $\in\{0,1,...,p-1\}$ . A matrix of the form (16) is automatically epicirculant. It is a weighted sum of epicirculant permutation matrices $E$ : we have

[TABLE]

Here, ${\bf x}$ is an appropriate $w\times w$ pitch matrix, depending on $r$ and $s$ . Proof is in Appendix D. We note that vector ${\bf a}$ and matrix ${\bf x}$ constitute a pair, fully specifying an affine transformation [10].

If $n$ is a prime power, say $n=p^{w}$ , then the epicirculant $p^{w}\times p^{w}$ permutation matrices form a group E( $n$ ), subgroup of P( $n$ ) (proof in Appendix E), isomorphic to the general affine group GA( $w,p$ ), a semidirect product of the direct product of cyclic groups of order $p$ and the general linear group GL( $w,p$ ):

[TABLE]

of order

[TABLE]

We note that GA( $w,p$ ) is a maximal subgroup of the symmetric group S ${}_{p^{w}}$ (O’Nan–Scott theorem) [11].

Each of the $w$ subgroups Cp consists of $p$ matrices, each a Kronecker product with a total of $w$ factors:

[TABLE]

where $I$ denotes the $p\times p$ unit matrix and $M$ a $p\times p$ circulant permutation matrix $S_{a,1}$ .

Lemma 5

If $n$ is a prime power, then every XU( $n$ ) matrix $X$ can be written

[TABLE]

with all $E_{\sigma}\in$ E( $n$ ).

The proof is as follows. If $n$ is a prime power $p^{w}$ , then all matrices $M_{r,s}$ are epicirculant with an invertible pitch matrix x. Also the van der Waerden matrix $W$ is epicirculant, as it is circulant:

[TABLE]

where the pitch matrix ${\bf 1}$ denotes the $w\times w$ unit matrix. Hence, according to (3), $X$ is a weighted sum of epicirculant permutation matrices.

Combining Lemmas 1, 2, and 5 leads to

Theorem 4

If $n$ is a prime power, then every XU( $n$ ) matrix $X$ can be written

[TABLE]

with all $E_{\sigma}\in$ E( $n$ ), such that both $\sum_{\sigma}c_{\sigma}=1$ and $\sum_{\sigma}|c_{\sigma}|^{2}=1$ .

6.1 First strategy

We can apply result (11) with $N$ given by (18). The only possible values of $\chi^{(1)}$ are $-1$ , [math], $p-1$ , $p^{2}-1$ , $p^{3}-1$ , …, and $p^{w}-1$ , as demonstrated in Appendix F.

6.2 Second strategy

For $w>1$ and $p>2$ , the general affine groups have, besides the standard representation, also an inequivalent anti-standard representation. For a proof, it suffices to point to a single example of an odd epicirculant permutation matrix with trace different from unity. We choose the $p^{w}\times p^{w}$ matrix

[TABLE]

i.e. the Kronecker product of $w-1$ matrices $I$ (i.e. the $p\times p$ unit matrix) and the $p\times p$ supercirculant matrix $M=S_{\,0,q}$ . The $w\times w$ pitch matrix associated with $E$ is the diagonal matrix $\mbox{diag}(q,1,1,...,1)$ .

On the one hand, we have the following property of the Kronecker product of two square matrices:

[TABLE]

Therefore, we have $\mbox{Det}(E)=\mbox{Det}(M)^{(p^{w-1})}$ . We choose the number $q$ such that $\mbox{Det}(M)=-1$ and thus $\mbox{Det}(E)=-1$ . This is always possible. Suffice it to choose $q$ equal to $g(p)$ , where $g$ is a generator of the modulo $p$ multiplication group [12]. Unfortunately, there is no algorithm known for finding such generator except brute force [13]. Nevertheless, we can prove that Det $(S_{0,g(p)})=-1$ , without a priori knowing the value of $g(p)$ : see Appendix C.

On the other hand, we have $\mbox{Tr}(E)=p^{w-1}\,\mbox{Tr}(M)=p^{w-1}\,1=p^{w-1}$ . Because $w>1$ , we have $\mbox{Tr}(E)>1$ and thus $\chi^{(1)}>0$ . We thus conclude that we can apply result (14) with $N$ according to (18).

The above reasoning is not valid for $p=2$ , because, in that case, $\mbox{Det}(M)=-1$ does not imply $\mbox{Det}(E)=-1$ . For the case $p=2$ , we will prove that all $2^{w}\times 2^{w}$ epicirculant matrices are even permutations. For this purpose, it is sufficient to demonstrate that all group generators are even. From reversible computing [14] [15] [16], it is known that the group GA( $w,2$ ) is generated by following matrices:

[TABLE]

with a total of $w-1$ (for $A$ ) or $w-2$ (for $B$ and $C$ ) factors $I$ . In the context of computing, these matrices represent NOT gates, respectively controlled NOT gates. Applying (19), we have:

[TABLE]

except if $w=2$ . Thus, for $w>2$ , all members of GA( $w,2$ ) represent even permutations and the second strategy (Subsection 3.2) is not applicable.

This leaves us with the case $p=2$ and $w=2$ . The epicirculant matrices form a group E(4) isomorphic to the symmetric group S4. As stated in Section 4.2, the second strategy is applicable. The results on the applicability of the second strategy are summarized in Table 1.

7 Conclusion

According to [2], every unit-linesum $n\times n$ unitary matrix can be decomposed as a weighted sum of the $n\times n$ permutation matrices, such that both the sum of the weights and the sum of the squared moduli of the weights equal unity. Such Birkhoff sum contains $n!$ terms. In the present paper, we demonstrate the following:

•

If $n\geq 4$ , then $n!/2$ terms suffice.

•

If $n=p^{w}$ with $p$ an arbitrary prime and $w$ an arbitrary integer, then $p^{w}(p^{w}-p^{w-1})(p^{w}-p^{w-2})...(p^{w}-p)(p^{w}-1)$ suffice.

•

If $n=p^{w}$ with $p$ an arbitrary odd prime and $w$ an integer $\geq 2$ , then $p^{w}(p^{w}-p^{w-1})(p^{w}-p^{w-2})...(p^{w}-p)(p^{w}-1)/2$ suffice.

For numerical examples, see Table 2.

The case of $n$ equal to the product of two different primes is left for further investigation.

Appendix A The group of supercirculant permutation matrices

The supercirculant $n\times n$ permutation matrices form a group. Indeed, the product of two such matrices (say $S_{a,x}$ and $S_{b,y}$ ) yields a third such matrix. In order to prove this fact, we compute the matrix entry at position $(u,v)$ :

[TABLE]

and hence

[TABLE]

If $n$ is a prime $p$ , each non-zero number $x$ has an inverse number $x^{-1}$ . Applying (26), we find

[TABLE]

The right-hand side being the $p\times p$ unit matrix, the result proves that each supercirculant matrix has an inverse matrix that also is supercirculant:

[TABLE]

We conclude by considering two applications of eqn(26):

•

choosing $x=y=1$ leads to

[TABLE]

illustrating that the $p$ matrices $S_{a,1}$ are isomorphic to the addition modulo $p$ ;

•

choosing $a=b=0$ leads to

[TABLE]

illustrating that the $p-1$ matrices $S_{0,x}$ are isomorphic to the multiplication modulo $p$ .

Each supercirculant matrix can be decomposed as the product of a zero-shift matrix and a unit-pitch matrix:

[TABLE]

Appendix B The trace of a supercirculant permutation matrix

We compute the trace of the supercirculant permutation matrix $S_{a,x}$ :

[TABLE]

If the eqn

[TABLE]

is fulfilled, then the corresponding number $u$ points to a unit entry in position $(u,u)$ of the matrix $S_{a,x}$ . We notice:

•

If $x\neq 1$ , then $u=a(1-x)^{-1}$ is the one and only solution;

•

if $x=1$ and $a\neq 0$ , then the eqn has no solution $u$ ;

•

if $x=1$ and $a=0$ , then $u$ may have any value from $\{0,\ 1,\ 2,\ ...,\\ p-1\}$ .

Thus we conclude:

•

Tr $(S_{a,x})=1$ , if $x\neq 1$ ,

•

Tr $(S_{a,1})=0$ , if $a\neq 0$ , and

•

Tr $(S_{0,1})=p$ .

Appendix C The determinant of a supercirculant permutation matrix

As mentioned in Appendix A, each supercirculant matrix can be decomposed as follows:

[TABLE]

Hence:

[TABLE]

We have $S_{a,1}=(S_{1,1})^{a}$ and therefore $\mbox{Det}(S_{a,1})=(\mbox{Det}(S_{1,1}))^{a}$ . If $p$ is odd, then $\mbox{Det}(S_{1,1})=1$ , such that $\mbox{Det}(S_{a,1})=1$ . In other words: for odd primes, all of the $p$ circulant permutation matrices have unit determinant. The situation is different for the $p-1$ matrices $S_{0,x}$ . Half of them have unit determinant and half of them have determinant equal to $-1$ . In order to prove this fact, the key observation is the fact that the cyclic group is Abelian; so there exists a similarity transformation that diagonalizes all matrices $S_{0,x}$ . We now prove that the following matrix $F$ serves our purpose:

[TABLE]

where $\omega=\exp(\frac{2\pi i}{p-1})$ is the $(p-1)$ th root of unity, and the function $\varphi(a)$ gives the ‘position’ of the number $a$ in the cyclic group ${\bf C}_{p-1}$ (multiplicative group modulo $p$ ), as a power of the (a priori unknown) generator $g$ , i.e.

[TABLE]

From this definition, the following interesting properties of $\varphi$ can be deduced:

[TABLE]

These properties are key in the following derivation. We compute the similarity transformation given by $F^{\dagger}S_{0,x}F$ . Because both $F$ and $S_{0,x}$ are block diagonal with a single 1 in the upper-left corner, we only need to compute the lower-right part:

[TABLE]

This result leads to two conclusions:

•

By choosing $x=1$ , we find that $(F^{\dagger}F)_{u,v}=\delta_{u,v}$ and thus that $F$ is unitary.

•

By choosing $x$ arbitrary, we find that the matrix $S_{0,x}$ has the eigenvalues $\omega^{v\varphi(x)}$ plus an additional 1 from the upper-left matrix block.

The determinant is just the product of all eigenvalues:

[TABLE]

Now, if $p$ is an odd prime, then $e^{\pi ip}=-1$ , such that $\mbox{Det}(S_{0,x})=(-1)^{\varphi(x)}$ , which proves that the sign of the determinant of $S_{0,x}$ alternates in the chain of successive elements of Cp-1. More in particular, the position of $x=g$ always is $\varphi(g)=1$ , so we have $\mbox{Det}(S_{0,g})=-1$ .

We note that the above results for both $S_{a,1}$ and $S_{0,x}$ are only valid for odd primes $p$ . If $p$ is even, i.e. if $p=2$ , then there exist only two supercirculant matrices $S_{0,1}=\tiny\left(\begin{array}[]{cc}1&0\\ 0&1\end{array}\right)$ , with determinant equal to $1$ , and $S_{1,1}=\tiny\left(\begin{array}[]{cc}0&1\\ 1&0\end{array}\right)$ , with determinant equal to $-1$ .

Appendix D The pitch matrix

In (17), the epicirculant matrix $E_{{\bf a},{\bf x}}$ needs a unit entry in position $({\bf k},{\bf a}+{\bf x}{\bf k})$ if

[TABLE]

implying

[TABLE]

or

[TABLE]

and thus

[TABLE]

or

[TABLE]

and thus

[TABLE]

We fulfil this condition by the set of $w$ non-coupled eqns

[TABLE]

For each eqn, we expect $p^{w-1}$ solutions (as we can choose $w-1$ out of the $w$ dits $x_{j,v}$ arbitrarily from $\{0,1,...,p-1\}$ ). However, many solutions have to be rejected. Indeed, each column of the matrix $E_{{\bf a},{\bf x}}$ in (17) should contain one and only one unit entry. For this purpose, it is necessary and sufficient that the matrix x is invertible. Proof is as follows. We require that for any two different row numbers ( $k^{\prime}\neq k$ ) the unit entry of the permutation matrix is in another column:

[TABLE]

and thus ${\bf x}({\bf k^{\prime}}-{\bf k})\neq{\bf 0}$ . This requires that for any non-zero number $K$ we have

[TABLE]

This, in turn, requires that the rows of x are linearly independent and thus that the matrix x is invertible.

We now prove that, for any pair $(r,s)$ , the set (27) has at least one acceptable solution, i.e. a solution such that the matrix x is invertible. Indeed:

•

Because both $r$ and $s$ are non-zero, at least one dit $r_{u}$ is non-zero and at least one dit $s_{j}$ is non-zero. Let $r_{\alpha}$ be the least-significant non-zero dit of $r$ ; let $s_{\beta}$ be the least-significant non-zero dit of $s$ .

•

We choose all dits $x_{j,v}=0$ , except the dits $x_{v,v}$ , $x_{\beta,v}$ , and $x_{\alpha,\beta}$ . Thus eqns (27) become

[TABLE]

•

For $v\neq\alpha$ and $v\neq\beta$ , we choose $x_{v,v}=1$ . Further we choose $x_{\alpha,\alpha}=0$ and $x_{\alpha,\beta}=1$ . Thus eqns (28) become

[TABLE]

which lead to a single solution set $x_{\beta,v}$ .

The resulting pitch matrix x consists of a non-zero diagonal, one non-zero row, and one extra unit entry. E.g. for $w=7$ , $\alpha=2$ , and $\beta=4$ , we have:

[TABLE]

We note that here Det(x) equals $x_{4,2}$ . In general, we have

[TABLE]

Because Det(x) $\neq 0$ , we have that x is invertible.

Appendix E The group of epicirculant permutation matrices

The epicirculant permutation matrices form a group. An arbitrary entry (at location $({\bf k},{\bf l})$ ) of such matrix $E_{{\bf a},{\bf x}}$ is $\delta_{{\bf l},\,{\bf a}+{\bf xk}}$ . The product of two such matrices yields a third such matrix. Indeed:

[TABLE]

and hence

[TABLE]

Straightforward application of this result leads to

[TABLE]

The right-hand side being the $p^{w}\times p^{w}$ unit matrix, the result proves that each epicirculant matrix has an inverse matrix that also is epicirculant:

[TABLE]

Each epicirculant matrix can be decomposed as the product of a matrix with zero shift vector a and a matrix with unit pitch matrix x:

[TABLE]

Appendix F The trace of an epicirculant permutation matrix

We compute the trace of the epicirculant permutation matrix $E_{{\bf a},{\bf x}}$ :

[TABLE]

If the eqn

[TABLE]

is fulfilled, then the corresponding number $u$ points to a unit entry in position $(u,u)$ of the matrix $E_{{\bf a},{\bf x}}$ . Here, 1 denotes the $w\times w$ unit matrix. We notice:

•

If ${\bf(1-x)}$ is invertible, then ${\bf u}={\bf(1-x)}^{-1}{\bf a}$ is the one and only solution;

•

if ${\bf(1-x)}={\bf 0}$ and ${\bf a}\neq{\bf 0}$ , then the eqn has no solutions ${\bf u}$ ;

•

if ${\bf(1-x)}={\bf 0}$ and ${\bf a}={\bf 0}$ , then $u$ may have any value from $\{0,1,2,...,\\ p^{w}-1\}$ ;

•

if ${\bf(1-x)}$ is neither invertible nor zero, then ${\bf(1-x)}$ has rank $\lambda$ with $1\leq\lambda\leq w-1$ and ${\bf u}$ can have as many values as there are solutions of the eqn ${\bf(1-x)u}={\bf 0}$ , i.e. as the size of the kernel of $({\bf 1}-{\bf x})$ , i.e. $p^{w-\lambda}$ .

Thus we conclude:

•

Tr $(E_{{\bf a},{\bf 1}})=0$ , if ${\bf a}\neq{\bf 0}$ ,

•

Tr $(E_{{\bf 0},{\bf 1}})=p^{w}$ , and

•

Tr $(E_{{\bf a},{\bf x}})=p^{w-\lambda}$ , if $({\bf 1-x})$ has rank $\lambda\neq 0$ .

Bibliography16

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] G. Birkhoff, “Tres observaciones sobre el algebra lineal”, Universidad Nacional de Tucumán: Revista Matemáticas y Física Teórica , vol. 5 (1946), pp. 147-151.
2[2] S. De Baerdemacker, A. De Vos, L. Chen, and L. Yu, “The Birkhoff theorem for unitary matrices of arbitrary dimension”, Linear Algebra and its Applications , vol. 514 (2017), pp. 151-164.
3[3] A. De Vos and S. De Baerdemacker, “The Birkhoff theorem for unitary matrices of prime dimension”, Linear Algebra and its Applications , vol. 493 (2016), pp. 455-468.
4[4] A. De Vos and S. De Baerdemacker, “The NEGATOR as a basic building block for quantum circuits”, Open Systems & Information Dynamics , vol. 20 (2013), 1350004.
5[5] A. De Vos and S. De Baerdemacker, “On two subgroups of U( n 𝑛 n ), useful for quantum computing”, Journal of Physics: Conference Series: Proceedings of the 30 th International Colloquium on Group-theoretical Methods in Physics, Gent (July 2014) , vol. 597 (2015), 012030.
6[6] A. Klappenecker and M. Rötteler, “Quantum software reusability”, International Journal of Foundations of Computer Science , vol. 14 (2003), pp. 777-796.
7[7] W. Tadej and K. Życzkowski, “A concise guide to complex Hadamard matrices”, Open Systems & Information Dynamics , vol. 13 (2006), pp. 133-177.
8[8] mathworld.wolfram.com/Transitive Group.html (2018).

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

The Birkhoff theorem

Abstract

1 Introduction

Theorem 1

Lemma 1

Lemma 2

2 The group XU(nnn)

Remark 1

3 Underlying framework

3.1 First strategy

3.2 Second strategy

4 The case of arbitrary dimension nnn

Lemma 3

Theorem 2

4.1 First strategy

4.2 Second strategy

5 The case of prime dimension n=pn=pn=p

Lemma 4

Theorem 3

5.1 First strategy

5.2 Second strategy

6 The case of prime-power dimension n=pwn=p^{w}n=pw

Remark 2

Lemma 5

Theorem 4

6.1 First strategy

6.2 Second strategy

7 Conclusion

Appendix A The group of supercirculant permutation matrices

Appendix B The trace of a supercirculant permutation matrix

Appendix C The determinant of a supercirculant permutation matrix

Appendix D The pitch matrix

Appendix E The group of epicirculant permutation matrices

Appendix F The trace of an epicirculant permutation matrix

2 The group XU( $n$ )

4 The case of arbitrary dimension $n$

5 The case of prime dimension $n=p$

6 The case of prime-power dimension $n=p^{w}$