The Quantum Cocktail Party Problem

Xiao Liang; Yadong Wu; Hui Zhai

arXiv:1904.06411·quant-ph·May 12, 2020

The Quantum Cocktail Party Problem

Xiao Liang, Yadong Wu, Hui Zhai

PDF

TL;DR

This paper introduces a quantum analog of the classical cocktail party problem, focusing on recovering pure quantum states from mixed states using classical and quantum optimization methods.

Contribution

It formulates the quantum cocktail party problem, proposes physical realizations, and offers solution strategies including classical optimization and quantum Hamiltonian mapping.

Findings

01

Formulation of the quantum cocktail party problem.

02

Proposed physical realization methods.

03

Solution approaches using classical and quantum techniques.

Abstract

The cocktail party problem refers to the famous selective attention problem of how to find out the signal of each individual sources from signals of a number of detectors. In the classical cocktail party problem, the signal of each source is a sequence of data such as the voice from a speaker, and each detector detects signal as a linear combination of all sources. This problem can be solved by a unsupervised machine learning algorithm known as the independent component analysis. In this work we propose a quantum analog of the cocktail party problem. Here each source is a density matrix of a pure state and each detector detects a density matrix as a linear combination of all pure state density matrix. The quantum cocktail party problem is to recover the pure state density matrix from a number observed mixed state density matrices. We propose the physical realization of this problem, and…

Figures3

Click any figure to enlarge with its caption.

Tables1

Table 1. Table 1: A comparison between the classical and the quantum cocktail party problem in term of different definition of source, different role of detector and different loss functions.

	Classical CPP	Quantum CPP
Source	Voice of each speaker	Each pure state
Detector	Mixed voices	Mixed state
Loss function	Minimizing entropy	Minimizing $\| ρ^{2} - ρ \|$

Equations23

x_{j} (t) = j i \sum A_{j i} s_{i} (t) .

x_{j} (t) = j i \sum A_{j i} s_{i} (t) .

ρ_{j} = j i \sum A_{j i} ∣ ϕ_{i} ⟩ ⟨ ϕ_{i} ∣.

ρ_{j} = j i \sum A_{j i} ∣ ϕ_{i} ⟩ ⟨ ϕ_{i} ∣.

∣Φ ⟩ = i = 1 \sum N φ_{i} (r) ∣ ϕ_{i} ⟩ ∣ s_{i} ⟩ .

∣Φ ⟩ = i = 1 \sum N φ_{i} (r) ∣ ϕ_{i} ⟩ ∣ s_{i} ⟩ .

F = mn \sum ∣ (ρ^{2} - ρ)_{mn} ∣^{2}

F = mn \sum ∣ (ρ^{2} - ρ)_{mn} ∣^{2}

= mn \sum ∣ j = 1 \sum M (ρ_{j}^{2} w_{j}^{2} - ρ_{j} w_{j})_{mn} + i \neq = j = 1 \sum M (ρ_{i} ρ_{j})_{mn} w_{i} w_{j} ∣^{2} .

w (t + Δt) = w (t) - \frac{F ^{'} [ w ( t )]}{F ^{''} [ w ( t )]},

w (t + Δt) = w (t) - \frac{F ^{'} [ w ( t )]}{F ^{''} [ w ( t )]},

∣ ϕ ⟩ = \frac{1}{\sum _{k} ∣ c _{k} ∣ ^{2}} k \sum c_{k} ∣ k ⟩,

∣ ϕ ⟩ = \frac{1}{\sum _{k} ∣ c _{k} ∣ ^{2}} k \sum c_{k} ∣ k ⟩,

\hat{H} = ij k l \sum A_{ij k l} σ_{i}^{z} σ_{j}^{z} σ_{k}^{z} σ_{l}^{z} + ij k \sum B_{ij k} σ_{i}^{z} σ_{j}^{z} σ_{k}^{z} + ij \sum C_{ij} σ_{i}^{z} σ_{j}^{z},

\hat{H} = ij k l \sum A_{ij k l} σ_{i}^{z} σ_{j}^{z} σ_{k}^{z} σ_{l}^{z} + ij k \sum B_{ij k} σ_{i}^{z} σ_{j}^{z} σ_{k}^{z} + ij \sum C_{ij} σ_{i}^{z} σ_{j}^{z},

A_{ij k l} = Tr (ρ_{i} ρ_{j}) (ρ_{k} ρ_{l});

A_{ij k l} = Tr (ρ_{i} ρ_{j}) (ρ_{k} ρ_{l});

B_{ij k} = - Tr (ρ_{i} ρ_{j}) ρ_{k} - Tr ρ_{i} (ρ_{j} ρ_{k});

C_{ij} = Tr ρ_{i} ρ_{j};

M \to \infty lim ∣ y - s_{k} ∣_{min}^{M} \to 0.

M \to \infty lim ∣ y - s_{k} ∣_{min}^{M} \to 0.

P_{accept} = {1 e^{- β (E_{t^{'}} - E_{t})} E_{t^{'}} < E_{t} E_{t^{'}} \geq E_{t},

P_{accept} = {1 e^{- β (E_{t^{'}} - E_{t})} E_{t^{'}} < E_{t} E_{t^{'}} \geq E_{t},

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

The Quantum Cocktail Party Problem

Xiao Liang

Laboratory of Quantum Information, University of Science and Technology of China, Hefei, 230026, China

Institute for Advanced Study, Tsinghua University, Beijing, 100084, China

Yadong Wu

Institute for Advanced Study, Tsinghua University, Beijing, 100084, China

Hui Zhai

[email protected]

Institute for Advanced Study, Tsinghua University, Beijing, 100084, China

Abstract

The cocktail party problem refers to the famous selective attention problem of how to find out the signal of each individual sources from signals of a number of detectors. In the classical cocktail party problem, the signal of each source is a sequence of data such as the voice from a speaker, and each detector detects signal as a linear combination of all sources. This problem can be solved by a unsupervised machine learning algorithm known as the independent component analysis. In this work we propose a quantum analog of the cocktail party problem. Here each source is a density matrix of a pure state and each detector detects a density matrix as a linear combination of all pure state density matrix. The quantum cocktail party problem is to recover the pure state density matrix from a number observed mixed state density matrices. We propose the physical realization of this problem, and how to solve this problem through either classical Newton’s optimization method or by mapping the problem to the ground state of an Ising type of spin Hamiltonian.

Introducation.

The cocktail party problem refers to the phenomenon that the brain of a listener can focus on a single voice while filtering out a range of other voices in a multi-talker situation, say, in a cocktail party Review . This selective attention problem is first defined as the “cocktail party problem” (CPP) by C. Cherry in 1953 Cherry . For several decades, it is an important research subject for both neuroscience to understand how human or animals solve this problem and computer science to design algorithms to solve this problem. During recent years, machine-learning based approach to solve the CPP is essential for many industrial applications such as automated speech recognization. The independent component analysis (ICA) is such an algorithm particularly suitable for the CPP. The CPP can also found its application in physical science such as astrophysics data analysis CPP_physics1 ; CPP_physics2 . Recently, it has also been proposed to use the spirt of CPP and ICA method to extract the eigen frequency of a quantum system from a dynamical probe Wu .

Let us first briefly review the classical CPP (c-CPP). Considering $N$ -independent speakers in a room, they speak simultaneously and the voice of each speaker is a source denoted by a sequence $s_{i}(t)$ ( $i=1,\dots,N$ ). There are also $M$ detectors in the room. Each detector detects a signal $x_{j}(t)$ ( $j=1,\dots,M$ ) that is considered to be a linear combination of all sources $s_{i}(t)$ , as schematically shown in Fig. 1(a). That is to say, we have a $M\times N$ -dimensional matrix $A$ and

[TABLE]

To concentrate on one of the speakers, it means that we should find out $A^{-1}$ such that we can determine $s_{i}(t)$ as $s_{i}(t)=\sum_{j}(A^{-1})_{ij}x_{j}(t)$ from the signals of all detectors. For human, we only have two ears which means the number of detectors is two. But for computer algorithm problem, we make the situation simpler by considering that there are more detectors than sources, that is, $M>N$ . Even though, this is still an ill-defined problem if no information of the source is known. In practices, we utilize the information that being voice of an individual speaker, each source $s_{i}(t)$ displays certain feature and is more regular than a mix of several voices. By performing statistics over $t$ for each sequence $s_{i}(t)$ , we can determine the entropy of the sequence and we use the criterion that the entropy of each sequence should be minimized to determine each $s_{i}(t)$ . This is how we solve the CPP with the ICA method ICA .

In this work we will propose a quantum analogy of the CPP, termed as the quantum cocktail party problem (q-CPP). We will discuss how to solve the q-CPP with an analogy of the ICA method. We should also present a mathematical statement that can help us to map the loss function to a Hamiltonian of Ising spins. Though by classical Monte Carlo, we show that the ground state spin configuration can solve the q-CPP, we point out that this spin Hamiltonian can be solved more efficiently by quantum methods, for example, by quantum simulation and quantum annealing.

Results.

Quantum CPP. Here we first propose the q-CPP. We consider $N$ different sources, and each source $s_{i}$ to be a density matrix of a pure state as $|\phi_{i}\rangle\langle\phi_{i}|$ , where $|\phi_{i}\rangle$ is a normalized quantum state in a Hilbert space with dimension $d$ . There are $M$ number of detectors, and the signal $x_{j}$ detected by each detector is a density matrix of a mixed state denoted by $\rho_{j}$ as

[TABLE]

We also normalize $\rho_{j}$ to be trace unity, which require $\sum_{i}A_{ji}=1$ for all $j$ . The q-CPP is defined as that, suppose that we know sufficient number of $\rho_{j}$ , whether one can find out $A_{ji}$ to recover each $|\phi_{i}\rangle\langle\phi_{i}|$ .

Here we will briefly discuss the uniqueness of the solution. First of all, we should emphasize that in order to ensure the solution is unique, it is important to require that different $|\phi_{i}\rangle$ are not orthogonal to each other. Secondly, when the number of detector increases by one, the constraints increases by $d^{2}$ and the free parameters increases by $N-1$ , so we will consider the situation that $d^{2}>N$ . Lastly, it is always good to have sufficient number of detectors, normally we consider $M>N$ . We do not rigorously prove the uniqueness of the solution, but we find that in practices, generically we always find unique solution when these conditions are satisfied.

A physical realization of the q-CPP can be proposed as follows. Let us consider a particle whose internal Hilbert space is a product of two degrees of freedom as $\mathcal{H}=\mathcal{H}_{A}\times\mathcal{H}_{B}$ . The dimensionality of $\mathcal{H}_{A}$ is $d$ and the dimensionality of $\mathcal{H}_{B}$ is $N$ , and $\{|s_{i}\rangle\}$ ( $i=1,\dots,N$ ) forms a complete set of bases in $\mathcal{H}_{B}$ . A wave function $|\Phi\rangle$ can be generally expanded in these bases as

[TABLE]

For a more physical picture, one can consider Eq. 3 as particle emitted from $N$ different sources, and the wave function in $\mathcal{H}_{B}$ is $|s_{i}\rangle$ for particle emitted from the source- $i$ , as schematically shown in Fig. 1(b). If we place $M$ detectors in different places ${\bf r}_{i}$ and the quantum measurement traces out the Hilbert space $\mathcal{H}_{B}$ , it results in $M$ different density matrix as Eq. 2 with $A_{ji}=|\varphi_{i}({\bf r}_{j})|^{2}$ , thus, it is also natural to require $A_{ji}$ to be positive numbers. In practices, these density matrices can be constructed by the quantum state tomography. We consider the situation that both the wave function $\varphi({\bf r})$ and $|\phi_{i}\rangle$ are unknown. The q-CPP is to determine them from $\rho_{j}$ ( $j=1,\dots,M$ ).

To find out the pure state, the most important information we use here is that the density matrix of a pure state has the property that $\rho^{2}=\rho$ . Thus, the scheme is to find out a proper combination of $\rho_{j}$ as $\rho=\sum_{j=1}^{M}w_{j}\rho_{j}$ with the normalization condition $\sum_{j}w_{j}=1$ that can minimize $|\rho^{2}-\rho|$ . This is equivalent to say, we define the loss function as

[TABLE]

A comparison between c-CPP and q-CPP is summarized in the Table 1.

Optimization with Newton’s Method. We firstly use the classical Newton’s method to optimize the loss function $\mathcal{F}$ , and the update rule of w is:

[TABLE]

where ${\bf w}=\{w_{1},\dots,w_{M}\}$ is a $M$ -dimensional vector. Here $\mathcal{F}^{\prime}$ and $\mathcal{F}^{\prime\prime}$ are respectively the first order and the second order gradient of the loss function $\mathcal{F}$ . When the loss function reaches the minimum, it should yield $\mathcal{F}=0$ , thus the optimization is completed. This process does not require any information of $A_{ij}$ and $|\phi_{i}\rangle$ as a prior.

To test this algorithm, we first randomly generate a set of pure state in the form of

[TABLE]

where $\{|k\rangle\}$ is a complete set of basis in the Hilbert space $\mathcal{H}_{A}$ with dimension chosen as $d_{A}=8$ , and $c_{k}$ is a real coefficient uniformly sampled in the range of $[-5,5]$ . To generate $\rho_{i}$ , we randomly sample $A_{ij}$ in the range of $[0,1]$ , then normalized under the constraint $\sum_{j}A_{i,j}=1$ . For the example shown in Fig. 2, we choose three pure states $\rho_{i}=|\phi_{i}\rangle\langle\phi_{i}|$ and the fidelities between the three pure states are $F(\rho_{1},\rho_{2})\doteq 0.56$ , $F(\rho_{1},\rho_{3})\doteq 0.54$ and $F(\rho_{2},\rho_{3})\doteq 0.88$ , where the fidelity is defined as $F(\rho_{a},\rho_{b})=\text{Tr}\sqrt{\sqrt{\rho_{a}}\rho_{b}\sqrt{\rho_{a}}}$ .

We then use the Newton’s method to solve the q-CPP. $\bf{w}$ is initialized in such a way that $w_{i}$ ( $i=1,\dots,M-1$ ) are uniformly sampled in the range of $[-2,2]$ and $w_{M}$ is determined by the constraint $\sum_{i}w_{i}=1$ . Then, we can reach a convergent solution following Eq. 5. In the example of Fig.2, three different $\rho_{f}$ can be found by the Newton’s method depending on different initialization, and their fidelities with $\rho_{i}$ ( $i=1,2,3$ ) are shown in Fig. 2(a-c). One can see that there is always one fidelity equalling unity. For instance, for the case Fig. 2(a), $F(\rho_{f},\rho_{1})\doteq 1$ , and $F(\rho_{f},\rho_{2})$ , $F(\rho_{f},\rho_{3})$ are consistent with $F(\rho_{1},\rho_{2})$ and $F(\rho_{1},\rho_{3})$ , respectively. This means that the resulting $\rho_{f}$ recovers $\rho_{1}$ . Similarly, in the cases of Fig. 2(b) and (c), the resulting $\rho_{f}$ recovers $\rho_{2}$ and $\rho_{3}$ , respectively. We have also tried different number of detectors. For the case with three sources, we find that the performance is good as long as the number of detectors is equal or greater than three.

Mapping to a Hamiltonian Problem. Since Eq. 4 is a function of $\{w_{j}\}$ , and if we restrict the value of all $w_{j}$ to be $\pm 1$ , minimizing Eq. 4 can be regarded as finding the ground state of a Hamiltonian of the Ising spins. If we replace $w_{j}$ as $\sigma^{z}_{j}$ , Eq. 4 can be written into a Hamiltonian form as

[TABLE]

where

[TABLE]

Here we set the energy unit of the Hamiltonian as unity. In order to satisfy the constraint $\sum_{j}w_{j}=1$ , we require the number of spin to be odd and the total magnetization to be unity. Note that this Hamiltonian contains four, three and two-body interactions. In this Hamiltonian, the number of sites are equal to the number of detectors. Here it is worth emphasizing that only computing the coefficients listed in Eq. 8-10 depends on the pure state Hilbert dimension $d$ of the original quantum problem, and the complicity of the Hamiltonian Eq. 7 itself will not increase as $d$ increases. Given that in the previous discussion of uniqueness of the solution, we prefer to have a large $d$ , this is a great advantage of this approach.

Now the question is whether we can restrict all $w_{j}$ to be $\pm 1$ . Here we make the following statement:

Statement: We consider each source ${\bf s}_{i}$ is a vector, and $M$ -number of signal ${\bf x}_{j}$ ( $j=1,\dots,M$ ) as a mixing of $N$ -number of sources ${\bf s}_{i}$ written as ${\bf x}_{j}=\sum_{ji}A_{ji}{\bf s}_{i}$ , where all $A_{ji}$ are positive numbers ranging between zero and unity without any other restrictions. We construct ${\bf y}=\sum_{j=1}^{M}w_{j}{\bf s}_{j}$ where $w_{j}$ can only take $\pm 1$ . For each given $M$ and for a specified target ${\bf s}_{k}$ , we optimize $\{w_{j}\}$ ( $j=1,\dots,M$ ) to minimize $|{\bf y}-{\bf s}_{k}|$ and the minimized value is denoted by $|{\bf y}-{\bf s}_{k}|^{M}_{\text{min}}$ . We state that

[TABLE]

The meaning of this statement is that, as long as the number of the detector is sufficient, we can always restrict $w_{j}$ to be $\pm 1$ .

We have verified this statement with numerical simulations. As an example, we consider five sources and each of them is an eight-dimensional vector. As shown in Fig. 3, we plot $|{\bf y}-{\bf s}_{k}|^{M}_{\text{min}}$ as a function of $1/M$ and find that it does converge to zero as $M$ increases.

Now we show the ground state spin configuration of this Hamiltonian can determine the solution of the q-CPP. As $M$ becomes large, the Hilbert space dimension of the Hamiltonian increases and it is hard to solve the ground state by the exact diagnolization. Hence, here we use the classical Monte-Carlo method to find the ground state approximately. In our simulations, the initial temperature is unity which is the same as the energy unit of the Hamiltonian. During the annealing process, the temperature is reduced epoch by epoch, and in each epoch the temperature is reduced by $\frac{1}{n(n+1)}$ , where $n$ is the epoch number. In each epoch, we randomly flip the spin for 12000 times with the acceptance probability $P_{\text{accept}}$ given by

[TABLE]

where $E_{t^{\prime}}$ and $E_{t}$ are the eigenenergies after and before randomly flipping the spins, respectively. $\beta=1/(k_{\text{b}}T)$ is the inverse temperature. In the simulation, it suffers from the problem of trapped into local minimum. If so, when we regard $\sigma^{z}_{j}$ as $w_{j}$ and reconstruct density matrix $\rho_{f}=\sum_{j}w_{j}\rho_{j}$ , $\rho_{f}$ may not be a positive definite matrix. Hence, during the annealing process, we simultaneously run two criterions. We require the von Neumann entropy of the reconstructed density matrix, defined as $-\text{Tr}\rho_{f}\text{log}\rho_{f}$ to be close to zero, and we also require the reconstructed density matrix to be positive definite. We stop the annealing process at a temperature when these two criterions are well satisfied.

The results of the Monte Carlo calculation are presented in Fig. 2(d-f). Here we also choose three different input density matrices with fidelity mutually as $F(\rho_{1},\rho_{2})\doteq 0.13$ , $F(\rho_{1},\rho_{3})\doteq 0.70$ and $F(\rho_{2},\rho_{3})\doteq 0.45$ . Similar as the results from the Newton’s method, depending on different initialization, we can find different spin configurations that can construct three different density matrices $\rho_{f}$ . For instance, for the case shown in Fig. 2(d), one can find the fidelity $F(\rho_{f},\rho_{1})$ is very close to unity, and $F(\rho_{f},\rho_{2})$ and $F(\rho_{f},\rho_{3})$ are consistent with $F(\rho_{1},\rho_{2})$ and $F(\rho_{1},\rho_{3})$ , respectively. We can also see that the results get improved as the number of detectors increases. For instance, in Fig. 2(d), for $M=19$ , $23$ , $27$ , $F(\rho_{f},\rho_{1})$ are $0.991$ , $0.998$ and $0.999$ , which gradually approaches unity, and $F(\rho_{f},\rho_{2})$ are $0.187$ , $0.143$ and $0.136$ , which gradually approaches the input value $F(\rho_{1},\rho_{2})$ .

Finally, we should remark that both exact diagnoalization and classical Monte Carlo have limitations for solving the Hamiltonian like Eq. 7. The most efficient way to find out its ground state is through quantum method, either by quantum simulation or by quantum annealing annealing .

Conclusion and Outlook.

In summary, we have proposed a quantum analogy of the cocktail party problem. The essential point is to replace the classical signal from each source with quantum data. In the classical problem, it is the independent classical signals that are emitted from different sources, which are mixed and detected by the detectors. In the quantum problem, instead, it is the quantum wave functions that are emitted from different sources, and here the “independent” means that the wave functions are orthogonal in part of the Hilbert space. The wave functions interfere and the reduced density matrix are observed by detectors. In both cases, the goal is to recover the individual sources from the information of detectors only.

We also show that solving this problem can be mapped to finding the ground state of an Ising type spin Hamiltonian, and we propose to solve this Hamiltonian by quantum simulator or quantum annealing. We note that, to ensure the uniqueness of the solution, it is preferred to keep both the Hilbert space dimension $d$ of the pure state wave function and the number of detector $M$ large enough. Under this situation, this quantum approach has advantage that on one hand, the complicity of the Hamiltonian does not increase with the increasing of $d$ ; and on the other hand, the number of Ising spins equal to $M$ and the quantum approach can exhibit its advantage when $M$ is large.

We envision that one can generalize this quantum version to more complicated situation, such as time-dependent wave function. Similar as that the classical CPP is very useful in classical information processing, we believe the quantum version of CPP can also be useful in quantum information processing.

Competing Interests

The authors declare that there are no competing interests.

Author Contribution

All authors contribute extensively to the entire project.

Data Availability

Data sets were generated or analyzed during the current study.

Acknowledgment. This work is supported MOST under Grant No. 2016YFA0301600 and NSFC Grant No. 11734010.

Bibliography7

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1(1) M. A. Bee and C. Micheyl, The cocktail party problem: what is it? How can it be solved? And why should animal behaviorists study it? , J. Comp. Psychol. 122 235-251 (2008).
2(2) E. C. Cherry, Some experiments on the recognition of speech, with one and with two ears , J. Acoust. Cos. Am. 25 975-979 (1953).
3(3) I. P. Waldmann, Of Cocktail Parties and Exoplanets , Astrophys. J. 747 12 (2012).
4(4) J. Crowder and N. J. Cornish, Solution to the galactic foreground problem for LISA , Phys. Rev. D 75 043008 (2007).
5(5) Y. Wu and H. Zhai, Generalized Independent Component Analysis for Extracting Eigen-Modes of a Quantum System , ar Xiv: 1904.05067
6(6) A. Hyvärinen and E. Oja, Independent component analysis: algorithms and applications , Neural Netw. 13 411-430 (2000).
7(7) S. E. V.-Andraca, W. C.-Santos, C. Mc Geoch and M. Lanzagorta, A cross-disciplinary introduction to quantum annealing-based algorithms , Contemp. Phys, 59 174-197 (2018).