The Quantum Cocktail Party Problem
Xiao Liang, Yadong Wu, Hui Zhai

TL;DR
This paper introduces a quantum analog of the classical cocktail party problem, focusing on recovering pure quantum states from mixed states using classical and quantum optimization methods.
Contribution
It formulates the quantum cocktail party problem, proposes physical realizations, and offers solution strategies including classical optimization and quantum Hamiltonian mapping.
Findings
Formulation of the quantum cocktail party problem.
Proposed physical realization methods.
Solution approaches using classical and quantum techniques.
Abstract
The cocktail party problem refers to the famous selective attention problem of how to find out the signal of each individual sources from signals of a number of detectors. In the classical cocktail party problem, the signal of each source is a sequence of data such as the voice from a speaker, and each detector detects signal as a linear combination of all sources. This problem can be solved by a unsupervised machine learning algorithm known as the independent component analysis. In this work we propose a quantum analog of the cocktail party problem. Here each source is a density matrix of a pure state and each detector detects a density matrix as a linear combination of all pure state density matrix. The quantum cocktail party problem is to recover the pure state density matrix from a number observed mixed state density matrices. We propose the physical realization of this problem, and…
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3| Classical CPP | Quantum CPP | |
|---|---|---|
| Source | Voice of each speaker | Each pure state |
| Detector | Mixed voices | Mixed state |
| Loss function | Minimizing entropy | Minimizing |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
The Quantum Cocktail Party Problem
Xiao Liang
Laboratory of Quantum Information, University of Science and Technology of China, Hefei, 230026, China
Institute for Advanced Study, Tsinghua University, Beijing, 100084, China
Yadong Wu
Institute for Advanced Study, Tsinghua University, Beijing, 100084, China
Hui Zhai
Institute for Advanced Study, Tsinghua University, Beijing, 100084, China
Abstract
The cocktail party problem refers to the famous selective attention problem of how to find out the signal of each individual sources from signals of a number of detectors. In the classical cocktail party problem, the signal of each source is a sequence of data such as the voice from a speaker, and each detector detects signal as a linear combination of all sources. This problem can be solved by a unsupervised machine learning algorithm known as the independent component analysis. In this work we propose a quantum analog of the cocktail party problem. Here each source is a density matrix of a pure state and each detector detects a density matrix as a linear combination of all pure state density matrix. The quantum cocktail party problem is to recover the pure state density matrix from a number observed mixed state density matrices. We propose the physical realization of this problem, and how to solve this problem through either classical Newton’s optimization method or by mapping the problem to the ground state of an Ising type of spin Hamiltonian.
Introducation.
The cocktail party problem refers to the phenomenon that the brain of a listener can focus on a single voice while filtering out a range of other voices in a multi-talker situation, say, in a cocktail party Review . This selective attention problem is first defined as the “cocktail party problem” (CPP) by C. Cherry in 1953 Cherry . For several decades, it is an important research subject for both neuroscience to understand how human or animals solve this problem and computer science to design algorithms to solve this problem. During recent years, machine-learning based approach to solve the CPP is essential for many industrial applications such as automated speech recognization. The independent component analysis (ICA) is such an algorithm particularly suitable for the CPP. The CPP can also found its application in physical science such as astrophysics data analysis CPP_physics1 ; CPP_physics2 . Recently, it has also been proposed to use the spirt of CPP and ICA method to extract the eigen frequency of a quantum system from a dynamical probe Wu .
Let us first briefly review the classical CPP (c-CPP). Considering -independent speakers in a room, they speak simultaneously and the voice of each speaker is a source denoted by a sequence (). There are also detectors in the room. Each detector detects a signal () that is considered to be a linear combination of all sources , as schematically shown in Fig. 1(a). That is to say, we have a -dimensional matrix and
[TABLE]
To concentrate on one of the speakers, it means that we should find out such that we can determine as from the signals of all detectors. For human, we only have two ears which means the number of detectors is two. But for computer algorithm problem, we make the situation simpler by considering that there are more detectors than sources, that is, . Even though, this is still an ill-defined problem if no information of the source is known. In practices, we utilize the information that being voice of an individual speaker, each source displays certain feature and is more regular than a mix of several voices. By performing statistics over for each sequence , we can determine the entropy of the sequence and we use the criterion that the entropy of each sequence should be minimized to determine each . This is how we solve the CPP with the ICA method ICA .
In this work we will propose a quantum analogy of the CPP, termed as the quantum cocktail party problem (q-CPP). We will discuss how to solve the q-CPP with an analogy of the ICA method. We should also present a mathematical statement that can help us to map the loss function to a Hamiltonian of Ising spins. Though by classical Monte Carlo, we show that the ground state spin configuration can solve the q-CPP, we point out that this spin Hamiltonian can be solved more efficiently by quantum methods, for example, by quantum simulation and quantum annealing.
Results.
Quantum CPP. Here we first propose the q-CPP. We consider different sources, and each source to be a density matrix of a pure state as , where is a normalized quantum state in a Hilbert space with dimension . There are number of detectors, and the signal detected by each detector is a density matrix of a mixed state denoted by as
[TABLE]
We also normalize to be trace unity, which require for all . The q-CPP is defined as that, suppose that we know sufficient number of , whether one can find out to recover each .
Here we will briefly discuss the uniqueness of the solution. First of all, we should emphasize that in order to ensure the solution is unique, it is important to require that different are not orthogonal to each other. Secondly, when the number of detector increases by one, the constraints increases by and the free parameters increases by , so we will consider the situation that . Lastly, it is always good to have sufficient number of detectors, normally we consider . We do not rigorously prove the uniqueness of the solution, but we find that in practices, generically we always find unique solution when these conditions are satisfied.
A physical realization of the q-CPP can be proposed as follows. Let us consider a particle whose internal Hilbert space is a product of two degrees of freedom as . The dimensionality of is and the dimensionality of is , and () forms a complete set of bases in . A wave function can be generally expanded in these bases as
[TABLE]
For a more physical picture, one can consider Eq. 3 as particle emitted from different sources, and the wave function in is for particle emitted from the source-, as schematically shown in Fig. 1(b). If we place detectors in different places and the quantum measurement traces out the Hilbert space , it results in different density matrix as Eq. 2 with , thus, it is also natural to require to be positive numbers. In practices, these density matrices can be constructed by the quantum state tomography. We consider the situation that both the wave function and are unknown. The q-CPP is to determine them from ().
To find out the pure state, the most important information we use here is that the density matrix of a pure state has the property that . Thus, the scheme is to find out a proper combination of as with the normalization condition that can minimize . This is equivalent to say, we define the loss function as
[TABLE]
A comparison between c-CPP and q-CPP is summarized in the Table 1.
Optimization with Newton’s Method. We firstly use the classical Newton’s method to optimize the loss function , and the update rule of w is:
[TABLE]
where is a -dimensional vector. Here and are respectively the first order and the second order gradient of the loss function . When the loss function reaches the minimum, it should yield , thus the optimization is completed. This process does not require any information of and as a prior.
To test this algorithm, we first randomly generate a set of pure state in the form of
[TABLE]
where is a complete set of basis in the Hilbert space with dimension chosen as , and is a real coefficient uniformly sampled in the range of . To generate , we randomly sample in the range of , then normalized under the constraint . For the example shown in Fig. 2, we choose three pure states and the fidelities between the three pure states are , and , where the fidelity is defined as .
We then use the Newton’s method to solve the q-CPP. is initialized in such a way that () are uniformly sampled in the range of and is determined by the constraint . Then, we can reach a convergent solution following Eq. 5. In the example of Fig.2, three different can be found by the Newton’s method depending on different initialization, and their fidelities with () are shown in Fig. 2(a-c). One can see that there is always one fidelity equalling unity. For instance, for the case Fig. 2(a), , and , are consistent with and , respectively. This means that the resulting recovers . Similarly, in the cases of Fig. 2(b) and (c), the resulting recovers and , respectively. We have also tried different number of detectors. For the case with three sources, we find that the performance is good as long as the number of detectors is equal or greater than three.
Mapping to a Hamiltonian Problem. Since Eq. 4 is a function of , and if we restrict the value of all to be , minimizing Eq. 4 can be regarded as finding the ground state of a Hamiltonian of the Ising spins. If we replace as , Eq. 4 can be written into a Hamiltonian form as
[TABLE]
where
[TABLE]
Here we set the energy unit of the Hamiltonian as unity. In order to satisfy the constraint , we require the number of spin to be odd and the total magnetization to be unity. Note that this Hamiltonian contains four, three and two-body interactions. In this Hamiltonian, the number of sites are equal to the number of detectors. Here it is worth emphasizing that only computing the coefficients listed in Eq. 8-10 depends on the pure state Hilbert dimension of the original quantum problem, and the complicity of the Hamiltonian Eq. 7 itself will not increase as increases. Given that in the previous discussion of uniqueness of the solution, we prefer to have a large , this is a great advantage of this approach.
Now the question is whether we can restrict all to be . Here we make the following statement:
Statement: We consider each source is a vector, and -number of signal () as a mixing of -number of sources written as , where all are positive numbers ranging between zero and unity without any other restrictions. We construct where can only take . For each given and for a specified target , we optimize () to minimize and the minimized value is denoted by . We state that
[TABLE]
The meaning of this statement is that, as long as the number of the detector is sufficient, we can always restrict to be .
We have verified this statement with numerical simulations. As an example, we consider five sources and each of them is an eight-dimensional vector. As shown in Fig. 3, we plot as a function of and find that it does converge to zero as increases.
Now we show the ground state spin configuration of this Hamiltonian can determine the solution of the q-CPP. As becomes large, the Hilbert space dimension of the Hamiltonian increases and it is hard to solve the ground state by the exact diagnolization. Hence, here we use the classical Monte-Carlo method to find the ground state approximately. In our simulations, the initial temperature is unity which is the same as the energy unit of the Hamiltonian. During the annealing process, the temperature is reduced epoch by epoch, and in each epoch the temperature is reduced by , where is the epoch number. In each epoch, we randomly flip the spin for 12000 times with the acceptance probability given by
[TABLE]
where and are the eigenenergies after and before randomly flipping the spins, respectively. is the inverse temperature. In the simulation, it suffers from the problem of trapped into local minimum. If so, when we regard as and reconstruct density matrix , may not be a positive definite matrix. Hence, during the annealing process, we simultaneously run two criterions. We require the von Neumann entropy of the reconstructed density matrix, defined as to be close to zero, and we also require the reconstructed density matrix to be positive definite. We stop the annealing process at a temperature when these two criterions are well satisfied.
The results of the Monte Carlo calculation are presented in Fig. 2(d-f). Here we also choose three different input density matrices with fidelity mutually as , and . Similar as the results from the Newton’s method, depending on different initialization, we can find different spin configurations that can construct three different density matrices . For instance, for the case shown in Fig. 2(d), one can find the fidelity is very close to unity, and and are consistent with and , respectively. We can also see that the results get improved as the number of detectors increases. For instance, in Fig. 2(d), for , , , are , and , which gradually approaches unity, and are , and , which gradually approaches the input value .
Finally, we should remark that both exact diagnoalization and classical Monte Carlo have limitations for solving the Hamiltonian like Eq. 7. The most efficient way to find out its ground state is through quantum method, either by quantum simulation or by quantum annealing annealing .
Conclusion and Outlook.
In summary, we have proposed a quantum analogy of the cocktail party problem. The essential point is to replace the classical signal from each source with quantum data. In the classical problem, it is the independent classical signals that are emitted from different sources, which are mixed and detected by the detectors. In the quantum problem, instead, it is the quantum wave functions that are emitted from different sources, and here the “independent” means that the wave functions are orthogonal in part of the Hilbert space. The wave functions interfere and the reduced density matrix are observed by detectors. In both cases, the goal is to recover the individual sources from the information of detectors only.
We also show that solving this problem can be mapped to finding the ground state of an Ising type spin Hamiltonian, and we propose to solve this Hamiltonian by quantum simulator or quantum annealing. We note that, to ensure the uniqueness of the solution, it is preferred to keep both the Hilbert space dimension of the pure state wave function and the number of detector large enough. Under this situation, this quantum approach has advantage that on one hand, the complicity of the Hamiltonian does not increase with the increasing of ; and on the other hand, the number of Ising spins equal to and the quantum approach can exhibit its advantage when is large.
We envision that one can generalize this quantum version to more complicated situation, such as time-dependent wave function. Similar as that the classical CPP is very useful in classical information processing, we believe the quantum version of CPP can also be useful in quantum information processing.
Competing Interests
The authors declare that there are no competing interests.
Author Contribution
All authors contribute extensively to the entire project.
Data Availability
Data sets were generated or analyzed during the current study.
Acknowledgment. This work is supported MOST under Grant No. 2016YFA0301600 and NSFC Grant No. 11734010.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1(1) M. A. Bee and C. Micheyl, The cocktail party problem: what is it? How can it be solved? And why should animal behaviorists study it? , J. Comp. Psychol. 122 235-251 (2008).
- 2(2) E. C. Cherry, Some experiments on the recognition of speech, with one and with two ears , J. Acoust. Cos. Am. 25 975-979 (1953).
- 3(3) I. P. Waldmann, Of Cocktail Parties and Exoplanets , Astrophys. J. 747 12 (2012).
- 4(4) J. Crowder and N. J. Cornish, Solution to the galactic foreground problem for LISA , Phys. Rev. D 75 043008 (2007).
- 5(5) Y. Wu and H. Zhai, Generalized Independent Component Analysis for Extracting Eigen-Modes of a Quantum System , ar Xiv: 1904.05067
- 6(6) A. Hyvärinen and E. Oja, Independent component analysis: algorithms and applications , Neural Netw. 13 411-430 (2000).
- 7(7) S. E. V.-Andraca, W. C.-Santos, C. Mc Geoch and M. Lanzagorta, A cross-disciplinary introduction to quantum annealing-based algorithms , Contemp. Phys, 59 174-197 (2018).
