Analog quantum algorithms for the mixing of Markov chains
Shantanav Chakraborty, Kyle Luh, J\'er\'emie Roland

TL;DR
This paper introduces an analog quantum algorithm for Markov chain mixing that improves sampling efficiency, and explores quantum mixing times, extending previous results with new theoretical insights and numerical validation.
Contribution
It presents a novel analog quantum algorithm for Markov chain mixing and extends understanding of quantum mixing times using advanced random matrix theory tools.
Findings
Quantum algorithm scales with the sum of square roots of classical mixing and hitting times.
Provided an intuitive understanding of eigenvalue statistics in random matrices.
Numerical simulations support the analytical results and extend to general Markov chains.
Abstract
The problem of sampling from the stationary distribution of a Markov chain finds widespread applications in a variety of fields. The time required for a Markov chain to converge to its stationary distribution is known as the classical mixing time. In this article, we deal with analog quantum algorithms for mixing. First, we provide an analog quantum algorithm that given a Markov chain, allows us to sample from its stationary distribution in a time that scales as the sum of the square root of the classical mixing time and the square root of the classical hitting time. Our algorithm makes use of the framework of interpolated quantum walks and relies on Hamiltonian evolution in conjunction with von Neumann measurements. There also exists a different notion for quantum mixing: the problem of sampling from the limiting distribution of quantum walks, defined in a time-averaged sense. In…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Analog quantum algorithms for the mixing of Markov chains
Shantanav Chakraborty1
Kyle Luh2
Jérémie Roland1
1QuIC, Ecole Polytechnique de Bruxelles, Université libre de Bruxelles
2Center for Mathematical Sciences and Applications, Harvard University
Abstract
The problem of sampling from the stationary distribution of a Markov chain finds widespread applications in a variety of fields. The time required for a Markov chain to converge to its stationary distribution is known as the classical mixing time. In this article, we deal with analog quantum algorithms for mixing. First, we provide an analog quantum algorithm that given a Markov chain, allows us to sample from its stationary distribution in a time that scales as the sum of the square root of the classical mixing time and the square root of the classical hitting time. Our algorithm makes use of the framework of interpolated quantum walks and relies on Hamiltonian evolution in conjunction with von Neumann measurements.
There also exists a different notion for quantum mixing: the problem of sampling from the limiting distribution of quantum walks, defined in a time-averaged sense. In this scenario, the quantum mixing time is defined as the time required to sample from a distribution that is close to this limiting distribution. Recently we provided an upper bound on the quantum mixing time for Erdös-Renyi random graphs [Phys. Rev. Lett. 124, 050501 (2020)]. Here, we also extend and expand upon our findings therein. Namely, we provide an intuitive understanding of the state-of-the-art random matrix theory tools used to derive our results. In particular, for our analysis we require information about macroscopic, mesoscopic and microscopic statistics of eigenvalues of random matrices which we highlight here. Furthermore, we provide numerical simulations that corroborate our analytical findings and extend this notion of mixing from simple graphs to any ergodic, reversible, Markov chain.
I Introduction
Markov chain-based algorithms are applied in a plethora of fields ranging from statistical physics [1], combinatorial optimization [2] to network science [3] and form the basis of Markov chain Monte Carlo-based methods [4]. In many of these applications, the underlying task is often to sample from the so-called steady state (also known as the stationary distribution) of the associated Markov chain.
One way to sample from a stationary distribution is by mixing. The Markov chain, which is represented by a stochastic matrix is applied repeatedly to some initial distribution. The resultant random walk reaches a final distribution that is close to a stationary distribution of , irrespective of the initial distribution. For most applications, the Markov chain is ergodic, implying that it has a unique stationary distribution and, reversible, i.e. it satisfies detailed balance. (We refer the reader to Sec. II for details on the definitions of these terms related to Markov chains). Henceforth, unless stated otherwise, we shall restrict our attention to ergodic, reversible Markov chains. For a given Markov chain , the minimum time after which the distribution is -close to the stationary distribution is known as the mixing time of the random walk on . It is well known that the mixing time is related to the spectral gap of . For an ergodic, Markov chain with spectral gap , the mixing time is in 111Throughout the article, we use and to suppress polylogarithmic factors, i.e. and ..
The stationary distribution, by definition, is the limiting distribution of the resultant random walk on , i.e. once the stationary state is reached, the random walk ceases to evolve. This implies that as , applied to any initial distribution converges to the stationary distribution. Thus the classical mixing time is also the time required to sample from the limiting distribution of the underlying random walk.
In the context of quantum algorithms, there arise two notions of mixing and hence of mixing time. First, it is natural to consider whether, given a Markov chain , a quantum algorithm can allow us to prepare a coherent encoding of the stationary distribution of . We shall refer to this problem as . Measuring the output state of such an algorithm would enable us to sample from the (classical) stationary state of . Preparing such a coherent encoding also has other applications which we discuss later.
The other notion of mixing arises from considering the limiting distribution of the underlying quantum walk itself. As quantum evolutions are unitary and hence distance-preserving, there is no inherent limiting stationary distribution for quantum walks. However, it turns out that one can define a limiting distribution of the quantum walk in a time-averaged sense.
Starting from some initial state, one can obtain the probability that the walker is in some final state after a time which is picked uniformly at random in the interval . This gives a time-averaged probability distribution at any time and also a limiting probability distribution as . The problem of sampling from this time-averaged limiting distribution of a quantum walk gives rise to another notion of mixing and we shall refer to this problem as . The mixing time of a quantum walk is then defined as the time after which the time-averaged probability distribution is close to the limiting probability distribution, i.e. the time required to solve .
In this article, we deal with both and problems. We provide the first purely analog quantum algorithm to solve the problem while for the problem, we expand and extend upon the results of Ref. [6], where we prove an upper bound for the quantum mixing time for almost all graphs.
Aharonov and Ta-Shma [7] demonstrated that the existence of an efficient quantum algorithm for would imply that problems in the complexity class Statistical Zero Knowledge (SZK) such as Graph Isomorphism would be solvable in polynomial-time using a quantum computer (BQP), i.e. . This would be a surprising result as such a generic algorithm would be oblivious to the specific structure of the underlying problem. For example, consider the problem of Graph Isomorphism [8] (deciding whether two graphs are isomorphic to each other). Given graphs and , a quantum algorithm for mixing could be used to prepare states that are a uniform superposition of all graphs that are isomorphic to them. If these states are equal, then and are isomorphic. A simple SWAP Test could then be used in conjunction with a quantum algorithm for to solve Graph Isomorphism. Thus, generic quantum algorithms for are unlikely to be efficient.
Having said that, there do exist quantum algorithms that solve this problem [9, 10, 11], some of which have even been instrumental in obtaining speedups for quantum machine learning [12, 13, 14]. Richter [15] conjectured that one could construct a quantum algorithm for this problem that has a running time that is in , yielding a quadratic speedup over its classical counterpart. Developing quantum algorithms that match this conjectured bound have been challenging. Most of the existing quantum algorithms are based on Szegedy’s framework for discrete-time quantum walks [16].
The key idea that encompasses all existing algorithms for is to make use of the so called quantum spatial search algorithm [17]. Given an ergodic, reversible Markov chain with a set of marked nodes, a spatial search algorithm finds an element from this marked set. Classically, this task requires a time known as the hitting time of the corresponding random walk on . It has been shown that a discrete-time quantum walk-based quantum algorithm for spatial search can accomplish this task quadratically faster (up to logarithmic factors) [18, 19]. Such quantum algorithms start from the coherent encoding of the stationary state of (it inherently assumes that this state can be prepared efficiently) and end up in a state that has a constant overlap with an element from the marked set. Thus, intuitively, quantum spatial search algorithms can be run in reverse to obtain quantum mixing algorithms. However, simply obtaining a constant overlap with the stationary state is not enough and these mixing algorithms require the use of quantum phase estimation [20] and quantum amplitude amplification [21] to solve the problem. Recently, Apers and Sarlette provided a quantum algorithm that can quadratically fast-forward the dynamics of Markov chains which can also be used to solve the problem [22]. The running time of these algorithms scale as the square root of the hitting time of the corresponding quantum walk on the underlying Markov chain.
To the best of our knowledge, there do not exist any analog quantum algorithm for solving the problem. In this framework, key algorithmic primitives such as quantum phase estimation and quantum amplitude amplification are missing as they are inherently discrete-time. In order to construct an analog quantum algorithm for we assume that, given an ergodic, reversible Markov chain , we have access to a time-independent Hamiltonian that encodes the connectivity of . This Hamiltonian, defined in Sec. IV, corresponds to a quantum walk on the edges of . Furthermore it has been recently used to design continuous-time quantum walk-based quantum algorithms for spatial search that can find a single marked node on any ergodic, reversible Markov chain in square root of the hitting time [23]. We use the time-evolution of this Hamiltonian as the key primitive to our algorithm. The second key primitive is to use von Neumann measurements [24] for quantum state generation. Childs et al. used a sequence of such von Neumann measurements as an alternative to adiabatic quantum computation and for solving combinatorial search algorithms [25]. In Sec. III we demonstrate that this scheme can be used to prepare eigenstates of Hamiltonians.
We show (Sec. V) that these two primitives allow us to develop a continuous-time quantum walk based algorithm for spatial search. This algorithm differs from the one developed in Ref. [23] which makes use of quantum phase randomization [26]. It provides an alternative scheme by which one can find an element in a marked set of states of any ergodic, reversible Markov chain in square root of the extended hitting time. Although this algorithm has the same running time as that of Ref. [23], it provides useful intuition about how to build an analog quantum algorithm for .
Our quantum algorithm for mixing, explained in detail in Sec. VI, avoids the need for amplitude amplification by making use of the framework of interpolated Markov chains and switching between two different values of the interpolation parameter. The running time scales as the sum of the square root of the classical mixing time and the square root of the hitting time.
We also discuss the problem of on generic graphs. The limiting distribution of quantum walks can be quite different from that obtained from a quantum algorithm for solving . Unlike its classical counterpart, for , the limiting distribution is dependent on the initial state of the quantum walk. Moreover, instead of being dependent on the spectral gap , the quantum mixing time depends on all eigenvalue gaps of the underlying Hamiltonian. Aharonov et al. [27] were the first to study this problem. They showed that a discrete-time quantum walk on the cycle graph mixes faster than its classical counterpart. Since then several works have considered the mixing time of both continuous and discrete-time quantum walks on specific graphs [28, 29, 30, 31, 32, 33]. The upper bound for the mixing time of quantum walks have been proven to be slower than its classical counterpart for some graphs while a quadratic speedup has been obtained for others.
Recently, we proved an upper bound for the mixing time of quantum walks for almost all graphs [6]. This implies that the fraction of graphs of nodes for which our upper bound holds, goes to 1 as goes to infinity or equivalently, if a graph is picked uniformly at random from the set of all graphs, our result provides an upper bound on the quantum mixing time almost surely, i.e. with probability . Throughout the article, we shall use the phrase almost all graphs to signify precisely this.
We proved this by obtaining the mixing time for quantum walks on Erdös-Renyi random graphs: graphs of nodes such that the probability of an edge existing between any two nodes is , typically denoted as . Here, we expand upon the results of the letter [6]. In particular, our goal is to offer an intuitive explanation of our proof techniques with an emphasis on the several recently developed random matrix theory tools that were used to derive the aforementioned results. We also corroborate our analytical findings numerically and also extend the notion of to any ergodic, reversible Markov chain. In fact, our numerical findings confirm the fact that the mixing time for quantum walks on is in for dense random graphs (constant ). Additionally they also show that the limiting probability distribution is close to the uniform distribution.
This article is organized as follows: In Sec. II, we explain some basic concepts and quantities related to Markov chains that we shall use in subsequent sections. In Sec. III we show how von Neumann measurements can be used for preparing eigenstates of Hamiltonians. In Sec. IV, we define a Hamiltonian corresponding to a quantum walk on the edges of any ergodic, reversible Markov chain. In Sec. V, we make use of von Neumann measurements and Hamiltonian evolution to provide a quantum algorithm for spatial search. This provides an intuitive understanding of our analog quantum algorithm for solving , which we describe in Sec. VI. Next, in Sec. VII, we deal with solving the problem. Finally, we conclude with a brief discussion and summary in Sec. VIII.
II Preliminaries
In this section we state some basic definitions about Markov chains which we shall use subsequently.
II.1 Basics of Markov chains
A Markov chain on a discrete state space , such that , can be described by a stochastic matrix [34]. Each entry of this matrix represents the probability of transitioning from state to state . Any distribution over the state space of the Markov chain is represented by a stochastic row vector.
A Markov chain is irreducible if any state can be reached from any other state in a finite number of steps. Any irreducible Markov chain is aperiodic if there exists no integer greater than one that divides the length of every directed cycle of the graph. A Markov chain is ergodic if it is both irreducible and aperiodic. By the Perron-Frobenius Theorem, any ergodic Markov chain has a unique stationary state such that . The stationary state is a stochastic row vector and has support on all the elements of . Let us denote it as
[TABLE]
such that . Starting from any initial probability distribution over the state space , the repeated application of leads to convergence to the stationary distribution , i.e. . This is known as the mixing of a Markov chain. It follows from the Perron-Frobenius theorem that other than , all eigenvectors have eigenvalues of absolute value strictly less than . Thus, is the unique eigenvector with eigenvalue and all other eigenvalues lie between and . Throughout the paper we shall be working with the Markov chain corresponding to the lazy walk, i.e. we shall map . This transformation ensures that all the eigenvalues of lie between [math] and . This transformation will not affect our results other than by a factor of two, which is irrelevant in the asymptotic limit. Throughout the article, we shall denote the gap between the two highest eigenvalues of (the spectral gap) by .
Let denote the -entry of the ergodic Markov chain with stationary state . Then the entry of the time-reversed Markov chain of , denoted by , is
[TABLE]
We shall concern ourselves with ergodic Markov chains that are also reversible, i.e. Markov chains for which . Any reversible satisfies the detailed balance condition
[TABLE]
This can also be rewritten as
[TABLE]
where is a diagonal matrix with the diagonal entry being . In other words, the reversibility criterion implies that the matrix is symmetric. Henceforth we shall only deal with reversible (and hence ergodic) Markov chains.
Interpolated Markov chains: Let us assume that a subset of the elements of the state space of the Markov chain is marked. Let denote the set of marked elements. Given any , we define as the absorbing Markov chain obtained from by replacing all the outgoing edges from to by self-loops. If we re-arrange the elements of such that the unmarked elements appear first, then we can write
[TABLE]
where and are square matrices of size and respectively. On the other hand and are matrices of size and respectively. Then the interpolated Markov chain is defined as
[TABLE]
where . The interpolated Markov chain thus has a block structure
[TABLE]
Clearly, and . Notice that if is ergodic, so is for . This is because any edge in is also an edge of and so the properties of irreducibility and aperiodicity are preserved. However when , has outgoing edges from replaced by self-loops and as such the states in are not accessible from , implying that is not ergodic. The spectral gap of is denoted by .
Now we shall see how the stationary state of is related to that of . Since , the stationary state can be written as
[TABLE]
where and are row-vectors of length and respectively. As mentioned previously, is not ergodic and does not have a unique stationary state. In fact, any state having support over only the marked set is a stationary state of .
On the other hand is ergodic for . Let be the probability of obtaining a marked element in the stationary state of . Then it is easy to verify that the unique stationary state of is
[TABLE]
**Discriminant matrix: ** We denote by
[TABLE]
the symmetric matrix whose entry is . Here indicates the Hadamard product.
For any as is reversible, the detailed-balance condition is satisfied. So, each entry of can be expressed as
[TABLE]
This leads us to the following fact:
Fact 1
For any ergodic, reversible Markov chain , we have that for
[TABLE]
where is a row vector with its -entry being .
From Fact 1, it follows that is similar to , i.e. they have the same set of eigenvalues 222Note that throughout this paper, we shall be dealing with values of and so the properties of are not relevant here..
Let the spectral decomposition of be
[TABLE]
where is an eigenvector of with eigenvalue . Furthermore, .
Fact 2
For , the eigenstate of with eigenvalue is given by
[TABLE]
where is a row vector with its -entry being .
This fact follows from the reversibility condition stated in Fact 1, i.e. for we have
[TABLE]
The -eigenvector of , can also be expressed in a different form.
Proposition 3
The eigenstate of eigenvalue of can be expressed as
[TABLE]
where and are defined as
[TABLE]
This follows directly from Fact 2.
II.2 Some quantities related to Markov chains: Hitting and mixing times
In this subsection, we define certain quantities related to Markov chains which we shall use in subsequent sections for our analysis.
Spatial search problem and hitting time: Consider a graph with vertices and edges. Consider a subset of vertices that are marked. Then the spatial search problem involves finding any of the marked vertices in . This problem can be solved by both classical random walks and quantum walks.
Given an ergodic, reversible Markov chain with a stationary state , the random walk based algorithm to solve the spatial search problem is described in Algorithm 1.
The hitting time of with respect to is the expected number of times step of Algorithm 1 is executed. Let us denote this by . Thus, the random walk based algorithm finds a marked vertex in time . Note that the random walk algorithm stops as soon as a marked element is reached. Thus, this is equivalent to applying an absorbing Markov chain that is obtained by replacing all the outgoing edges from the marked vertices of by self loops. From this we can define .
Hitting time of a Markov chain: The hitting time of any Markov chain with respect to a set of marked elements can be expressed as
[TABLE]
where and are the eigenvalues and eigenvectors of the matrix and
[TABLE]
where is the probability of sampling a marked vertex from the stationary state of .
Interpolated hitting time and Extended hitting time: For any interpolated Markov chain , in refs. [18, 23], the authors define a quantity known as the interpolated hitting time in the context of spatial search which will also be useful here for subsequent analysis. This is defined as
[TABLE]
There is a relationship between the spectral gap of the Markov chain and since
[TABLE]
For the spatial search algorithm, we shall find that the quantity of interest is the extended hitting time. The extended hitting time of with respect to a set of marked elements is given by
[TABLE]
Clearly for , we have that . Krovi et al. proved an explicit relationship between and [18]. They showed that
[TABLE]
Combining Eqs. (18) and (20), we have
[TABLE]
Mixing-time of a Markov chain: Given a reversible Markov chain , any initial probability distribution over the state space converges to the stationary distribution , i.e. , for any initial distribution . Given and an initial state , the mixing-time of a classical random walk is defined as the minimum time such that we have that
[TABLE]
for some , where is the total variation distance.
That is, is the minimum time required for the Markov chain to converge to a distribution that is -close to the stationary distribution which implies that [36]:
[TABLE]
where is the spectral gap of and .
Thus, given an ergodic, reversible Markov chain with stationary state and spectral gap , one can sample from a distribution that is -close to in time . Next we discuss how one can use von Neumann measurements to prepare eigenstates of Hamiltonians, a tool which will help us provide an analog quantum algorithm for solving .
III Quantum state generation by von Neumann measurements
In this section, we make use of von Neumann measurements to prepare eigenstates of a Hamiltonian. The goal would be to use this technique to prepare the eigenstate of the quantum walk Hamiltonian (encoding an ergodic reversible Markov chain ) that corresponds to a coherent encoding of the stationary distribution of .
In this framework, in order to measure any observable , the system of interest is coupled to a pointer, which is simply a free particle in one dimension. If represents the Hamiltonian of the system and the momentum operator corresponding to the pointer, then the total Hamiltonian corresponding to the coupling between the system and the pointer is given by
[TABLE]
where is the mass of the free particle and is the interaction strength between the observable and the pointer. Since we are interested in measuring the energy of the system, we have . We consider the particle as “massive”, thereby enabling us to neglect the free Hamiltonian of the particle. Furthermore, we assume that we are working with units such that the interaction strength . These imply that
[TABLE]
It is well known that the momentum operator, is a generator of translation in the position of the particle. In other words, the operator applied to a wavepacket whose wavefunction is results in
[TABLE]
Thus the wavepacket is translated in position by . Now consider that the system Hamiltonian has eigenvalues
[TABLE]
such that . Furthermore, suppose that we initialize the pointer to a state , a wavepacket centred around [math]. Then,
[TABLE]
That is, the wavepacket is translated in position by and as such, measuring the displacement of the pointer register can in principle reveal information about the eigenstate of in the first register. By linearity, for any initial state , we have
[TABLE]
In order to implement this on a quantum computer, we assume that the pointer register is of qubits. The choice of is crucial as it determines the precision up to which the position of the pointer is obtained. In fact, if we measure the position of the pointer with a high enough precision to resolve all eigenvalue gaps, , a measurement of the position of the pointer results in a measurement of the system Hamiltonian .
For our purposes, we shall show how this formalism can be used to prepare the [math]-eigenstate of , i.e. , in a purely analog fashion. To that end, we formally state via Lemma 4 and Corollary 5.
Lemma 4
Let be a Hamiltonian with eigenvalues such that . Let represent the momentum operator corresponding to a free particle in one dimension with its mass large enough so that its free Hamiltonian can be neglected and so that it can be represented in qubits as
[TABLE]
where
[TABLE]
for some . Furthermore let
[TABLE]
Then starting from the state and evolving for a time according to the Hamiltonian , results in a state
[TABLE]
where and for .
In particular, for
[TABLE]
we have and for .
**Proof. **
If represents the momentum eigenstates, then the momentum operator is represented by
[TABLE]
Note that the position and momentum states are equivalent up to a Fourier transform and so the localized wavepacket centred at is completely delocalized in the momentum basis. That is,
[TABLE]
Therefore,
[TABLE]
Since we ultimately want to read off the position of the pointer variable, we re-express the pointer register in the position-basis to obtain
[TABLE]
The pointer register has a measure of the displacement of the wavepacket which was initially centered at . In fact, as shown previously, the shift will be proportional to the eigenvalue corresponding to the eigenstate in the first register (expressed in qubits). That is, we will have states of the form . We are interested in preparing the [math]-eigenstate . We first observe that the amplitude of obtaining in the pointer register when the first register is in the state is one, i.e.
[TABLE]
On the other hand, for any other eigenstate , the amplitude corresponding to measuring in the second register is
[TABLE]
Since for any and for any , we can bound this amplitude as
[TABLE]
where we have used the fact that so .
Thus, after the time evolution for a time , the state of the system and the pointer is given by
[TABLE]
where and for .
Since , for we have and for .
We shall use this lemma to derive the following corollary.
Corollary 5
Let where and suppose that the pointer register contains
[TABLE]
qubits initialized in the state , where .
Then repeating the Hamiltonian evolution of Lemma 4 with a total of -times using a fresh block of pointer qubits each time, followed by post-selecting on the pointer register to be in , succeeds with probability at least in constructing a quantum state such that
[TABLE]
in time
[TABLE]
**Proof. ** After the application of a total of -times using blocks of qubits in the pointer register each time, observe that for any , the amplitude for observing in the pointer register when there is in the first register is bounded by
[TABLE]
This implies that the resulting state after this procedure is given by
[TABLE]
where and . This takes time
[TABLE]
The state in Eq. (44) can be re-written as
[TABLE]
where the state
[TABLE]
has norm with
[TABLE]
This implies that post-selecting on obtain in the pointer register we obtain the state
[TABLE]
with probability , such that
[TABLE]
Thus Lemma 4 and corollary 5 can be used to prepare the eigenstate with success probability at least , and repeating this procedure times allows to boost this probability to , which leads to a protocol using total time
[TABLE]
Note that it would have been possible to use quantum amplitude amplification to reduce quadratically the dependency on . However, we are interested in developing analog algorithms, assuming that we have access to a time-independent Hamiltonian. Provided that the cost of preparing the initial state is small, the cost of the algorithm is the total time of Hamiltonian evolution. Moreover our protocol (Sec. VI) to prepare the stationary state of any reversible Markov chain ensures that , thereby resulting in at most a constant slowdown with respect to amplitude amplification.
IV Hamiltonian for quantum walk on any reversible Markov chain
Given any ergodic, reversible Markov chain, we shall make use of the Hamiltonian introduced by Somma and Ortiz [26] and subsequently used in Refs. [37, 23]. We recall the Hamiltonian and its spectral properties here for completeness and it will be used in our quantum algorithm for .
IV.1 Defining the Hamiltonian
Let denote the -entry of and let be the set of edges of . Furthermore let . Then one can define a unitary such that for all ,
[TABLE]
where the state represents a fixed reference state in . Let us also define the swap operator
[TABLE]
Observe that . Then, if , we have,
[TABLE]
so that . We define the search Hamiltonian as
[TABLE]
In Ref. [23], we have shown that , in a rotated basis, corresponds to a quantum walk on the edges of . That is, the rotated Hamiltonian
[TABLE]
corresponds to a quantum walk on the edges of . If the walker is localized in a directed edge from node to node , i.e. , then the walker can move to a superposition of outgoing edges from node of the form . Note that our algorithms (See Algorithm 2 and Algorithm 3) could be implemented using the Hamiltonian instead of . In such a case, we need to apply the same rotation to the initial state of the algorithm and the final state of the algorithm. However, subsequently we shall be working with as it simplifies the analysis considerably. In the next subsection, we will characterize the spectrum of .
IV.2 Spectrum of
As discussed in Sec. II.1, the spectrum of is related to that of and in particular, the state is an eigenstate of with eigenvalue zero. The spectrum of has been explicitly described in Ref. [37] and we mention it here for completeness. The total Hilbert space of can be divided into the following set of invariant subspaces:
For ,
[TABLE]
Now, observe that
[TABLE]
This implies
[TABLE]
i.e. is an eigenstate with eigenvalue [math].
On the other hand, note that for , the eigenstates and eigenvalues of in are
[TABLE]
where is a quantum state that is in such that . Thus, if the underlying Markov chain has a spectral gap , then in this subspace has a quadratically amplified spectral gap given by
[TABLE]
Now, there are eigenvalues of out of which belong to . The remaining eigenvalues are [math] and belong to which is the orthogonal complement of the union of the invariant subspaces. We need not worry about this subspace as we start from a state that has no support on which is an invariant subspace of . Thus, throughout the evolution under , our dynamics will be restricted to .
The following connection, first observed in [37], between the eigenstates of in and the interpolated hitting time introduced in Sec. II.2 will be useful to the complexity analysis of our algorithms
Proposition 6
Let be defined as
[TABLE]
Then, we have
[TABLE]
**Proof. ** From Eqs. (59-60), we have
[TABLE]
so that
[TABLE]
The bounds on then follow from the definition of and the fact that
[TABLE]
V Spatial search by continuous-time quantum walk using von-Neumann measurements
We first show how to make use of the state-generation scheme described in Sec. III to provide a continuous-time quantum walk based algorithm to solve the spatial search problem. This algorithm will provide an intuitive understanding of our analog quantum algorithm for .
Suppose we are given an ergodic, reversible Markov chain with the marked set denoted by . The spatial search algorithm on involves finding a node within this marked set and is often tackled by the formalism of random walks. We have seen previously in Sec. II.2 that the expected number of steps taken by the walker to find a node within this marked set is known as the hitting time of with respect to . Quantum walks provide a natural framework to tackle this problem. A natural question to ask is whether a quantum walk can offer any speedup over its classical counterpart in order to solve the spatial search problem. Here, we concentrate on the continuous-time quantum walk framework to tackle this problem.
The spatial search algorithm by continuous-time quantum walk on involves evolving a time-independent Hamiltonian (which encodes the connectivity of ), starting from some initial state, for some time, and then measuring in the basis spanned by the states of .
Childs and Goldstone [38] introduced the first continuous-time quantum walk-based algorithm to tackle the spatial search problem for simple, unweighted graphs. They showed that this algorithm, defined as a quantum walk on the nodes of the underlying graph, could find a marked node in time for certain graphs with nodes such as the complete graph, hybercube and -dimensional lattices with . This offered a quadratic speedup over classical random walks for the spatial search problem on these graphs. When , the running time of the Childs and Goldstone algorithm is , offering a less than quadratic speedup whereas there is no substantial speedup for . Since then, a plethora of results have been published exhibiting a running time on certain specific graphs each requiring an ad-hoc analysis [39, 40, 41, 42, 43, 44, 45, 46, 47]. Recently in Ref. [48], the authors provided the necessary and sufficient conditions for this algorithm to be optimal under very general conditions on the spectrum of the quantum walk Hamiltonian. They showed that attaining a generic quadratic speedup is impossible using this algorithm.
In Ref. [23], the authors provided a different spatial search algorithm by continuous-time quantum walk which finds a marked element on any ergodic, reversible Markov chain in square-root of the extended hitting time, thereby matching the running time of best known algorithms in the DTQW-framework in the case of where a single node is marked, i.e. [18, 19]. Given any , their algorithm made use of the framework of interpolating quantum walks
[TABLE]
where is the absorbing Markov chain such that any state having support only over the marked vertices is its stationary state. In fact, they used the Somma-Ortiz Hamiltonian (described in Sec. IV), to define quantum walk on the edges of . The underlying technique is to use a procedure called quantum phase randomization to (approximately) prepare a (mixed) state that has a constant overlap with the [math]-eigenstate of . For some specific value of , this eigenstate has a constant overlap with the marked subspace . This required measurement at a time chosen uniformly at random between , where is the extended hitting time, defined in Eq. (19).
In this section, we provide an alternative spatial search algorithm (Algorithm 2) by continuous-time quantum walk that finds an element from a marked set in time that scales as the square root of the extended hitting time. Algorithm 2 is similar in spirit to that of Ref. [23] in that both make use of the Somma-Ortiz Hamiltonian defined in Eq. (50).
However, motivated by the problem of quantum state generation using von Neumann measurements, Algorithm 2 prepares the [math]-eigenstate of by coupling this Hamiltonian to a free-particle in one dimension. Unlike the algorithm of Ref. [23], we evolve the Hamiltonian for a fixed time , where
[TABLE]
Furthermore, the [math]-eigenstate can be written as
[TABLE]
where and are as defined in Proposition 3. Thus it has a constant overlap with both and . The intuition behind the algorithm is as follows. In step 1, it either detects a marked node, in which case it succeeds with probability 1, or it prepares state . Since this state has overlap with , step 2 allows to prepare a state such that, if we were to project the third register on , we would obtain with probability close to a state close to . This state therefore has overlap close to over , so that step 3 will yield a marked node with probability close to .
We now formally state Algorithm 2 and prove its correctness in Lemma 7.
Lemma 7
Algorithm 2 outputs a marked node with probability at least in time
[TABLE]
**Proof. ** We shall make use of Lemma 4. Observe that for , the [math]-eigenstate of is simply with
[TABLE]
where and are as defined in Proposition 3. Also, the initial state in the first register is
[TABLE]
so at the end of step 1, we project on with probability (in which case we will immediately obtain a marked node in step 3), or we project on .
From Lemma 4, the time evolution of the Hamiltonian , starting from the state
[TABLE]
where and , prepares a state
[TABLE]
where and for .
Consider the measurement operator
[TABLE]
where is a projection on the states of the Markov chain. Since , the probability to measure [math] in the third register is
[TABLE]
From and , we can bound as
[TABLE]
where we have used Prop. 6 and the fact that .
In summary, we prepare with probability at least the state
[TABLE]
where, from the calculation above, the second term has norm at most . This state is therefore close to , more precisely, we have
[TABLE]
Let us finally consider the measurement in the state-space of the Markov chain. If the state we had prepared was we would obtain a marked node with probability since . Equivalently, if we define the projector to be the projector over marked nodes of the Markov chain, we have . The triangle inequality then implies
[TABLE]
This implies that measuring in the state-space of the Markov chain yields a marked node with probability at least . Since, in the previous step we had prepared this state with probability at least , overall the algorithm outputs a marked node with probability at least .
Finally, Eq. (20) implies that for , we have , so that the time to execute Step 2 of the algorithm is , which concludes the proof.
Algorithm 2 can be thought of as an analog version of the quantum spatial search algorithm by Krovi et al [18]. Although this algorithm requires additional ancillary qubits unlike the algorithm of Ref. [23], it will help create an intuitive understanding of our algorithm for solving , discussed in Sec. VI. In fact, the cost of preparing the initial state of the spatial search algorithm corresponds to the problem.
In Ref. [23], it was shown that can be simulated be using only query access to the discrete-time quantum walk unitary introduced in Ref. [18]. This connection would allow us to quantify the running time of our quantum algorithm in terms of basic Markov chain operations.
To that end, given a Markov chain , let us define the following oracular operations:
- •
Check (): Cost of checking whether a given node is marked. We denote this by .
- •
Update (): Cost of applying one step of the walk , which we denote by .
- •
Setup (): The cost of preparing the initial state , denoted by .
As from Refs. [18, 19], the cost of implementing , and consequently, the cost of evolving for constant time, is in , the running time of Algorithm 2 is
[TABLE]
The problem helps quantify the cost and intuitively, a quantum algorithm for this problem can be obtained by running the spatial search algorithm in reverse.
VI Analog quantum algorithm to prepare coherent encoding of the stationary state of a Markov chain
In this section we describe our algorithm which, given a reversible Markov chain with stationary state , prepares a state that is -close to the state
[TABLE]
A measurement in the basis spanned by the states of the Markov chain will allow us to sample from , thereby solving the problem. From Fact 2 and Proposition 3, we have that
[TABLE]
Thus, this is simply the highest eigenstate of the discriminant matrix or equivalently, the [math]-eigenstate of . Therefore, given , the problem of preparing boils down to the state-generation problem just as in the case of spatial search.
Following Lemma 4 and Corollary 5, one can think of an algorithm to prepare as follows.
Starting from some initial localized state where , one can evolve according to the Hamiltonian for a time that scales as to prepare with probability . Then by using -rounds of (fixed-point) amplitude amplification [49], one can prepare .
However, amplitude amplification is a discrete quantum algorithm and to the best of our knowledge it has no analog counterpart. As such, while constructing an analog quantum algorithm for this problem we cannot make use of amplitude amplification. We shall switch the value of to get around the need for amplitude amplification.
Consider the scenario where, given , one marks a single state , i.e. all the outgoing edges from are replaced with self-loops. We denote the absorbing Markov chain corresponding to this . Then the resulting interpolated Markov chain is
[TABLE]
If the entry of the stationary state of , corresponding to the marked element is , then we find that and so for
[TABLE]
and from Eq. (13) we have that
[TABLE]
Thus, the state has a constant overlap with . We can use this fact to establish the following connection between the average hitting time and the eigenstates of , following Proposition 6.
Proposition 8
Let be as defined in Proposition 6 and be the average hitting time of with respect to the vertex . Then, for ,
[TABLE]
Furthermore,
[TABLE]
**Proof. ** For , we have that for any , and substituting the expression of , we obtain that for any such that ,
[TABLE]
So, we have:
[TABLE]
Following proposition 6, we have
[TABLE]
Since we have a single marked node , the extended hitting time with respect to is equal to . The final expression of the proposition can be obtained by observing that from Eq. (20), we have .
Recall that the initial state of Algorithm 2 contained in the first register and our state-generation scheme resulted in the preparation of a state that has a constant overlap of with . For our algorithm, we assume that for any , the state is easy to prepare. The idea of the algorithm (See Algorithm 3) is to first invoke Lemma 4 to prepare an intermediate state that has a constant overlap with . We will prove that this can be achieved by setting , and choosing the initial state to be . For the second stage, we set and start from the state obtained in stage 1. By invoking Corollary 5, we prepare a state that is -close to with a constant probability. By this two-stage procedure, we can avoid the need to use amplitude amplification. We formally state the algorithm in Algorithm 3 and prove its correctness in Lemma 9.
Lemma 9
Algorithm 3 outputs a quantum state such that
[TABLE]
in time
[TABLE]
*with success probability at least . *
**Proof. **
First note that the 0-eigenstate of is given by Eq. (13) and so for , we have that
[TABLE]
Thus, the state can be written in the eigenbasis of as
[TABLE]
where and .
Now, we shall evolve the state according to the Hamiltonian for a time . From Lemma 4, we have that the resulting state
[TABLE]
where and for .
The probability to measure in the third register is
[TABLE]
From , and the analysis in Lemma 7, we can bound as
[TABLE]
where we have used Prop. 8 and the fact that . Thus, after stage one, with probability , we have prepared the state
[TABLE]
in the first register, where . The running time of stage one of the algorithm is
[TABLE]
For any constant , we have .
We shall use this state as the initial state for stage two of our algorithm. In this stage, we set and prepare a state that is -close to . Observe that the overlap , so
[TABLE]
We begin by expressing in the eigenbasis of . We have
[TABLE]
where and for any , .
Let , be the eigenvalue gap between the [math]-eigenstate of and the rest of the spectrum. So, if is the spectral gap of , .
From Lemma 4, by choosing , we obtain a state
[TABLE]
where now, for any such that , , and .
We can now use Corollary 5, with , and . By choosing , we prepare a quantum state that is -close to with probability at least . The running time of the second stage of the algorithm is in
[TABLE]
Now putting things together, Algorithm 3 prepares a state which is -close to . The first stage succeeded with probability and so the overall success probability is . The overall running time
[TABLE]
Interestingly, by Eq. (22) this also implies that the running time of our algorithm is actually the sum of the square root of the classical hitting time and the square root of the classical mixing time, i.e.
[TABLE]
Note that in general, the hitting time is at least as large as the mixing time of an ergodic, reversible Markov chain. Thus the running time is in fact,
[TABLE]
As mentioned in the previous section, the problem helps quantify the Setup cost of the spatial search problem (See Eq. (70)). As such, Algorithm 3 implies that the setup cost of Algorithm 2 is given by Eq. (81).
VII Time-averaged quantum mixing: Limiting distribution and mixing time
Now we shall deal with the problem and the notion of mixing time that arises from this problem. For any ergodic, reversible Markov chain , we have seen from Sec. II.2 that it is possible to sample from its distribution at (limiting distribution) after a time , known as the mixing time of , where is the spectral gap of . In fact, any initial distribution converges to the stationary distribution after applications of . In a strict sense, such a limiting distribution is absent for quantum walks as the underlying dynamics is unitary and hence, distance preserving.
However, one can define the mixing of quantum walks on a graph is defined in a time-averaged sense: the probability that the walker is at some node after some time , picked uniformly at random in the interval [27]. This gives a time-averaged probability distribution at any time and also a limiting probability distribution as . The mixing time of a quantum walk on any ergodic, reversible Markov chain is the time after which the time-averaged probability distribution is -close to the limiting probability distribution.
Consider any ergodic, reversible Markov chain with . Given , suppose denotes the underlying Hamiltonian corresponding to a quantum walk on . We require that the eigenvalues of lie between and , i.e. . Let the spectral decomposition of where is the eigenstate corresponding to the eigenvalue , . Furthermore, suppose that the initial state of the walker is .
Consequently, the state of the walker after a time is governed by the Schrödinger equation, i.e.
[TABLE]
In order to define a limiting distribution for quantum walks, one obtains a Césaro-average of the probability distribution, i.e. one evolves for a time chosen uniformly at random between [math] and followed by a measurement. The average probability that the state of the walker is some localized node is given by
[TABLE]
Thus, as , this leads to a limiting probability distribution, i.e.
[TABLE]
where the sum is over all pairs of degenerate eigenvalues. So, if has a simple spectrum, i.e. all its eigenvalues are distinct, then the sum is over all its eigenvalues.
In order to calculate how fast the instantaneous time-averaged distribution of the quantum walk converges to this limiting distribution, i.e. we need to bound the quantity .
In fact, it is easy to verify that they are -close, i.e.
[TABLE]
as long as
[TABLE]
This naturally leads to the following upper bound on the quantum mixing time
[TABLE]
There do exist differences between the quantum and classical limiting distributions. For example, in the quantum case, the limiting distribution is dependent on the initial state of the quantum walk. Also, unlike classical random walks, the quantum mixing time depends on all the eigenvalue gaps of as opposed to only the spectral gap.
Ignoring the numerator in the right hand side of Eq. (86), we need to evaluate the following quantity in order to upper-bound :
[TABLE]
As such we intend to obtain the best possible bounds for this quantity. To this end, let us define as the minimum eigenvalue gap of , over all pairs of distinct eigenvalues, i.e.
[TABLE]
Note that this is different from the spectral gap , which is the difference between the two highest eigenvalues of . We prove the following:
Lemma 10
If and are defined as in Eqs. (87) and (88) then
[TABLE]
**Proof. ** The lower bound is straightforward by noting that such that .
For the upper bound, we have that
[TABLE]
where we have used the fact that for any , . This implies that if , we obtain
[TABLE]
The upper bound on obtained in Lemma 10 leads directly to an upper bound on the quantum mixing time . This can be seen from the fact that the numerator in Eq. (86) is less than one and so
[TABLE]
for any .
All prior works hitherto have analyzed the problem for simple, unweighted graphs. Given a graph of nodes and edges, the underlying quantum walk is defined on the nodes of the graph with corresponding Hamiltonian being the (normalized) adjacency of the graph. That is, , where is an matrix such that each entry
[TABLE]
and is the spectral norm of .
In this section, we elaborate on the results of Ref. [6] and provide numerical evidence to back up our analytical bounds. In particular, we focus on the random matrix theory aspects of our proof, elaborating on the underlying concepts. Finally, by defining a quantum walk on the edges as in Sec. IV.1, we extend the notion of to any ergodic, reversible Markov chain.
VII.1 Erdös-Renyi random graphs
Let us consider a graph with a set of vertices . We restrict ourselves to simple graphs, i.e. unweighted graphs which do not contain self-loops or multiple edges connecting the same pair of vertices. The maximum number of edges that a simple graph can have is . Thus, there are graphs of edges and the total number of (labelled) graphs is [50]. We consider the random graph model , a graph with vertices where we have an edge between any two vertices with probability , independently of all the other edges [51, 52, 53] (See Fig. 1). In this model, a graph with edges appears with probability . In particular, if we consider the case , each of the graphs appears with equal probability . We shall refer to random graphs having a constant as a dense random graph. On the other hand, random graphs for which , i.e. when decreases with shall be referred to as sparse random graphs.
In their seminal papers, Erdös and Rényi introduced this model of random graphs and studied the probability of a random graph to possess a certain property [51, 52]. For example, they investigated properties such as the connectedness of the graph, the probability that a certain subgraph is present, etc. They stated that almost all graphs have a property if the probability that a random graph has goes to as . Equivalently, it can be stated that almost surely has property , i.e. property holds with probability .
Interestingly, certain properties of random graphs arise suddenly for a certain critical probability , where this probability depends typically on . More precisely, if grows faster than , the probability that the random graph has property goes to in the asymptotic limit, whereas if it grows slower than it goes to [math]. For example when the graph is almost surely connected, whereas if the graph has almost surely isolated nodes.
Here we shall concern ourselves with random graphs above the percolation threshold and calculate an upper bound on the quantum mixing time for quantum walks on such graphs. Observe that for a random graph, , its adjacency matrix, which we denote as , is an symmetric matrix with each non-diagonal entry being with probability and [math] with probability . All diagonal entries of are [math]. Thus is a discrete random matrix and knowledge of its eigenvalues and eigenvectors is crucial to obtaining the quantum mixing time.
Finally, from the aforementioned discussion, obtaining the quantum mixing time for can be interpreted as holding for almost all graphs.
VII.2 Random matrices: Spectral properties of
Here we look at the eigenvalues and eigenvectors of the random matrix .
As mentioned earlier, the Hamiltonian corresponding to the quantum walk on is simply the normalized Adjacency matrix of . The highest eigenvalue of , converges to a Gaussian distribution with mean and standard deviation , as . This fact was first shown in Ref. [54] for constant and was later improved for sparse random graphs () in Ref. [55]. In fact, as we shall show shortly it suffices to consider the matrix
[TABLE]
as the quantum walk Hamiltonian.
Let the eigenvalues of be , such that is the eigenvector corresponding to the eigenvalue , , i.e. . Then we have that for ,
[TABLE]
with probability , which implies that [55, 6].
It can also be shown that for the same range of , the second highest eigenvalue can be upper bounded as
[TABLE]
with probability [54, 56, 6]. This immediately implies that the spectral gap of , . Consequently, a classical random walk on mixes quite fast - in time.
However, it is clear from the expression for in Eq. (86) that the knowledge of all eigenvalue gaps are crucial in obtaining the quantum mixing time. As such require the knowledge of the spacings between all the eigenvalues of the random matrix .
**Semicircle law: ** It is well known that as , the spectral density of the bulk of the spectrum of converges to the well known semicircle distribution given by
[TABLE]
This implies that eigenvalues of lie within where
[TABLE]
is the radius of the semicircle. On applying the appropriate normalization, we find that the spectral density of converges to a semicircle of radius
[TABLE]
The fraction of eigenvalues of lying in some spectral window converges to the area of the semicircle within as . However, the semicircle law provides only a macroscopic description of the eigenvalues of , i.e. the aforementioned result holds only when . However, in order to obtain the quantum mixing time, we need information about all eigenvalue gaps including consecutive gaps where , which renders the semicircle law useless. As a result, for our purposes we need to look at mesoscopic and microscopic statistics of eigenvalues of .
However, for subsequent analysis, we shall require two results that can be obtained from the semicircle law itself which we state now. Note that there are eigenvalues with a radius of . This directly gives information about the average eigenvalue gap of given by
[TABLE]
Also from the semicircle law itself, one can define the so-called classical eigenvalue locations of . For each , we can define the classical location as the solution to the following equation
[TABLE]
Thus, the position of the classical location is obtained by filling up -area of the semicircle. From this condition one obtains that for , and some universal constant ,
[TABLE]
An identical estimate holds for the other half of the spectrum by symmetry.
**Eigenvalue rigidity criterion: ** The semicircle law was shown to hold for smaller spectral windows in Refs. [57, 55]. An immediate consequence of this fact is that the every eigenvalue (with the exception of ) of is located close to their classical eigenvalue positions. Formally, they showed that for and any the eigenvalues of satisfy
[TABLE]
with probability , where
[TABLE]
Eigenvalue rigidity does reveal information about eigenvalue gaps of . Note that for any one obtains
[TABLE]
As a result, whenever scales larger than , eigenvalue rigidity kicks in and an accurate estimate of is given by the difference between their classical eigenvalue locations, . That is, there exists some , such that for all ,
[TABLE]
From Eq. (103) and Eq. (104), we obtain that
[TABLE]
As such we cannot exploit eigenvalue rigidity to estimate gaps of the form as long as . Thus, eigenvalue rigidity does not provide information about the smallest eigenvalue gaps (for a pictorial representation of this fact, see Fig. 2) and reveals eigenvalue statistics of at a mesoscopic scale. Thus in order to obtain information about consecutive eigenvalue gaps of , we shall need to go to a microscopic scale.
Microscopic eigenvalue statistics of : At the microscopic scale, results are notoriously difficult to obtain. Tao and Vu [58] showed that has a simple spectrum for dense graphs, resolving a long-standing conjecture due to Babai [59]. Recently, this was resolved also for sparse graphs [60]. We state their results formally.
Fact 11
There exists a constant such that for , has a simple spectrum with probability .
The fact that every eigenvalue gap is non-zero implies that the expression for , the double sum in Eq. (87) can now be re-written as
[TABLE]
while the limiting probability distribution is
[TABLE]
Furthermore, Nguyen, Tao and Vu [61] proved that all the eigenvalue gaps of for dense random graphs are not only non-zero, but also separated. This was improved for the case of sparse random graphs by Lopatto and Luh [62]. In fact they asked the following question: how likely is it for any eigenvalue gap to be less than some times the average gap ? They proved that there exists a constant such that for ,
[TABLE]
for all .
Applying a union bound to this gives a lower bound on the minimum eigenvalue gap (defined in Eq. (88)) for . We prove that here.
Lemma 12
Lower bound on : For ,
[TABLE]
with probability .
**Proof. ** Let be the event that . Then using the union bound and Eq. (110), we obtain
[TABLE]
This implies that the probability that at least one of the gaps is less than is upper bounded by the right hand side of Eq. (112). By choosing
[TABLE]
we have that
[TABLE]
i.e. with probability , no is less than . This in turn implies that,
[TABLE]
with probability .
We are now equipped with the random matrix theory results, and in the next subsection, we derive a tight upper bound on the double sum , defined in Eq. (108).
VII.3 Upper bound on
As mentioned previously, in order to obtain the quantum mixing time, we first obtain bounds for the double sum . Recall that we an obtain lower and upper bounds for as
[TABLE]
In this subsection, our goal is to obtain an upper bound for that is as close as possible to its lower bound. To that end, our strategy would be to make use of the results on the eigenvalue statistics of at macroscopic, mesoscopic and microscopic levels. In particular, in order to evaluate , for , we shall make use of the tail-bounds on consecutive eigenvalue gaps in Eq. (110). On the other hand for , the eigenvalue rigidity criterion (See Eq. (104)) kicks in and we can replace with .
**Upper bound on : ** We first obtain an upper bound on the sum of the inverse of consecutive eigenvalue gaps, i.e.
[TABLE]
In the Supplemental Material of Ref. [6], we have explicitly derived an upper bound for . We restate the result here
Lemma 13** (Upper bound on [6])**
[TABLE]
with probability .
The key idea is that we count the number of consecutive eigenvalue gaps () lying within an interval of times the average gap and find that a high fraction of the -s lie within this window around the average gap. For a detailed derivation, we refer the readers to Ref. [6].
Now we can derive an upper bound on by combining mesoscopic and microscopic eigenvalue statistics of at different scales of . In particular, we use the upper bound on along with the eigenvalue rigidity condition. We state the upper bound on that we obtained in Ref. [6].
Lemma 14** (Upper bound on [6])**
For , the eigenvalues of satisfy
[TABLE]
with probability .
We provide an an intuition of the proof techniques and we refer the reader to the Supplemental Material of Ref. [6] for details. We first split into two different parts:
[TABLE]
For the first sum in the right hand side of Eq. (118), we are dealing with small eigenvalue gaps and hence we make use of the microscopic eigenvalue statistics, namely the upper bound on , i.e. we replace this sum with with the upper bound . On the other hand, for the second double sum, eigenvalue rigidity provides kicks in and the gaps between the classical eigenvalue locations , is a better estimate of than the tail bounds. In fact, an upper bound is obtained by replacing each eigenvalue gap with the lower bound from Eq. (103).
Observe that for dense random graphs, the upper bound on is quite close to its lower bound of . Having obtained this bound, we shall now upper bound the quantum mixing time for .
VII.4 Mixing of continuous-time quantum walks on
Here, we shall obtain the (i) limiting distribution of the quantum walk and the time after which the quantum walk converges (in a time-averaged sense) to this distribution, the quantum mixing time.
In order to obtain both these results, we make use of the fact that all the eigenvectors of are completely delocalized. In fact, it was conjectured in Ref. [63], that for dense random graphs, the eigenstates of are completely delocalized. This implies that when any of its eigenvectors is expressed in the basis of the nodes of the underlying graph, the absolute value of each entry is at most (up to logarithmic factors). Erdös et al. [55] answered this optimally even for sparse and the results therein were subsequently extended for any above the percolation threshold recently by He et al. [64]. This implies that as long as , for all
[TABLE]
with probability .
To the expression for the limiting probability distribution in Eq. (109), we substitute the delocalization of eigenvectors from Eq. (119) to obtain
[TABLE]
independent of , i.e. the limiting distribution converges to a (nearly) uniform distribution.
Observe that the upper bound on already provides an upper bound on the quantum mixing time. However, we can improve the bound it further if we assume that the quantum walk commences from an easy to prepare initial state. By this we mean that the initial state is a superposition over atmost a number of nodes. In fact, generally it is assumed that the initial state is localized at some node of the underlying graph, i.e. , which is standard.
If the quantum walk commences from an easy to prepare state,
[TABLE]
where is in , we can use Eq. (119) to obtain
[TABLE]
with probability .
Thus for ,
[TABLE]
for .
Observe that for dense Erdös-Renyi random graphs,
[TABLE]
Also, as decreases the upper bound on the mixing time increases. Unfortunately for sparser random graphs, i.e. for , such that , we cannot make use of eigenvalue rigidity. However simply using Lemma 13 along with the observation that
[TABLE]
for , gives us a weaker upper bound for the quantum mixing time in such regimes of sparsity. We obtain that
[TABLE]
In fact, the breakdown of rigidity estimates in [55] is not an artifact of the proof. For extremely sparse graphs, the optimal rigidity estimates that hold in dense graphs are known to break down [65].
Note that there exist weaker forms of rigidity of sparse graphs when which may lead to modest improvements of the exponent of in the mixing time. However, we have not expended too much effort optimizing the exponent as we are fundamentally limited by the smallest gap, (see lower bound of Eq. (114)) for which the bounds in [62] are still quite far from the conjectured behaviour. Obtaining the conjectured smallest gap behaviour appears to be a difficult problem in random matrix theory.
Finally, we numerically verify the analytical results obtained. Fig. 3 shows that for , the instantaneous time-averaged probability distribution () converges to a distribution that is close to the uniform distribution (horizontal dashed black line). While in Fig. 4 we plot with time and the inset plot depicts the exponent for the quantum mixing time () for random graphs of various sizes and . The numerical results conform with the analytically obtained upper bound for the quantum mixing time in Eq. (127).
VII.5 Mixing time for continuous-time quantum walks on any ergodic, reversible Markov chain
Our results thus far have provided an upper bound on the quantum mixing time for almost all simple unweighted graphs. Now we address the quantum mixing time for any ergodic, reversible Markov chain . Any symmetric matrix that captures the local connectivity of can be a used as a Hamiltonian for performing a CTQW on . As need not be symmetric in general, one cannot consider a quantum walk on directly. Given any such Markov chain , one can define the Hamiltonian as stated in Sec. IV (for ). In this section we consider the limiting distribution of a continuous-time quantum walk under , on the edges of .
Here, we shall explore whether any generic speedup is obtained for the problem. Note that the time evolution of some initial state , under the action of is given by
[TABLE]
The limiting probability distribution (note that now we are projecting on obtaining in the second register) is given by
[TABLE]
This implies,
[TABLE]
Also, the upper bound on the quantum mixing time is given by,
[TABLE]
where recall from Eq. (59) in Sec. IV.2 that .
Now the generic upper bound on the quantum mixing time is upper bounded by the double sum and as such
[TABLE]
where is the minimum eigenvalue gap of the Hamiltonian . We now need bound in terms of the minimum eigenvalue gap of , . To that end we have the following lemma:
Lemma 15
Suppose is an ergodic, reversible Markov chain with eigenvalues . Suppose is the spectral gap of and the minimum of all gaps between distinct eigenvalues of be . Then the minimum eigenvalue gap of the Hamiltonian , is bounded as
[TABLE]
**Proof. ** We know that for , in the relevant subspace, each eigenvalue of , , maps to . Thus if , then we have
[TABLE]
We are concerned with the minimum eigenvalue gap . Without loss of generality, we assume that has a simple spectrum (consequently, so does ) and for some , the eigenvalue gap is minimum for two consecutive distinct eigenvalues and . That is, for some value of , and henceforth we consider that value of . Observe that in such a case, the second term inside the square-root
[TABLE]
So expanding Eq. (135) according to Taylor series, we have
[TABLE]
This expression implies that the minimum eigenvalue gap of is mapped to the minimum eigenvalue gap of multiplied by the ratio of the corresponding eigenvalues of and . The upper and lower bounds follow from observing that for all , and , respectively.
So from Lemma 15, we have that for any ergodic, reversible Markov chain
[TABLE]
Let us now consider that is a symmetric, i.e. . Then the underlying quantum walk can also be performed on itself. Assuming that the eigenvalues of are ordered, for a continuous-time quantum walk on , from Eq. (132) and Lemma 15, we observe that the upper bound for the quantum mixing time may be faster or slower than a quantum walk performed on depending on where the minimum eigenvalue gap appears (See Fig. 5 for a pictorial representation).
If happens to be between two eigenvalues that are close to , and hence the upper bound on the quantum mixing time is in which is faster than the bound in Eq. (94). On the other hand, if is in the vicinity of , the upper bound on the quantum mixing time is given by Eq. (139).
For generic ergodic, reversible Markov chains however, this comparison is inapplicable as may not be symmetric and cannot be used as a CTQW Hamiltonian.
This is in contrast to the problem, where using offers a generic quadratic speedup over using as the Hamiltonian in Algorithm 3. This shows a fundamental difference between the two different notions of mixing for quantum algorithms as elucidated by and problems.
VIII Discussion
In this article we have discussed the two notions of quantum mixing and designed analog quantum algorithms to tackle these problems. First, using Hamiltonian evolution and von Neumann measurements, we have presented an analog quantum algorithm that, given an ergodic, reversible Markov chain outputs a coherent encoding of its stationary state. The running time of our algorithm matches that of its discrete-time counterparts. Secondly, we have also discussed the problem of sampling from the limiting distribution of a (time-averaged) continuous-time quantum walk. We have offered an intuitive explanation of the tools used in Ref. [6] to derive upper bounds on the mixing time for random graphs. We have also backed up the analytical results therein with numerical simulations and extended the time-averaged notion of mixing to any ergodic, reversible Markov chain.
Our results could pave the way for further research. For example, quantum state-generation using von Neumann measurements can be used to develop novel analog quantum algorithms. Note that our methods could be used to obtain other analog quantum algorithms for solving the problem. One could reverse the spatial search algorithm by Childs and Goldstone [38, 48] and use von Neumann measurements to prepare a coherent encoding of the highest eigenstate of underlying Hamiltonian. In the case of state-transitive graphs, this will allow for uniform sampling.
It would be interesting to explore whether using our framework, one can construct an analog quantum algorithm to fast-forward the dynamics of any ergodic, reversible Markov chain much like the results of Apers and Sarlette in discrete-time [22]. The challenge is that most of the underlying techniques that enable this, such as the recently developed techniques in the context of quantum simulation [66, 67, 68], are absent in continuous-time. However, the fact that the Hamiltonian defining our continuous-time quantum walk can be efficiently simulated using query access to the unitary defining the discrete-time quantum walk of Ref. [18] might offer useful insights towards designing such algorithms.
Our algorithm can also be used to prepare stationary states of slowly-evolving Markov chains, i.e. given a sequence of Markov chains , such that there is a significant overlap between the stationary distributions of any two consecutive Markov chains, meaning is large [7, 10, 13]. Given that one can prepare efficiently, the task is to prepare . Such situations arise in a host of approximation algorithms for counting as has been pointed out in Ref. [7]. Our algorithm will provide a quadratic speedup over that of Ref. [7] as given any , the spectral gap of the Hamiltonian defined in Sec. IV is amplified quadratically over the corresponding discriminant matrix, which acts as the Hamiltonian for the approach in [7].
For the problem of time-averaged mixing, it would be interesting to explore the possibility of obtaining better bounds on the quantum mixing time for any ergodic, reversible Markov chain. Furthermore, this notion of quantum mixing is closely related to the problem of equilibration of isolated quantum systems, a widely studied problem in quantum statistical mechanics [69]. As a result, our results can help obtain better upper bounds for the equilibration times of isolated quantum systems defined by random Hamiltonians.
Acknowledgements.
S.C. acknowledges funding from F.R.S.-FNRS. S.C. and J.R. are supported by the Belgian Fonds de la Recherche Scientifique - FNRS under grants no F.4515.16 (QUICTIME) and R.50.05.18.F (QuantAlgo). K.L. has been partially supported by NSF postdoctoral fellowship DMS-1702533.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Kurt Binder, Dieter Heermann, Lyle Roelofs, A John Mallinckrodt, and Susan Mc Kay. Monte carlo simulation in statistical physics. Computers in Physics , 7(2):156–157, 1993.
- 2[2] Emile HL Aarts and Peter JM Van Laarhoven. Statistical cooling: A general approach to combinatorial optimization problems. Philips J. Res. , 40(4):193–226, 1985.
- 3[3] Lawrence Page, Sergey Brin, Rajeev Motwani, and Terry Winograd. The pagerank citation ranking: Bringing order to the web. Technical report, Stanford Info Lab, 1999.
- 4[4] Walter R Gilks, Sylvia Richardson, and David Spiegelhalter. Markov chain Monte Carlo in practice . Chapman and Hall/CRC, 1995.
- 5[5] Throughout the article, we use \mathaccent V t i l d e 07 E 𝒪 ( . ) \mathaccent V{tilde}07E{\mathcal{O}}(.) and \mathaccent V t i l d e 07 E Ω ( . ) \mathaccent V{tilde}07E{\Omega}(.) to suppress polylogarithmic factors, i.e. \mathaccent V t i l d e 07 E 𝒪 ( f ( n ) ) = 𝒪 ( f ( n ) polylog ( f ( n ) ) ) \mathaccent V 𝑡 𝑖 𝑙 𝑑 𝑒 07 𝐸 𝒪 𝑓 𝑛 𝒪 𝑓 𝑛 polylog 𝑓 𝑛 \mathaccent V{tilde}07E{\mathcal{O}}(f(n))=\mathcal{O}(f(n)\mathrm{polylog}(f(n))) and \mathaccent V t
- 6[6] Shantanav Chakraborty, Kyle Luh, and Jérémie Roland. How fast do quantum walks mix? Physical Review Letters , 124(5):050501, 2020.
- 7[7] Dorit Aharonov and Amnon Ta-Shma. Adiabatic quantum state generation and statistical zero knowledge. In Proceedings of the thirty-fifth annual ACM symposium on Theory of computing , pages 20–29. ACM, 2003.
- 8[8] Alfred V Aho and John E Hopcroft. The design and analysis of computer algorithms . Pearson Education India, 1974.
