Quantum Approach to Fast Protein-Folding Time
Li-Hua Lu, You-Quan Li

TL;DR
This paper proposes a quantum walk framework for protein folding, showing that quantum strategies can significantly reduce folding times compared to classical methods, offering new insights into the Levinthal paradox.
Contribution
It introduces a quantum approach to model protein folding as a quantum walk, avoiding prior hypotheses and demonstrating faster folding times.
Findings
Quantum walk model yields shorter folding times.
Quantum approach outperforms classical random walks.
Framework provides new insights into protein folding dynamics.
Abstract
In the traditional random-conformational-search model, various hypotheses with a series of meta-stable intermediate states were often proposed to resolve the Levinthal paradox. Here we introduce a quantum strategy to formulate protein folding as a quantum walk on a definite graph, which provides us a general framework without making hypotheses. Evaluating it by the mean of first passage time, we find that the folding time via our quantum approach is much shorter than the one obtained via classical random walks. This idea is expected to evoke more insights for future studies.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Quantum Approach to Fast Protein-Folding Time
Li-Hua Lu
Zhejiang Province Key Laboratory of Quantum Technology Device and Department of Physics, Zhejiang University, Hangzhou 310027, P.R. China
You-Quan Li
Zhejiang Province Key Laboratory of Quantum Technology Device and Department of Physics, Zhejiang University, Hangzhou 310027, P.R. China
Collaborative Innovation Center of Advanced Microstructure, Nanjing University, Nanjing 210008, R.R. China
Abstract
In the traditional random-conformational-search model various hypotheses with a series of meta-stable intermediate states were often proposed to resolve the Levinthal paradox. Here we introduce a quantum strategy to formulate protein folding as a quantum walk on a definite graph, which provides us a general framework without making hypotheses. Evaluating it by the mean of first passage time, we find that the folding time via our quantum approach is much shorter than the one obtained via classical random walks. This idea is expected to evoke more insights for future studies.
Understanding how proteins fold spontaneously into their native structures is both a fascinating and fundamental problem in interdisciplinary fields involving molecular biology, computer science, polymer physics as well as theoretical physics etc.. Since Harrington and Schellman discovered that protein-folding reactions are very fast and often reversible processesSchellman1956 , there has been progressively more investigations on protein folding in both aspects of theory and experiment. Levinthal Levinthal noted early in 1967 that a much larger folding time is inevitable if proteins are folded by sequentially sampling of all possible conformations. Thus the protein was assumed to fold through a series of meta-stable intermediate states and the random conformational search does not occur in the folding process. The questions about what are the energetics of folding and how does the denature cause unfolding motivates one to think that the protein folding proceeds energetically downhill and loses conformational entropy as it goes. Based on such a hypothesis, the free-energy landscape framework was one way to describe the protein folding Malla ; Wolynes ; Passway:Jackson , where the energy funnel landscape provided a first conjecture of how the folding begins and continues Wolynes2012 .
As we known, there have been substantial theoretical models with different simplifying assumptions, such as Ising-like model Ising1 ; Ising2 , foldon-dependent protein folding model Englander , diffusion-collision model Karplus ; Sali , and nucleation-condensation mechanism Thirumalai ; Fersht etc.. Theoretical models are useful for understanding the essentials of the complex self-assembly reaction of protein folding, but till now, they often rely on various hypotheses Wolynes2005 ; Shakhnovich2006 ; Wolynes2012 ; Dill2012 ; Thirumalai2013 . This often brings in certain difficulties in connecting analytical theory to experimental results because some hypotheses can not be easily put into a practical experimental measurement. As it introduces less hypotheses in comparison to those theoretical models, the atomistic simulations PianaS ; HenryR ; Snow were used to investigate the protein folding along with nowadays’ advances in computer science. Recently, a high-throughput protein design and characterization method was reported that allows one to systematically examine how sequence determines the folding and stability Rocklin2017 . However, quantitatively achieving the folding time and accurately understanding how the sequence determines the protein folding remain to be a key challenge.
Here we propose a quantum strategy to formulate protein folding as a quantum walk on a definite graph, which provides us a general scheme without artificial hypotheses. In terms of the first-passage probability, one can calculate the folding time as the mean of the first-passage time. The obtained folding time in terms of our quantum scheme is much shorter than the one obtained via classical random walks. This idea is expected to open a new avenue for investigating the protein folding theoretically, which may motivate a necessary step toward developing technology for protein engineering and designing protein-based nanodevices Munoz .
Theoretical consideration: We describe the protein structure by the frequently adopted lattice model GoModel ; HP:Dill ; LiTang1996 ; LiJi2004 , namely, a protein is regarded as a chain of non-own intersecting unit (usually referring an amino-acid residue) of a given length on the two-dimensional square lattice. For a protein with amino-acid residues, we can calculate the total number of distinct lattice conformations that distinguish various protein intermediate structures. For instance, we have and . This provides us a set with objects, which we call structure set and denote it by hereafter.
In order to study the protein folding process, we propose a concept of one-step folding. On the basis of the lattice model, we can naturally define the one-step folding by one displacement of an amino acid in one of the lattice sites. This makes us to establish certain connections between distinct points in the set and to have a connection graph . In other words, two structures are connected via one-step folding if their conformation differs in one site only. As a conceptual illustration, we plotted the structure set , the connection graphs and in Fig. 1 (the in Fig. S1 in supplementary material). Such a graph is described by the so-called adjacency matrix that characterizes a classical random walk Kampen1997 on the graph.
Folding as a quantum walk: Letting denote the state of a protein structure in the shape of the -th lattice conformation, we will have a quantum Hamiltonian in a -dimensional Hilbert space,
[TABLE]
where refers to the connection between different points in the structure set, i.e., is nonzero only if the -th protein structure can be transited into the -th structure by a one-step folding. With these physics picture one can also investigate quantum walk Aharonov1993 ; Farhi1998 ; WangJB2014 on the aforementioned graph.
From the coarse grained point of view, the 20 amino acids are classified HP:Dill as hydrophobic and hydrophilic (it is also called polar) groups according to their contact interaction. As and represent the hydrophobic and polar amino acids conventionally, a sequence of amino acids can be labeled by where with refers to either or . Thus there will be totally a set of possible sequences. Let us call the entire of the whole random sequences as the sequence set denoted by . For any definite sequence- specified by a , we can calculate the total contact energy LiTang1996 ; LiJi2004 for each structure in ,
[TABLE]
where labels different structures, and denote the successive labels of the amino-acid residues in the sequence (i.e., the order in the chain), while stands for the coordinate position of the -th residue in the -th structure and refers to either or . Here the notation of Kronecker delta is adopted, i.e., if and if . It is widely believed that the native structure of a protein possesses the lowest free energy Anfinsen . This can be interpreted by the hydrophobic force that drives the protein to fold into a compact structure with as many hydrophobic residues inside as possible HP:Dill . Thus the - contacts are more favourite in the lattice model HP:Dill ; HP:Onuchic ; HP:Shakhnovich ; HP:Wolynes , which can be characterized by choosing , , and as adopted in Ref. LiTang1996, .
With the contact energy (2) for every structure, the potential term can be expressed as
[TABLE]
Thus the total Hamiltonian for a definite sequence- is given by . Clearly, the kinetic term is determined by the connection graph merely while the potential term defined on the structure set is related to the concrete sequence- under consideration. This means that we have a hierarchy of Hamiltonian actually for a theoretical study of the protein folding problem.
Note that one may obtain the same contact energy for several different sequences. In this case, the dynamical properties are the same although those sequences may differ. Such a dynamical degeneracy implies a partition within the sequence set . There are totally 16 possible sequences in which is partitioned into three subsets, i.e., , thus there will be three situations in the discussion on the time evolution. For , there are totally 64 possible sequences in which is partitioned into 45 subsets, i.e., (see Tables SI SII in supplementary material).
Random walk with sticky vertices: As we known, the continuous time classical random walk Montroll1965 on a graph is described by the time evolution of the probability distribution that obeys the master equation
[TABLE]
where with being the probability-transition matrix. In the conventional classical random walk, the probability-transition matrix is determined by the adjacency matrix of an undirected graph, namely, where represents the degree of vertex- in the graph . However, we ought to reconsider the random walk if there are some “sticky” vertices in the graph. This corresponds to the case when we take account of the contact energy in the protein conformations. Thus, the probability-transition matrix should be modified so that the strength hopping into differs from that hopping out of those sticky vertices. The modified transition matrix is given by
[TABLE]
Here , , and where a notation is adopted for simplifying the expression. The newly added two terms in (5) together guarantee the probability conservation. Therefore, in the presence of sticky vertices, one needs to solve the master equation (4) with the modified in the discussion of classical random walks.
The quantum dynamics: To accomplish a quantum mechanical understanding, we take account of the energy dissipation caused by the medium in which the folding occurs. This is governed by the Lindblad equation Lindblad
[TABLE]
where
[TABLE]
reflects the effect of dissipation. Here and is called the Lindblad operator which can be determined from the analyses of random walks in the presence of sticky vertices. The aforementioned off-diagonal part in (5) provides this operator i.e., . Actually, equation (7) presents a general expression, which becomes the traditional one in terms of Pauli matrices, with for a two level system that can be regarded as the two-vertices graph with a sticky vertex.
We solve the density matrix from Eq. (6) with the initial condition . Here refers to the completely unfolded straight-line structure. To illustrate our theory intuitively, we start from the simplest model of where the protein-folding problem becomes a task to investigate the quantum walk on the graph . We solve the numerically for the three situations , and respectively. In the calculation, we set and to be unity and take the time step as . For the initial condition: and the other matrix elements vanish when , we solve Eq. (6) by means of Runge-Kutta method and obtain the magnitude of at any later time, with . We plot the time dependence of the diagonal elements of the solved density matrix for the case in Fig. 2(a) and the other cases in the supplementary material Fig. S3. Likewise, we solve the density matrix for another initial condition again so that the first-passage probability can be determined later on. The population of the most compact structure is evaluated by the diagonal element . For instance, in , and and in . We can see that the probability of the state referring to the most compact structure increases much more rapidly in the quantum folding process Fig. 2(a) than in the classical process Fig. 2(d). Toward a genuine understanding, we further study the quantum walk on the graph by solving the density matrix numerically one by one for the aforementioned forty-five situations.
The folding time: Now we are in the position to define the protein folding time which can be formulated with the help of the concept of the mean first-passage time FirstPassageTime1969 ; FirstPassageTime2001 ; FirstPassageTime2004 ; FirstPassageTime2007 ; FirstPassageTime2016 . The mean first-passage time from a starting state to a target state is given by where represents the time period when the first-passage probability vanishes which really occurs for the aforementioned quantum walk. For example in Fig. 2(b), the solved first-passage probability becomes negative after . The first-passage probability from a state to another state after time obeys the known convolution relation
[TABLE]
Here denotes the probability of a state being the basis state at time if starting from the state at initial time . Quantum mechanically, it is evaluated by the diagonal elements of the density matrix, i.e., where is solved from Eq. (6) with the initial condition , while is solved with another initial condition . Here the superscripts are introduced to distinguish the solution from different initial conditions. In the classical case, and refer to the solved from Eq. (4), respectively, with initial conditions and .
As protein folding is the process that proteins achieve their native structure, the folding time is the case that the starting state is chosen as and the target states are the most compact states. For example, they are , or for . The formula for the calculation of the folding time is thus given by
[TABLE]
To calculate the folding time we need to solve the first-passage probability as a function of from the convolution relation (8). As an illustration, we first consider the case of . For the classical folding process, we plot in Fig. 2(e) the and . With these two time-dependent functions, the first passage-probability can be further solved from the convolution relation (8) by numerical iterations (see Fig. 2(e)). It is nonnegative and approaches to zero when goes to infinity. This can be understood without difficulty because the classical probability distribution changes monotonously and approaches to its steady solution at the infinity time. However, for a quantum walk the probability distribution oscillates in time. We can see that the solved density matrix shown in Fig. 2(a) and Fig. S3 oscillates in time. With this new characteristics in quantum walk, the value of the first-passage probability solved directly from (8) appears to be negative in certain time region (see Fig. 2(b)) that is unphysical.
The zero point of determines the upper limit of the integration in the formula (9). In the simplest model with 4 residues, the classical folding times for the sequence subsets , and are , and respectively. Their corresponding quantum folding times are , and respectively. Clearly, the quantum folding is faster than the classical folding with about four to six times even for the simplest model. In the same way, we calculate the quantum folding time for the forty-five situations for the case with 6 residues (see Tables SIII, SIV SV in supplementary material). One can see that the quantum folding is faster than the classical folding with almost ten to hundred times or more. The experimental observation FoldingTimeExp ever exhibited that the protein folding is much faster than the theoretical prediction based on a random conformation search process. To visualize more easily we plot the quantum folding times in Figs. 2 (g) to (i). As a comparison, we also plot the ratios of classical folding time to the quantum folding time on the same panels. In those three histograms, the longest folding time takes place for the sequence subsets , and while the shortest folding time occurs for the sequence subset , and . The largest ratios occur for the subsets , and but the smallest ratios occur for , and .
In the above, we proposed a self-contained general theory to investigate protein folding problem quantum mechanically. In terms of - lattice model, one can always have a structure set for an amino-acid chain of any given number of residues. With such a structure set, one can naturally define a connection graph by means of our definition of one-step folding. Thus either a classical random walk or a quantum walk on the graph can be solved with standard procedures. The former implies a random conformational search while the latter involves in fact a parallel search due to the quantum mechanical coherence Christopher . The application of quantum walk has attracted more attentions XuPeng to study various contemporary topics in recent years, our present strategy may open a new avenue in the area of the application of quantum walks. We have known if proteins were folded by sequentially sampling of all possible conformations, the calculated folding time would be inevitably very large because there is a very large number of degrees of freedom in an unfolded polypeptide chains. We elucidated that the quantum evolution naturally helps us to understand a faster protein folding. In terms of the concept of first-passage probability, we can calculate the quantum protein folding time as the mean first-passage time. It is worthwhile to mention that the first-passage probability solved from the conventional convolution relation may take negative value in some time domain. This is very important for the application of the quantum approach to an investigation of protein folding time. According to our results for and , the quantum folding time is much shorter than that obtained from classical random walk. The presented theory is expected to bring in new insight features of protein folding process.
The work is supported by National Key R & D Program of China, Grant No. 2017YFA0304304, and partially by the Fundamental Research Funds for the Central Universities.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1(1) W. F. Harrington, and J. A. Schellman, Evidence for the instability of hydrogen-bonded preptide structures in water, based on studies of ribonuclease and oxidized ribonuclease, C R Trav Lab Carlsberg Chim 30 , 21 (1956).
- 2(2) C. Levinthal, Are there passways for protein folding, J. Chem. Phys. 65 , 44 (1968).
- 3(3) F. Mallamace, C. Corsaro, D. Mallamace, et al. , Energy landscape in protein folding and unfolding, PNAS 113 , 3159 (2016).
- 4(4) J. J. Portman, S. Takada, and P. G. Woylnes, Variational theory for site resolved protein folding free energy surfaces, Phys. Rev. Lett. 81 , 5237 (1998).
- 5(5) S. E. Jacksom, How do small single-domain proteins fold? Folding Des. 3 , R 81 (1998).
- 6(6) P. G. Wolynes, W. A. Eaton, and A. R. Fersht, Chemical physics of protein folding, PNAS 109 , 17770 (2012).
- 7(7) V. M u ~ ~ 𝑢 \tilde{u} noz, and W. A. Eaton, A simple model for calculating the kinetics of protein folding from three-dimensional structures, PNAS 96 , 11311 (1999).
- 8(8) E. R. Henry, and W. A. Eqton, Combinatorial modeling of protein folding kinetics: Free energy profiles and rates, Chem Phys 307 , 163 (2004).
