Generic Dynamical Phase Transition in One-Dimensional Bulk-Driven Lattice Gases with Exclusion
Alexandre Lazarescu

TL;DR
This paper demonstrates a universal dynamical phase transition in one-dimensional bulk-driven exclusion processes, showing different scaling behaviors for high and low current deviations, and extends analysis methods to more general models.
Contribution
It generalizes the understanding of dynamical phase transitions in exclusion processes by including spatial inhomogeneities and short-range interactions, beyond exactly solvable models.
Findings
High current deviations scale extensively with system size.
Low current deviations are independent of system size.
Transition relates to pushing current beyond hydrodynamic limits.
Abstract
Dynamical phase transitions are crucial features of the fluctuations of statistical systems, corresponding to boundaries between qualitatively different mechanisms of maintaining unlikely values of dynamical observables over long periods of time. They manifest themselves in the form of non-analyticities in the large deviation function of those observables. In this paper, we look at bulk-driven exclusion processes with open boundaries. It is known that the standard asymmetric simple exclusion process exhibits a dynamical phase transition in the large deviations of the current of particles flowing through it. That phase transition has been described thanks to specific calculation methods relying on the model being exactly solvable, but more general methods have also been used to describe the extreme large deviations of that current, far from the phase transition. We extend those methods…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Generic Dynamical Phase Transition
in One-Dimensional Bulk-Driven Lattice Gases with Exclusion
Alexandre Lazarescu
Complex Systems and Statistical Mechanics Group, University of Luxembourg
Abstract
Dynamical phase transitions are crucial features of the fluctuations of statistical systems, corresponding to boundaries between qualitatively different mechanisms of maintaining unlikely values of dynamical observables over long periods of time. They manifest themselves in the form of non-analyticities in the large deviation function of those observables.
In this paper, we look at bulk-driven exclusion processes with open boundaries. It is known that the standard asymmetric simple exclusion process exhibits a dynamical phase transition in the large deviations of the current of particles flowing through it. That phase transition has been described thanks to specific calculation methods relying on the model being exactly solvable, but more general methods have also been used to describe the extreme large deviations of that current, far from the phase transition.
We extend those methods to a large class of models based on the ASEP, where we add arbitrary spatial inhomogeneities in the rates and short-range potentials between the particles. We show that, as for the regular ASEP, the large deviation function of the current scales differently with the size of the system if one considers very high or very low currents, pointing to the existence of a dynamical phase transition between those two regimes: high current large deviations are extensive in the system size, and the typical states associated to them are Coulomb gases, which are highly correlated ; low current large deviations do not depend on the system size, and the typical states associated to them are anti-shocks, consistently with a hydrodynamic behaviour. Finally, we illustrate our results numerically on a simple example, and we interpret the transition in terms of the current pushing beyond its maximal hydrodynamic value, as well as relate it to the appearance of Tracy-Widom distributions in the relaxation statistics of such models.
driven exclusion process; dynamical phase transition; current fluctuations; large deviations.
pacs:
05.40.-a; 05.60.-k; 02.50.Ga
Contents
-
IV.3.4 Conclusion: asymptotics of the large deviation function of the currents
-
IV.4.2 Non-hydrodynamic behaviour for long-range interactions
I Introduction
The main mission of statistical physics is to bridge the gap between different levels of description of physical systems, from a microscopic level where we know the laws governing each of the myriad of components of the system and their interactions, to a macroscopic level where the whole system is described through emergent laws relating only a handful of global observables. The law of ideal gases is an extreme example of that, where three quantities are enough to describe the typical state of a somewhat caricatural system in a rather specific set-up, but it is essentially the same question that is asked when one want to describe, for instance, a model of particles moving and interacting on a lattice, by a Langevin equation on just the local average density of particles with a conserved Gaussian noise. Obtaining such a hydrodynamic description, and determining whether the Gaussian noise does reproduce faithfully the fluctuations of the lattice model at that scale, is in general an extremely challenging problem.
For systems that are close to equilibrium, where detailed balance is broken only infinitesimally in the large size limit, such as boundary-driven or weakly bulk-driven models, the aforementioned macroscopic hydrodynamic description is known to hold quite generally, under the names of macroscopic fluctuation theory (MFT, Bertini2007 ) and additivity principle PhysRevLett.92.180601 . The question becomes more interesting when one considers systems that are far from equilibrium, where no such general result exists. There, from an analytical perspective, one is mostly restricted to study specific models or classes of models which are amenable to calculations, which often means exactly solvable, such as zero-range processes Harris2005 ; Chleboun2016 or exclusion processes Derrida199865 ; 0034-4885-74-11-116601 . Moreover, since the question involves not only the typical behaviour of the systems at a macroscopic scale, but their fluctuations as well, the relevant framework is that of large deviation functions Touchette20091 , which are the dynamical equivalent of equilibrium free energies.
The most interesting feature of those large deviation functions are so-called dynamical phase transitions. Just like their equilibrium equivalents, they appear as non-analyticities which correspond to boundaries between qualitatively different behaviours of the system when changing its parameters. Unlike for equilibrium phase transitions, however, dynamical control parameters are usually abstract quantities which do not correspond to actual tunable parameters of the physical model Jack2015a , and those different qualitative behaviours correspond to the best way in which the system can fluctuate to maintain an atypical value of some observable for a long time, rather than stable state changes that could be observed experimentally by tuning the environment. This is not to say that dynamical phase transitions have no experimentally measurable consequences, as for instance models which are critical in that respect (i.e. where the stationary state sits precisely at a dynamical phase transition) will show non-Gaussian fluctuations, anomalous diffusive behaviours and special dynamical exponents, a famous example being models in the KPZ universality class Spohn2016 ; Corwin2016 . Identifying those dynamical phase transitions is quite crucial when attempting to describe the macroscopic behaviour of stochastic systems: they define the domains of applicability of any proposed description.
In this paper, we focus on one-dimensional bulk-driven open simple exclusion processes (SEP), where particles jump from site to neighbouring site on a one dimensional lattice, with a bias towards one side (e.g. the right), and can enter or exit the system only at the boundaries. The simple exclusion constraint means that the particles interact through hard-core repulsion: at most one particle can be on a given site at a given time. These models are among the most studied in non-equilibrium statistical physics, both analytically for the exactly solvable versions derrida1993exact ; 1751-8121-40-46-R01 ; Lazarescu2015 (which can also have periodic boundary conditions derrida1998exact ; Prolhac2009 and symmetric or weakly asymmetric jumps Enaud2004 ; Prolhac2009a ), and numerically in the case of non-solvable generalisations 1742-5468-2008-06-P06009 ; Greulich2008 ; Ciandrini2010 .
For such transport models, the most important observables are usually the macroscopic currents of particles (or whatever charges are being transported), which are a direct consequence of the system being driven out of equilibrium, and which can be expected to play an important role in its macroscopic behaviour. In particular, any dynamical phase transition that the system might undergo will most likely be visible in the large deviation function of those currents, which is a contraction of the full joint large deviation function of currents and densities. Indeed, such transitions have been found, for instance in the weakly asymmetric version of the model, both for periodic boundary conditions PhysRevE.72.066110 ; Bodineau2008 ; appert2008universal ; Simon2011 ; Espigares2013 and open ones Lecomte2010 ; Baek2016a .
A dynamical phase transition has also been found in the large deviations of the current of the asymmetric simple exclusion process, both for total asymmetry (TASEP, where particles jump only to the right) and partial asymmetry (PASEP or simply ASEP, where particles can jump backwards as well), and both in the periodic and the open geometry. Those models are integrable (in the sense of quantum integrability Faddeev1996 ; Lazarescu2014 , through its connexion with the XXZ spin chain sandow1994partially ), meaning that one can in principle obtain an exact expression for the large deviation function, and describe the transition analytically. This was first done in a periodic and totally asymmetric setting in derrida1998exact and derrida1999universal , which was then extended to the partially asymmetric case Prolhac2009 ; prolhac2010tree . In the open setting, part of the large deviation function was correctly conjectured in Bodineau2006 and later confirmed in DeGier2011a , and the full large deviation function for the TASEP was obtained soon afterwards Lazarescu2011 and later extended to the ASEP gorissen2012exact ; Lazarescu2014 . In all of these works, the results were obtained using integrability methods such as the Bethe Ansatz, and are in principle restricted to small fluctuations of the current because of approximations made towards the large size limit, except for Popkov2010 which deals with the limit of very high currents in the periodic ASEP.
Finally, the complete dynamical phase diagram of the current for the open ASEP was obtained in Lazarescu2013 ; Lazarescu2015 , by combining the aforementioned results for small fluctuations with exact diagonalisation methods for extreme values of the current. It was found that in the very high current limit, the system effectively behaves like a discrete Coulomb gas, as it does in the periodic case Popkov2010 , with a large deviation function proportional to the size of the system ; in the very low current limit, the system has an effective dynamics involving only anti-shock states, and the large deviation function does not depend on . The whole phase diagram is then obtained by interpolating between those tree regimes, and by conjecturing that the MFT can be used to obtain a large part of it (which is compatible with all the exact results). A dynamical phase transition is identified, which includes the stationary maximal current phase, and separates the two different scalings in .
As we mentioned, a large part of those results are obtained by integrability methods, which are unfortunately entirely specific to that very special model. Luckily, the aforementioned exact diagonalisation methods for extreme currents do not require the model to be integrable, which means that they can be applied in more general settings. Indeed, in this paper, we consider a much broader class of models, by adding two features to the ASEP: arbitrary spatial inhomogeneities in the jump rates, and an arbitrary short range interaction between the particles. We will see that the same methods apply (albeit in a much more involved way) and yield essentially the same results: the large deviation function of the current is proportional to for high currents, and independent of for low currents. This points to the existence of a dynamical phase transition separating those two regimes, although this time no description of the transition itself is available, and in particular no information on the critical exponents associated to it, which leads us to qualify the transition as generic rather than universal.
The structure of this paper is as follows. In section II, we first introduce the reader to the models and formalism we will be using throughout the paper, along with a few useful standard results. In section III, we focus on the high current limit, where we first recall the results pertaining to the regular ASEP, before extending them rather straightforwardly to our generalised versions. In section IV, which contains the main technical result of this paper, we do the same for the low current limit, by finding bounds on the behaviour of both the large deviation function of the current, and the shape of the typical states associated to those fluctuations. We then provide numerical illustrations for those results in section V, as well as an interpretation of the origin of the dynamical phase transition in terms of maximal hydrodynamic currents, and discuss its relation to the KPZ universality class BenArous2011 and third order phase transitions LeDoussal2016 . We finally conclude and discuss a few extensions that our result could receive in the future.
II Models and formalism
In this first section, we define the models that we intend to study, and present the formalism in which they will be treated, as well as the mathematical tools we will use in order to access the fluctuations of the current of particles flowing through them.
II.1 Markov matrix and master equation
The basic model upon which we aim to build is the asymmetric simple exclusion process (ASEP). It is a Markov process in continuous time defined on a finite one-dimensional lattice of size , be it periodic or open and connected to reservoirs at both ends. Each site can be either empty, or holding one particle, and particles hop from site to site with a rate to the right and to the left (as if driven by a field ). In the open case, particles can enter the system on the first site with rate and on the last with rate , and exit from the first site with rate and from the last with rate . In all these cases, if the target site is already occupied, the exclusion rule means the particle cannot jump. The rates for the open case are summarised on fig.1.
We will also be considering a simpler version called the totally asymmetric simple exclusion process (TASEP), where the particles can only jump to the right, which is to say that . Note that we will be focusing on the open case, but that everything we will say is easily transposed to the periodic case, as will be regularly pointed out.
To put that into equations: states of the system are written as configurations , where if site is empty and if it is occupied. The probability vector , of which an entry gives the probability to be in configuration at time , obeys the master equation
[TABLE]
where is the Markov matrix of the open ASEP:
[TABLE]
with
[TABLE]
It is implied that acts as written on site (and is represented in basis for the occupancy of the first site), and as the identity on all the other sites. Likewise, acts as written on site , and on sites and (and is represented in basis for the occupancy of those two sites). Each of the non-diagonal entries represents a transition between two configurations that are one particle jump away from each other. Note that we will be using the convention where is a transition from to , consistently with acting to the right on .
We will be generalising this model in two ways. First, by adding an interaction potential to configurations, which does not have to be two-body or translation-invariant (note that we will later assume to be short-range and bounded in order to prove the results in section IV). This is done by multiplying the transition rate from to by . If the system were in equilibrium (if for instance in the periodic case), the stationary probability of would then be up to a normalisation. Note that in order not to introduce too many irrelevant quantities, the Boltzmann inverse temperature is not written explicitly (i.e. either taken equal to , or as an inverse unit of energy).
Secondly, by adding on-site inhomogeneities in the jump rates, i.e. by having them depend on space: and become and , where is the label of the bond involved in the transition, starting at between the first and second site (so that a particle on site jumps to the right with rate and to the left with rate ), consistently with the notation used for the boundary rates. This second generalisation merely adds an index to the entries of , but the first one modifies the structure of : the entries of can now depend on all the details of the initial and final configurations, so that they are not effectively of dimension any more. Note that part of the inhomogeneity can be absorbed in , so that these two generalisations are not orthogonal, but this will not be important for us.
We will be generically writing the transition rates of our process as , and we will take to mean that there is a transition from to . It will be useful for future calculations to decompose the Markov matrix of this generalised simple exclusion process into three pieces:
- •
, containing the rates for jumps to the right, of the form ;
- •
, containing the rates for jumps to the left, of the form ;
- •
, containing the escape rates, i.e. the diagonal part of , of the form .
Note that the addition of the potential to and is a conjugation (division to the left, multiplication to the right) of the potential-less rates with a diagonal matrix , but this is not the case for .
II.2 Deformed Markov matrix for the current
Now that we have defined our processes, we can see how to access the statistics of the stationary current. The simplest, most tractable way to do that, for our purposes, is through the cumulants of the current.
Let us say that we want to know, for instance, the stationary statistics of the time-averaged current of particles across the bond between the left reservoir and the first site. We should, in principle, from a given initial condition, look at all the possible histories of the system for a certain runtime , count the number of particles crossing that bond in either direction, and compute the probability that the difference of these two numbers is close to . A simpler way to proceed is to introduce a fugacity (or counter or Lagrange multiplier) for that algebraic number of ingoing jumps, and multiply, in , the rate which increases that count by one (i.e. in ) by and the rate which decreases it (i.e. in ) by . By using these deformed rates instead of the original ones in our process (which is not a proper Markov process any more, since we have not modified ), every history will be multiplied by as many fugacities as there was jumps, i.e. by a factor . Summing these weighted probabilities and taking derivatives with respect to will then yield the moments of , which we can then relate to their probability distribution.
This can be rephrased neatly in a few equations. For the sake of compactness, we will consider the case where , so that we can write in a simple way, but it is straightforward to include the potential as well. Let us now assume that we want to monitor all of the time-averaged currents , across each of the bonds in the system, independently, in order to obtain their large deviation function defined through
[TABLE]
in the large limit, where is the algebraic number of jumps made between sites and from time [math] till . We associate a fugacity to each current (fig.2), and write simply for the vector containing the ’s.
Consider then the following matrix
[TABLE]
with
[TABLE]
(where, as before, it is implied that acts as written on site [math] in the basis and as the identity on the other sites, and the same goes for on site ; similarly, is expressed by its action on sites and in the basis and acts as the identity on the rest of the system).
As we said earlier, summing the weights of histories obtained by using this matrix as a generator for a time from an initial state yields the generating function of the joint moments of all the currents up to a trivial dependence on . Taking the logarithm of that generating function and dividing by therefore gives the (rescaled) generating function of the cumulants of the current , in the limit of large times:
[TABLE]
It is straightforward to show from that expression that is in fact the largest eigenvalue of : for large, we can write
[TABLE]
where is the largest eigenvalue of and and are the corresponding eigenvectors. Injecting this in (7) yields . All of these elements hold important information regarding the stationary fluctuations of the current and the fluctuating states themselves. We can show, by making a saddle-point approximation for large in the definition of , that Touchette20091 ; Lazarescu2015 :
- •
Gärtner-Ellis theorem: the large deviation function of the currents, , where is the vector holding all the individual currents , is the Legendre transform of , i.e.
[TABLE]
- •
The right eigenvector holds probabilities of final configurations conditioned on the currents that have been observed, up to a normalisation, i.e.
[TABLE]
- •
The left eigenvector holds probabilities of initial configurations conditioned on the currents that will be observed, up to a normalisation, i.e.
[TABLE]
- •
The product of these two probabilities gives the stationary probability of configurations (far from the initial and final time) conditioned on the currents that are observed, up to a normalisation, i.e.
[TABLE]
In all of these cases, the currents we consider are time-averaged, but for a finite-sized system, they also correspond to stationary instantaneous currents (in other words, a given average current is best realised, over a long period, by a constant instantaneous current of the same value). These properties are not specific to one-dimensional simple exclusion processes, but are in fact valid for any currents in any Markov process on a finite configuration space.
We can significantly simplify these expressions for our models, due to the fact that they are one-dimensional, with conservative dynamics in the bulk (no particles are created or destroyed). This means that stationary currents cannot depend on space: there can be no prolonged build-up or depletion of particles anywhere in the system. Perhaps the easiest way to see this formally is through the following procedure.
Consider the diagonal matrix (with ) with an entry for all configurations for which site is occupied, and otherwise. One may easily check that the matrix similarity simply replaces and by, respectively, and , and leaves the rest of unchanged. That is to say that part of the deformation is transferred from to . Using combinations of these transformations for any sites and parameters , we conclude that all the Markov matrices deformed with respect to the currents are similar, and therefore have the same eigenvalues, as long as the sum of the deformation parameters is fixed. Note that the eigenvectors are different, but related to each other through those simple transformations, and that the product of the two eigenvectors, , is invariant as well.
In short, the two quantities that give information on the stationary fluctuations of the current in the system, and , only depend on the sum of the ’s, which we will write as , and all the ’s have the same value . We may then rewrite eq.(9) as
[TABLE]
where all the variables are now scalars. The freedom we have in distributing the ’s for a given is quite useful in practice for calculations. In this paper, we will always choose .
II.3 Current and entropy production
A physically meaningful consequence of this simplification, which also highlights the importance of the current as an observable, is that this space-independent current is exactly proportional to the entropy production in the system. Consider the particular set of weights defined by , for which becomes :
[TABLE]
which is the deformed Markov matrix measuring the entropy production. We see immediately that
[TABLE]
which implies the Gallavotti-Cohen symmetry PhysRevLett.74.2694 ; Lebowitz99agallavotti-cohen for the eigenvalues and between the left and right eigenvectors of with respect to the transformation .
Considering that \mu=\nu\log{\Bigl{(}\frac{\prod_{i}p_{i}}{\prod_{i}q_{i}}\Bigr{)}}, we also obtain the Gallavotti-Cohen symmetry related to the current, namely
[TABLE]
which is also valid for the other eigenvalues of , and the corresponding relations between the right and left eigenvectors, as well as a simple relation between the microscopic entropy production , conjugate to , and the macroscopic current , conjugate to :
[TABLE]
Two remarks are to be made here. First of all, those weights are ill-defined for the TASEP: micro-reversibility (i.e. the fact that for any allowed transition, the reverse transition is also allowed) is essential to have a fluctuation theorem. Moreover, if we take either the or the limit (with all the ’s being finitely smaller than the ’s, so that the logarithm is of order ), the centre of the Gallavotti-Cohen symmetry \mu^{\star}=-\frac{1}{2}\log{\Bigl{(}\frac{\prod_{i}p_{i}}{\prod_{i}q_{i}}\Bigr{)}} is rejected to , so that the ‘negative current’ part of the fluctuations is lost. This will in fact be useful to us: assuming that we can exchange the two limits (the validity of this assumption will be discussed in the conclusion), we will be able to get information about the limit by taking to in a totally asymmetric situation, which is much simpler than taking it to -\frac{1}{2}\log{\Bigl{(}\frac{\prod_{i}p_{i}}{\prod_{i}q_{i}}\Bigr{)}} in a partially asymmetric one.
Secondly, we may consider the detailed balance case, where . In that case, there is no entropy production whatsoever, i.e. that , as we see from eq.(17). This does not mean, however, that : the deformations through and are in that case not equivalent. The only implication this has on is that it is an even function: , all the odd cumulants are zero, and positive and negative currents of the same amplitude are equiprobable.
Note that, apart from the precise expressions of and , all of this holds equally well for the case with an extra interaction potential .
II.4 A note on asymptotics and Legendre transforms
In the rest of this paper, we will see how analysing the asymptotic behaviour of for can give us information on the nature of extreme fluctuations of the current, and indicate the existence of a generic dynamical phase transition. This will be done by obtaining asymptotic expressions or bounds on the generating function of cumulants from diagonalising for , and taking their Legendre transforms. In order to do that, one has to be careful to check that the Legendre transform of the asymptotic equivalent of is indeed an asymptotic equivalent of the Legendre transform of the real . This will be done in appendix A.
Moreover, all calculations will be done at a size large but finite, so that the limits are taken before . These two limits do not commute in principle, meaning that our results will not be a proof of the existence of a dynamical phase transition but rather a strong indication of it. In particular, in the case of a very inhomogeneous system (possibly with quenched disorder), or a very unphysical potential, the behaviour of and the presence of a sharp dynamical transition will depend on how the limit is taken on and . On the contrary, for a well-behaved and local , and slow-varying disorder , the phase transition is likely to be similar to that which can be observed in the simple ASEP derrida1999universal ; Lazarescu2015 , although it is unclear which of its features are universal. This will be illustrated with a few numerical plots in section V.
III High current limit
We will first consider the limit where , which corresponds to .
As before, we can write
[TABLE]
Note that the diagonal part of is not deformed. The generic form of the entries of and for a given transition and its reverse is
[TABLE]
From what we saw in section II.2, we can find a function such that the matrices and have rates
[TABLE]
so that in the limit, and become negligible, and the problem reduces to the analysis of .
III.1 Mapping to an XX spin chain
We are now considering only the right-moving part of , with entries
[TABLE]
between configurations which differ by one jump to the right. Using matrix similarities, we can significantly simplify the problem. First of all, it is clear that we can get rid of through the similarity , so that has no influence on large positive fluctuations of the current (as long as takes finite values). We are left with entries of the form , which is to say that we can treat the inhomogeneity in the rates as an inhomogeneity in the deformations . Since only their sum matters, only will appear in the fluctuations of the current, so that the inhomogeneity is also virtually irrelevant.
Putting all this together, we are left with a much simpler matrix to analyse : the right-moving part of a standard ASEP, with all , and with a pre-factor . It turns out that it is in fact more convenient to keep a factor in the boundary matrices, so that we are left to study the matrix
[TABLE]
where and are matrices which respectively add and remove a particle at site .
We may recognise this to be the upper half of the Hamiltonian of an open XX spin chain Bilstein1999 . Moreover, it happens to commute with its transpose, thanks to the factors on each side. We know, from the Perron-Frobenius theorem, that the highest eigenvalue of that matrix is real and non-degenerate. It is therefore also the highest eigenvalue of its transpose, with the same eigenvectors (because they commute). This allows us to define their average , which has the same highest eigenvalue and the same eigenvectors as . Forgetting about the constant pre-factor for the time being, is given by:
[TABLE]
which is the Hamiltonian for the open XX chain with spin and extra boundary terms and (with ). Luckily, we can diagonalise it exactly.
III.2 Largest eigenvalue of H
We will not give here the full derivation of the diagonalisation of , which involves standard free fermion techniques, but only the results relevant to this paper. For more details, the reader can refer to section V B of Lazarescu2015 , or to Bilstein1999 , where this spin chain is studied with more general boundary conditions.
This system has independent excitations, with energies
[TABLE]
and a vacuum energy of
[TABLE]
The highest eigenvalue is therefore obtained by adding all the positive excitation energies, which yields
[TABLE]
Remembering the global pre-factor which we removed earlier, and after simplification, we finally get:
[TABLE]
Depending on how the ’s are chosen, the first factor in this expression might be negligible or not. However, we can simply decide to always impose that or is of order , which means choosing the average time it takes for one free particle to go through the system as a natural time scale. Without a great loss of generality, we therefore have
[TABLE]
and
[TABLE]
which is proportional to , and depends neither on nor on the ’s (apart from a trivial re-scaling).
This expression, and its scaling with respect to , are what we were mainly after, but it will also be informative to look at the corresponding stationary state.
III.3 Stationary state conditioned on a high current
As stated before, the state that we want to examine is the highest energy state of a free fermions system, so that it will not come as a big surprise that the probability of each configuration can be expressed as a Vandermonde determinant. Once more, we will only state the relevant results, and let the curious readers refer to section V B of Lazarescu2015 for the details of the calculations.
The un-normalised stationary probability of a configuration can be expressed, as we saw, as a product of the right and left eigenvectors corresponding to the eigenvalue . In the limit, the eigenvectors of do not depend on except up to a matrix similarity which leaves this product invariant, so that we will simply write it as . We have
[TABLE]
conditioned on a current . We find that this probability can be expressed as:
[TABLE]
where or is the occupancy of site , and . Note that all these probabilities are still un-normalised.
This distribution is that of a Dyson-Gaudin gas Gaudin1973 , which is a discrete version of the Coulomb gas, on a periodic lattice of size , with two defect sites (at [math] and ) that have no occupancy, and a reflection anti-symmetry between one side of the system and the other (fig.-3). The first (upper) part of the gas is given by the configuration we are considering, and the second (lower) is deduced by anti-symmetry. The interaction potential between two particles at angles and is then given by:
[TABLE]
An important feature of that state is the centred density correlation between two sites and , i.e.
[TABLE]
Using the probability distribution we just obtained, we can compute it to be given by
[TABLE]
The correlations are therefore exactly [math] for sites which are an even number of bonds apart (as is the case for a half-filled periodic chain Popkov2010 ), and behave as
[TABLE]
otherwise, if the two sites are far away enough from the boundaries. Note that those correlations do not vanish with the size of the system, in contrast with the steady state of the ASEP at , where they behave as in the maximal current phase and vanish exponentially in the high and low density phases Derrida1993a .
In the periodic case, it was shown in Popkov2010 that the large current limit of the steady state of the ASEP of size converges to a simple periodic Dyson-Gaudin gas (without defects or symmetry). The inhomogeneous interacting version of that model can be treated in the exact same way as we did here, yielding the same result.
We should note that the trick consisting in taking the sum of and its transpose to reconstruct an XX spin chain is not in fact necessary. All the results can be obtained, in a slightly different way, on directly, which has the added advantage that the imaginary part of the other eigenvalues is not lost Karevski2016 .
IV Low current limit
We now consider the limit.
This case is somewhat more subtle than the previous one. As we saw in section II.3, the centre of the Gallavotti-Cohen symmetry is at \mu^{\star}=-\frac{1}{2}\log{\Bigl{(}\frac{\prod_{i}p_{i}}{\prod_{i}q_{i}}\Bigr{)}}, and is also the point where , by symmetry. Trying to analyse the behaviour of around that point is, in general, not much simpler that the complete problem. However, as we remarked, if we consider the TASEP instead, where all the backward rates are set to [math], this point becomes , which greatly simplifies the problem, as we will shortly demonstrate.
The question then remains of the relevance of this special case. Notice that even without taking , the value of will go to in the large size limit, as long as the amplitude of the inhomogeneities is bounded (i.e. does not get arbitrarily small or large when goes to ), so that grows exponentially with . This suggests that the two limits might be consistent (first , then , or the reverse), but in no way proves it. We will come back on this assumption in the conclusion of this paper, but in all rigour the computations in this section apply only to totally asymmetric models.
We will first present the problem, and the tools appropriate for solving it. We will then look at the simple case of the TASEP (which can also be found in Lazarescu2015 but is reproduced here, in less detail, for pedagogical purposes), where everything can be done fairly explicitly. We will generalise the proof to interacting inhomogeneous systems, where the calculations cannot be done in detail, but where we can still obtain bounds on that will be sufficient for the main result to hold ; that subsection is particularly technical, and the reader interested in the results rather than the methods and proofs may jump directly to section IV.3.4. Finally, we will give a few illustrative examples of cases where the result holds (IV.4.1) or doesn’t (IV.4.2). The results and techniques presented in this section constitute the core of this paper, and our main new contribution to the study of dynamical phase transitions.
IV.1 Matrix perturbation and resolvant method
Our starting point here is the deformed Markov matrix for a totally asymmetric model, with all the deformations set to for simplicity, as in the previous section. In the limit , we have . We can then write the deformed Markov matrix as
[TABLE]
We need to extract the largest eigenvalue of this matrix. Unlike for , we cannot simply keep only the leading term, which is this time , because it is constant, and therefore not sufficient to give us the behaviour of through a Legendre transform. We will need to treat the non-diagonal part perturbatively in order to obtain the first correction to that constant term as well.
We will go into the details of how this can be obtained, but the reader familiar with perturbation theory may jump directly to the last paragraph of this sub-section, where these preliminary calculations are summarised.
The first step is to find the eigenspace of with the largest eigenvalue. Since is a diagonal matrix containing the opposite of the escape rate from every configuration (i.e. minus the sum of all out-going rates), that eigenspace will be the space of all configurations having the smallest possible escape rate. In other terms, the best states to be stuck into to produce an extremely small current are the ones that have the longest life-time. Let us write that space as , the configurations that live in it as , and the corresponding eigenvalue of as . Once we have that space, we need to find the largest eigenvalue of the perturbation restricted to it.
While that perturbative calculation can be performed without too much effort in the simplest cases (where the dominant eigenvalue of is not degenerate), we will need to cover all possible complications. This can be done in a very compact way using the so-called resolvent formalism Fredholm1903 . Not only is it a clean and systematic way to deal with perturbative expansions in degenerate spaces, but it also provides with a rigorous definition of an effective dynamics within that space.
This formalism can be stated as follows: for a diagonalisable matrix with eigenvalues and eigenvectors and , we may write
[TABLE]
where the sum is over the eigenvalues of which lie inside of the contour .
Since we are only interested in the eigenvalues of which are close to the dominant eigenvalue of , we can apply this formula to with a small contour around to get our effective biased Markov matrix, which we will write as (keeping the scalar term out for convenience). Shifting the origin of the complex plane to for simplicity, we get
[TABLE]
where is a small circle centred at [math]. We can now expand this expression in powers of :
[TABLE]
which is a sum over paths of length , with transitions given by and a configuration weight given by . We see that, if is small enough to contain only the poles of which are at [math], the only terms which contribute to the integral (i.e. that give first order poles which yield non-zero residues) are those for which is taken at at least twice. In other terms, they are the paths which go through at least two of the ’s.
Moreover, since we are only interested in the leading term in the largest eigenvalue of , the corrections to the eigenvectors will not be relevant, so that is equivalent to its projection onto . This means that we can restrict the expression (39) to paths that start and end in , from some to , accounting for the previous requirement.
Finally, notice that the weight of each of those paths is proportional to to the power of the number of jumps performed, with a pre-factor which is the ratio of all the jump rates used on that path over all the escape rates of the intermediate configurations minus . Since we are only interested in the leading order in , we only need to consider the paths with the least number of jumps between the initial and final configuration. The corresponding entry in will have a pre-factor, being the sum of the ones for each of the paths that have that minimal length, and which we can calculate in the simplest cases, but which will ultimately be inessential to our result.
Let us summarise this first section before getting into specific calculations. If has a highest eigenvalue with eigenspace generated by configurations , then the highest eigenvalue of is given by plus a first correction which is the largest eigenvalue of a matrix with entries , where is the smallest number of jumps connecting to , and is a numerical pre-factor which can be obtained from expression (39) although it will not be necessary.
IV.2 TASEP conditioned on low current
We first look at the simple case of the regular TASEP, with all bulk rates equal to , and boundary rates and . It is customary to consider and , which does not restrict the behaviour of the system by much. We will focus on those case and merely comment on the remaining ones, which are covered in all generality by section IV.3.
For this model, we can easily narrow down the set of candidates for : since all the transition rates are independent (i.e. the rate of a jump does not depend on which other jumps are possible), the escape rate of a state with several possible jumps is the sum of escape rates of states which have only one possible jump, and is therefore larger than each one of those. We only need to consider states with a single allowed jump, which is to say states that are completely full up to a given site and then completely empty.
We then see that there are three qualitatively different situations (fig.-4):
- •
if and , then the best configuration is empty ( for all ’s), with . If and , we have the same in reverse: the best configuration is full ( for all ’s) and (those two first cases are symmetric to one another, and we will only be considering the first one) ;
- •
if , then , and we have two competing configurations: empty or full ;
- •
if and , any configuration with a block of ’s followed by a block of [math]’s has an eigenvalue of , which is the highest, except possibly the completely full and empty configurations depending on whether those inequalities are strict or saturated ; all those situations being essentially identical in the large size limit, we will focus on , in which case there are states in .
We will now construct in each of these cases, and analyse its largest eigenvalue.
IV.2.1 Empty/full phases
This first case, where , is fairly straightforward: contains only the empty configuration, and has only one entry, equal to the weight of a path going from the empty configuration to itself. That is achieved in the least number of steps, which is , by having one single particle go through the whole system from left to right. The contribution of is therefore , and the numerical pre-factor is simply given by .
This gives us
[TABLE]
and
[TABLE]
Finally, note that in this case, the second largest eigenvalue is for (and corresponds to a completely full system), so that the gap between the first two eigenvalues of is finite and equal, at leading order, to .
The corresponding results for (the ‘full’ phase) can be obtained by exchanging with .
IV.2.2 Empty-full line
We now consider the slightly more complex case where . This time, there are two states with equal eigenvalues for :
[TABLE]
As in the previous case, the diagonal part of is given by the shortest paths going from these configurations to themselves, with the same weight as before (them being equal to each other because of the particle-hole symmetry). For the off-diagonal elements, we have to consider the shortest way to go from to , or the opposite. This means completely filling or emptying the system, which can be done in steps, but in this case the pre-factor contains contributions from many paths and is not straightforward to calculate. We therefore have something of the form
[TABLE]
where is said pre-factor.
From this, we see that the difference between the two highest eigenvalues is of order . For symmetry reasons, the main eigenvector is then , and the second one is . The leading terms of the largest eigenvalue are the same as before:
[TABLE]
and
[TABLE]
but this time, the gap behaves as .
IV.2.3 Anti-shock zone
For the last case, where all the jumping rates are equal (), we find states with an eigenvalue equal to for . Those states are given by , i.e. configurations made of a block of ’s followed by a block of [math]’s. Those are called anti-shocks, being symmetric to the usual shocks observed in the steady state of the TASEP for , which have a low density region followed by a high density one.
Using the resolvent formalism, we find:
[TABLE]
as well as terms of the type
[TABLE]
and so on. We can check those last terms to be of sub-leading order in , and we will neglect them right away, which will allow us to complete our calculation. A more rigorous approach will be taken in section IV.3.
We are left with
[TABLE]
This matrix is similar to
[TABLE]
and can be diagonalised exactly (see Lazarescu2015 for more details). This time the dominant contribution to the largest eigenvalue turns out to come from the non-diagonal part, and be equal to , which yields
[TABLE]
and
[TABLE]
In this case, the gap is equal to
[TABLE]
The cases where and/or are almost identical, with the first state and/or the last state removed, and show the same large-size behaviour.
IV.3 Interacting inhomogeneous TASEP
In this section, which contains the main new result of the present paper, we generalise the ones obtained for the standard TASEP to the inhomogeneous and interacting processes defined in section II. That result can be stated as follows: in the limit of small currents (), the first correction to the generating function of the cumulants of the current scales exponentially with , with a rate that does not vanish in the limit of large sizes (as it does in the high current limit):
[TABLE]
where can depend on but not .
The fact that it scales exponentially with simply comes from it being an eigenvalue of a finite matrix whose elements are powers of (with unimportant numerical pre-factors). The proof of the bound can then be achieved in three steps:
- •
we will first show that, under certain assumptions, the states in , which have the longest lifetime, are similar to anti-shocks, with a region transiting from completely full to completely empty which can have any occupancy but whose size cannot grow with ;
- •
we will then show that a simple cycle of such states takes a number of steps that is at least one order of larger than the number of states in the cycle ;
- •
finally, we will use that bound to estimate the principal minors of , in order to show that the leading power of in the largest eigenvalue of is linear in , which will complete the proof.
Note that all of the estimates we will be making are broad enough to account for the worst cases, but can certainly be made more precise for specific models.
IV.3.1 Longest-lived states
We will first show that all the states in are of the form with where is independent of , i.e. anti-shocks with a maximal width . In this expression, is the site of the first particle that can jump, and that of the last one (it is assumed that site is empty and that site is occupied, unless ).
In order to prove that first step, we need to make two assumptions:
- •
first, that the values of the inhomogeneous rates do not scale with the size of the system, i.e. that the values of are bounded on both sides for any value of :
[TABLE]
- •
secondly, that the potential is such that any difference of over a transition (i.e. if ) is bounded on both sides:
[TABLE]
and that it is local, in the sense that the potential difference for a jump between sites and doesn’t depend strongly on the part of the configuration that is far enough from the jump:
[TABLE]
where is the portion of that is between sites and , and the is there for later convenience ; these two conditions are for instance verified if is a short-range two-body potential, and are not verified for typical long-range potentials, but are less restrictive than that.
From assumptions (58) and (59), we get that the transition rates are bounded independently of :
[TABLE]
We can then prove the following crucial statement: any state with the minimal escape rate has at most \bigl{\lfloor}w_{max}/w_{min}\bigr{\rfloor} possible transitions. The proof is straightforward: if a longest-lived state has possible transitions, then, from the left side of the previous inequality, its escape rate is larger than ; from the right side of the inequality, it also has to be smaller than , because there are, for instance, states with only one transition whose escape rate is one single jump rate and which is smaller than that bound ; therefore
[TABLE]
Moreover, those transitions cannot be too far from each other, because of the locality assumption (60) which ensures that particles cannot stabilise each-other from afar. Consider for instance a configuration with two successive possible jumps involving particles at sites and such that . This means that the first of the two particles has more than holes in front of it, or that the second has more than particles behind it. Let us assume it is the former (but the latter can be treated in the exact same way). Consider also the configurations resulting from one of the jumps, from site to , the configuration which is the same as up to site and empty afterwards, and the configurations resulting from the jump from site to in (cf. fig.5). has at most possible jumps.
Choosing now for , we have, according to (60):
[TABLE]
because and are identical up to a distance from any transitions that they have in common. We can now compare the escape rates from and :
[TABLE]
The right-hand side is smaller than the escape rate from , since there is at least one transition to the right of . Therefore, it cannot be one of the longest-lived states, because has a strictly longer lifetime.
From this, we conclude that longest-lived states cannot have transitions that are further apart than with .
Combining this with the previous result, we find that all the transitions from a longest-lived state have to be within a region of size K=2l_{\alpha}\bigl{(}\bigl{\lfloor}w_{max}/w_{min}\bigr{\rfloor}-1\bigr{)}, which is independent of . All the sites to the left of that region have to be full, and all those to the right have to be empty. This concludes this first part of the proof.
IV.3.2 Length of simple cycles in
For this second step, we will be estimating the minimal number of jumps performed along a simple cycle of states from : , where no state is visited more than once. Let us write the total minimal number of steps along that cycle. As we saw in the previous section, the states in can be indexed by the position of the first jump, the position of the last jump (with ), and the configuration of the sites in between. We will also write the number of particles in a state as .
We want to show that there is a constant independent of such that .
Consider first a cycle of any length, however small. Since we are coming back to the initial state at the end, the total number of jumps has to be a multiple of (the jumps can be reordered so that every particle does an integral number of loops around the system before coming back to its initial position). Since at least one step is taken, we have then . This is true in particular for cycles of length , so that for any .
To obtain a bound on the number of steps of a larger cycle, we can first simplify the problem by noticing that any state is a finite number of steps away from the totally ordered state with as many particles, which we will call , with particless followed by holes (i.e. the state that we called for the simple TASEP). In the worst case, it takes steps to go from to , which happens if is even and if is of the form . Considering now the number of steps from to , we know that it is larger than that from to plus that from to plus that from to . Writing as the minimal number of steps from to , we have therefore that
[TABLE]
Moreover, depending on and , it is straightforward to obtain :
- •
if , then the last particles have to leave the system through the right boundary, which can be done with
[TABLE]
- •
if , then the last particles have to enter the system from the left boundary, which can be done with
[TABLE]
Note that in all cases, , so that
[TABLE]
It follows that, for a cycle with and , the total number of steps has to be larger than the direct path back and forth between two states that realise those extrema, so that
[TABLE]
One final remark to be made is that a certain value of can correspond to at most different states from . This can be seen by considering a state with , with the other particles being confined to the sites following the first hole at . There are at most such states, for from to , plus the state with , which all sum up to .
Consider now a cycle of length . From what we just saw, takes at least different values along the cycle, so that . Combining this with the previous inequality (69) gives
[TABLE]
which, for , finally gives
[TABLE]
with .
IV.3.3 Equivalent and bound for
We will now find a bound on the eigenvalues of using the bounds we have on the weight of its cycles. This can be done by looking at the characteristic polynomial of :
[TABLE]
where and . It is well known that can be expressed as a sum of the principal minors of size of , which is to say a sum of weights of all composite cycles from of total length . Each is therefore a polynomial in , the valuation (smallest exponent) of which we will write as . Let us also define
[TABLE]
where, as in the previous section, is the number of steps in a cycle of length . We have shown that .
Consider now the rescaled polynomial
[TABLE]
which has at least one finite coefficient (), all the others being infinitesimal in . The roots of this polynomial are therefore finite in the limit , and at least one of them is non-vanishing. It is not obvious that the highest root, which is the one we are interested in, is among those roots, as they could in principle be all negative. We will however see that it is the case here.
Consider the matrix where only the entries that contribute to the cycles that realise the minimum (73) are kept:
[TABLE]
where means that the transition is on at least one cycle such that . In particular, for transitions which can be done in the same number of steps through different paths, only the paths with the most intermediate states (the highest ) will be kept, which explains why some terms were sub-dominant in in section IV.2.3. By construction, the characteristic polynomial of is : we can obtain it by rescaling , taking to [math] so that only the finite terms survive, and finally taking it back to the original scaling. Consider also \hat{M}_{eff}=\tilde{M}_{eff}\big{|}_{\varepsilon=1}
[TABLE]
whose characteristic polynomial is therefore . Both and are diagonalisable, because all of their entries are on at least one cycle, and they have the same characteristic polynomial, from which we conclude that they are similar. In particular, the largest eigenvalue of is the same as that of , which is a positive matrix. It is therefore strictly positive.
Moreover, the matrix contains all the information relative to the effective process at low current. We will describe that process in two simple cases in section IV.4.1 (fig.8).
Going back to , we conclude that its largest eigenvalue scales as with
[TABLE]
for . This concludes our proof.
IV.3.4 Conclusion: asymptotics of the large deviation function of the currents
We conclude by re-stating the assumptions and the result from this section, and examining its consequence on the large deviation function of the current.
We have seen that, for a generalised TASEP with bounded inhomogeneities and a short-range potential, we can find a quantity independent of the size of the system such that
[TABLE]
for and . Moreover, is equivalent to the largest eigenvalue of a matrix who behaves as
[TABLE]
for with bounded as a function of . The states in , which are the ones involved in the dynamics in that limit, are anti-shocks, of the form with where the maximal width of the anti-shock is independent of
The statements and proofs did not require any specific form for the inhomogeneous rates and the potential, other than conditions (58-60), which means that there is no reason for to have a limit for . However, if is local and well-behaved and is slowly varying in space, we can expect that limit to exist.
Having then an equivalent of that form, we can deduce that
[TABLE]
which does not scale with the size of the system. This and the localised nature of the states in , which are the typical states occupied by the system at low current, are compatible with a hydrodynamic description of the fluctuating current Bodineau2006 ; Lazarescu2015 , where the cost of maintaining such a fluctuation in the system comes from one localised defect and hence is independent of .
IV.4 Illustrative examples
In this section, we give a few simple examples to illustrate our result beyond the excessively simple case of the TASEP. We start with a family of models with finite-range interaction for which the width of the anti-shock region can be tuned to any value, and the number of relevant states can be adjusted as well. We also explicitly build the effective dynamics in the simplest non-trivial case. Finally, to illustrate the necessity of having local interactions, we give an example of a system with long-range interactions where our result does not hold.
IV.4.1 Anti-shock regime for finite-range interactions
We will see here how we can easily build models with a prescribed maximal width of the anti-shock and various numbers of longest-lived states in .
We consider a homogeneous system, with except for and at the boundaries, and a symmetric two-bodies interaction which we will write as
[TABLE]
Note that we count every pair of sites only once. To make sure that is short-range, we will take for . With that notation, the transition rates from anti-shock states in the bulk of the system take a simple form, as seen on fig.6: for a configuration with domain walls at positions and at positions , the jump rate of the particle at position is given by
[TABLE]
By choosing , we get that every escape rate from states with anti-shocks of width less than is exactly , and all other escape rates are larger than (the boundary rates also need to be tuned to insure that, which is straightforward). This surprising identity can be easily checked on the examples shown in fig.6, and a formal proof can be found in appendix B. This choice of rates yields a number of longest-lived states of order . Considering, for instance, the cycle of states shown on fig.7, we see that (as defined in eq.(73)). The full effective process in this case is still quite complicated, so we will not go into more detail.
A simpler example is obtained by choosing and otherwise. In this case, only the anti-shocks of width [math] or of width with two possible jumps have an escape rate equal to , all the others being larger. We then have a number of longest-lived states of order , and an effective process with a structure given in fig.8. Note that, except for , only three types of anti-shocks end up contributing to the effective dynamics, as all the others only contribute to sub-dominant terms in . In all cases, we find that , so that and .
IV.4.2 Non-hydrodynamic behaviour for long-range interactions
We now exhibit an system with long-range interactions where our result is not valid, to illustrate the importance of that condition. We will construct that example step by step starting from the TASEP with , changing one element at a time in order to obtain a model where there is a cycle of longest-lived states which makes only steps in total.
The simplest such cycle that one could think of is that of one-particle states plus the empty state , where one particle enter from the left, jumps through the whole system, and exits from the right. The escape rates for those states in the case of the TASEP are from and , but from the other states. Moreover, all perfect (i.e. of width [math]) anti-shock states have an escape-rate of as well, which we don’t want (except for and ). We need to correct those two issues.
First, we increase the escape-rate of unwanted anti-shocks by adding a repulsive nearest-neighbour interaction
[TABLE]
with , so that particles leaving a neighbour behind do it with a rate , thus disqualifying those states. This has the adverse consequence of facilitating the first jumps from states and , giving them smaller escape-rates. We correct this by setting and , so that those states and now have an escape-rate larger than . At this stage, all states have an escape-rate of , except for which has an escape-rate of , and all other states have an escape-rate of or more.
It only remains for us to set the transition rate for to without modifying the other rates. That is equivalent to adding an interaction potential
[TABLE]
which is non-zero only for the empty state . It is clear that this term does not satisfy the locality condition (60), which allows the escape rate from to be the same as that from , even though the two possible jumps out of are as far away from each-other as possible and one of them is the same as the jump out of .
It follows from that cycle of states that , as defined in eq.(73), and that , so that
[TABLE]
The factor means that the probability of observing a small current scales with the size of the system, as expected. This can be seen clearly on numerical evaluations of for small system sizes. On fig.9, we plot that quantity for systems of various sizes and with or without the non-local interaction . As we can see, it is independent of even for small negative values of when is not introduced, and depends strongly on when it is.
V Illustration and discussion
In this final section, we illustrate our results with a variety of numerical plots, and discuss a possible physical interpretation of the dynamical transition in terms of maximum hydrodynamic current as well as a possible connexion to the KPZ universality class.
Considering the broad class of models that we have been looking at, constrained only by (58), (59) and (60), we need to somewhat restrict ourselves in order to obtain something meaningful in the large size limit (i.e. an identifiable phase transition). Above all, that limit itself has to make sense, which restricts the sequence of models of increasing size (or possibly the sequence of ensembles of models of increasing size) that we may consider. We first have to distinguish between disordered models, for which and/or might be drawn from a distribution, and models with fixed parameters. We then have to define and so that a large size limit can be taken, which might involve taking to be a discretisation of a fixed smooth function , and to be a combination of simple short- or finite-range body interactions, with perhaps a slow space-dependence.
In this section, we will only be considering (and conjecturing about) homogeneous systems with simple short-range potentials, of which the ASEP is the simplest example (and we will use it as a guide throughout, with all the related results taken from Lazarescu2015 ).
We start by simply looking at the qualitative behaviour of the spectrum of as a function of . For this, we choose a simple case where all , with next-to-nearest-neighbour interactions in the notation of section IV.4.1, and a system size . We plot, on fig.10, the real part of the eigenvalues of , where the rescaling is introduced so that those eigenvalues converge to a constant for rather than diverge, which makes the plot clearer. We also plot on fig.11 the rescaled complex spectrum of the same model with for a few values of .
As we can clearly see, the aspect of the spectrum is quite different between negative and positive values of . For , we have a Fermionic spectrum, where the eigenvalues are distributed according to the semi-circle law (as is clear on fig.11 ; the highest and lowest eigenvalues on fig.10 seem to be separated from the rest of the spectrum by a rather large gap, but this is due to the small system size and would not be the case in the large size limit). In the limit, on the other hand, we have a quasi-discrete real part of the spectrum where eigenvalues accumulate around specific values with a high degeneracy (this is helped by the fact that we chose an homogeneous system with a simple next-to-nearest-neighbour potential, in order to have a high degeneracy of escape rates even for a small size ; in the large size limit, we would observe such a high degeneracy even for more complex potentials and slowly-varying jump rates). The transition between these two regimes is clearly visible on fig.10 as an area dense with bifurcations and crossings, extending between and , and is an indication of the potential existence of a phase transition in the large size limit, even though for such a small size no sign of a non-analiticity can be yet observed for the highest eigenvalue . It is unclear how that area itself behaves in the large size limit, but it would be reasonable to expect that it converge to a single non-analiticity at under the proper rescaling.
That behaviour is similar to that of the open ASEP, as analysed in Bodineau2006 ; Lazarescu2015 . In the low current regime, we have a hydrodynamic phase where the large deviation function of the current is consistent with applying the macroscopic fluctuation theory (MFT, Bertini2007 ) or the additivity principle PhysRevLett.92.180601 to the continuous limit of the model, with a diffusion constant of order , which is to say that the long time large deviation function of the current and mean local density has the form
[TABLE]
with boundary conditions and and can be minimised over in order to obtain . In the case of the ASEP, we have that , and (this proportionality being a far-from-equilibrium version of Einstein’s relation, which might be specific to the ASEP and a few other models and is not to be expected in general). Moreover, the minimisation produces not only the most probable density , which is associated to through the Legendre transform of , but also a family of metastable states , which turn out to be related to the other eigenvalues in the same way (c.f. Lazarescu for more details). The structure of thus obtained is similar to what is observed on fig.10 for low enough. The first group of eigenvalues then account for the effective process discussed in section IV.1 at lowest order, and subsequent groups account for higher orders. Moreover, those eigenvalues undergo many first-order phase transitions as the boundary parameters and are varied.
In the high current regime, as we have seen, the behaviour of the ASEP is exactly the same as that of the inhomogeneous interacting versions that we have considered: we find a correlated free Fermion phase characterised by an average density of with long-range correlations, which is sometimes called a hyperuniform phase Jack2015 , and which can be described by a conformal field theory Karevski2016 .
Between those two regimes is a dynamical phase transition, which occurs when the fluctuating current goes through the critical value . That transition is of second order, and can be observed directly on the minimisation of eq.(86), since a value yields a minimum of order (the integrand is finite for every ) whereas for it is of order (the integrand is non-zero only on a fraction of order of ). However, the behaviour of for is wrongly predicted by the MFT as , whereas exact calculations using integrability methods Lazarescu2014 ; Lazarescu2015 yield instead a true scaling as .
The dynamical phase diagram of is shown on fig.12.a, as a function of the boundary parameters and . The purple surface corresponds to the hydrodynamic/correlated phase transition (the distinction between the light and dark purple zones will be made later). The region above, marked in blue, is the hydrodynamic phase corresponding to and contains a variety of first-order phase transitions between different typical states . The region below, marked in red, is the correlated phase, and does not contain other phase transitions as far as we know, although a precise description of the bulk of the phase is yet to be obtained.
In light of these similarities, we are led to give the same interpretation to the physical origin of the dynamical phase transition as in Lazarescu2015 for the ASEP: for low enough currents, the system behaves, in the large size limit, in accordance with a Langevin equation with conserved noise
[TABLE]
where is a Gaussian white noise. The so-called transport coefficients , and are in general very difficult to obtain from the microscopic process Arita2014 ; Arita2016 , but one important property of can be deduced from the exclusion property alone: it vanishes at and at , since those densities cannot sustain any current, which means that is bounded from above, with a maximum value .
For a fluctuating current smaller than that maximum (as well as for the steady state, which lies somewhere along ), there are hydrodynamic states which produce it through localised defects at a cost which doesn’t grow with . In the limit of very low currents, for instance, the typical densities are [math] and , and the localised defects in question are the anti-shocks that bridge those two densities.
For a current larger than that maximum, the best hydrodynamic states have a cost proportional to , but it becomes more efficient to introduce correlations everywhere in the system, producing a hyperuniform state which is optimised to produce a large current by forcing the particles and holes to alternate more than at random. The dynamical phase transition therefore corresponds to the appearance of correlations in the system when the current is pushed beyond its hydrodynamic regime.
This is illustrated schematically on fig.12.b, where a hypothetical is represented (as would appear for instance with nearest-neighbour interactions, in a KLS-type model PhysRevB.28.1655 ; Popkov1999 ), along with a trajectory obtained by varying from to for certain fixed boundary densities and (which is to say a vertical line from the diagram on fig.12.a): the blue curve represents the fluctuating hydrodynamic regime, from a completely empty state to a state (purple dot) sustaining the maximal hydrodynamic current (purple dashed line), passing through the typical stationary state (blue dot) at ; the red dotted curve corresponds to the correlated regime for currents larger than that maximum.
A more recent study Baek2016 makes a very similar conjecture for boundary-driven systems, consistent with ours: for systems obeying Einstein’s relation , one can observe a hydrodynamic behaviour for currents even far from equilibrium, as long as they are lower than the maximum of if that maximum exists. When is unbounded, as for some zero-range processes, no such dynamical phase transition can be found.
Finally, we should note that the hydrodynamic/correlated dynamical phase transition for the ASEP is closely related to the appearance of Tracy-Widom distributions in the statistics of the relaxation of towards stationarity, as seen in models from the KPZ universality class Spohn2016 ; Corwin2016 . The context there is quite different: the models are of infinite size and observed at long times, whereas we put ourselves at infinite time and then increase the size. However, it would make sense that both approaches be at least somewhat connected, and a first confirmation of this is the fact that the phase diagrams of small deviations around the typical currents are identical (although the current in question and the two parameters are not exactly the same in both cases).
More precisely, we compare:
- •
- the vicinity of the central slice (marked by a green dashed line) of fig.12.a, which corresponds to small fluctuations of the time-averaged stationary current of the open ASEP with boundary densities and , in the large size limit (we refer to section VI.B.5 of Lazarescu2015 for the names and full description of the phases) ;
- •
- fig.2 from BenArous2011 , which corresponds to the fluctuations of the time-integrated current across the middle bond of an ASEP on an infinite line with a product state initial condition with densities on the left () and on the right (), in the large time limit, on a certain time-scale (which is outside of the maximal current phase , and inside of it, including the boundaries).
The comparison is quite straightforward:
- •
the parts of 1) which do not sit at a phase transition, i.e. the LD phase and HD phase ), correspond to Gaussian fluctuations on a diffusive time-scale in 2) ;
- •
the line which corresponds to a first order phase transition between two states in 1), i.e. the S line , corresponds to the maximum of two Gaussian distributions on a diffusive time-scale in 2) ;
- •
most importantly, the parts of 1) which sit at the hydrodynamic/correlated dynamical phase transition, i.e. the MC phase and its boundaries, correspond to three different Tracy-Widom distributions on a subdiffusive time-scale in 2).
On this last point, in both cases, the special statistics of the current arise from the fact that it is much more difficult for the system to accommodate for currents higher than average than for lower ones. It is then natural to wonder which features of these statistics are universal, and which are model-dependent, or even parameter-dependent within the same model. Considering even the standard ASEP, the transition surface naturally splits in four areas, corresponding to the four sectors , of which we have mentioned only one so far, namely . It turns out that, because of a very special symmetry of the model (proven in appendix C, extending on a result from Torkaman2015 ), the sector has the same properties. On the other hand, the line (which is, mysteriously, equivalent to a half-filled periodic system derrida1999universal ), on the boundary of the MC phase, and the point , have slightly different properties (same exponents but different pre-factors and distributions), and we expect the sector and its symmetric to make for yet another universality subclass.
All that being said, we expect models with more complex hydrodynamic currents to all be in the same universality class. However, situations where several densities produce the same maximal current (for instance a version of fig.12.b with a symmetric ) would most probably show different behaviours in the appropriate regimes, and there is undoubtedly much more to be understood about the hydrodynamic/correlated dynamical phase transition. In particular, we should be able to find some correspondence between the exponents and pre-factors found around the transition in the stationary case and those obtained in the infinite volume case from the perspective of so-called third order phase transitions Majumdar2014 ; LeDoussal2016 , which we believe to be the long-time relaxation equivalent to our stationary dynamical phase transition.
VI Conclusion
In this paper, we have analysed the large deviations of the current in extreme limits for a very general class of models based on the TASEP, with inhomogeneities and short-range interactions. After defining the models and formalism relevant to our endeavour, we reduced the problem to that of obtaining the approximate behaviour of the largest eigenvalue of the Markov matrix deformed by a counting parameter , in the limits of .
In the limit, corresponding to a high current, the deformed Markov matrix is equivalent to a free Fermions Hamiltonian and depends only trivially on the disorder and interactions. We found that is proportional to the size of the system and that the typical states are Coulomb gases, with an average density equal to and strong nearest-neighbour anti-correlations, resulting in a correlated phase. In the limit, corresponding to a low current, the deformed Markov matrix is a high-order perturbation of a diagonal matrix. We were able to show that the typical states are of the anti-shock type, with a block of particles followed by a block of holes, separated by an area no larger than a certain constant, and in particular not growing with the system size . Moreover, was also shown not to scale with , consistently with being in a hydrodynamic phase. Those two very different limits, and in particular the different scaling of with respect to , indicating the possible existence of a dynamical phase transition in between.
We then looked at a specific model for illustration, with homogeneous rates and next-to-nearest-neighbour interactions. We saw how the transition between a hydrodynamic regime and a correlated one manifested itself on the whole spectrum of the deformed Markov matrix, and interpreted that transition in terms of pushing beyond the maximal hydrodynamic current at the price of introducing correlations in the system. We also showed a connexion between that dynamical phase transition and the appearance of Tracy-Widom distributions, as for all models in the KPZ universality class, in the infinite volume case.
These results are a quite encouraging step towards understanding the large size limit of interacting particle models far from equilibrium, especially because of the non-solvable nature of the models we have considered: the behaviour that we have described comes only from the geometric structure of the models (i.e. a lattice gas, with physical inhomogeneities and interactions), and not from a very special algebraic structure of the Markov matrix. This makes it likely that our methods could be applied for many other models with different components or geometries (it would for instance be quite straightforward to apply them to the totally asymmetric partial exclusion process, where the number of particle per site is not limited to one but to some integer Arita2014 , or to a multispecies TASEP Crampe2016 ; Crampe ). Of course, we were only able to perform calculations in extreme limits where the problem is greatly simplified, and it is more than likely that we will have to rely at least partly on numerics if we want to go further, for instance in describing the dynamical transition itself rather than the phases on each side of it. Luckily, a lot of progress is being made on the variety and effectiveness of the numerical methods available for that purpose Gorissen2009 ; giardina2011simulating ; Espigares2013 ; Nemoto2016 .
There is of course a lot more to be done on the subject. One of the outstanding problems in this context, for instance, is to obtain the hydrodynamic transport coefficients, as seen in eq.(87), from the microscopic dynamics of the systems, even in an equilibrium setting Arita2014 ; Arita2016 . Once those coefficients are known, and the MFT is assumed to be valid, one can analyse the full spectrum of the model in the bulk-driven case, and identify the effective dynamics to any order (this will be the subject of a future work Lazarescu ), as well as relate them to the relaxation paths and the pseudopotential which have already been studied Bahadoran2010 . On the other side of the transition, a good description of the correlated phase remains to be found except in the infinite current limit Karevski2016 , and in particular the appropriate order parameters have not been clearly identified, although it seems likely that they would consist of correlation functions (as the local density becomes irrelevant in that phase, and the correlations grow from [math] to a finite value). As for the transition itself, we have already mentioned that different situations will give rise to different pre-factors to the scaling of , related to different sub-classes within KPZ universality, and a precise classification of those does not exist yet as far as we know. Finally, we have focused on models with well-behaved jump rates and potentials, but our general result holds for disordered systems as well, although the consequences are more mysterious in that case and remain to be analysed. It is for instance unclear under which conditions we can expect a hydrodynamic phase to survive, as it does for dilute disorder Bahadoran2016 .
We conclude by examining in more detail a few of the natural extensions of our results as well as how they relate to other existing works.
- •
Partially asymmetric models: In order to take the limit through , we had to restrict ourselves to totally asymmetric models. It is quite likely that the phenomenology of partially asymmetric models would be exactly the same, as was shown for the ASEP in Lazarescu2015 , where the only difference is a simple factor which rescales the current (meaning that, by some miracle, Einstein’s relation is valid even far from equilibrium), although in general we would expect to depend on the backwards rates in a more complicated way. However, extending our method to that case does not seem easy: first of all, the expansion around now has to be done at a finite value of , which means that the deformed Markov matrix is not a perturbation of a diagonal matrix any more ; and secondly, the possibility of backward jumps makes eq.(39) much more complex, since every term now contains an infinity of paths that needs to be re-summed.
- •
Close-to-equilibrium systems: The case of boundary-driven or weakly bulk-driven systems is more significantly different. Unlike the asymmetric case, the MFT is rigorously proven for low currents (note that in that case, the stationary current is itself “low”, as it is of order if we measure it on one bond only, and it is the space-integrated current which converges to a finite value), but the typical states are not discontinuous, as there is no difference in scaling between the drift and diffusion terms in the MFT equation equivalent to eq.(87). On the other hand, the large current limit is exactly the same as here since the limit is identical. We therefore expect to find a dynamical phase transition, and we already mentioned consistent recent results for large fluctuating currents in close-to-equilibrium models Baek2016 . However, the nature of the transition might well be different : in periodic systems, a dynamical phase transition has been identified PhysRevE.72.066110 ; Bodineau2008 ; appert2008universal ; Simon2011 , in which travelling waves states seem to play an important role PhysRevE.72.066110 ; Espigares2013 , although that might be an effect of the total density constraint in periodic systems, as no such states are found in open systems Lecomte2010 ; Baek2016a . Moreover, the appropriate scaling for the current is not the same as in the bulk-driven case (i.e. the space-integrated current is finite, not the one-bond current), which means that the high current part of the large deviation function loses its scaling with respect to .
- •
Large deviations of the activity: Another quite natural extension would be to consider observables other than the current, such as for instance the dynamical activity, defined as a symmetric average of the number of jumps in both directions, rather than an antisymmetric one. Note that, unlike the current, the activity is not conserved throughout the system, and so every different weighting of the single bond activities is a different observable, the standard one being the uniform average (or unweighted sum). For totally asymmetric models, the current and average activity are one and the same, but they play different roles close to equilibrium (as the current appears explicitly in the MFT action, and the activity does not). The activity is known to undergo dynamical phase transitions for exclusion processes Bodineau2008 ; Lecomte2012 ; Jack2015 as well as for kinetically constrained models Garrahan2009 ; Nemoto2016 ; Nemoto2014 . For the former, the transition must be somewhat linked to that of the current, since they are identical in the totally asymmetric limit, although one should note that the choice of scaling of the observable (total activity or average activity, the latter being divided by the system size ) will have an effect on the aspect of the phase transition, which will appear to be of first order for the total activity, as it would for the total current Jack2015 ; 1751-8121-44-11-115005 . For more general models, or more general definitions of the activity, we expect our methods to be applicable, although perhaps not as straightforwardly as for the current. In the high activity limit, the deformed Markov matrix would be equivalent to an inhomogeneous XX spin chain, solvable in principle, and the same scaling would be found for the large deviation function. In the low activity limit, we would still have a perturbation around the diagonal, but with backward jumps allowed, which seems to result in terms being present in , meaning that the large deviation function of the activity would scale linearly in (excluding the constant part) as it does in the high activity regime. That would invalidate our scaling argument for the existence of a dynamical phase transition. A more careful analysis is in order.
- •
Higher dimensions: Finally, we might wonder if our methods can be extended to other geometries. The first step would be to consider the model on a tree, where it is known that the stationary density to current relation is consistent with a hydrodynamic behaviour at least in some cases Mottishaw2013 . We expect to be able to extend the low current method without major issues. In the high current limit, although the standard free Fermion techniques we used here are expected to fail, one might still be able to perform calculations using auxiliary spins at every fork, as done in Crampe2013 for a star graph. In the case of higher-dimensional regular lattices, things might be less straightforward, as there are more ways to be far for equilibrium than in one dimension: the system can be driven along loops rather than from one side to the opposite one. Moreover, the stationary current itself will be more complex than in one dimension, as the zero divergence condition allows for vortices in addition to a constant overall flux. In the relatively simple case of a system driven along one of the lattice directions, between reservoirs, with periodic or closed boundary conditions in the other directions, we expect our methods to be applicable but to produce highly degenerate dominant states at leading order, which then have to be separated by a perturbation to a higher order (especially in the high current limit, where the system essentially splits into disconnected one-dimensional chains at leading order). The low current limit can then be compared to the MFT approach, where a dynamical phase transition has already been found close to equilibrium Hurtado2011 .
Acknowledgements: I would like to thank M. Esposito and his group, as well as C. Maes, G. Schutz and D. Karevski, for interesting and useful discussions. I am grateful to R. Jack for helping me correct a mistake in eq.(80). This work was supported by the Interuniversity Attraction Pole - Phase VII/18 (Dynamics, Geometry and Statistical Physics) at KU Leuven and the AFR PDR 2014-2 Grant No. 9202381 at the University of Luxembourg.
Appendix A Asymptotics of the Legendre transform
In this appendix, we check that the Legendre transform of the asymptotic equivalent of is an asymptotic equivalent of the Legendre transform of the real .
Fist of all, we may note that this is far from being guaranteed. Consider for instance:
[TABLE]
for and , where means . The functions and are not equivalent to each other, even though their Legendre transforms are. The issue comes from not being algebraic.
Let us now focus on in particular. It is the largest eigenvalue of a matrix whose entries are algebraic functions of (either proportional or inversely proportional to it, or constant), so it is itself an algebraic function of . We need to consider both and .
Consider first (i.e. ) and examine the large asymptotics of : since it is algebraic, we have for some constants and . We also have that , and more importantly that the inverses of these functions are also algebraic, with for instance \bigl{[}xF^{\prime}(x)\bigr{]}^{-1}(y)=(y/A\alpha)^{1/\alpha}+\mathcal{O}(y^{1/\alpha-1}). The Legendre transform of is then
[TABLE]
so that
[TABLE]
yielding finally
[TABLE]
so that the Legendre transform of is indeed an equivalent of .
Consider now (i.e. ) and for some constants and . We also have that , and \bigl{[}xF^{\prime}(x)\bigr{]}^{-1}(y)=(y/A\alpha)^{1/\alpha}+\mathcal{O}(y^{1/\alpha+1}). The Legendre transform of is then
[TABLE]
so that
[TABLE]
yielding finally
[TABLE]
so that the Legendre transform of is indeed an equivalent of .
Appendix B Identity on escape rates for two-body interactions
In this appendix, we prove the claim made in section IV.4.1 that by choosing for and otherwise, all the escape rates of anti-shocks of width or less are equal to , and all escape rates from other states are larger.
Consider an integer , complex numbers and , and the complex function
[TABLE]
That function being rational, the sum of all its residues is [math]. It has poles at all the ’s and at infinity, and the residue at infinity is clearly , which yields:
[TABLE]
Consider now to be integers, corresponding to the positions of alternating and boundaries in an anti-shock state of width or less, as seen on fig.6. Every term in the sum then corresponds precisely to the jump rate of the particle at position , so that the whole sum is the escape rate from that configuration, equal to . Moreover, the escape rate from an anti-shock with a width larger than can be obtained from that expression by replacing every term or which is larger than in absolute value by , making the corresponding ratio, and hence the whole sum, strictly larger.
Appendix C Left-right duality for the biased ASEP
In this appendix, we derive a surprising symmetry for the large deviation function of the standard one-dimensional open ASEP. Consider once more the deformed Markov matrix
[TABLE]
with
[TABLE]
It was noticed in Torkaman2015 that for certain constrained boundary rates, which can be written as , , and , the deformed Markov matrix for was, up to a constant, the same as the non-deformed Markov matrix with and , which can be seen as either a left/right or a particle/hole transformation. We will see here that this identity can be extended to a symmetry of the whole deformed Markov matrix, and by extension of the cumulant generating function and the large deviation function . In order to do that, we will need to add , , and as parameters to our notations, i.e. for instance write the deformed Markov matrix as .
We now consider the following identities (remembering that we note the occupancy of site as ):
[TABLE]
Summing those identities, we see that all the deltas cancel one-another, and we are left with
[TABLE]
with (remember that ), which leads to the same identity for :
[TABLE]
Moreover, we have that
[TABLE]
The first equality is obtained through a left/right transformation , and the second through a particle/hole transformation . In both cases, the current is reversed, hence the . Finally, the Gallavotti-Cohen symmetry reads
[TABLE]
To see the consequence of those symmetries on the dynamical phase diagram (as shown on fig.12.a), we have to take the limit, which leads to two simplifications: first, the part of the phase diagram corresponding to , i.e. \mu\leq\frac{1}{2}\log\Bigl{(}\frac{q^{L+1}(1-\rho_{a})\rho_{b}}{p^{L+1}\rho_{a}(1-\rho_{b})}\Bigr{)}, is rejected to and disappears from the diagram ; secondly, as is remarked in section IV.C of Lazarescu2015 , the value of does not depend on all four of the boundary rates, but only on two combinations which are here precisely equal to and , meaning that the special case we considered is generic in the large size limit. We can therefore describe the symmetries of the full phase diagram by combining all the finite-size relations that we have just written, keeping only those for the forward ASEP ( in that order, which we can remove from the parameters) and for finite. The left/right particle/hole symmetry is well known and clearly visible on the diagram, but we also get a new and surprising symmetry which exchanges the two reservoirs:
[TABLE]
This symmetry exchanges the shock and anti-shock phases as seen on fig.20 of Lazarescu2015 , and moreover, since all those calculations can be done in much the same way directly on the deformed Markov matrices (which we did not do here simply for the sake of readability), we find that the typical profiles associated to shocks and anti-shocks are symmetric to one-another. In particular, the phase transitions shown in dark purple on fig.12.a are equivalent.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1(1) L. Bertini, A. De Sole, D. Gabrielli, G. Jona-Lasinio and C. Landim. Stochastic interacting particle systems out of equilibrium . Journal of Statistical Mechanics: Theory and Experiment 2007(07) , P 07014 (2007).
- 2(2) T. Bodineau and B. Derrida. Current fluctuations in nonequilibrium diffusive systems: An additivity principle . Physical Review Letters 92(18) , 180601–1 (2004).
- 3(3) R. J. Harris, A. Rákos and G. M. Schütz. Current fluctuations in the zero-range process with open boundaries . Journal of Statistical Mechanics: Theory and Experiment 2005(08) , P 08003–P 08003 (2005).
- 4(4) P. Chleboun, S. Grosskinsky and A. Pizzoferrato. Current large deviations for zero-range processes on a ring (2016).
- 5(5) B. Derrida. An exactly soluble non-equilibrium system: The asymmetric simple exclusion process . Physics Reports 301(1-3) , 65–83 (1998).
- 6(6) T. Chou, K. Mallick and R. K. P. Zia. Paradigmatic Model To Biological Transport . Reports on Progress in Physics 74(11) , 116601 (2011).
- 7(7) H. Touchette. The large deviation approach to statistical mechanics . Physics Reports 478(1-3) , 1–69 (2009).
- 8(8) R. L. Jack and P. Sollich. Effective interactions and large deviations in stochastic processes . The European Physical Journal Special Topics 224(12) , 2351–2367 (2015).
