Density large deviations for multidimensional stochastic hyperbolic conservation laws
Julien Barr\'e (1), Cedric Bernardin (2), Rapha\"el Chetrite (2) ((1), MAPMO, IUF, (2) JAD)

TL;DR
This paper studies the probability of rare density fluctuations in multidimensional hyperbolic conservation laws, deriving explicit large deviation functions and analyzing the structure of optimal currents, especially when conductivity and diffusivity are not proportional.
Contribution
It provides explicit calculations of the large deviation function for step-like profiles and explores the structure of optimal currents without the proportionality assumption.
Findings
Explicit large deviation function for step-like density profiles.
Optimal current exhibits non-trivial structure when conductivity and diffusivity are not proportional.
Lower bound established for the large deviation function in general cases.
Abstract
We investigate the density large deviation function for a multidimensional conservation law in the vanishing viscosity limit, when the probability concentrates on weak solutions of a hyperbolic conservation law conservation law. When the conductivity and dif-fusivity matrices are proportional, i.e. an Einstein-like relation is satisfied, the problem has been solved in [4]. When this proportionality does not hold, we compute explicitly the large deviation function for a step-like density profile, and we show that the associated optimal current has a non trivial structure. We also derive a lower bound for the large deviation function, valid for a general weak solution, and leave the general large deviation function upper bound as a conjecture.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Density large deviations for multidimensional stochastic hyperbolic conservation laws
J. Barré
MAPMO - UMR CNRS 7349, Fédération Denis Poisson
Université d’Orléans, Collegium Sciences et Techniques
Bâtiment de mathématiques - Route de Chartres
B.P. 6759 - 45067 Orléans cedex 2 FRANCE
et Institut Universitaire de France
,
C.Bernardin
Université Côte d’Azur, CNRS, LJAD
Parc Valrose
06108 NICE Cedex 02, France
and
R. Chetrite
Université Côte d’Azur, CNRS, LJAD
Parc Valrose
06108 NICE Cedex 02, France
(Date: .)
Abstract.
We investigate the density large deviation function for a multidimensional conservation law in the vanishing viscosity limit, when the probability concentrates on weak solutions of a hyperbolic conservation law. When the mobility and diffusivity matrices are proportional, i.e. an Einstein-like relation is satisfied, the problem has been solved in [4]. When this proportionality does not hold, we compute explicitly the large deviation function for a step-like density profile, and we show that the associated optimal current has a non trivial structure. We also derive a lower bound for the large deviation function, valid for a general weak solution, and leave the general large deviation function upper bound as a conjecture.
Key words and phrases:
Large Deviations Principle, Stochastic conservation laws, Kawasaki dynamics, Active particles.
1. Introduction
Systems of interacting particles are often described on large scales by a partial differential equation. This PDE results in some sense from a Law of Large Number, and it is sometimes important to go beyond this level, and to include a description of the fluctuations around the most probable evolution [46], expressed by this PDE. Many studies have been devoted to this kind of description for driven diffusive systems, leading to what is called ‘Macroscopic Fluctuation Theory”, a cornerstone of modern out of equilibrium statistical physics (for a recent review, see [8]). By contrast, less is known for systems whose macroscopic description involves a transport equation, which is the main set up here. In this article, we approach this second set-up with the first one and for this we consider a generic multi-dimensional transport-diffusion system in the limit of small diffusion: the hydrodynamic for the density111Depending on the context has to be interpreted as a density, an energy … , , with , is described by the parabolic equation
[TABLE]
Here is a square symmetric matrix called diffusivity, is a -dimensional vector called hyperbolic flux. The parameter regulates the strength of the diffusion and will tend to infinity later on: in this limit, (1.1) becomes a scalar hyperbolic conservation law. The solution of (1.1) will be denoted by .
To take into account the fluctuations around this typical behavior we have to replace the previous PDE by the SPDE [34, 28, 46]
[TABLE]
Here is a space-time Gaussian white noise, is a symmetric matrix valued function called mobility and is the number of particles in the interacting particle system which tends to infinity. Examples of interacting particle systems, among many others, which are described by stochastic conservation laws (1.2) are the following :
- •
Driven Kawasaki exchange dynamics: These dynamics [32] are jump Markov processes with state space , a sub-lattice of . For a configuration we interpret as the presence of a particle at site and as its absence. The dynamics are such that the number of particles is locally conserved. They were first introduced as reversible dynamics w.r.t. the Gibbs measure with some Hamiltonian by imposing the corresponding detailed balance condition on the rates , , . Yet, adding some external constant and homogeneous electric field to such a system 222This means that the initial jump rates are modified into such that the “local detailed balance” , with the configuration obtained by exchanging with , is satisfied. results in a nonequilibrium stationary state with a non-zero average flux of particles, which moreover is usually not Gibbsian. If the rates are anisotropic then the matrices and are not proportional and long-range correlations are expected, despite the fact that the dynamics is only local. In the case of isotropic jump rates, the matrices and are proportional and the stationary state has short range correlations [25], [38],[9].
- •
Active particles: Different biological systems, from bacteria to flocks of mammals, are described by interacting particles, each one of them being self-propelled. The macroscopic description of such systems is in general more complicated than (1.1), including several PDEs [7, 17]; [6] provides an example where the finite size noise is kept in the final equations. Nevertheless, simplified models can fit exactly in the framework (1.2) [2]. For these systems, there is no reason that noise and diffusion satisfy an Einstein relation, and and are not proportional in general.
Due to the nonlinearity, the SPDE (1.2) is in general ill defined and need to be properly renormalized [16, 21, 27]. Here, a precise meaning is given by restricting to the small noise limit, and by interpreting (1.2) in the large deviation framework [40, 47, 23, 53, 8]. Let us fix a horizon time and define . The couple satisfies then a large deviation principle with speed on the time window . Moreover the Large Deviations Function (LDF) can be obtained formally as [40, 47, 23, 53, 8]
[TABLE]
if the constraint
[TABLE]
is satisfied and equal to infinity otherwise. Hereafter denotes the usual scalar product of . This LDF describes the cost to observe during a time window a density profile and a current profile for the underlying microscopic system.
We are interested333A natural problem to investigate is also the LDF for the current. See [41, 48, 49] for studies in this direction. in the density LDF which describes the cost to observe an atypical density profile over a time interval ; it is related to by a contraction principle [51, 20, 51], over the admissible currents :
[TABLE]
We will focus on the limiting form of as , i.e. for systems whose typical behavior is a scalar hyperbolic conservation law. Taking formally the limit , it is easy to convince oneself that the density LDF vanishes for any such that . Indeed, for such a , the choice fulfills the constraint equation (1.4) in the formal limit of the equation (1.1), and the integral appearing in the definition of vanishes in the limit. In other words, the probability concentrates on all weak solutions of the hyperbolic conservation law [39]. There are many such weak solutions, and from the point of view of fluctuations, this limit of the LDF misses some interesting physical properties. To go further we consider the limit of the scaled LDF: we look for a large deviation principle with speed , and define
[TABLE]
Clearly, is infinite if is not a weak solution of the conservation law. Our goal is to compute for such a weak solution. In full rigor, the above limit shall be understood in the sense of -convergence444 A sequence of functional defined on some topological space -converges to if 1) for any and any sequence , (-liminf inequality) and 2) there exists a sequence such that (-limsup inequality). which is the right notion to deal with convergence of variational problems [14].
As far as we know, this problem has been investigated mainly in the one dimensional case ([10] considers a specific example with concave; [5] treats the general case). The -dimensional case is solved in [4] under the restrictive hypothesis that the matrices and are proportional: we will see that far from being a technical condition, this is a fundamental hypothesis. Motivated in particular by active particles systems, the aim of this paper is to make progresses in the generic -dimensional case.
As explained above, we expect to be infinite if is not a weak solution (see Section 2.1) of the scalar conservation law Hence we take to be such a weak solution. Weak solutions are usually continuous functions apart from some codimension 1 time-space manifold . A point is classified as a shock or an anti-shock according to the fact that it dissipates or produces entropy 555Notice that the notion of entropy production discussed here is a PDE concept different from the corresponding physical concept. , see Section 2. In the one dimensional case, it has been proved [30, 52, 39] that only the anti-shocks give a contribution to , and that the entropy production has to be measured using the inverse of the susceptibility . Furthermore, these contributions add up, i.e. must be integrated along , and the infinitesimal contribution at can be obtained by approximating the weak solution by a moving step propagating in the normal direction to at .
Note that, since in the dimensional case we are dealing now with matrices, in general makes no sense, so that we have to introduce a new measure of entropy production. Our main result is to derive a formula (3.13) giving when the weak solution is a moving step. If the additivity principle proved in the one-dimensional case also holds in the multi-dimensional case, which is not obvious, we can deduce a formula (3.1) for .
The article is organized as follows: in Section 2, we gather the definitions and results on hyperbolic scalar conservation laws which will be needed later on; although this material is by no means original, it is not necessarily well-known to statistical physicists. Then, in Section 3, we derive our main results: i) a (quite formal) lower bound for the probability of observing a generic weak solution in terms of the entropy production in Section 3.1; ii) a lower and upper bound for the probability to observe a given moving step function, which coincides in this particular case with the previous lower bound, in Section 3.3; iii) the most probable current associated to this moving step function in Section 3.3.1: contrary to the case when an Einstein relation holds, it has a non trivial structure. The upper bound for a generic weak solution, which, together with point i), would give the probability to observe a generic weak solution, is left as a conjecture: it depends on the validity of an additive principle, which we state explicitly in Section 3.2.3. We emphasize the differences with the case where an Einstein relation holds. Some technical points are detailed in the appendices.
Finally, we note that there has been a number of studies on “stochastic scalar conservation laws” (see for instance [33, 22, 15]); in these references, the noise term is non conservative, and the questions addressed are quite different.
In the rest of the paper we only consider the case for simplicity but our results extend in the multidimensional case.
2. Solutions of hyperbolic conservation laws: Formulation, Lyapounov functions and Entropy production
The aim of this section is to present the mathematical concepts of weak and entropic solutions of a scalar conservation law and to explain the relevance of these notions from a physical viewpoint.
2.1. Generalities and Entropy production
Consider the -scalar conservation law
[TABLE]
Since this equation does not admit classical (smooth) solutions 666In general, smooth solutions only exist for a short time and develop shocks. For particular initial conditions, smooth solutions exist but this is quite exceptional., it has to be interpreted in a weak sense, i.e. by integrating w.r.t. smooth test functions , , and transporting the partial derivatives on the test function by a formal integration by parts:
[TABLE]
Therefore we say that a function is a weak solution of (2.1) if it satisfies (2.2) for any smooth test function . But since there exist several weak solutions to select the “physical” one we shall impose an extra condition to weak solution to restore uniqueness. Here “physical” means that it can be derived by a space-time coarse graining procedure from the underlying microscopic model as the typical macroscopic profile observed in a suitable time scale.
In the PDE’s literature, a scalar function on is called “entropy” for (2.1) if (2.1) is compatible with an extra conservation law in the form
[TABLE]
In the context of systems of conservation laws, entropies are quite difficult to obtain but in the scalar case, any function is an entropy. Indeed, it is sufficient to define , which is a vector-valued function on , such that . The vector is called the conjugated entropy flux to the entropy .
A weak solution is called entropic if for each entropy-entropy flux pair with convex, the inequality
[TABLE]
holds in the sense of distributions, i.e. for any positive test function
[TABLE]
Observe that if is a smooth (classical) solution then the previous inequality becomes an equality. Existence and uniqueness of an entropic solution has been proved under generic conditions [13, 45]. Thus in this sense the entropic solution is the only weak solution dissipating entropy.
The relevance of the entropic solution for an asymmetric microscopic system with one conservation law is that among all the weak solutions, this is the solution which describes the typical behavior of the microscopic system. This has been rigorously proved for only few asymmetric Kawasaki exchange dynamics in dimension such that the Asymmetric Exclusion Process ([44]). Weak solutions are usually considered as irrelevant but as proved in [30, 52, 5] they play a special role to understand the fluctuations of the density at the macroscopic level.
We extend the notion of entropy-entropy flux pair by defining an entropy sampler and its conjugated entropy flux sampler as scalar and vector-valued functions such that for any , is an entropy-entropy flux pair. This means that for all :
[TABLE]
where ′ denotes the derivative with respect to the argument. An example is the factorized case , where and is an arbitrary smooth function. Then the -sampled entropy production 777Observe that the notion of entropy production introduced here is a PDE concept which is quite different from the corresponding “physical” concept [31, 18, 37, 24]. In particular, following the mathematical convention, entropy typically increases. on the time interval of a function , weak solution of (2.1), is defined as the real number with non-definite sign
[TABLE]
where the gradient with respect to the third (space) variable of . -sampled entropy production will play an important role in the large deviation function for the density studied in the next sections.
2.2. Viscosity solution
The entropic solution of the hyperbolic conservation law (2.1) is denoted by . It can be obtained as the vanishing viscosity limit of the smooth solution of (see [13, 45])
[TABLE]
where is a uniformly elliptic matrix-valued function, which means that for any density . In particular, under the ellipticity assumption on , the solution of (1.1), converges to the entropic solution .
The vanishing viscosity approach also explains the inequality (2.3) since (2.3) holds for up to a term vanishing in the limit; the inequality will thus persist in this limit.
2.3. Quasi-potential and Lyapounov function
For (resp. ), the quasi-potential (resp. ) associated to the dynamical LDF (resp. ) is defined by
[TABLE]
where is a time independent density profile and the infimum is carried on the set of time-space density profiles such that and , with the stationary profile of (1.1) (resp. (2.1)). The quasi-potential is the LDF of the empirical density in the stationary state of the interacting particle system whose macroscopic behavior is described by (1.1) (resp. (2.1)):
[TABLE]
The quasi-potential is usually called entropy or free energy in the physics literature. Since converges to we have that
[TABLE]
For , the quasi-potential is solution to the Hamilton-Jacobi equation [26, 23, 8]
[TABLE]
It follows that is a Lyapounov function for the parabolic equation (1.1). Indeed we have that if is solution of (1.1), then
[TABLE]
We recall that we denote by the entropic solution of the conservation law (2.1) (or equivalently of (1.1) with ). It follows from (2.6) and from the fact that that is a Lyapounov function for the entropic solution :
[TABLE]
The latter fact can be seen as a form of the second principle. For real physical systems like a gas, described by the laws of the classical mechanics, the scalar conservation law would be replaced by a system of conservation laws (e.g. Euler equations) and the quasipotential, i.e. the entropy, would furnish a non-trivial Lyapounov function of the system.
The computation of the quasipotential ( or ) is of high interest but usually very difficult to perform. A particular case where an explicit formula is available is when the Einstein relation holds (see [9]):
[TABLE]
It turns out that in this case the quasipotential is local and take a simple form (see below). If (ER) does not hold, the functional is non local, i.e. it cannot be written in the form
[TABLE]
for a suitable function . Indeed, assume that takes this form then by (2.7) we have
[TABLE]
Since there exists a vector valued function such that the RHS of the previous expression is equal to [math]. Since the equality is valid for any profile we have that . Therefore a necessary and sufficient condition to have a local quasi-potential is that (ER) holds. For Kawasaki dynamics (ER) is satisfied if the dynamics is isotropic but usually not for anisotropic ones. In the context of active particles, (ER) is also rarely satisfied.
It is important to notice that the scalar conservation law (2.1) describes only the typical behavior of the microscopic system by forgetting many details of the dynamics. Therefore it describes a priori many different microscopic systems. The quasipotential retains more information of the underlying microscopic dynamics and may be different for microscopic systems whose typical behavior is described by the entropic solution of the same scalar conservation law.
Let us also notice that in one dimension, the quasipotential associated to a hyperbolic conservation law with Dirichlet boundary conditions has been investigated in [1].
2.4. Kinetic formulation and associated Entropy production
A kinetic formulation of the PDE theory of hyperbolic conservation laws has been proposed and developed since [12, 43, 36, 42]. This interpretation will be useful in the sequel. We introduce an auxiliary variable and we define . Then is a weak solution if, in the sense of distributions, is solution of
[TABLE]
for some locally finite measure in the form . To see this, just integrate (2.8) with respect to (the RHS becomes [math]) and observe that . In this picture we can imagine that the variable plays the role of an (artificial) velocity and the measure plays the role of the collision term in Boltzmann equation.
Entropy production has a nice form within the kinetic formulation. Assume first that so that with be the entropy flux associated to and a smooth test function. Then, by multiplying (2.8) by , integrating in and using that , we get
[TABLE]
In particular (2.9) shows that an entropic solution is such that is a negative measure for any . It also shows that
[TABLE]
for an entropy sampler in the form . Since a generic entropy sampler can be approximated by a sequence of linear combinations of entropy samplers in the previous form, (2.10) is valid for any entropy sampler .
2.5. Explicit formula
Our aim is now to give a more explicit formula for and thus for . Let be a weak solution and denote by its jumps set that we assume to be such that for any , is a smooth curve 888The regularity of the jump sets of weak solutions is studied in [35]. It is not smooth in general but is sufficiently regular to define almost everywhere a unit normal to the jumps set. . Then we have that is a -dimensional manifold parameterized by . Let be the two connected components of . A unit normal vector to at is
[TABLE]
with a normalization factor. For any let be the limit of as . Observe that is regular in and discontinuous on . See Figure 1. An integration by parts and Green’s theorem shows that if is a smooth function vanishing at time [math] and time ,
[TABLE]
where is the Lebesgue measure on and , . Since is regular in the two first integrals on the RHS of the last equality of (2.11) are zero and we conclude by (2.9) that, as space-time measures,
[TABLE]
Remark that up to now we did not use any convexity property of and that the choice is valid. Since is a weak solution, this choice implies that the LHS of (2.11) is then [math] and this gives the Rankine-Hugoniot condition:
[TABLE]
on . Now we write
[TABLE]
and
[TABLE]
We plug this in (2.12) and use the Rankine-Hugoniot condition (2.13) to get
[TABLE]
Since this is true for any sufficiently regular we get (see Appendix A)
[TABLE]
with
[TABLE]
where , is defined by
[TABLE]
Observe that \|{\bf n}^{\mathbf{x}}\|d\gamma_{J}=\|\big{[}\tfrac{ds_{t}}{d\alpha}\big{]}^{\perp}\|d\alpha\,dt=\|\tfrac{ds_{t}}{d\alpha}\|d\alpha\,dt=dt\,ds_{t} where is the Lebesgue measure on the curve .
2.6. Entropy splittable solution, shocks and anti shocks
From (2.15), we see that the measure is concentrated on the jump set of the weak solution . For a general weak solution we denote by
[TABLE]
the (time-space) positive and negative parts of the measure and by their support. Observe that roughly is the set of for which entropy is produced (resp. dissipated) by the weak solution when .
Let us first state precise definitions of shocks and antishocks:
Definition 2.1*.*
Let be a weak solution. A point is said to be a shock if it always dissipate entropy:
[TABLE]
it is said to be an antishock if it may produce entropy:
[TABLE]
Observe that a point may belong to and at same time belong to for some . Following [5], we now introduce a special class of weak solutions for which this does not happen.
Definition 2.2*.*
A weak solution is said to be entropy splittable if for any discontinuity, characterized by and a local unit vector , the quantity has a constant sign when varies in . In other words,
[TABLE]
The entropy splittable weak solutions will play a technical role later on. Entropy splittable solutions only have shocks and “perfect antishocks”, that always produce entropy (ie points ).
Remark 2.3*.*
In dimension , if the flux function is convex or concave, any weak solution is entropy splittable.
3. Main Results: Density Large Deviation Function
We turn now to the main object of our study, the large deviation function for a density profile . The density LDF of the considered weakly drifted interacting particle system is given by
[TABLE]
where the infimum is carried over currents satisfying the constraint
[TABLE]
We are interested in the behavior of as . It is conjectured that converges999More exactly, -converges. to a functional which is the LDF of the empirical density for the strongly drifted underlying system of interacting particles.
In this section we argue that in the -case, the functional is
[TABLE]
where is the positive part of the function which is defined by (2.17) and is the Lebesgue measure on . The notations are those of Figure 1 and Section 2.5.
3.1. Large deviation function lower bounds in terms of the entropy production
The aim of this section is to prove the following lower bound for
[TABLE]
In order to prove it we first show a non-optimal lower bound whose derivation will be however useful for our purpose. This non-optimal lower bound is the following: for “any” weak solution of the scalar conservation (2.1)
[TABLE]
We recall that the entropy production has been defined in (2.10) and we denote by the set of convex entropy samplers (, where ′ denotes the derivative with respect to the first argument) satisfying the relaxed Einstein condition
[TABLE]
for any .
More precisely, by definition of the -convergence (see footnote 4), in order to show this lower bound we have to prove that
[TABLE]
for any sequence of smooth functions converging to the weak solution in a suitable topology. This proof is a simple extension 101010We thank C. Bahadoran for having brought this to our attention. of the proofs of Theorem 2.5, (i) of [5] or Theorem 2.1 in [4]. To prove (3.5) we will need the following dual variational characterization of which is proved in Appendix D:
[TABLE]
with
[TABLE]
Let be an entropy- entropy flux couple sampler, and let (we recall that ′ denotes the derivative with respect to the argument and the gradient with respect to the space variable). Then
[TABLE]
Thus
[TABLE]
Apply first this inequality in the particular case and for some constant sufficiently small to ensure that . Then, the last RHS of (3.7) contains only one non zero term and we get
[TABLE]
Therefore we can always assume (otherwise extract a subsequence) that the approximation satisfies
[TABLE]
Now, the last three terms in (3.7) vanish in the limit, because they contain strictly less than two (apply Cauchy-Schwarz inequality and use the previous bound). The first term in the RHS of the last equality of (3.7) is exactly , which tends to . Furthermore, if is chosen such that
[TABLE]
then the second term is positive. One possibility is to choose convex (w.r.t. ), and satisfying (3.4); this leads to the bound (3.5).
Let us now explain why this lower bound is not optimal and how it can be improved. Let us recall that around any , the weak solution looks like a step function propagating in the direction of the unit vector with a velocity prescribed by the Rankine-Hugoniot condition (see Figure 1). Moreover is actually parallel to , up to subdominant terms. Thus, if we repeat the previous computations but with a convex entropy samplers such that
[TABLE]
we will still have (because is almost parallel to )
[TABLE]
Let be the set of convex entropy samplers such that for any the inequality (3.9) is satisfied. We deduce that
[TABLE]
We now choose if (an ”entropic point” does not cost anything), and
[TABLE]
Such a choice may violate the regularity requirements for an entropy sampler; in this case, it is necessary to consider a regularization of the above choice, introducing serious mathematical complications that we disregard. Using (2.10) and (2.18), we see that the RHS of (3.10) coincides with the RHS of (3.2), which is then proved. Note that since is in general a larger set than , the lower bound (3.3) is in general not optimal.
3.2. The one-dimensional generalized Jensen-Varadhan functional and the additive principle
We have proved in the previous section the lower bound part of the conjecture (3.1). We now review the results obtained in the one-dimensional case in [30, 52] (for a particular flux) and in [5] (for a general non convex hyperbolic flux). This will further substantiate conjecture (3.1), and emphasize what is missing to prove it.
3.2.1. The one-dimensional generalized Jensen-Varadhan functional
In [5] it is proved rigorously111111In fact, in [5], the -convergence of to is only proved for “entropy splittable” weak solutions and the extension to generic weak solution would require a density argument which appears as a very difficult problem (see the comments after Theorem 2.5 there). that conjecture (3.1), simplified according to the fact that and are scalars, is correct; hence, with
[TABLE]
Notice that (3.11) is precisely the one dimensional equivalent of (3.1) when and are scalars. This can be seen by replacing by its explicit expression given by the one dimensional versions of (2.15),(A.1),(2.17). In particular, implicitly contains an integral over the jump set of the weak solution .
We call this functional the generalized Jensen-Varadhan functional. It is important to notice that the only points which contribute in are points in which produce entropy, i.e. the anti-shocks. The integral formula (3.11) shows that all the contributions of anti-shocks simply add up: we now make this remark more precise.
3.2.2. Case of a step function in one dimension
First, consider the particular case where is a step function separated by a discontinuity moving at velocity and taking the values for , for . We assume and let . Observe that the velocity has to be equal to to ensure that is a weak solution (Rankine-Hugoniot condition, see (2.13) ). We have where is the position of the discontinuity at time . Let be the one-dimensional version of the function introduced in (2.17). If is convex or concave on then has a constant sign, i.e. is entropy splittable, so that the discontinuity corresponds either to a shock or to a “perfect” antishock in the sense that it belongs to . Then by (2.15), (3.11) becomes (see also Remark 2.7 in [5])
[TABLE]
if is not an entropic solution and [math] if it is. Here is the length of the discontinuity set on . As noticed in [10], the term coincides with the cost to produce a time averaged current equal to in the large limit. In the case where the flux is neither convex, nor concave, the function may change sign on the interval ; in this case, the discontinuity may be an anti-shock but it is not a “perfect antishock”: the solution is not “entropy splittable”. Then, the term in (3.12) has to be replaced by [39]. From a mathematical point of view, all this has been rigorously derived by a LDP ([52]) for the asymmetric simple exclusion process, which corresponds to the case and .
3.2.3. Additive principle
It is interesting to notice that the formula (3.12) is in fact sufficient to recover (3.11) by assuming a (space-time) additive principle [11]:
Additive principle:* For a generic weak solution only antishocks contribute to and simply add up. Moreover, the contribution of each antishock can be evaluated by approximating locally the weak solution by a moving step function.*
In the -case, it is tempting to follow this route: solve the problem for a simple step profile; then use the additive principle to obtain an expression for for a general profile. This is our aim in the following section.
3.3. Case of a moving step function
We consider a moving step profile between and moving in some arbitrary direction , , with some velocity . The RHS of (3.1) is then infinite since the jumps set of is a strip of and that the RHS is roughly extensive in the area of the jump set. Therefore our aim will consist to evaluate
[TABLE]
where to define we replace in (1.5) the set by the set . We then show that the previous limit is equal to
[TABLE]
To simplify the notation we omit the index .
The lower bound does not follow directly from (3.1), since the RHS of (3.1) is infinite for a step function; we then actually have to repeat the arguments of Section 3.1, using samplers supported in , see Section 3.3.3. Also, serious mathematical difficulties have been disregarded in Section 3.1; by contrast, the arguments presented below for a step function in Section 3.3.3 are essentially rigorous. We first derive the upper bound121212In the sense of the -convergence, see footnote 4. by obtaining a smooth approximation of such that
[TABLE]
To be more precise, following the strategy of [5], we first prove the upper bound of (3.13) for an “entropy splittable” moving step. We then argue (in a not fully rigorous manner) that this upper bound holds for general moving steps.
For an entropic moving step, (3.13) is obviously valid, since both LHS and RHS vanish. For a non entropic entropy splittable moving step, the RHS of (3.13) can be rewritten
[TABLE]
This is the upper bound we shall show in the next section.
3.3.1. Upper bound for an entropy splittable step
A weak solution of (2.1) in the form of a moving step profile between and moving in some arbitrary direction , , with velocity , is given, and we want to approximate it in an optimal way. Let us look for a traveling wave propagating with velocity in the direction given by
[TABLE]
solution of
[TABLE]
on with boundary conditions
[TABLE]
A simple computation shows that shall satisfy for any that
[TABLE]
for some constant . Since this equation is invariant by for any constant , in order to fix uniquely , we impose that . We then consider the restriction of on , and introduce as the time reversal of :
[TABLE]
We have that converges to an entropic solution with a step function with a shock located at :
[TABLE]
Then converges as to a non-entropic moving step profile . Observe that the anti-shock of is present in at time when
[TABLE]
Otherwise it is not. By (3.16) and (3.15), we have that
[TABLE]
The second condition is nothing but the Rankine-Hugoniot condition that we could have assumed ab initio. Since
[TABLE]
any such that is in the form
[TABLE]
for some function . We have then
[TABLE]
where we recall that . The optimal is a solution131313Two different solutions differ by a function of time which is irrelevant in the variational formula. of the PDE
[TABLE]
The solution is in the form of a traveling wave
[TABLE]
where shall satisfy
[TABLE]
It follows that
[TABLE]
where is a constant that has to be optimized. To simplify notation we write for . We get then
[TABLE]
As shown in Appendix C, optimizing over yields . Therefore
[TABLE]
where
[TABLE]
The limit of the RHS of this expression as and then is postponed to Appendix C. We obtain then
[TABLE]
This proves the upper bound (3.13) for an entropy splittable moving step. It is important to remark that the term which contains in the optimal current (3.21) does not vanish in general: it gives a non trivial contribution along the shock. However, it does vanish in the special case when and are proportional, as is clear from (3.26) using that is a scalar. This is a qualitative difference between the cases when the Einstein relation is satisfied and when it is not.
3.3.2. Upper bound for a general moving step
We have shown the upper bound (3.13), or equivalently (3.14), for an entropy splittable moving step. We now argue that (3.13) is valid for a general moving step. Let be a moving step between and , with direction given by a unit vector , which is not entropy splittable, but can be “decomposed” into one totally anti-entropic step, between and , and one entropic step, between and . This means that there exists a unique such that and
[TABLE]
It implies in particular that
[TABLE]
Note that the Rankine-Hugoniot condition together with (3.31) impose that the velocity of the steps , , and are all equal. As shown in the appendix B, it is then relatively straightforward to construct a profile which approximates the non entropic moving step, and such that
[TABLE]
where denotes the (entropy splittable) step function between and propagating at velocity in the direction . Here the limsup refers to the ordered limits , , . The equality between and is a consequence of (3.30).This proves the limsup bound for the non entropy splittable step . Clearly, this extends to any step such that the segment can be decomposed into entropic and totally anti-entropic intervals.
3.3.3. Lower bound for a moving step
In order to get a lower bound for we have to compute
[TABLE]
for a moving step function, with direction , , and velocity , i.e.
[TABLE]
Here is the set of entropy samplers vanishing outside of (see Section 3.1 for a definition of ). Exploiting that for any in the discontinuity set and that we have
[TABLE]
It follows that
[TABLE]
The inequalities above become equalities with the choice
[TABLE]
which is possible in ; notice that such a choice is not necessarily admissible in , hence the lower bound (3.5) may not be strong enough. Therefore we have that
[TABLE]
It follows that for a moving step function propagating in the direction we have
[TABLE]
where is the area of the discontinuity set of the weak solution in . Taking the limit this proves the formula for the lower bound of .
Finally, this result together with that of Section 3.3.2 shows the announced formula for a general moving step.
3.4. The upper bound in (3.1) and the additive principle in two dimensions
Now that we have proved the upper bound for all moving step functions, we imagine we can approximate a general weak solution as a superposition of local moving steps. Using the additive principle of Section 3.2.3, we can deduce the upper bound in (3.1) for a general weak solution.
Nevertheless, there is a potential difficulty here. The key point is that the optimal current (3.21) associated to a step has a trivial part (which is the flux in the large limit), plus a non trivial part which essentially vanishes except close to the step. Thus, it is possible that there is no interference between different discontinuities, and that the costs simply add up. This picture seems correct for two steps propagating in the same direction at the same speed (Section 3.3.2); nevertheless, since the non trivial part of the optimal current does not vanish along the step (see (3.24) and (3.26)), it is a bit less clear for more general discontinuities, where everything (shock direction, height, velocity…) varies continuously.
Acknowledgements
We acknowledge very useful discussions with C. Bahadoran, T. Bodineau, M. Mariani and C. Nardini. This work has been supported by the Brazilian-French Network in Mathematics and the project EDNHS ANR-14-CE25-0011 of the French National Research Agency (ANR) and the project LSD ANR-15-CE40-0020-01 LSD of the French National Research Agency (ANR). This research was supported in part by the International Centre for Theoretical Sciences (ICTS) during a visit for participating in the program Non-equilibrium statistical physics (Code: ICTS/Prog-NESP/2015/10).
Appendix A Proof of (A.1)
[TABLE]
where , is defined by (2.17).
Appendix B Approximating a non entropy splittable moving step
The goal is here to provide an approximating profile for a non entropy splittable moving step as in Section 3.3.2, and compute its cost. We introduce the approximating profiles for each one of the smaller steps , , as traveling waves , solutions of the equations
[TABLE]
with boundary conditions
[TABLE]
[TABLE]
We choose the such that the strong gradient of is around . For , the are exponentially close to their asymptotic value. To be more precise, we have for and which tends to infinity:
[TABLE]
for some positive constant of order . A similar exponential estimate holds for negative .
We also introduce a new small parameter , and the shifted approximating profiles:
[TABLE]
Let be a step function increasing from [math] to around , regularized at scale , with compact support in ; we take such that . We write now an approximation for the whole step:
[TABLE]
If is much larger than , is thus a ”double step”, see Figure 2, which varies at scale around , and around . In the following, we shall always bear in mind this ordering: .
Notice that (resp. ) when (resp. ). For , and are both very close to , that is exponentially in : these are the tails of the traveling waves profiles. We have that
[TABLE]
Making use of the fact that , we see that between the two sub-steps, that is for , is exponentially small in . For (resp. ), reduces to the contribution coming from (resp. ). To summarize
[TABLE]
Similarly, we have that
[TABLE]
We can now estimate , starting from the expression
[TABLE]
with the constraint . We take and want to construct the optimal . Using (B.1), the constraint becomes:
[TABLE]
The idea is now to choose
[TABLE]
where is the optimal non trivial contribution to the current computed in Section 3.3.1 for the totally anti-entropic moving step, shifted by . The vector field contains the gradient-like part , plus a rotational part. Notice that both and vanish exponentially in the intermediate region ; hence as constructed above can be chosen to be smooth. Inserting in (B.3), the part contributes the cost of the totally anti-entropic step (up to small terms), and contributes only small terms (that is essentially the cost of the entropic step). Then, by denoting the (entropy splittable) step function between and propagating at velocity in the direction , we get
[TABLE]
where the limsup refers to the ordered limits , and then . This proves the limsup bound for this simple non entropy splittable step.
Appendix C Proof of (3.29)
We first show that the choice is optimal. Introducing the notation , we notice that
[TABLE]
Hence
[TABLE]
When (3.27) is introduced into (3.22), the term which is linear in contains precisely the scalar product (C.2); thus it vanishes. The term which is quadratic in is , a strictly positive quantity. Hence it is minimized for , as announced.
Now, the integral of (3.28) is equal to
[TABLE]
By multiplying (3.16) by and integrating between and we conclude that
[TABLE]
This converges as to
[TABLE]
Observe this is non zero if and only if the shock is in the interval . Therefore we get that
[TABLE]
To conclude we observe that
[TABLE]
Making use of (C.1) and of , we have
[TABLE]
We conclude that
[TABLE]
We then use (3.19) to get that
[TABLE]
Observe also that
[TABLE]
Finally, since we are considering a totally anti entropic moving step, coincides with , and (C.4) coincides with (3.29), which we wanted to prove.
Appendix D Proof of (3.6)
To get (3.6) observe that the supremum appearing there is realized for solution to
[TABLE]
with zero boundary conditions at infinity. The RHS of (3.6) is then equal to
[TABLE]
Observe also that satisfies . On the other hand we have that
[TABLE]
since any divergence free vector field is a rotational. Since
[TABLE]
we deduce that the previous infimum is realized for constant and equal to (D.1). This proves the claim.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] C. Bahadoran, A quasi-potential for conservation laws with boundary conditions, eprint ar Xiv:1010.3624 (2010).
- 2[2] J. Barré, C. Bernardin, R. Chétrite, C. Nardini, F. Peruani. In preparation.
- 3[3] J. Barr é, D. Mukamel, S. Ruffo, Inequivalence of ensembles in a system with long-range interactions, Phys. Rev. Lett. 87, 030601-1-4 (2001).
- 4[4] G. Bellettini, M. Mariani. Variational convergence of multidimensional conservation laws. Bulletin of the Greek Mathematical Society 57, 31-45 (2010).
- 5[5] G. Bellettini, L. Bertini, M. Mariani, M. Novaga. Γ Γ \Gamma -entropy cost for scalar conservation laws. Arch. Ration. Mech. Anal. 195, no. 1, 261-309 (2010).
- 6[6] E. Bertin, H. Chaté, F. Ginelli, S. Mishra, A. Peshkov, S. Ramaswamy. Mesoscopic theory for fluctuating active nematics. New Journal of Physics 15, 085032 (2013).
- 7[7] E. Bertin, M. Droz, G. Grégoire. Boltzmann and hydrodynamic description for self-propelled particles. Phys. Rev. E 74, 022101 (2006).
- 8[8] L. Bertini, A. De Sole, D. Gabrielli, G. Jona-Lasinio, C. Landim. Macroscopic fluctuation theory. Rev. Mod. Phys. 87, 593 (2015).
