Response Formulae for $n$-point Correlations in Statistical Mechanical Systems and Application to a Problem of Coarse Graining
Valerio Lucarini, Jeroen Wouters

TL;DR
This paper derives general response formulae for n-point correlations and spectral properties in statistical mechanical systems, with applications to coarse graining in multiscale systems, enhancing understanding of system responses to perturbations.
Contribution
It extends previous response theories to include n-point correlations and spectral properties, providing explicit formulae for coarse graining in multiscale systems.
Findings
Derived response formulae for n-point correlations under perturbations
Computed how spectral properties respond to perturbations
Applied results to parameterization in coarse graining, affecting all Mori-Zwanzig terms
Abstract
Predicting the response of a system to perturbations is a key challenge in mathematical and natural sciences. Under suitable conditions on the nature of the system, of the perturbation, and of the observables of interest, response theories allow to construct operators describing the smooth change of the invariant measure of the system of interest as a function of the small parameter controlling the intensity of the perturbation. In particular, response theories can be developed both for stochastic and chaotic deterministic dynamical systems, where in the latter case stricter conditions imposing some degree of structural stability are required. In this paper we extend previous findings and derive general response formulae describing how n-point correlations are affected by perturbations to the vector flow. We also show how to compute the response of the spectral properties of the systemā¦
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Response Formulae for -point Correlations in Statistical Mechanical Systems and Application to a Problem of Coarse Graining
Valerio Lucarini
Department of Mathematics and Statistics, University of Reading, Reading, RG66AX, UK
Centre for the Mathematics of the Planet Earth, University of Reading, Reading, RG66AX, UK
CEN - Institute of Meteorology, University of Hamburg, Hamburg, 20144, Germany
āā
Jeroen Wouters
Department of Mathematics and Statistics, University of Reading, Reading, RG66AX, UK
Centre for the Mathematics of the Planet Earth, University of Reading, Reading, RG66AX, UK
School of Mathematics and Statistics, The University of Sydney, Sydney, Australia
Abstract
Predicting the response of a system to perturbations is a key challenge in mathematical and natural sciences. Under suitable conditions on the nature of the system, of the perturbation, and of the observables of interest, response theories allow to construct operators describing the smooth change of the invariant measure of the system of interest as a function of the small parameter controlling the intensity of the perturbation. In particular, response theories can be developed both for stochastic and chaotic deterministic dynamical systems, where in the latter case stricter conditions imposing some degree of structural stability are required. In this paper we extend previous findings and derive general response formulae describing how point correlations are affected by perturbations to the vector flow. We also show how to compute the response of the spectral properties of the system to perturbations. We then apply our results to the seemingly unrelated problem of coarse graining in multiscale systems: we find explicit formulae describing the change in the terms describing parameterisation of the neglected degrees of freedom resulting from applying perturbations to the full system. All the terms envisioned by the Mori-Zwanzig theory - the deterministic, stochastic, and non-Markovian terms - are affected at first order in the perturbation. The obtained results provide a more comprehesive understanding of the response of statistical mechanical systems to perturbations and contribute to the goal of constructing accurate and robust parameterisations and are of potential relevance for fields like molecular dynamics, condensed matter, and geophysical fluid dynamics. We envision possible applications of our general results to the study of the response of climate variability to anthropogenic and natural forcing and to the study of the equivalence of thermostatted statistical mechanical systems.
Contents
-
I.2 Parameterisation of a Coarse Grained Model: Stochasticity and Memory Effects
-
II.1 Derivation of Response Formulae for -point Correlations
-
III.1 Constructing the Projected Evolution Equations for Coarse Grained Variables
I Introduction
I.1 Response Theories
Understanding how a system responds to perturbations is a key challenge in mathematical and natural sciences and has long been the subject of extensive analysis through formal, experimental, and numerical investigations. A fundamental step in the direction of developing a comprehensive response theory can be found in the early work of Kubo (1957) (see also KuboĀ etĀ al. (1988)), who studied the impact of imposing weak perturbations to a statistical mechanical system originally at the thermodynamic equilibrium as described by the canonical ensemble. While the proposed theory had been criticised from an early stage - see the famous argument by van Kampen (1971) as discussed in MarconiĀ etĀ al. (2008) - it has been extremely successful in describing many physical phenomena LucariniĀ etĀ al. (2005); MarconiĀ etĀ al. (2008). The Kubo response theory leads to response formulae that express the change in the expectation value of a given observable of the system as a perturbative series. The zeroth order term is the expectation value of the observable in the unperturbed system, while the first order term, corresponding to the linear response, is expressed in terms of an explicitly determined causal Greenās function, which contains comprehensive information on the interplay between the background dynamics of the system and the applied perturbation. It is important to note that the Greenās function itself is constructed as an expectation value of an observable on the unperturbed measure, with the ensuing effect that the unperturbed system contains the information needed for estimating its response to general forcings. This provides the basis for the cornerstone of Kuboās response theory, the so-called fluctuation-dissipation theorem (FDT), which links forced and free fluctuation in the linear perturbative regime. This structure extends to higher order terms with a simple generalization, see e.g. LucariniĀ andĀ Colangeli (2012)
A basic pitfall of Kuboās approach in terms of physical applicability is the impossibility of dealing with perturbations resulting from non-conservative forces. In fact, Kuboās theory does not allow for a consistent treatment of the energy budget of the perturbed system: in general, the external field will inject or subtract energy, so that in order to reach a well-defined steady state it is necessary to add a thermostat Gallavotti (1997); CohenĀ andĀ Rondoni (1998); Ruelle (2000). The natural question is then whether a specific choice of the thermostat alters the computed linear response. Fortunately, as shown in EvansĀ andĀ Morriss (2008), in the thermodynamic limit of a system with infinite number of particles, the choice of the thermostat does not alter the predictions of linear response theory: the sensitivity of macroscopic observables does not depend on the details of the microscopic dynamics.
What is also unsatisfactory about the Kubo response theory is that mathematical rigour has been missing in establishing whether the many limits involved in constructing the response formulae are well defined. Additionally, no provision is given for computing the response of nonequilibrium systems to perturbations.
Ruelle (1997, 1998, 2009) showed that it is possible to establish a rigorous response theory for Axiom A maps and flows, which possess invariant Sinai-Ruelle-Bowen (SRB) measures. In other terms, Ruelle showed that in the case of Axiom A systems the invariant measure is differentiable with respect to the parameters controlling small modifications to the flow of the system, and provided explicit expressions for the linear and higher order contributions to the response.
Axiom A systems are indeed far from being typical dynamical systems, but, according to the chaotic hypothesis of Gallavotti and Cohen GallavottiĀ andĀ Cohen (1995); Gallavotti (1996), they can be taken as effective models for chaotic dynamical systems with many degrees of freedom. Specifically, this means that when looking at macroscopic observables in sufficiently chaotic (to be intended in a qualitative sense) high-dimensional systems, it is expected that it is extremely hard to distinguish their properties from those of an Axiom A system, including some degree of structural stability. Note that the chaotic hypothesis can be seen as the natural extension of the ergodic hypothesis, which is the fundamental heuristic step needed to apply results of equilibrium statistical mechanics to interpret and predict the properties of real systems at equilibrium. Linear response is therefore expected to hold in practice for very general dynamical systems, while the known counter-examples are currently limited to low-dimensional non-uniformly expanding maps BaladiĀ andĀ Smania (2008); GottwaldĀ etĀ al. (2016).
Axiom A systems corresponding to equilibrium physical systems possess an invariant measure that is absolutely continuous with respect to the Lebesgue measure because the phase space does not contract nor expand, as the flow is nondivergent. Axiom A systems featuring - on the average - a contraction in the phase space provide excellent mathematical models for nonequilibrium systems Gallavotti (2006). In this case, the invariant measure lives on a set with a Hausdorff dimension lower than the number of degrees of freedom of the system and is singular with respect to the Lebesgue measure, as a result of the contraction taking place in the stable manifold EckmannĀ andĀ Ruelle (1985). Despite the geometrical complexity associated to the attractors of nonequilibrium systems, the Ruelle response theory, somewhat surprisingly, ensures that differentiability can be established also in this case.
In the case of an equilibrium system, the Ruelle response theory allows for deriving the FDT. In nonequilibrium systems, instead, there is no one-to-one correspondence between forced and free fluctuations, as already suggested by Lorenz (1979): Ruelle (1997, 1998, 2009) provides a mathematical explanation of this property, while a physical interpretation is given in, e.g., Lucarini (2008, 2009); LucariniĀ andĀ Sarno (2011). The basic idea is that while the natural fluctuations are able to substitute for the effect of the components of the forcing along the unstable manifold of the system, the impact of the components of the forcing along the stable manifold have no counterpart in the unperturbed system.
Interestingly, while on one side there have been positive examples of applications of the FDT in nonequilibrium systems, like the climate, it is clear that, for a given class of forcing, the quality of the obtained response operator depends substantially on the chosen observable GritsunĀ andĀ Branstator (2007); GritsunĀ etĀ al. (2008). In a recent paper, GritsunĀ andĀ Lucarini (2017) have provided examples in a system of geophysical relevance of various scenarios supporting or not the applicability of the FDT to reconstruct the response of the system to perturbations. They have clearly shown that, indeed, when the applied forcing has a relevant projection on the stable manifold of the unperturbed system, the forced variability can have little resemblance to the natural one. In particular, the forcing can in some cases excite resonances corresponding to special dynamical features that are virtually unexplored by the unperturbed system, so that one can observe so called climatic surprises.
The difficulties in constructing ab-initio the response operator using Ruelleās formulae come from the extremely different behaviour of the contribution coming from the unstable and stable manifold AbramovĀ andĀ Majda (2007). The computation of the contributions coming from the stable directions give neither numerical nor conceptual problems. When the unstable directions are considered, problems emerge from the fact that contributions to the response come from integrals over time of exponentially growing functions, resulting from the presence of sensitive dependence on initial conditions. The illāposedness of this operation is at the core of the van Kampen (1971) criticism mentioned above. On the other side, response operators, as described in the next section, are constructed by integrating over the statistical ensemble of the (unpertubed) system. Such an operation - under suitable conditions - regularises the previous divergences and explains why linear response is indeed well-posed. Nonetheless, obtaining in practice a stable estimate of the response operators from a finite number of ensemble members and from finite numerical simulations is far from obvious. We note that algorithms based on adjoint methods seem to partially ease these issues EyinkĀ etĀ al. (2004); Wang (2013).
Convincingly good results in terms of climate prediction performed using the linear response theory have instead been obtained through bypassing the problem of constructing the response operator and using, instead, the formal properties of the Greenās function LucariniĀ andĀ Sarno (2011); LucariniĀ etĀ al. (2014); RagoneĀ etĀ al. (2016); LucariniĀ etĀ al. (2017). Tests in simple models have emphasized that also the nonlinear response theory is extremely solid and amenable to numerical verification Lucarini (2009).
Modern methods of spectral theory have provided different and elegant proofs and further generalizations of Ruelleās results. The response theory can be developed by comparing the Perron-Frobenius transfer operator Baladi (2000) of the unperturbed and of perturbed system, thus focussing on the evolution of distributions rather than of individual trajectories - see e.g. LiveraniĀ andĀ GouĆ«zel (2006); BaladiĀ andĀ Smania (2008); BaladiĀ etĀ al. (2014). This approach has allowed the extension of Ruelleās results to systems more general than the Axiom A case, by focusing on constructing suitable Banach space of anisotropic distributions. The practical applicability of transfer operator-based methods for studying the response in high dimensional systems is still not entirely clear, as a result of the curse of dimensionality, even if some optimism comes from the overall positive results obtained when severely reduced order models are considered TantetĀ etĀ al. (2015, 2015). Additionally, ideas borrowed from the theory of the transfer operator have proved extremely useful for studying the behaviour of geophysical systems in the vicinity of critical transitions, where the response theory breaks apart, decorrelation times become very long, and the presence of Ruelle-Pollicott resonances lead to the appearance of rough dependence of the system properties on the perturbation parameter ChekrounĀ etĀ al. (2014). Recently, explicit formulae based on simple matrix algebra have been proposed for computing the response of a finite state Markov chain to perturbations, thus providing a model for studying finer and finer partitions of actual phase spaces Lucarini (2016).
A different way to approach the problem of constructing a response theory can be followed by taking the point of view of stochastic dynamics, as proposed initially by Hänggi and Thomas (1975, 1977); see a recent review by Baiesi and Maes (2013). Adding (suitably chosen, typically gaussian white) noise on top of the deterministic dynamics allows to deal with invariant measures that are absolutely continuous with respect to Lebesgue and for making sure that the decay of correlations in the system is fast. As a result, some of the mathematical issues discussed above are automatically sorted out and, in particular, the FDT holds in all cases. Thanks to the presence of noise it is possible to set a general framework for linear response theory in much greater regularity, including the case of infinite dimensional systems; see Hairer and Majda (2010) for a mathematically accurate study of linear response for stochastic system, where many subtleties are sorted out. One needs to note, though, that while the presence of noise smoothens the invariant measure of the system, the weaker the noise, the harder it is for a numerical model to appreciate such smoothness given the finite length numerical simulations and the finite size of the ensemble of performed simulations.
I.2 Parameterisation of a Coarse Grained Model: Stochasticity and Memory Effects
Adding stochastic forcings on top of the deterministic dynamics should be justified on physical grounds and not used just as an ad hoc assumption. A convincing way to motivate the introduction of a random component to the dynamics comes from the need of taking into account the effect of microscopic, unresolved scales; see a mathematically rigorous and complete treatment in ChekrounĀ etĀ al. (2015a, b). Along the lines of the early results by Mori and Zwanzig Zwanzig (1961); Mori (1965), ChekrounĀ etĀ al. (2015a, b) also clearly show that the construction of reduced order models unavoidably leads also to introducing non-Markovian terns in the surrogate dynamics of the variables of interest.
The problem of constructing accurate and robust parameterisations for degrees of freedom that are hard to simulate explicitly is a crucial problem in a variety of scientific fields, and most notably in condensed matter physics BhallaĀ etĀ al. (2016), molecular dynamics ShinodaĀ etĀ al. (2007); BaronĀ etĀ al. (2007); KmiecikĀ etĀ al. (2016), and in geophysical fluid dynamics FranzkeĀ etĀ al. (2015); BernerĀ et al. (2016).
The situation in the case of atmospheric, ocean, and climate models is particularly complex because there is no clear gap (in terms of temporal and spatial scales) in variability of the fluid motions GhilĀ andĀ Childress (1987); PeixotoĀ andĀ Oort (1992); LucariniĀ etĀ al. (2014). As a result, first, the approximation of infinite time separation between resolved and unresolved scales is unsatisfactory, so that the standard homogenization theory PavliotisĀ andĀ Stuart (2008) cannot be safely applied in this case. As a result, on one side the stochastic terms in the parameterisation cannot be represented as white noise, and the presence of memory effects leads additionally to the need to incorporate, in principle, non-Markovian terms in the dynamics.
Additionally, given the available numerical resolution at hand, one always faces the problem of dealing with the so-called grey zone, a range of scales where physical processes are only partially resolved Gerard (2007). Further, the parameterisation depends on where one defines the cutoff between resolved and unresolved scales of motion (practically often determined by the computational facilities at hand or the required length or number of the model runs), so that a painstaking process of tuning is in principle necessary each time the resolution of the model needs to be changed. As a result, the quest for self-adaptive parameterisation has been recently emphasized in the literature, see e.g. ArakawaĀ etĀ al. (2011); Park (2014); SakradzijaĀ etĀ al. (2016). Self-adaptivity is crucial for the goal of constructing models able to perform seamless prediction, i.e. to be used for weather forecast, seasonal prediction, and climate modelling PalmerĀ etĀ al. (2008).
As for the scope of this paper, it is relevant to note that one can use the Ruelle response theory to compute explicitly the effect of small scale, fast degrees on freedom on the macroscopic ones. In this case, the perturbation one studies using the results by Ruelle is exactly the coupling between the dynamics occurring at the different scales. One discovers that it is possible to derive an explicit parameterisation providing a deterministic, a stochastic, and a non-Markovian contribution to the dynamics of the variables of interest, thus obtaining a perturbative yet self-consistent closure to the problem WoutersĀ andĀ Lucarini (2012, 2013, 2016). The various terms are constructed in terms of specific response operators at first and second order. Some first promising examples of applications of the theory and investigation of the skills of the parameterization schemes have been recently presented in models of various degrees of complexity WoutersĀ etĀ al. (2016); VissioĀ andĀ Lucarini (2016); DemaeyerĀ andĀ Vannitsem (2017).
I.3 This Paper
In this paper we set ourselves in the context of (possibly high-dimensional) chaotic deterministic dynamical systems, assume the chaotic hypothesis and, consequently, the applicability of the Ruelle response theory. We expect, nonetheless, that our results should apply also in the case of stochastic dynamics, apart from obvious changes in the notation. This paper has a twofold purpose and addresses an interdisciplinary audience.
We first take a rather general point of view and note that most of the theoretical results presented in the literature focus on assessing the response of the system to perturbations in terms of changes of the expectation values of suitably defined observables. or, equivalently, of the invariant measure. This statement applies to both more heuristic and more rigorous studies, and both to approaches based on the framework of deterministic or stochastic dynamics. The elephant in the room is, in our view, the lack (at least up to the authorsā knowledge) of general explicit formulae predicting how the time-lagged correlations of observables change as a result of perturbations to the dynamics. Therefore, in this paper we provide explicit linear response formulae for point time correlations of observables. As discussed below, in the general case treated here the response formulae become more involved than in the usual case of observables and one derived new terms that cannot be framed, even in the case of unperturbed systems possessing smooth invariant measure, in terms of the FDT. The possibility of having formulae for studying the response of higher order moments is quite attractive because it paves the way to asking how the statistical properties of the fluctuations of the system change as a result of the applied perturbation. In the specific case of climate dynamics, which is an application of special interest for the authors, this amounts to being able to address the question of how the climate variability changes in response to climate forcing Ghil (2015). This is a major and indeed open problem in the climate literature.
We then discuss a - seemingly unrelated - problem of interdisciplinary relevance, which was, in fact, the original driver of the investigation presented in this paper. We look into the problem of constructing reduced order models for multiscale systems and take advantage of the fact that, as mentioned above, it can be framed as an indeed nontrivial exercise that can be studied using response theory. Finding an accurate and efficient way to perform coarse graining in multiscale systems amounts to constructing a parameterised dynamics for the variables of interest (usually the large scale, slow ones) and is key to supporting the development of practically usable numerical models. A much desired quality of a parameterisation is its adaptivity with respect to changes in the properties of the system. In previous publications WoutersĀ andĀ Lucarini (2012, 2013, 2016) we have introduced a general method for constructing parameterisations whose main advantage is its adaptivity to the parameters describing the coupling and/or the time scale separation between the slow and fast scale of motion, whose lack is, instead, a key drawback of many other methods, and especially of the empirical ones. A basic issue, both at practical and at theoretical level, is to assess the robustness of a parameterisation with respect to small changes in the dynamics of the system. In this paper, using the general results mentioned above, we are able to construct a response theory for the reduced order, coarse grained model, and derive explicit formulae for the change of the various terms composing the parameterisation. This has relevance for the goal of constructing parameterisations able to adjust to small changes in the dynamics of the full system. Note that such perturbations can also be considered as a representation of the model error: in this case, our results address the problem of understanding how the model error translates in the formulation of the reduced order model.
Being the numerical implementation and analysis of the response based parametersation a topic that is in full development, the current extension of the theory consists mostly of formal calculations, at this stage. Numerical analysis will be the subject of future investigations.
The paper is organised as follows. In Section II we show how the response formulae are changed when the observable we are considering is also a function of the small parameter controlling the intensity of the forcing. In Section II.1 we use the result of Sect. II to present the extension of the response theory for the case of point correlations. We show in detail the calculations needed to reach general formulae that include, as special case, the usual response formulae for observables. The results contained in Sect. II might be of interest for experts in dynamical systems and statistical mechanics. In Section III we recapitulate how to construct parameterisations allowing for performing consistently coarse graining on multiscale systems and we show how the theory developed in Sect. II.1 allows for finding explicit formulae for the corrections to the parameterisations due to a perturbation applied to the full system. The results contained in Sect. III might be additionally of interest for scientists interested in specific applications of coarse graining methods, such as those working on the development of parameterisations for describing the coarse grained dynamics of systems of interest for, e.g. molecular dynamics or geophysical fluid dynamics. In Section IV we discuss our results and present our conclusive remarks.
II A Simple Extension of the Standard Response Theory
Letās consider a continuous time Axiom A dynamical system EckmannĀ andĀ Ruelle (1985); Ruelle (1989) defined on a compact -dimensional manifold of the form
[TABLE]
possessing an invariant measure . We frame our results below in the setting of deterministic dynamical systems but we stress that equivalent equations will hold for stochastic differential equations.
The expectation value of a general observable on such a measure can be written as . We can also write the expectation value in a more compact form as or as , where we stress that the expectation value is the result of applying a linear functional (the measure ) to the measurable function .
Let be the flow from an initial condition , i.e. and satisfies (1). Then the Koopman operator is the composition of an observable with the flow: . Under suitable conditions, one can express the Koopman operator as , where is such that for all differentiable functions . The Perron-Frobenius-Ruelle operator is the adjoint of the Koopman operator and defines the push-forward of an initial measure so that , defined as follows:
[TABLE]
Note that we have , with . Additionally, by definition, we have and, correspondingly, .
Letās now consider a small perturbation to the vector flow of the form
[TABLE]
so that the perturbed flow possesses an invariant measure , and one can define the perturbed Liouville operator as , where . We also define the perturbed evolution and Perron-Frobenius-Ruelle operators as and , respectively.
It is of clear relevance to be able to say under which conditions for small values of it is possible to expand as follows:
[TABLE]
where indicates higher order terms, and to find an explicit expression for the key quantity , which controls the first order correction of the expectation value. The Ruelle response theory says that if the unperturbed dynamical system is Axiom A and we consider a observable , one can write
[TABLE]
so that one can alternatively write where
[TABLE]
we write in this case .
Note that if , so that the perturbation is just a linear change in the time variable , we have that because , from the definition of . Note that rescaling time does not affect the expectation value of any observable at all orders of perturbations.
It is easy to generalise the problem to the case where the observable is a function of so that one can write the following expansion for small values of : . In this case, we have that
[TABLE]
where the linear sensitivity can be expressed as:
[TABLE]
where the first term corresponds to the usual response theory, and comes from the change of the dynamics of the system, while second term comes from the change of the definition of the observable as a function of .
Letās take a first simple and relevant example to illustrate the meaningfulness of this result. We consider as observable the divergence of the flow in Eq. 3. The expectation value of this observable is equal to the sum of the Lyapunov exponents of the system and can be interpreted as the opposite of its entropy production Ruelle (1989); Gallavotti (2014). We have that
[TABLE]
If the expectation value on the unperturbed measure of the divergence of perturbation flow is zero (or a fortiori if the perturbation flow is divergence-free), the second term vanishes. See Appendix A for a discussion on the physical interpretation of Eq. 9.
II.1 Derivation of Response Formulae for -point Correlations
II.1.1 Two-point Correlations
We now consider as observable the product of the value two observables and taken as different times, i.e., without loss of generality . The expectation value of , is , the lagged correlation between and . The local quantity measures the joint fluctuations of the two observables and at different times but along the same orbit.
We consider the perturbed flow given in Eq. 3. The product can be written as , so that we must add a lower index to the expressions and to .
In order to obtain an expression for , we need to expand the Koopman for small values of . Using the Dyson formalism, we have:
[TABLE]
where indicates terms featuring higher powers of the parameter . Note that the term proportional to in the right hand side of the previous equation is instrumental for deriving the desired result. We then have that the linear response of the lagged time correlation between the two observables and can be written as:
[TABLE]
The first term on the right hand side gives to the correction of the local (in phase space) fluctuations computed according to the unperturbed dynamics due to the fact that the perturbation flow modifies the trajectories, and corresponds to what one would obtain with a naive application of the response theory for studying the change in the correlations of the system. The second term corresponds to the expectation value on the unperturbed dynamics of the change in the evolution law due to the presence of the perturbation.
In particular, we can write the first term as:
[TABLE]
Comparing with ColangeliĀ andĀ Lucarini (2014), we observe that this expression resembles a second order response term for regular observables, but, thanks to the presence of a slightly simpler functional form, can be brought to a FDT-like form by applying the operator to the unperturbed invariant measure :
[TABLE]
where we have an integral over one time variable of a three-point correlation.
Instead, the second term in Eq. 11 can be written as:
[TABLE]
Note that this term vanishes if because in this case the function is not anymore a function of , and the usual response theory formulae apply. Due to the presence of a different time ordering in the operators, we cannot reframe Eq. 14 in a FDT-like form.
We also wish to note that if the system is mixing and has rapid decay of correlations, both terms given in the right hand side of Eqs. 12-14 will tend to zero for large values of .
In order to have a simple consistency test of our results, letās also take the special case seen above where , i.e., we rescale the time variable . In this case, the first term given in Eq. (12) vanishes, because . This corresponds to what discussed before when looking at the response theory for observables.
Instead, the second term reads . The (trivial) fact that rescaling time leads to a change in the correlations functions can be immediately derived by observing that
[TABLE]
just as obtained above.
II.1.2 The General Case of -point Correlations
We now consider the case of general correlation functions. Take
[TABLE]
and define the -point correlation function for the perturbed system as:
[TABLE]
We can then construct the following first order expansion for the -point correlation as follows:
[TABLE]
The term proportional to is given by the sum of terms, the first one resulting from the linear correction to the measure, which corresponds to what one would naively obtain by applying the standard response theory, and the other terms resulting from the linear correction to each of the Koopman operators appearing in the definition of the -point correlation function. We have:
[TABLE]
As seen in the case of two-point correlations, the first term can be brought to a FDT-like form by applying the operator to the unperturbed invariant measure , while the other terms have a more convolute structure.
II.1.3 Change in the Spectral Properties of the System
We can use the results presented before to draw interesting conclusions on how the spectral properties of the system under investigation change as a result of the perturbation. Under suitable conditions of integrability, we have that , where is the Fourier transform of and is the complex conjugate of . With we indicate the co-spectrum of the two functions and (note the effect of the time lag). In particular, we have that if , , which corresponds to the Khinchin-Wiener theorem. Thanks to the linearity of the Fourier transform, we can then derive the following expression from Eq. 11:
[TABLE]
where we have added a lower index to the cross-spectrum in order to keep track of the presence of the -perturbation to the dynamics. Equation 19 provides the answer to the quite relevant question of how the spectral properties of the system change as a result of the presence of perturbations. Note that the first term on the right hand-side of Eq. 19 can be interpreted as cross-spectrum of the same observables and where the time statistics is computed according to the measure (instead of the original invariant measure ). A simple dynamical-statistical interpretation for the second term is harder to provide, as the time-dependent operator appearing between the two observables leads to computing correlations (with respect to the unperturbed invariant measure ) between points in the phase space having no obvious dynamical link. See also the previous discussion around Eqs. 12-13.
Note also that the linear response of higher order spectral properties of the system to the perturbation can be derived by applying the dimensional Fourier transform in Eq. 18. This shows that our results allow for a more comprehensive understanding of the response of the system to perturbations than usual response theory.
We note that in Lucarini (2012) the problem of looking at the change of the spectral properties of a system had been approached from a different angle, studying the effect of stochastic perturbations applied on top of deterministic chaotic dynamics. The main result obtained there is that one can establish a simple algebraic link between the change of the power spectrum of an observable (corresponding to the specific choice in terms of what presented here) and the squared modulus of the susceptibility referred to the same observable.
III Response Formulae for Reduced Order Models
We find a useful application of the results detailed above in the special case of constructing parameterisations for reduced order models, along the lines of WoutersĀ andĀ Lucarini (2012, 2013, 2016). Letās first recapitulate the main results obtained there and we shall then see how to apply the extended response theory described above to derive some new results. The idea is to derive formulae able to describe how the parameterisation changes as a result of perturbations applied to the full system, or, in other terms, how applying a perturbation changes the properties of the Mori-Zwanzig projection operator.
III.1 Constructing the Projected Evolution Equations for Coarse Grained Variables
We consider a high-dimensional chaotic dynamical system where belongs to a compact manifold , and then rewrite the dynamics by separating into two subsets of variables, with . Such a separation typically comes from the fact that we are interested in studying the properties of the variables only, corresponding to the coarse grained quantities of interest. Typically, the number of variables is much larger than the number of variables, and one would like to have a time-scale separation (or spectral gap) between the two sets of variables. Without loss of generality one can write:
[TABLE]
where we have separated the part of the vector field () coupling the and the variables from the part of the vector field () that drives independently the two groups of variables. We have also introduced the bookkeeping parameter , which measures the strength of the coupling between the and variables. We wish to derive a reduced model for the variables able to reproduce accurately (in some sense to be defined later) its statistical properties resulting from the full dynamics given in Eqs. 20-21. The Mori-Zwanzig theory allows for a exact and powerful yet implicit solution to this problem, obtained by formally removing the evolution of the variables. As a result, one obtains that it is possible to write the projected dynamics of the variables as follows:
[TABLE]
where contains both Markovian and non-Markovian components and provides the so-called parameterisation of the effect of the variables on the variables. The vector field contains information on the average effect of the coupling between the and variables, on the impact of the fluctuations of the variables, and on the memory effects due to nonlinear cross-correlations between the two groups of variables.
Unfortunately, the explicit form of is not in general available. In the limit of infinite time scale separation between the and variables, such that the variables fluctuate infinitely faster than the variables, it is instead possible to derive explicit results using the homogenization technique PavliotisĀ andĀ Stuart (2008).
One obtains that the term is given by the sum of a deterministic term, corresponding to the intuitive mean field effect, plus a white noise stochastic term, which describes the effect of the fluctuations, while the memory term disappears. Following PavliotisĀ andĀ Stuart (2008), one has that in physical systems the white noise should be interpreted in the sense of Stratonovich, as it should be considered as limiting case of a red noise having vanishing decorrelation time.
This approach is extremely powerful and physically appropriate in all the situations where a substantial time-scale separation can be found between the two sets of variables. In situations, like in the case of climate dynamics, where there is no real spectral gap, the assumption of infinite time scale separation is risky.
In WoutersĀ andĀ Lucarini (2012, 2013, 2016) we have shown that, assuming that that is small (weak coupling hypothesis), it is possible to find an explicit expression of the Mori-Zwanzig corrections to the dynamics by performing a formal expansion of the Koopman operator in powers of and retaining the first two orders. The idea is to treat the coupling as a perturbation to the otherwise uncoupled dynamics of the and variables. One obtains that the surrogate dynamics of the variables can be written as follows:
[TABLE]
where is a determistic vector field, is a stochastic term constructed from the statistics of the fluctuations of the variables, and is a non-Markovian term describing the fact that in the fully coupled dynamics the current state of the variables contains information on the state of the variables at previous times. This result is in agreement with the general theory on model reduction proposed by ChekrounĀ etĀ al. (2015a, b).
The explicit expressions for the terms on the right hand side of Eq. 23 are obtained as follows. We start by defining as the invariant measure of the dynamical system , where in the lower index refers to the fact the dynamics of is uncoupled from the dynamics of , so that the expectation value of a measurable observable .
We then take the simplifying assumption that and . As discussed in WoutersĀ andĀ Lucarini (2013, 2016), such an assumption leads to simpler and easier to interpret formulae; yet, it does not really lead to a loss of generality, if one takes into account the possibility of expanding a function of both and variables as a sum of products of functions of separately and variables only, using a Schauder decomposition LindenstraussĀ andĀ Tzafriri (1996).
The deterministic mean field term is given by:
[TABLE]
We introduce now the anomalies for and . We have that the second term of the parameterisation can be written as:
[TABLE]
where is a centered random process with time correlation given by
[TABLE]
where indicates the Koopman operator of the variables in the uncoupled case with , such for any function of the phase space . Note that the random process is not unique, as, at the desired level of precision in terms of , we only require that the noise is centered and with the above mentioned correlation properties. Finally, the third term in the parameterisation provides the non-Markovian contribution to the reduced model and is given by
[TABLE]
where the integration kernel is written as
[TABLE]
A thorough interpretation of the three terms is reported in WoutersĀ andĀ Lucarini (2012, 2013, 2016).
We note that, using the Ruelle response theory, one also proves that up to second order in the invariant measure of the dynamical system given in Eq. 23 is the same as the projection of the measure of the full dynamics given in Eqs. 20-21. Therefore, the parameterisation given in Eq. 23 is effective in reproducing both the dynamical and the statistical properties of the full system.
Furthermore, as opposed to more common heuristic approaches, it performs - in the limit of small - consistently well no matter which observable we are considering; it is, in this sense, universal and not targetted to a specific measure of skill. In WoutersĀ etĀ al. (2016); VissioĀ andĀ Lucarini (2016); DemaeyerĀ andĀ Vannitsem (2017) the properties of parameterisations of models of different level of complexity obtained following this strategy are studied in detail. Note that in the limit of infinite time-scale separation between the and variables, the homogeneization theory results are recovered and the non-Markovian term drops out.
III.2 Impact of the Perturbations on the Parameterisation
A basic problem often encountered when constructing parameterisations for unresolved processes is assessing the robustness of the reduced model with respect to small changes of the dynamics of the full system. When the dynamics of the full system is weakly perturbed with respect to reference conditions, one expects that also the reduced model undergoes small changes. In what follows, we define a set of response formulae able to predict how the various terms in Eqs. 24-27 defining the parameterisation change as a result of such a perturbation. One needs to note that the presence of a small perturbation to the dynamics is usually interpreted as resulting from changes in the applied forcing applied or from changes in the value of some internal parameters. Alternatively, the small perturbation can be interpreted as caused by model error due to our incomplete knowledge of the system. We then consider the following system:
[TABLE]
where we have included on the right hand side of the evolution equations a (small) perturbation vector field, whose intensity is controlled by the bookkeeping parameter , while leaving the coupling unaltered with respect to the original system shown in Eqs. 20-21. In this case, the uncoupled model reads as
[TABLE]
The reduced model, following Eq. 23, can be written as:
[TABLE]
where the dependence on is implicit for all terms except the trivial one. We now wish to expand the terms , , and in powers of and retain the 0th and 1st terms. This will lead us to the response formulae for the reduced order model. In order to do so, we define the invariant measure of the dynamical system in Eq. 32, so that clearly , and take advantage of the results contained in Sect. II in order to compute the linear response of expectation values of observables and correlations to the perturbation proportional to . Letās first look at the deterministic term introduced in Eq. 24. We use Eqs. 4-5 to derive:
[TABLE]
where is given in Eq. 24, , and . Note that the correction term is proportional to so that, when we insert it in Eq. 33, it brings a contribution proportional to the product of the two perturbation parameters and .
When looking at the modifications of the stochastic term given in Eq. 25, we have that the -correction to the dynamics of the variables leads to a change in the correlation properties of the random process . We obtain that
[TABLE]
where we have that:
[TABLE]
with given in Eq. 26; using Eq. 12-14 we have
[TABLE]
The previous formula shows that the changes in the correlation of the noise due to the -perturbation of the dynamics are non-trivial. In the limit of infinite time separation between the and the variables, such that the noise correlation is proportional to a Diracās delta in both the unperturbed and perturbed system, the correction above results into a change of the constant in front of the by a factor proportional to .
Finally, in order to construct the response formula for the term responsible for the non-Markovian part of the parameterisation, we need to evaluate the first order correction to the memory kernel , where
[TABLE]
By definition we have:
[TABLE]
and we wish to construct the following expansion:
[TABLE]
where is given in Eq. 28. On the r.h.s. of Eq. 39 the parameter appears, reading from left to right, in the Koopman operator of the variables, in the definition of the invariant measure, and in the Koopman operator of the variables, thus implying that the term proportional in Eq. 40 includes the sum of three separate corresponding contributions. The three terms are reported below in Eqs. 41, 42, and 43, respectively:
[TABLE]
It is interesting to note that the first contribution above in Eq. 41 is the only one involving the perturbation to the Liouville operator for the variables . Correspondingly, it leads to a memory term in the definition of the kernel, which makes the overall non-Markovian term of the parameterisation more cumbersome; compare with Eq. 38.
The results presented here, albeit admittedly convoluted, show how it is in principle possible to construct the response theory for a reduced order model resulting from the coarse graining of higher dimensional system. In other terms, we find how one can construct a flexible parameterisation that can be explicitly adapted when the background system is altered, as a result of perturbations to the dynamics or taking into account the model error.
IV Summary and Conclusions
Response formulae are extremely useful tools for predicting how the properties of statistical mechanical systems change as a result of perturbations. In practice, such perturbation can result from changes in the forcing applied to the system or to the internal parameters. Mathematically solid response theories can be constructed both taking the point of view of chaotic deterministic dynamical systems - see e.g. Ruelle (2009); Liverani and Gouëzel (2006) - and of stochastic dynamical systems - see e.g. Hairer and Majda (2010). The deterministic point of view faces the difficulty of requiring relatively stringent conditions of the nature of the flow, while the stochastic point of view permits deriving the desired results under more general conditions. The unavoidable price we pay in this latter case is that we should be able to justify the nature of the noise we use in our mathematical construction. For any practical use, the deterministic and the stochastic formulation of the problem are virtually equivalent.
In this paper we have extended the usual results of linear response theory by computing how the -point correlations at different times of general smooth observables of the system under investigation change as a result of adding a weak perturbation to the vector flow. The obtained response formulae entail exactly different terms. The first term results results from the change in the invariant measure of the system, and is what one would guess from a naive use of response theory. The additional terms result from the linear correction to the Koopman operator of the system evaluated at all the consecutive intervals defining the ordering of the time variables in the argument of the correlation function. Such terms cannot be framed in any form similar to the FTD, as opposed to the first term. By taking advantage of the linearity of the Fourier transform, we are able to derive expressions describing how the spectral properties of the system are altered as a result of the presence of the perturbation. Formulae for second or higher order response to perturbations can also be obtained but are not presented here as they are rather complicated and do not add much for the scopes of this paper.
We have then applied the general findings above to a problem of specific interest in the theory of coarse graining of multi-scale dynamical systems. From a truncation of the Mori-Zwanzig projection operator we can derive a parameterisation of the neglected degrees of freedom such that the resulting invariant measure of the surrogate system is identical to the projected measure of the full system up to second order in the parameter controlling the intensity of the coupling between the degrees of freedom of interest and the ones we want to neglect WoutersĀ andĀ Lucarini (2012, 2013, 2016). One obtains that the parameterisation contains a deterministic component, a stochastic component, and a non-Markovian component, in agreement with the general theory of ChekrounĀ etĀ al. (2015a, b), and derives explicit expressions for the three terms. In this paper we have derived explicit expressions describing how the parameterisation changes as a result of a perturbation applied to the full system, or, in other terms, we have computed how the additional forcing projects in the reduced order model. Alternatively, one can see our results as a way to predict how the model error in the full system is translated as error in the reduced order model.
One has to note that all the terms in (34)-(43) are expectation values w.r.t. , the uncoupled measure. Therefore, if we have access to such statistics, it is possible not only to construct a reduced model, but also to adapt it to account for small perturbations. Therefore, our results provide a basis for constructing general parameterisations for reduced order models that can be modified in order to account for changes in the dynamics of the full system. We suggest that this might be of relevance for fields such as condensed matter, molecular dynamics, and geophysical fluid dynamics, where the construction of accurate, flexible, and adaptive coarse graining procedures is of the uttermost relevance and urgency. In particular, in the case of geophysical fluid dynamics, our results might be useful for the construction of robust scale aware parameterisations, i.e, parameterisations that can be automatically or easily adapted to a changing grid resolution of the numerical model, which determines which physical processes can be explicitly resolved.
We will delve into the problem of implementing these results in specific numerical models and testing their accuracy in future investigations.
The formulae presented provide an overarching framework for understanding how higher order statistical moments of the systems are impacted by changes in the dynamics, and appear to be of general interest. In previous papers we showed that the Ruelle response theory is a tool of practical utility for approaching the problem of predicting climate change RagoneĀ etĀ al. (2016); LucariniĀ etĀ al. (2017). Among the many possible applications of the results presented in this paper, we would to emphasise that the generalised response formulae introduced here allow for framing the question of how the climate variability responds to anthropogenic and natural forcings. This is a major and indeed open problem in the climate literature Ghil (2015) and we will try to approach it in future studies.
An application of possible interest in the area of statistical mechanics deals with the study of the equivalence of perturbed Hamiltonian systems that are allowed to reach a steady state thanks to the coupling with thermostats described by different microscopic dynamics. In Appendix A we have briefly described the motivations behind the introduction of thermostats in physical systems. The formulae presented here allow for computing explicitly the linear response of the correlations of the macroscopic physical variables of differently thermostatted perturbed Hamiltonian systems and then for checking whether an equivalence in the thermodynamic limit of such corrections exists, and if, so, how fast in terms of , thus extending the results of EvansĀ andĀ Morriss (2008) for the case of linear response for physical observables.
Acknowledgments
The results contained in this paper have been drafted during the Workshop Transport in Unsteady Flows: from Deterministic Structures to Stochastic Models and Back Again held on January 16-20 2017 at the Banff International Research Station, Banff, Canada. We thank the organizers and the institution for having made this possible. The authors wish to thank Judith Berner, Tamas Bodai, Mickael Chekroun, Matteo Colangeli, Giovanni Gallavotti, Michael Ghil, Georg Gottwald, Cecile Penland, David Ruelle, Stephane Vannitsem, and Gabriele Vissio for many stimulating conversations on the topics discussed in this paper. VL wishes to thank the DFG TRR181 Energy Transfers in the Atmosphere and the Ocean for partial financial support. The research leading to these results has received funding from the European Communityās Seventh Framework Programme (FP7/2007-2013) under grant agreement No. PIOF-GA-2013-626210.
Appendix A: Thermostatted Systems
A short note should be added in the case we are studying the response to perturbations of an -particle system described by a Hamiltonian where and , where are the canonical variables, is the mass of the particles, and is the internal potential describing the interaction between the particles. The unperturbed system obeys the following equation of motions, for :
[TABLE]
If we want to study the problem of deviations from equilibrium due to the application of an external (in general, non conservative) force acting on each particle, in order to keep physical well-posedness, we need to alter the vector flow as follows:
[TABLE]
where is a nontrivial friction coefficient describing the action of a thermostat Gallavotti (1997); CohenĀ andĀ Rondoni (1998); Ruelle (2000) that avoids the long-term accumulation or depletion of energy in the system and allows for the set up of a well-defined steady state. We consider here the case of deterministic thermostats.
As as example, choosing , one obtains that the function is an invariant of the system given in Eq. 45. Using such thermostatted equations of motions and considering as perturbation flow in Eq. 3 ā where the perturbation affects only the evolution equations for the momentum variables, one recovers in Eq. 9 the correspondence between change in the phase space contraction rate and entropy production of the system mentioned above CohenĀ andĀ Rondoni (1998). Instead, neglecting the term responsible for the thermostatting, one instead derives from Eq. 9 the physically wrong result that the entropy production of an equilibrium system driven out of equilibrium by an external field vanishes.
Many functional forms can be given for , describing different ways of realising microscopically such long term balance. The equivalence of the thermostats means that in the thermodynamic limit the expectation values of macroscopic physical observables does not depend on the choice of , with differences between the results obtained using different thermostats typically going to zero typically as Gallavotti (1997); CohenĀ andĀ Rondoni (1998); Ruelle (2000); EvansĀ andĀ Morriss (2008); Gallavotti (2014); GallavottiĀ andĀ Lucarini (2014). This property persists also when the sensitivity of the system is considered: in the thermodynamic limit the linear response of observables to perturbations is also independent of the choice of EvansĀ andĀ Morriss (2008).
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Abramov and Majda (2007) Abramov, R V, and A. J. Majda (2007), āBlended response algorithms for linear fluctuation-dissipation for complex nonlinear dynamical systems,ā Nonlinearity 20 (12), 2793ā2821.
- 2Arakawa et al. (2011) Arakawa, A, J.-H. Jung, and C.-M. Wu (2011), āToward unification of the multiscale modeling of the atmosphere,ā Atmospheric Chemistry and Physics 11 (8), 3731ā3742 . Ā· doiĀ ā
- 3Baiesi and Maes (2013) Baiesi, M, and C Maes (2013), āAn update on the nonequilibrium linear response,ā New Journal of Physics 15 (1), 013004 .
- 4Baladi (2000) Baladi, V (2000), Positive Transfer Operators and Decay of Correlations (World Scientific, Singapore).
- 5Baladi et al. (2014) Baladi, Viviane, Michael Benedicks, and Daniel Schnellmann (2014), āLinear response, or else,ā ICM Seoul 2014 talk.
- 6Baladi and Smania (2008) Baladi, Viviane, and Daniel Smania (2008), āLinear response formula for piecewise expanding unimodal maps,ā Nonlinearity 21 (4), 677 .
- 7Baron et al. (2007) Baron, Riccardo, Daniel Trzesniak, Alex H. de Vries?, Andreas Elsener, Siewert J. Marrink, and Wilfred F. van Gunsteren (2007), āComparison of thermodynamic properties of coarse-grained and atomic-level simulation models,ā Chem Phys Chem 8 (3), 452ā461 . Ā· doiĀ ā
- 8Berner et al. (2016) Berner et al. , J (2016), āStochastic parameterization: Towards a new view of weather and climate models,ā Bulletin of the American Meteorological Society 0 (0), null , http://dx.doi.org/10.1175/BAMS-D-15-00268.1 . Ā· doiĀ ā
