Drift-diffusion processes from elimination of fast variables under inhomogeneous conditions
Paul E. Lammert

TL;DR
This paper develops a unified method to eliminate fast variables in inhomogeneous environments, deriving effective drift-diffusion processes for particles, applicable to both passive and active motion, with detailed generator calculations.
Contribution
It introduces a straightforward approach to derive effective drift-diffusion models from complex systems with position-dependent velocities and fast variables, enhancing understanding of environmental effects on particle motion.
Findings
Unified framework for eliminating fast variables
Explicit drift-diffusion generators for various models
Insights into inhomogeneity-induced drift in passive and active particles
Abstract
The problem of eliminating fast-relaxing variables to obtain an effective drift-diffusion process in position is solved in a uniform and straightforward way for models with velocity a function jointly of position and fast variables. A more unified view is thereby obtained of the effect of environmental inhomgeneity on the motion of a diffusing particle, in particular, whether a drift is induced, covering both passive and active particles. Infinitesimal generators (equivalently, drift-diffusion fields) for the contracted processes are worked out in detail for several models.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Drift-diffusion processes from elimination of fast variables
under inhomogeneous conditions
Paul E. Lammert
Department of Physics, Pennsylvania State University, University Park, PA 16802
(Nov. 1, 2019)
Abstract
The problem of eliminating fast-relaxing variables to obtain an effective drift-diffusion process in position is solved in a uniform and straightforward way for models with velocity a function jointly of position and fast variables. A more unified view is thereby obtained of the effect of environmental inhomgeneity on the motion of a diffusing particle, in particular, whether a drift is induced, covering both passive and active particles. Infinitesimal generators (equivalently, drift-diffusion fields) for the contracted processes are worked out in detail for several models.
1 Introduction
Thirty years ago, van Kampenvan Kampen (1988) investigated the effects of environmental inhomogeneity in models of random motion of passive particles. In general, the problem continues to attract interestChristensen and Pedersen (2003); Bringuier (2009); Yang and Ripoll (2013); Livshits (2016); Bhattacharyay (2019); Baldovin et al. (2019); Abdoli et al. (2019), and has been especially revived for active matterHong et al. (2007); Butler et al. (2015); Byun et al. (2017); Zhao et al. (2018); Popescu et al. (2018). Cates and TailleurCates and Tailleur (2013) and Vuijk et aliaVuijk et al. (2019), in particular, studied variants of a model to be taken up here. Consider, for instance, a micron-sized particle propelling itself via chemical reactions using fuel from the environment, at a speed increasing with fuel concentration. Random torques cause its direction of motion to wander (time scale sec), so that its motion is diffusive on longer time scales. Will a fuel concentration gradient cause drift? If so, the mechanism would seem to be rather different than for a passive particle.
This paper presents a unified perspective for velocity models — models for which velocity is given as a function of continuous position and fast-relaxing variables. I show how to eliminate the fast variables to obtain the infinitesimal generator for a positional drift-diffusion in the markovian limit by a uniform and natural method. Drift is just a part of this contraction problem. Although the infinitesimal generator is interconvertible with a Fokker-Planck-type equation, being a dual, or adjoint formulation, the method given here seems significantly simpler than methods which work with a probability density from the beginning. Significant generalizations of both the active particle model mentioned above and the Langevin-Kramers model of a brownian particle discussed by van Kampen are treated in detail, together with a coupling of them. All those belong to the class of quasilinear velocity models, for which practical formulas for drift-diffusion fields are given, requiring a handful of tensor fields on position space expressing simple properties of the original velocity model and its fast equilbria. Intuition suggests that, all other things being equal, a passive brownian particle drifts toward lower velocity damping while the simple active particle drifts toward higher fuel concentration. Suitably embellished, these are the only two general mechanisms of gradient-induced drift for quasilinear models, and the same is arguably roughly true for general velocity models.
Zoom out now to situate that contraction problem in a broader idea of coarse-graining. Drift-diffusions, characterized by drift () and diffusion () coefficients, and markovian random processes with continuous sample paths, are essentially the same thing. Therefore, to make a drift-diffusion approximation (of a simple random walk, for an elementary example) is to approximate by a markovian process with continuous sample paths. If the process as given lacks markovianity, it may display it approximately over longer time scales — a temporal coarse-graining is called for. The processes treated in this paper have a velocity determined jointly by position and some fast variables, that is, with quickly decaying correlations. They belong to this first general category. If the process as given has discontinuous sample paths (a so-called hopping modelvan Kampen (1988, 1992); Christensen and Pedersen (2003); Smith et al. (2017), for instance) it might be approximable as continuous after a spatial coarse-graining. Those are outside the purview of this paper. A seeming third category comprises discrete time markovian processes, such as the archetypal random walk. However, viewed as time-homogeneous continuous-time processes, as they must be to prepare for a drift-diffusion approximation, they are nonmarkovian. Therefore, they too conform to the rule of temporal/spatial coarse-graining to obtain markovianity/continuous paths.
Here is a content guide. Section 2 reviews the aforementioned equivalence of drift-diffusions with markovian with continuous paths, and the observable-centered (“Heisenberg picture”) semigroup formalism we will use. Section 3 solves the contraction problem — elimination of fast variables — for a general velocity model, leading to expressions for the infinitesimal generator of the contracted process in terms of time correlation functions in fast-equilibrium. [See Eqs. (15), (16), (17)]. Section 4 elaborates the Langevin-Kramers model, to one with arbitary tensor parameters, and a simple model of an active particle, to one with fully anisotropic rotational diffusion characteristics, as well as a coupling of the two. The elaborated models are fit into the framework of (quasilinear) models describable in terms of intrinsic drift (), decay rate (), velocity transformation (), and kinetic (), tensor fields. A general conclusion here is that drift which is not already implied by fast equilibrium is due to derivatives of either or . Section 5 makes contact with the more common Fokker-Planck perspective, discussing the relationship of with drift and diffusive currents is discussed, as well as position-conditional mean instantaneous velocity.
2 Background: Feller processes and semigroups on manifolds
This Section is a quick review of some standard loreGardiner (1985); Kallenberg (1997), with two principal objectives. First, to introduce the semigroup formulation which will be used subsequently. This is a description which focuses on the evolution of observables rather than a probability density. It stands in relation to what might loosely be called a Fokker-Planck-type formulation as the Heisenberg picture in quantum mechanics stands to the Schrödinger picture. Secondly, and concurrently, to recall the equivalence between Markov processes with continuous sample paths and drift-diffusions.
For a Markov process on a manifold , and with denoting the state of the system (a point in ) at time , operators are defined via
[TABLE]
this being the expectation of assuming initial state . The markovian no-memory property is expressed by the Chapman-Kolmogorov equation (), which can also be read as saying that comprises a semigroup. A Feller semigroup is one which maps (continuous functions tending to zero at infinity) into itself and is strongly continuous with respect to supremum norm, and its (infinitesimal) generator is defined by , with domain the subspace of on which the limit exists. The generator and semigroup are related by , solution to which is formally written as . Assuming as well that (so far, all is all consistent with a jump process) and that all sample paths are continuous, is guaranteed to be a second-order differential operator. In a coordinate chart ,
[TABLE]
where
[TABLE]
and equals one on some neighborhood of . Here and throughout, summation convention is followed for matching upper and lower indices. Eqs. (2) provide the interpretation of . Exactness of the second-order form goes back to a 1931 paper of KolmogorovKolmogoroff (1931). Now, while is covariant, is not. This has the slightly confusing consequence that transforms as a twice-contravariant tensor, but the drift has an anomalous term in its coordinate-transformation rule:
[TABLE]
This curious fact indicates that should be thought of as a unit, not two fully distinct things.
In the converse direction to the conclusion represented by (2), suppose given a differential operator as generator of a semigroup . If bears interpretation as the semigroup associated with a stochastic process on — a probability measure or coherent system of starting-point-indexed probability measures on paths — then the process is perforce markovian (Chapman-Kolmogorov equation) and has continuous sample paths since whenever vanishes on any neighborhood of . Further, as already seen, must in that case be second order.
The semigroup formulation is convenient for calculating multi-time correlation functions. For example, with initial measure and ,
[TABLE]
In the final expression, time ordering must be observed, and acts on everything to its right (i.e., ).
A general Markov process on can be thought of as a continuous motion occasionally interupted by unanticipated jumps. Later, we will discuss adding jumps to our prototype models, but will do no concrete calculations, hence do not introduce any notation for them.
3 Contraction of velocity models
This section constructs the contraction of a general velocity model, with special attention to the markovian limit. The generator , equivalently the drift-diffusion field of the contracted process, are expressed in terms of time correlation functions in the fast equilibrium. As concrete examples to keep in mind, simple prototypes will be described first.
3.1 Prototypes
We introduce some very simple concrete models as examples. They live on product spaces , where the first factor is for position and the second for some internal, fast degrees of freedom.
: is velocity space and the generator is
[TABLE]
where is the -dimensional euclidean laplacian. This is nothing but the Langevin-Kramers model for a particle in a thermal bath, except that there is no assumption that anything is thermal, so the fluctuation-dissipation link is broken and and are functions of (as emphasized in the notation). The drift in Model was investigated already decades agoErmak and McCammon (1978); van Kampen (1988).
: , the unit sphere in , so a point in can be thought of as a unit vector in . has infinitesimal generator
[TABLE]
where is the Laplace-Beltrami operator on the sphere. As motivation, consider that this can model a self-powered colloidal particle which propels itself by consuming fuel in its environment. The concentration of fuel, hence its speed varies with the position-dependent concentration. At the same time, its direction of motion is subject to Brownian wandering represented by the first term of . Variants of this model were investigated relatively recentlyCates and Tailleur (2013); Vuijk et al. (2019).
In Section 4 these models will be significantly elaborated. But the simple examples are a good thing to keep in mind during the following abstract development.
3.2 Slow and fast degrees of freedom
The general and natural geometrical setting for our investigation is as follows. is a manifold, the configuration, or base, manifold, representing slow degrees of freedom of the system — position in the concrete models. is a fiber bundle over with typical fiber . represents fast degrees of freedom. Conventionally, coordinates or points in the fibers will be denoted by . In the case of models and , is just a product bundle, with . Several motivations can be adduced for working in the manifold setting. Certainly there are interesting cases of particles moving in noneuclidean geometries, in a lipid bilayer membrane, for example. Fast variables, such as orientational variables, are perhaps even more likely to inhabit noneuclidean spaces. Model of this paper is an example. Also, since the questions at issue are local in position, the mathematical overhead involved is fairly light, while nonuniform parameters are very natural in arbitary coordinates.
What determines the rate of change of position is the velocity, of course. may simply be velocity, as for Model . In any case, the velocity is a function of the fast and slow coordinates. Thus, is a bundle map. The basic assumption made about the fundamental Markov process is that it have a generator of the form
[TABLE]
where generates a fiber-preserving semigroup (the state never moves out of the fiber it started in) in each fiber. The structure of (8) explains the name “velocity model”. In addition, the fast processes are assumed to attain equilbrium on the time scale :
[TABLE]
That is, represents a fiber-by-fiber equilibrium depending on base position (in ).
The smaller compared to our observation time scale for the contracted process, the closer the latter is to markovian. In order to access the markovian limit in a formal way, we scale up by a large factor, scaling down the time required to reach fast equilibrium. That alone would have the effect of rendering all fluctuations negligible, so a rescaling of the velocity is also needed. With temporary “old/new” labels for clarity, we take
[TABLE]
For infinitesimal , the contracted process is markovian, and therefore, according to Section 2, fully characterised by drift-diffusion fields on . An overbar on a symbol indicates that it pertains to a contracted process.
3.3 Contracted semigroup
The fast generator generates a fast semigroup and drives the system toward the fast equilibrium . These are all black boxes to us. In such generality, there are very few options of how to proceed. Take the clutching term
[TABLE]
from , and put it into the Duhamel-Dyson formula as a perturbation:
[TABLE]
So far, this resembles perturbation expansions for evolution operators as they occur in many fields, quantum mechanics notably. However, we wish to apply this to functions of position (in ) alone, thereby constructing a contracted semigroup with generator . To that end, examine the structure of the th term of the expansion:
[TABLE]
The integrand is a string of operators, and , each of which potentially acts on everything to its right. If , that action is effectively the same as taking the fast-equilibrium value. Thus, the integrand reduces to a product of clusters of the form . The entire perturbation expansion thus reduces, formally, to a sum over all (ordered) products of clusters as the cluster positions range over the interval . That is, , with
[TABLE]
Insofar as we are interested in the markovian limit, however, this is overkill. Each factor of carries a factor in the , and each integration brings another factor . Formally, then, , and, for , . Therefore, in the limit , we obtain the markovian generator
[TABLE]
using the abbreviation
[TABLE]
(The subtraction of the first on the second line is allowed since what it multiplies has zero fast expectation value.) Henceforth, we are interested only in this markovian limit and therefore drop the subscript.
Now, recalling once again that the derivatives in (15) act on everything to the right and matching against the general form (2), the drift-diffusion fields of the contracted process in the markovian limit are revealed:
[TABLE]
and
[TABLE]
where ‘Sym’ indicates symmetrization on the tensor indices. Verification that these formulas conform to the coordinate-transformation law (4) is straightforward.
4 Prototypes elaborated
This Section explores various ways to elaborate the prototype models and . Rather than doing so in an unconstrained fashion, the aim is to preserve a simplicity of the relevant correlation functions. Thus, in preparation we define a restricted class of velocity models with a convenient and fairly intuitive set of parameters: intrinsic drift (), decay (), velocity transformation (), and kinetic () tensors. Definitions follow.
4.1 Quasilinear models
quasilinear models are defined by two conditions.
(a) The velocity function has a factorized form:
[TABLE]
(b) decays exponentially, on average:
[TABLE]
where is a position-dependent linear operator with spectrum in the right complex half-plane .
Condition (b) implies that
[TABLE]
The calculations for and under these assumptions are similar, the latter being slightly easier. For (16), we have
[TABLE]
Now, according to the factorization condition (a), here is oblivious to , so we can rearrange factors and put it before the derivative. The result will be correct in any specific coordinate system. (Vide infra for comment about transformation properties.) With the definition
[TABLE]
we obtain
[TABLE]
The expression for the generator corresponding to this pair is actually hardly less compact than either. It is
[TABLE]
Here, the alternative notation for derivatives is meant to emphasize that is an operator that operates on everything to its right, not just the parenthesized expression immediately following. Written in terms of fast-equilibrium correlations, this is
[TABLE]
This expression is valid with only condition (b) imposed, but that is not the reason it is written here. (26) exhibits general coordinate covariance, whereas (25) does not. The reason is that, in the latter, a factor was moved in front of a derivative so that we could write things in terms of . itself is not position-dependent, but with a change of coordinates, it will generate a jacobian factor which may be, but which is invisible in (25).
Before moving on to concrete models, a few observations are in order about the significance of (23) and (24). (i) Apart from , there are two basic mechanisms to produce drift in a quasilinear model: gradients of either the decay tensor or the velocity scaling tensor . These two are precisely the mechanisms whereby drift is generated in the prototypes and , respectively. (ii) In contrast to the case of a generic velocity mdoel, the quasilinear parametrization allows a precise statement of that sort. Still, it seems reasonable to say that something roughly like that holds in general. (iii) Derivatives of do not enter. Therefore, we cannot say that inhomogeneity generically induces drift. This is not in conflict with the existence of thermophoresisPiazza and Parola (2008). Temperature-dependent free energy of interaction with the surrounding medium can contribute to . (iv) The only information required directly about the fast equilibria is that encoded in and , which is to say, not very much.
4.2 Model
Now, we briefly investigate what sorts of physically plausible elaborations can be made of Model (later, ) from Section 3.1 without spoiling the quasilinearity conditions (18) and (19). The first, most obvious way to modify Model is simply to make all the scalar coefficients tensors. Thus, Model A is defined by the fast generator (, , and are functions of alone)
[TABLE]
Here, has the interpretation of some kind of external force. is allowed to be anisotropic and may have partially rotational character, and is symmetric and positive, but otherwise unconstrained. With regard to the anisotropies, this is not written down with any specific physical system in mind, but one may imagine motion in an anisotropic medium, e.g. nematic liquid crystal, a magnetic field, and random excitation coming from an anisotropic nonthermal source such as ultrasonic pulses.
Three of the quasilinear parameters follow immediately:
[TABLE]
Finally, demanding gives the equation
[TABLE]
which determines implicitly, and has a unique solution given the spectral restrictions on . In fact, writing and as bilinear forms, (29) says . The solution is , which is manifestly symmetric and positive, since is.
Multiplying (29) by and from left and right, respectively, yields
[TABLE]
bypassing . Unfortunately, is still needed for , and no tidy formula appears available in general. Three special cases will be considered.
Model : is proportional to the identity (subscript ‘’ for ‘scalar’). Then, the foregoing quickly leads to
[TABLE]
We might try to further specialize by supposing is scalar. That makes no sense in general, but if has a riemannian metric , we may suppose and proportional to . Write , so ; this can be viewed as a definition of . Essentially, this is Model with an external force. The result here is easily seen to agree with that obtained by van Kampen in Section 4 of Ref. van Kampen, 1988 after notational translation.
Model : is equipped with a riemannian metric and . The interpretation of this is that fast-equilibrium is thermal equilibrium at position-dependent temperature , and relies on something outside the model itself for its justification. Thus, the formula for is given by fiat, and implies the restriction
[TABLE]
where one index of is raised with in the usual way. From this, we easily obtain
[TABLE]
Model : has a full complement of smoothly varying position-dependent (complex!) eigenvectors labelled by :
[TABLE]
With projection operators defined as
[TABLE]
by use of the dual basis to the basis ,
[TABLE]
we obtain
[TABLE]
Now, we can proceed to write everything else in terms of the eigenvalues and projections . The results are not particularly transparent, but from (40) below, one at least gets some sense of how derivatives of both the eigenvalues and the eigendirections contribute to the drift. The kinetic tensor is
[TABLE]
the contracted diffusion tensor is
[TABLE]
and the contracted drift is
[TABLE]
Consider briefly some additional elaborations that would preserve quasilinearity without writing formulas for the parameters. First, can be made velocity dependent. Secondly, a variety of jumps can be included — a mean zero jump distribution would contribute to , while jumps with probability density depending on a scattering angle and fractional decrease of would contribute to .
4.3 Model
The elaboration of Model is perhaps more interesting. As for Model , we assume here that is a riemannian manifold. The prototype, , was defined on the sphere bundle ; was just a unit vector in that case. Implicitly, the particle was supposed to have only a single distinguished axis, that along which it propels itself. The full Model will be defined on the associated orthonormal frame bundle; will be an entire orthonormal frame keeping track of all the orientational degrees of freedom. Consider a particle shaped like a brick. This is easy to talk about and visualize, but an ellipsoid, or even more complicated shape with appropriate reflection symmetries, will have the same type of rotational characteristics. A brick has three principal axes for its hydrodynamic properties with, as a result, distinct rotational diffusion coefficients about each of them. A dynamically inert dot painted on one of each pair of opposite faces of the brick allows unambiguous definition of an oriented axis and hence the association of a distinct orthonormal frame with each particle configuration. In the -dimensional case, the body-frame plane has its own distinct orientational diffusion coefficient (, diagonal entries zero).
This picture is intuitively simple. However, some notation needs to be set up to write down formulas. Given a reference frame , each body frame can be identified with the rotation matrix in which brings the reference frame into coincidence with it. That is, is identified with the frame , where is the th column of with components . Now, let denote the rotation by angle in the reference plane, and define the operators for via
[TABLE]
This differentiates with respect to rotation about the local, configuration-dependent, plane for each . For computational purposes, it is useful to note that is a derivation: ; and
[TABLE]
where is a matrix with component equal to , component equal to and all other components zero.
Finally, we can write down the generator for fully anisotropic orientational diffusion:
[TABLE]
To finish the specification of the model, assume that the velocity is along in the reference configuration, so that its direction in configuration is , and the velocity map is
[TABLE]
It may seem that there is no possible physical interpretation for nonscalar , but it will be put to good use in Section 4.4.
For Model , the fast process induced on the tangent space via the velocity map is not generally markovian. To see that, suppose that is much larger than all other orientational diffusion coefficients. And, suppose that we have observations of at two times with a delay of order . Then, we have a good idea of where the local -axis is and can expect the velocity to keep moving approximately in the - plane for the near future. Nevertheless, the induced process is quasilinear, as follows from a straightforward calculation yieldng
[TABLE]
with the definition
[TABLE]
In case for , the motion of the orientation is ergodic, other rotational degrees of freedom are decoupled, and the model effectively reduces to the prototype .
Along with , given a priori, the quasilinear parameters are then
[TABLE]
Substitution into the quasilinear formula (25) yields
[TABLE]
The formula for follows from , or more strongly from equidistribution of over in fast equilibrium, which the reader may consider evident enough. For a more formal derivation, consider . This implies that is independent of (each of a set of expectations is a weighted average of the others). Two appeals to orthogonality of after summing, first over and then over , show that .
Due to the interest for run-and-tumble particlesCates and Tailleur (2013), addition of jumps to the model are now briefly considered. In keeping with the operative symmetries, the jump rate measure (rate of transitions ) is required to have a well-defined form with respect to the body frame, and to be invariant under reflection through the hyperplane normal to the body axes , :
[TABLE]
With denoting the total jump rate, the associated contribution to the generator is defined via
[TABLE]
In particular, using reflection symmetry in the second equality,
[TABLE]
where . For scattering which sends uniformly over the sphere or pure reversal, or , respectively. In the markovian limit, therefore, jumps simply make an additive contribution to the effective diffusion coefficient (and thence to ). is already an effective quantity insofar as the various are not necessarily equal. With jumps, it should be replaced by
[TABLE]
4.4 A two-stage contraction
Now, consider combining two velocity models, 1 and 2. A trivial way to do this is to define a (sum) velocity model via , where the fast parts of models 1 and 2 are independent. Then the sum model is not quasilinear, but that is not a problem, because each of models 1 and 2 makes its own separate contribution to and (i.e., we should add and ).
If models 1 and 2 have very different relaxation times, however, a more interesting possibility — two-stage contraction — emerges. Let us consider an example constructed from Models and . It will be essentially the same model as treated by Vuijk et aliaVuijk et al. (2019), except that we allow inhomogeneous and . Following that source, the model is motivated as follows. Consider an active colloidal particle (m) a self-generated thrust in the direction moving in a liquid. Ordinarily, the velocity would simply be some scalar multiple of that, in fact , where is the viscous relaxation time for velocity, which, for a micron-sized particle in aqueous medium, is of order 1 s. Really, we should understand that velocity as the output of a Langevin-Kramers model with the external driving force being the particle’s self-generated thrust. But, what if the damping tensor is not a simple scalar? The scenario envisaged by Vuijk et alia (loc. cit.) is a charged particle in a magnetic field, but we do not commit to that and allow general .
Since the relaxation time of the Model velocity in this scenario is of order times that of the thrust direction, we first eliminate the Model variables. According to Eq. (33) (), that gives
[TABLE]
The second term does not depend on so we set it aside temporarily. The coefficient in the first term is now taken as the velocity function of Model : , equivalently, , where both and are position-dependent. Inserting this data into (47) immediately yields
[TABLE]
Adding this to the set-aside piece of gives the fully contracted generator
[TABLE]
Note that all parameters here (, , , , and ) are potentially position-dependent. In the case of 3-dimensional euclidean space with uniform , , and , this agrees with Eq. (12) of Ref. Vuijk et al., 2019, where it was obtained by very different methods. (Note: here corresponds to of Vuijk et alia.)
5 Contacting the Fokker-Planck perspective
This section makes contact with the Fokker-Planck perspective. Section 5.1 takes a look at the current and asks what, if anything, the approach taken here says about splitting it into drift and diffusion parts. Section 5.2 returns to the methods of Section 3 to derive the conditional (on position) mean instantaneous velocity. Throughout, is assumed to be equipped with a riemannian metric . That allows the definition of a scalar probability density with respect to volume measure . Thus, is independent of coordinates.
5.1 Fokker-Planck equation and currents
This subsection considers a drift-diffusion which may not have come from contraction, so the overbars on have been dropped.
Via integration by parts, the time evolution of the expectation of test function is found to be
[TABLE]
This is simply an implicit way of writing the the Fokker-Planck equation . Here,
[TABLE]
is the probability current density with the unit vector field along the th coordinate direction and denotes the covariant divergence of the contravariant vector field . A possibly more aesthetically pleasing way to express the current density is implicitly via
[TABLE]
where denotes a covariant derivative and is a Christoffel symbol. Since the anomalous terms in the transformations of and cancel, the object in square brackets is a genuine covariant vector field.
The expression for the current is trivially rearranged as
[TABLE]
The literature seems to show a preference for rearranging the right side of (53) as (55) and identifying the first term as the total drift, with being an “extra drift”van Kampen (1988). I argue now against that identification, recognizing that, insofar as only the total current is physically relevant, it is a relatively low-stakes issue. Let us move to euclidean space to keep matters simple, and visualize the density as an ensemble of noninteracting particles. “Drift” ought to refer to a propensity of any one of these particles to move in a particular direction. According to the defintion (2), provides us with an unbiased estimate of where a randomly selected member of the ensemble will go, while quantifies the uncertainty. Furthermore, there is nothing strange about a diffusive current arising from a gradient of with uniform . Consider the familiar, elementary explanation of ordinary diffusive current: a plane separates a low-density region on the left from a high-density region on the right. Particles move randomly away from wherever they are. More particles randomly move across from right-to-left than vice versa, hence a diffusive current. If now we have equal densities, but more vigorous random motion on the right (larger ), a right-to-left current will again arise. Thus, it is perfectly proper to consider a contribution to diffusive current. All this may have a whiff of “Itô versus Stratonovich”van Kampen (1981) about it. However, since the approach taken here has neither the need for an interpretational rule that a Langevin equation does, not the complication of having a general probability distribution from the beginning, it is difficult to see any interpretational wiggle room.
5.2 Conditional mean instantaneous velocity
Now suppose a velocity model with contracted probability density which is well-aged (i.e., has been freely evolving for many fast-relaxation times ). What is the joint distribution of position and velocity? Refs. Vuijk et al., 2019; Cates and Tailleur, 2013 used a direction calculation of that joint distribution to find Fokker-Planck equations for Model variants. The method of Section 3 avoided it, but it is nevertheless a question of independent interest. One’s first thought might be that the joint distribution takes the factorized form
[TABLE]
where denotes the fast-equilibrium distribution at . Actually, the distribution over is slightly skewed from this, even in the markovian limit. We will not work out the full distribution, but only the first moment . Since is a distribution only over , this requires some explanation. In accordance with the preceding discussion, the expectation with respect to of a function of both and should be read as
[TABLE]
while for a function on , is the corresponding distribution conditioned on position being . The operator in (57) means that the velocity has had a chance to equilibrate with the “environment” described by the position distribution . A little reflection should convince one that the current density is related to the instantaneous velocity expectation via
[TABLE]
(For one-stage contraction; two stages is a bit more complicated.) The only real question is whether the framework of Section 3 is able to deliver this conclusion. In order to calculate the first moment of the conditional mean instantaneous velocity, it will be enough to compute
[TABLE]
for an arbitrary test function of position only. Now, recall (12)
[TABLE]
is oblivious to , so the first term gives a contribution to (59). The next term is more interesting. Since is infinitely large compared to the fast relaxation time, we obtain (with an integration-by-parts in the final step)
[TABLE]
Since is an arbitrary test function, we conclude that
[TABLE]
Comparing this to Eq. (53) for the probability current density, one see that (58) is indeed verified.
All this prompts one to ask which kind of measurement would be expected to yield the drift velocity , which the instantaneous velocity . The calculations used here to find them both involved evolving the system for a time , however, that evolution served very different purposes in the two cases. Suppose the system is started (or found) at . Then, is the expected average velocity of the system over the subsequent time . If now, the system has a probability density (perhaps we have calculated this for a time via the Fokker-Planck equation from an initial point mass at ), then our expectation of the instantaneous velocity of the system, regardless of its actual position, is . “Instantaneous” here means that the measurement is done over a time small (or at worst, comparable to) .
6 Conclusion
By means of a “Heisenberg picture” approach, elimination of fast variables was carried out in general for velocity models, resulting in a general expression for the infinitesimal generator , equivalently, the drift-diffusion field , of the contracted drift-diffusion process on position space in terms of time correlation functions in fast-equilibrium. Compared to Chapman-Enskog type expansions, which work in the “Schrödinger picture”, the method seems to be simpler and more automatically general. Information about the distribution of fast variables, which comes naturally with Chapman-Enskog expansions, is also available in the current approach, if desired (Sec. 5.2). Examination of the subclass of quasilinear models showed that drift, other than that already present in fast-equilbrium, traces to derivatives of either velocity decay () or the dependence of velocity on fast variables (). The first of these is exemplified by the humble brownian particle in a viscosity gradient, and the second by the active particle in a fuel concentration gradient.
Acknowledgements.
I thank Prof. Vincent H. Crespi for critical reading and suggestions. This work was funded by the Penn State MRSEC, Center for Nanoscale Science, under award National Science Foundation DMR-1420620.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1van Kampen (1988) N. G. van Kampen, Journal of Physics and Chemistry of Solids 49 , 673 (1988) . · doi ↗
- 2Christensen and Pedersen (2003) M. Christensen and J. B. Pedersen, Journal of Chemical Physics 119 , 5171 (2003) . · doi ↗
- 3Bringuier (2009) E. Bringuier, Physica A-Statistical Mechanics and its Applications 388 , 2588 (2009) . · doi ↗
- 4Yang and Ripoll (2013) M. Yang and M. Ripoll, Physical Review E 87 , 062110 (2013), 10.1103/Phys Rev E.87.062110 . · doi ↗
- 5Livshits (2016) A. I. Livshits, Physics Letters A 380 , 1891 (2016) . · doi ↗
- 6Bhattacharyay (2019) A. Bhattacharyay, Physica A-Statistical Mechanics and its Applications 515 , 665 (2019) . · doi ↗
- 7Baldovin et al. (2019) M. Baldovin, A. Vulpiani, A. Puglisi, and A. Prados, Physical Review E 99 , 060101(R) (2019), 10.1103/Phys Rev E.99.060101 . · doi ↗
- 8Abdoli et al. (2019) I. Abdoli, H. D. Vuijk, J.-U. Sommer, J. M. Brader, and A. Sharma, ar Xiv e-prints (2019), ar Xiv:1908.03101 [cond-mat.stat-mech] .
