Efficiency of one-dimensional active transport conditioned on motility

Francesco Cagnetta; Emil Mallmin

arXiv:1907.01383·cond-mat.stat-mech·March 4, 2020

Efficiency of one-dimensional active transport conditioned on motility

Francesco Cagnetta, Emil Mallmin

PDF

TL;DR

This paper explores how conditioning active particles on their motility alters their interactions and energy efficiency, revealing potential for topological interactions and efficiency gains in simplified models.

Contribution

It introduces a formal framework for analyzing conditioned active matter, deriving emergent interactions and efficiency effects in toy models like TASEP and run-and-tumble particles.

Findings

01

Run-and-tumble particles develop alignment interactions upon conditioning.

02

Conditioning can significantly increase energy efficiency, especially with large mobility fluctuations.

03

Emergent interactions may be topological rather than metric, indicating a screening effect.

Abstract

By conditioning a stochastic process on the value of an observable, one obtains a new stochastic process with different properties. We apply this idea in the context of active matter, and condition interacting self-propelled particles on their individual motility. Using the effective process formalism from dynamical large deviations theory, we derive the interactions that actuate the imposed mobility against jamming interactions in two toy models---the totally asymmetric exclusion process and run-and-tumble particles, \emil{in the case of two or three particles}. We provide a framework which takes into account the energy-consumption required for self-propulsion, and address the question of how energy-efficient the emergent interactions are. Upon conditioning, run-and-tumble particles develop an alignment interaction and achieve a higher gain in efficiency than TASEP particles. A point…

Equations26

P (Γ ∣ O) = \frac{P ( Γ and O )}{P ( O )} .

P (Γ ∣ O) = \frac{P ( Γ and O )}{P ( O )} .

W_{C^{'}, C} = W (C \to C^{'}) - δ_{C^{'}, C} C^{''} \neq = C \sum W (C \to C^{''}),

W_{C^{'}, C} = W (C \to C^{'}) - δ_{C^{'}, C} C^{''} \neq = C \sum W (C \to C^{''}),

P (N_{t}) ≍ e^{- t I (N_{t} / t)}, I (σ) = s \in R sup {s σ - c (s)} .

P (N_{t}) ≍ e^{- t I (N_{t} / t)}, I (σ) = s \in R sup {s σ - c (s)} .

\frac{W ^{eff} ( C \to C ^{'} , s )}{W ( C \to C ^{'} )} = \frac{ℓ ( C ^{'} , s )}{ℓ ( C , s )} exp {s α (C \to C^{'})},

\frac{W ^{eff} ( C \to C ^{'} , s )}{W ( C \to C ^{'} )} = \frac{ℓ ( C ^{'} , s )}{ℓ ( C , s )} exp {s α (C \to C^{'})},

V (C, s) \equiv - lo g ℓ (C, s) .

V (C, s) \equiv - lo g ℓ (C, s) .

I (σ) = s \in R^{N} sup {(i = 1 \sum N s_{i}) σ - c (s_{1}, \dots, s_{N})},

I (σ) = s \in R^{N} sup {(i = 1 \sum N s_{i}) σ - c (s_{1}, \dots, s_{N})},

I (σ) = s \in R sup {N s σ - c (s)},

I (σ) = s \in R sup {N s σ - c (s)},

η_{naive}^{TASEP} = \frac{1 - N / L}{1 - 1/ L} .

η_{naive}^{TASEP} = \frac{1 - N / L}{1 - 1/ L} .

η_{eff}^{TASEP} (σ) \equiv \frac{σ}{e ^{s (σ)}},

η_{eff}^{TASEP} (σ) \equiv \frac{σ}{e ^{s (σ)}},

η_{naive}^{RTP} (γ) ≃ \frac{1 + γ / L}{1 + 2 γ / L} (N = 2) .

η_{naive}^{RTP} (γ) ≃ \frac{1 + γ / L}{1 + 2 γ / L} (N = 2) .

η_{eff}^{RTP} (σ, γ) = \frac{σ}{γ} e^{- s_{γ} (σ)},

η_{eff}^{RTP} (σ, γ) = \frac{σ}{γ} e^{- s_{γ} (σ)},

η^{'} (\overset{σ}{ˉ}) = \frac{1}{γ} 1 - \frac{σ ˉ}{N ( σ ^{2} - σ ˉ ^{2} )},

η^{'} (\overset{σ}{ˉ}) = \frac{1}{γ} 1 - \frac{σ ˉ}{N ( σ ^{2} - σ ˉ ^{2} )},

γ^{*} = γ_{R} arg max {s_{γ_{R}}^{- 1} (lo g \frac{γ}{γ _{R}})} .

γ^{*} = γ_{R} arg max {s_{γ_{R}}^{- 1} (lo g \frac{γ}{γ _{R}})} .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Efficiency of one-dimensional active transport conditioned on motility

F. Cagnetta*, E. Mallmin*

SUPA, School of Physics and Astronomy, University of Edinburgh, Peter Guthrie Tait Road, Edinburgh EH9 3FD, United Kingdom

Abstract

By conditioning a stochastic process on the value of an observable, one obtains a new stochastic process with different properties. We apply this idea in the context of active matter, and condition interacting self-propelled particles on their individual motility. Using the effective process formalism from dynamical large deviations theory, we derive the interactions that actuate the imposed mobility against jamming interactions in two toy models—the totally asymmetric exclusion process and run-and-tumble particles, in the case of two or three particles. We provide a framework which takes into account the energy-consumption required for self-propulsion, and address the question of how energy-efficient the emergent interactions are. Upon conditioning, run-and-tumble particles develop an alignment interaction and achieve a higher gain in efficiency than TASEP particles. A point of diminishing returns in efficiency is reached beyond a certain level of conditioning. With recourse to a general formula for the change in energy efficiency upon conditioning, we conclude that the most significant gains occur when there are large fluctuations in mobility to exploit. From a detailed comparison of the emergent effective interaction in a two- versus a three-body scenario, we discover evidence of a screening effect which suggests that conditioning can produce topological rather than metric interactions.

I Introduction

How should a single biological entity—a macromolecule, a cell, an organism—act to efficiently fulfil its functions in the presence of restrictive collective effects? This question inverts the usual aim of active matter theories, which is to derive the ‘macroscopic’ consequences of a postulated ‘microscopic’ dynamics where fluxes and forces are generated by consuming energy Ramaswamy (2010); Vicsek and Zafeiris (2012). In populations of self-propelled particles, for instance, the efficiency with which the particles convert energy into motion is reduced by collisions Helbing (2001). However, in biological systems equipped with sensing and feedback mechanisms between constituents, we expect well-adapted, or smart, interactions to reduce inefficient behaviour like jamming. This suggests that smart interactions of active systems, such as an alignment rule à la Vicsek Vicsek et al. (1995), might emerge as solutions to physically motivated optimization problems Cavagna et al. (2014); Nemoto et al. (2019); Tociu et al. (2019).

The idea central to the present work is that smart interactions such as collision avoidance and alignment, can be obtained by conditioning an active matter model on high values of individual motility. This generalizes to the field of active matter an idea due to R. M. L. Evans Evans (2004, 2005), of deriving driven nonequilibrium models with a specific steady-state current from a subensemble of atypical trajectories of an equilibrium process. We summarize below how conditioning in this way has been made operational using modern mathematical tools (I.1) and how, in this work, we apply it to simple few-particle active matter models in one dimension (I.2).

I.1 Model-making by conditioning

To condition a stochastic process is to build a conditioned probability in the classic Kolmogorov sense. If $\Gamma$ is a realization of the stochastic dynamics (a full specification of the trajectories of all constituents) and $\mathcal{O}(\Gamma)=\mathcal{O}$ denotes a constraint on some trajectory-dependent observable, the conditioned process is defined via

[TABLE]

$P(\Gamma\text{ and }\mathcal{O})$ is the probability of observing a specific constraint-fulfilling trajectory $\Gamma$ among all possible trajectories. Dividing by the probability $P(\mathcal{O})$ of realizing the constraint with any consistent trajectory, we obtain a new normalized ensemble $P(\Gamma\,|\,\mathcal{O})$ were every trajectory satisfies the constraint. The problem of translating this formal construction into an explicit stochastic dynamics has only recently been solved with some generality. The key assumptions needed are that (1) the observation-time $t$ of trajectories is large compared to the characteristic time-scale(s) of the original dynamics, (2) the dynamical observable $\mathcal{O}(\Gamma)$ is time-additive, i.e. all its cumulants scale linearly with $t$ , and (3) that the original process is Markovian and time-homogeneous. Based on the theory of large deviations, one can extract an effective process 111This process has variously been referred to as effective, driven, auxiliary in the literature whose typical realizations (asymptotically) coincide with the trajectory ensemble implied by the conditioning (1) Popkov et al. (2010); Jack and Sollich (2010, 2015); Chetrite and Touchette (2015a)—i.e., the effective process describes how a fluctuation, meaning an atypical value of $\mathcal{O}$ , is generated. Remarkably, the effective process is Markovian and time-homogeneous too, and general expressions for its transition rates (discrete case) Jack and Sollich (2010) or drift and diffusion functions (continuous case) Chetrite and Touchette (2015a) have been derived. How such a process is generated is illustrated in Fig. 1.

The same effective process emerges from the so-called Maximum Caliber (MaxCal) method Dixit et al. (2018); Monthus (2011); Chetrite and Touchette (2015b). Based on the constrained maximisation of a path-wise entropy, MaxCal extends the maximum entropy principle of Jaynes Jaynes (1957) and yields the effective process when the constraints are made on long-time averages of time-additive observables (non-Markovian ensembles may otherwise result). MaxCal has been applied with success in active matter problems, in order to infer from empirical data the interactions that govern bird flocking Bialek et al. (2012); Cavagna et al. (2014).

For a Markov process describing a system of interacting particles, the additional effective interactions that emerge upon conditioning can be directly appreciated from the rates of the effective process—if one can find them. That amounts to solving an eigenvalue problem in the dimension of the state space, wherefore analytical results are scarce. There are nonetheless a few notable successes for integrable models, including a range of results for current fluctuations in the TASEP Derrida and Lebowitz (1998); Derrida (2007); Prolhac and Mallick (2008); Gorissen et al. (2012); Popkov et al. (2010) and zero-range processes Harris et al. (2013); Hirschberg et al. (2015), as well as kinetically constrained models Garrahan et al. (2009); Jack and Sollich (2013). These, together with numerical and analytical evidence from nonequilibrium liquids Cagnetta et al. (2017); Nemoto et al. (2019); Tociu et al. (2019); Fodor et al. (2019), indicate that effective interactions are capable of driving a constrained system towards novel phases. There is, however, still a limited understanding of what features of the unconditioned process and conditioning variables lead to interactions that have physical plausibility—firstly, in the sense of their qualitative features, like the range of interactions; secondly, in terms of their energy efficiency. This work is an basic study of this question through a detailed comparison of simple active matter models.

I.2 Description of models and results

We consider two one-dimensional toy models of active matter: the totally asymmetric exclusion process (TASEP), and interacting run-and-tumble particles (RTPs). In the TASEP on an $L$ -periodic lattice, $N$ particles hop clockwise each with a rate $\gamma$ , unless the arrival site is already occupied. The RTP model differs only in that each particle has a variable direction $+$ (clockwise) or $-$ (anti-clockwise), alternating or tumbling between the two with a rate $\omega$ Thompson et al. (2011). The TASEP has several active matter interpretations, including motor protein transport and DNA transcription Chou et al. (2011); Chowdhury et al. (2005). The RTP dynamic is a simplistic model of microswimmer motility, e.g., the motility patterns of bacteria such as E. Coli Berg (2004); Schnitzer (1993). We refer to the two processes just described as naive, to emphasize that these interactions have not been optimized with respect to motility.

When interpreted as active agents, both TASEP and RT particles accomplish their function, i.e. self-propulsion, with some level of efficiency. In the energetic picture we have in mind, a unit of energy is consumed at a rate $\gamma$ (e.g., from the hydrolysis of one ATP molecule Schnitzer and Block (1997)), and is converted into a hop on the lattice if possible; otherwise it is wasted. Therefore, we identify the total number of steps per particle on the lattice as an output. The efficiency is the ratio of output to input, with the input being the number of energy units consumed. The output coincides with the particle current for the TASEP, while for the RTPs it is an undirected particle traffic.

For both the TASEP and RTPs, exclusion interactions reduce the energy efficiency of motion as defined above. By conditioning on the number of steps of each particle, we aim to recover the equivalence between energy units consumed and steps taken. However, as clarified below, conditioning alters not only the interactions between particles, but also the individual base rate of energy consumption. Therefore, the conditioned process is not guaranteed to be more efficient than the naive one, and the efficiency needs to be assessed in both cases and compared.

Due to the lack of general analytical solutions, we proceed numerically but exactly with the conditioning problems, and limit our scope to two and three particles. While the effective interactions have been derived exactly for the $N$ -particle TASEP in the limit of a large current Popkov et al. (2010), this limit alone is insufficient for our analysis which encompasses also moderate levels of conditioning. In fact, we find that when conditioning the TASEP on higher currents a state of diminishing returns quickly sets in. Further increase in output is accompanied by negligible increase in efficiency. In simpler terms, the main effect of the conditioning is to make the particles jump faster, and at a proportionally higher energy consumption. The effect of the emergent effective interaction—long-range and repulsive Popkov et al. (2010)—is small in comparison.

The outcome is remarkably different in the RTP model, whose effective process has not been considered elsewhere for more than one particle Mallmin et al. (2019a). Given a high active Péclet number ( $\text{Pe}=\gamma/\omega$ ), there is a window of fluctuations of the naive process for which the corresponding effective process exhibits directional alignment interactions, with little increase in the base hopping rate. Therefore, the gain in efficiency upon conditioning on higher-than-average motility is substantial. A similar alignment phenomenon was also observed in rare event simulations of active particles Nemoto et al. (2019).

Building on the comparison between the two examples studied, we give a general quantitative argument that a large variance-to-mean ratio in the output, as, for example, afforded by slowly evolving internal states coupled to the output, implies a high attainable gain in efficiency. Furthermore, we present a formal construction of an interaction potential that is guaranteed to increase the efficiency of a naive process whilst keeping the energy consumption fixed. Finally, comparing the two- and three-particle conditioned processes allows us to make some concrete statements on open questions regarding the factorization of the effective interactions. For example: when do emergent $N$ -body interactions reduce to simpler (e.g., 2-body), and what is the nature of many-body contributions?

II Dynamical large deviations formalism

We begin with an overview of the mathematical machinery of dynamical large deviations theory, which allows the explicit construction of the effective process introduced above. We then illustrate its application to interacting particle systems. The theory concerns the asymptotic probability distributions of time-integrated observables of Markov processes Ruelle (2004); Chetrite and Touchette (2015a); Touchette (2018). A Markov jump process, specifically, is characterised by a vector of probabilities $P$ (with a component for each configuration) evolving by the master equation $\partial_{t}P=\mathbb{W}P$ . The matrix $\mathbb{W}$ has elements van Kampen (2007)

[TABLE]

where $W(\mathcal{C}\to\mathcal{C}^{\prime})$ denotes the transition rate from configuration $\mathcal{C}$ to $\mathcal{C}^{\prime}$ . Consider a time-additive observable $N_{t}(\Gamma)$ , e.g., the total number of steps of an active particle for the realization $\Gamma$ . To determine the exact time-dependent distribution $P_{t}(N_{t})$ is a daunting task. It is nonetheless often possible to characterise its fluctuations via a large deviation principle Ellis (2007); Touchette (2009):

[TABLE]

The symbol $\asymp$ means equality of the logarithms in the $t\rightarrow\infty$ limit. The rate function $I(\sigma)$ is a non-negative function which vanishes at the average $\bar{\sigma}\equiv\lim_{t\to\infty}\langle N_{t}\rangle/t$ . For $N_{t}=\sigma t\neq\bar{\sigma}t$ , it gives the decay rate of the likelihood of sustaining the fluctuation. The scaled variable $\sigma$ is the effective hopping rate observed over time $t$ . When convex and differentiable, $I(\sigma)$ is the Legendre-Fenchel (LF) transform of the scaled cumulant generating function (SCGF) $c(s)$ , defined as the long-time limit of $t^{-1}\ln{\left\langle e^{sN_{t}}\right\rangle}$ .

According to the Donsker-Varadhan theory Donsker and Varadhan (1975), $c(s)$ coincides with the principal eigenvalue of the tilted transition matrix $\mathbb{W}^{\text{tilt}}(s)$ , defined by multiplying each off-diagonal element of $\mathbb{W}$ by $e^{s\alpha(\mathcal{C}\rightarrow\mathcal{C^{\prime}})}$ , where $\alpha$ measures the increase in $N_{t}$ across the transition $\mathcal{C}\rightarrow\mathcal{C^{\prime}}$ , e.g., 1 if the transition is a hop, else 0. By the Perron-Frobenius theorem Seneta (1981), $c(s)$ is a real function of $s$ . The spectral elements of $\mathbb{W}^{\text{tilt}}(s)$ also furnish the construction of the effective process—the process whose typical value of $N_{t}/t$ can be any chosen $\sigma$ , and whose typical trajectories coincide with those atypical trajectories generating $\sigma$ as a fluctuation in the original process Jack and Sollich (2010); Chetrite and Touchette (2015a).

In the first step of its construction, the effective process is parametrized by the bias parameter $s$ , rather than the desired fluctuation $\sigma$ . Its transition rates $W^{\text{eff}}$ are given by

[TABLE]

where $\ell(s)$ is the left eigenvector or $\mathbb{W}^{\text{tilt}}(s)$ corresponding to the eigenvalue $c(s)$ . The factor $\ell(\mathcal{C}^{\prime},s)/\ell(\mathcal{C},s)$ can be cast in the form of an ‘effective’ potential difference via the definition

[TABLE]

The function $\alpha(\mathcal{C}\to\mathcal{C}^{\prime})$ enters as a non-conservative driving force, since it cannot in general be written as a potential difference. In the last step of this construction, any chosen fluctuation $N_{t}/t=\sigma$ of the original process is made typical in the effective process by substituting for $s$ the saddle point value $s(\sigma)=I^{\prime}(\sigma)$ , i.e. the maximiser of the LF transform in Eq. (missing) 3. This last step requires convexity of $I$ at $\sigma$ .

The whole procedure generalizes painlessly when more than one observable is considered, as when we condition a system of $N$ interacting active particles on each particle’s output simultaneously. As the observable $N_{t}$ becomes an $N$ -component vector $\mathbf{N}_{t}$ , so do $s$ and $\sigma$ , and the product $s\sigma$ in Eq. (missing) 3 is replaced by a scalar product $\bm{s}\cdot\bm{\sigma}$ . However, we will ultimately set the conditional outputs of the different constituents to the same vale $\sigma$ , i.e. $\sigma_{i}=\sigma$ , $i=1,\dots,N$ . The multidimensional LF transform then becomes

[TABLE]

where $I(\sigma)$ denotes $I(\sigma_{1},\dots,\sigma_{N})$ computed at $\sigma_{1}=\dots=\sigma_{N}=\sigma$ . For the limited number of particles considered in this paper, i.e. $N\leq 3$ , it is safe to assume due to the particle’s indistinguishability that the supremum of Eq. (missing) 6 is attained on the line $s_{1}=s_{2}=\dots=s_{N}$ . In this case, Eq. (missing) 6 can be replaced with the simpler

[TABLE]

where $c(s)$ is a shorthand for $c(s,\dots,s)$ . Although we have verified this assumption a posteriori in all the cases examined here, a symmetry breaking for permutations of the particle labelling cannot be excluded in the general case, so that Eq. (missing) 6 would not reduce to Eq. (missing) 7.

III The Two-body conditioning problem

III.1 The Two-Particle TASEP

We come now to the two-body TASEP conditioning problem. We set the hopping rate $\gamma=1$ without loss of generality by rescaling time. For the TASEP, the efficiency $\eta$ reduces to the ratio of steady state currents of the (effective or naive) interacting and non-interacting processes. Concerning the efficiency of the naive process, we may in fact suppose arbitrary particle number $N$ and (periodic) lattice size $L$ . Since all configurations are equally likely in the TASEP steady state, one finds (cf. 2.1.1 of Blythe and Evans (2007))

[TABLE]

As noted in the introduction, and now demonstrated with reference to Eq. (missing) 4, we see that conditioning carries two effects: the addition of an effective interaction potential $V(\mathcal{C},s)$ , and a renormalization of the ‘bare’ hopping rate $1\to e^{s}$ ( $\alpha=1$ for all allowed transitions). Therefore, the effective-process efficiency for a given level of conditioning $\sigma$ is

[TABLE]

where the dependence on $L$ and $N$ is left implicit. To obtain the saddle-point $s(\sigma)$ , we first compute $c(s)$ via a ‘tilted’ Bethe ansatz of the dynamics Derrida and Lebowitz (1998); Popkov et al. (2010), then solve the maximization Eq. (missing) 3. In Fig. 2 we plot the resulting efficiency for $N=2$ against $\sigma$ and $L$ . The efficiency gain with respect to the naive process is small, and rapidly diminishes with larger system size $L$ (see inset). Just as in the analytically tractable case of large current fluctuations, the effective interaction for moderate conditioning is still a weak long-range repulsion. It is ‘smart’ in the sense of reducing the tendency to jam, but it does not contribute substantially to the hopping rate (at $\sigma=1$ , $e^{\Delta V}\lesssim 1.03$ for $L=16$ and decreases with $L$ ). Rephrasing, the most probable way for the two particles to be as active as in the absence of crowding (i.e., choosing $\sigma=\gamma$ ) is to simply ‘push harder’. As this requires more energy input, the efficiency quickly reaches a point of diminishing returns.

III.2 Two Run-And-Tumble Particles

The conclusions are substantially different for the RTP model, which we now consider for $N=2$ . Upon rescaling time so as to set the tumbling rate $\omega=1$ , the hopping rate $\gamma$ can be interpreted as the active Péclet number $\text{Pe }=\gamma/\omega$ , which quantifies the ratio of self-propulsion to diffusion. At any given time, the directions $\tau_{i}\in\{+,-\}$ , $i=1,2$ , of the particles may be either aligned or anti-aligned—crucially, a pair of particles may be found in a jammed configuration where each obstructs the other. The exact nonequilibrium steady state of the RTP model is only known for $N=2$ Slowman et al. (2016); there, the jammed configuration carries an anomalously large weight. From this solution we obtain an explicit expression for the efficiency of the naive two-particle process. In particular, it has a simple scaling form for large $L$ ,

[TABLE]

For smaller $L$ , as shown in the top panel of Fig. 3, the exact efficiency curve collapses approximately onto the scaling form Eq. (missing) 10, provided the RTP efficiency is normalised by the $L$ -dependent TASEP efficiency, Eq. (missing) 8. As one would expect, the efficiency drops with increasing Péclet number: when $\gamma\gg L$ , the particles will with equal probability be either jammed or in an aligned TASEP configuration, thus mustering only half the TASEP efficiency on average.

Next, we construct the effective process and determine its efficiency. Consider first the large deviations of the total number of steps $N_{t}$ per particle. As shown in bottom panel Fig. 3, the naive process average $\bar{\sigma}=\langle N_{t}\rangle/t$ (i.e., the zero of $I(\sigma)$ ) decreases relative to $\gamma$ as this parameter becomes large. The SCGF can be calculated numerically either directly from the tilted transition matrix or by solving a tilted version of the ‘root-paramterized eigenvalue equations’ derived in Mallmin et al. (2019b). The resulting rate function has a Gaussian profile for fluctuations larger than $\sigma\simeq\gamma$ , whereas the complementary regime of fluctuations smaller than the free-particle speed becomes almost flat for large Péclet number. This large variance stems from the particles’ ability to either align and produce a large current, or anti-align and then quickly reach the jammed state Cagnetta et al. (2017). This feature proves instrumental in increasing the efficiency of the RTP process. As the inset of Fig. 3 shows, in the approximate window $\sigma\in[\bar{\sigma},\gamma]$ where the rate function is flat, the saddle point (which does depends on $\gamma$ for the RTPs) $s_{\gamma}(\sigma)=I^{\prime}(\sigma)$ is close to zero. Conditioning the process on $\sigma$ is this range will, beyond the potential Eq. (missing) 5, only weakly alter the hopping rates, as $e^{s\alpha}\approx 1$ . In this way, the output can be increased without immediately encountering diminishing returns.

For each orientation sector $(\tau_{1},\tau_{2})$ we get via Eq. (missing) 4 the $\sigma$ -dependent effective potential $V_{\tau_{1}\tau_{2}}(x_{2}-x_{1})$ (with $V_{++}=V_{--}$ and $V_{+-}(d)=V_{-+}(L-d)$ by symmetry). The largest and most relevant potential difference is the alignment affinity $\Delta E_{\text{align}}(d)\equiv V_{+-}(d)-V_{++}(d)$ shown in Fig. 4. The alignment interaction is strongest at short face-to-face distance, and is superimposed on a weak long-range repulsion similar to that of the TASEP for large $L$ and/or small $\gamma$ . In addition, Fig. 4 suggests that stronger interactions emerge, together with the ‘flat’ branch of the rate function, when $\gamma$ exceeds the ring size. The present result for two particles should then be relevant for systems with many interacting particles where the typical inter-particle distance replaces the ring size—the features of the large deviations Cagnetta et al. (2017); Nemoto et al. (2019) do not seem to vary qualitatively by this generalisation.

The efficiency depends separately on $\gamma$ and $\sigma$ (since the saddle point does) as

[TABLE]

which we plot in Fig. 5 versus $\sigma$ . As anticipated, most of the possible gain in efficiency occurs before $\sigma\approx\gamma$ , i.e., with little jump rate renormalization, after which diminishing returns sets in and the efficiency plateaus.

IV Beyond the two-body conditioning problem

Eq. (missing) 11 is not limited to active transport problems. Its formulation presupposes a collection of $N$ entities, each independently receiving an input quantity at a rate $\gamma$ . In a (Markovian) collective process, this quantity is converted into an output $\sigma$ (per entity) that obeys a dynamical large deviation principle. We first explore the general implications of this setting. Then we will specialize on the three-body TASEP and RTP problems.

The derivative of $\eta$ with respect to $\sigma$ , evaluated at the naive average $\bar{\sigma}$ , quantifies the immediate improvement in efficiency upon conditioning:

[TABLE]

where we have used the general identities $s(\bar{\sigma})=0$ , $Ns^{\prime}(\bar{\sigma})=I^{\prime\prime}(\bar{\sigma})=1/\text{Var }\sigma$ 222 $I^{\prime\prime}(\bar{\sigma})$ gives the reciprocal variance, as fluctuations close to the mean are Gaussian by the central limit theorem., with $N$ the number of constituents. Note that since $\sigma$ is defined per entity, it scales as $1/N$ . Therefore the subtracting term in Eq. (missing) 12 is not ensured to vanish in the large- $N$ limit. If the input-to-output conversion follows strictly Poisson statistics (as for non-interacting particles on a lattice), $\eta^{\prime}(\bar{\sigma})=0$ .

The general conclusion afforded by Eq. (missing) 12 is that a large variance-to-mean ratio in the output implies high possible gains in efficiency by conditioning. In effect, when there is ample variance in output, conditioning may produce a more optimized process by chiefly retaining the high-perfomance trajectories of the original process and discarding low-performance ones. Consider again RTPs at high Péclet number, as in the above numerical study for $N=2$ . The large variance in mobility is afforded by the separation of time-scales between the re-orientation and hopping events. We therefore expect that the efficiency of self-propelled particle systems can be increased by exploiting fluctuating internal states coupled to the current-generating dynamics Pietzonka et al. (2016); Mallmin et al. (2019a). As Eq. (missing) 12 holds also for large $N$ , it could be directly applied to active models for which the large deviation functions have been determined from simulation or by other means, e.g. Nemoto et al. (2019).

We now put three particles on the lattice, and study numerically the same conditioning problem as for two RTPs or TASEP-particles in section III. The lattice size is fixed to $L=16$ . The apposite questions to ask are, firstly, whether the observation for two particles (viz., the alignment interaction for the RTPs) generalizes to higher particle numbers; secondly, if the effective three-body potentials are the sum of the pairwise potentials obtained from the two-body conditioning. Regarding the first question, Fig. 6 shows the SCGF $c(s)$ for three RTPs, for a range of $\gamma$ . By scaling $c(s)$ with $\gamma$ , we achieve an approximate superposition of the curves in the $s>0$ half-plane. Therefore, according to Eq. (missing) 7, the rate functions superimpose for $\sigma>\bar{\sigma}$ , as they do for $N=2$ (cf. Fig. 3). Additionally, the second derivative at $s=0$ , $c^{\prime\prime}(0)$ (Fig. 6, inset), increases quite steeply with $\gamma$ . By Legendre duality, the rate functions will become progressively flatter, as it does for $N=2$ . The peculiar large deviations of interacting RTPs are then preserved in the passage from $N=2$ to $N=3$ . Furthermore, we can use Eq. (missing) 12 to predict the expected efficiency gain. The inset of Fig. 6 shows both the mean $\bar{\sigma}$ and the variance $\text{Var }{\sigma}$ . Their ratio is already in the hundreds for $\gamma=16$ , indicating an efficiency derivative close to the upper bound $1/\gamma$ .

The TASEP large deviations also do not change appreciably from $N=2$ to $N=3$ , especially for large $L$ . Nevertheless, it is interesting to compare the effective potentials obtained in the two cases. For the potential to be pairwise, the total potential of each $N=3$ configuration must coincide with the sum of the $N=2$ potentials of the three particle pairs. A conceptually important issue immediately arises of what respective levels of conditioning make two- and three-body systems meaningfully comparable. It may at first seem physically intuitive to compare the $N=2$ and $N=3$ systems for the same output per particle $\sigma$ . However, although the large deviations are qualitatively similar, there is a quantitative dependence on the particle density such that, for instance, $\bar{\sigma}_{N=2}>\bar{\sigma}_{N=3}$ , especially for small $L$ . Importantly, the saddle-point function $s(\sigma)$ is $N$ -dependent, giving differing renormalizations $e^{s(\sigma)}$ of the base hopping rates in the $N=2$ and $N=3$ processes conditioned on the same $\sigma$ . This suggests that processes with different particle numbers should be compared at fixed $s$ rather than $\sigma$ . In addition, as the effective potential is determined by the left eigenvector $\ell(\mathcal{C},s)$ , it is likely to have a simpler algebraic dependence on $s$ than it does on $\sigma$ via $s(\sigma)=I^{\prime}(\sigma)$ . In fact, in the one and only known case where the effective potential factorises, it does so as function of $s$ . Nonetheless, the clearest results will be found in the limits of small/large $s$ , equivalent to the small/large $\sigma-\bar{\sigma}$ limits.

IV.1 TASEP three-body potential

Let us then consider the effective potential $V^{(3)}(d_{12},d_{23})$ of the three-particle TASEP, where $d_{ij}$ is the distance (in number of lattice sites) from particle $i$ to $j$ , with periodicity demanding $d_{31}=L-d_{12}-d_{23}$ . As in the $N=2$ problem, the potential is generally repulsive, with maxima at $d_{12},d_{23}=1,L-2$ . This is clearly manifest in Fig. 7. We now compare, for $s$ and $L$ fixed, the potential $V^{(3)}$ to the pairwise potential $\widetilde{V}^{(3)}$ constructed as the sum of the effective 2-body potentials $V^{(2)}$ found in the previous section; $\widetilde{V}^{(3)}(d_{12},d_{23})=V^{(2)}(d_{12})+V^{(2)}(d_{12})+V^{(2)}(L-d_{12}-d_{23})$ . The difference $\Delta=V^{(3)}-\widetilde{V}^{(3)}$ indicates the extent to which the 3-body interaction is reducible to pairwise interactions—which we refer to as factorization (of the left principal eigenvector of $\mathbb{W}^{\text{tilt}}_{s}$ ). Fig. 8 shows $\Delta$ as a function of $d_{12}$ and $d_{23}$ , for $L=16$ and two values of $s$ . The top-right colour plot refers to the high- $s$ case, the bottom-left to the low- $s$ case considered also in Fig. 7. For the larger $s$ , $\Delta\simeq 0$ , following the large- $s$ factorisation of the TASEP effective potential, demonstrated in Popkov et al. (2010). As $s$ is reduced, $\Delta$ decreases, signalling that three-body interactions play a significant role in optimally achieving the fluctuation.

We find this reduction to be more pronounced when the three particles are all next to each other (corners of Fig. 8). Our result indicates the cost in (effective) potential of keeping the three particles from colliding to be less than that of keeping the three particle pairs singularly disjoint. This is caused by a sort of screening effect due to the third particle. If, e.g., $d_{12}=d_{23}=1$ (as in the bottom-left corner of Fig. 7), then the repulsion between $1$ and $3$ is already effected by the repulsion between 1–2 and 2–3. Alternatively, the result can be resolved with an interaction whose strength decreases not only with the metric distance, i.e. the length in lattice units of the shortest path between two particles, but also with the topological distance, that is the number of other particles located on this shortest path Ballerini et al. (2008).

The difference $\Delta$ also reveals the directional asymmetry of $V^{(3)}$ , a purely three-body effect caused by the unidirectional motion of TASEP. Imagine, for instance, fixing the first two particles and moving the third along the lattice, thus exploring the $d_{12}=1$ vertical line of the potential landscape shown in Fig. 7. In moving from $d_{23}=1$ to $L-2$ (the maximum distance for a given $L$ ), the pairwise potential $V^{(2)}$ reaches its minimum at the midpoint and is symmetrical. The minimum of $V^{(3)}$ , instead, is slightly shifted towards the $d_{23}=L-2$ end. In simple terms, $V^{(3)}$ favours configurations where particle $3$ lies behind the small cluster formed by $1$ and $2$ , rather than that where $d_{23}=d_{31}$ . This effect, which resembles slipstreaming (in the absence of any fluid), is clearer in the difference $\Delta$ (cf. Fig. 8) than in the three-body potential itself.

IV.2 RTP three-body potential

We now discuss the three-body effective potential for a system of RTPs. The effective potential depends on the orientational as well as translational degrees of freedom. We then write the potential as $V_{\tau_{1}\tau_{2}\tau_{3}}^{(3)}(d_{12},d_{23})$ , where $d_{ij}$ is the distance from particle $i$ to $j$ and $\tau_{i}\in\{+,-\}$ is the orientation of the $i$ th particle. For the TASEP(subsection IV.1), we only had the single orientation sector $\tau_{1}\tau_{2}\tau_{3}={{+}{+}{+}}$ ; for the RTPs, we consider only the sectors ${+}{+}{+}$ and ${-}{+}{+}$ , as the rest are related to these by permutation of particle labelling and spatial inversion.

Plotting $V_{+++}^{(3)}(d_{12},d_{23})$ would reveal the same weak repulsive interaction found for the 3-particle TASEP and shown in Fig. 7; the difference $V_{+++}^{(3)}-V_{-++}^{(3)}$ shows an alignment interaction reminiscent of that discussed in subsection III.2. Additional three-body contributions are better understood by resorting to the difference with respect to the sum of pairwise interactions, $\Delta_{\tau_{1}\tau_{2}\tau_{3}}$ —notice the dependence on particle orientations for RTPs. This difference is shown for both the ${+}{+}{+}$ (bottom-left triangle) and ${-}{+}{+}$ (top-right triangle) sectors in Fig. 9, at $\gamma=\omega=1$ and $L=16$ . The left and right panels are representative of the low $s$ and high $s$ regimes, respectively. Let us begin with the former, i.e. compare $\Delta_{+++}$ and $\Delta_{-++}$ at low $s$ .

$\Delta_{+++}$ , though generally small, is larger in modulus at the corners of the colour plot, implying a weaker repulsion than in the two-body case. This observation, as in the TASEP, can be explained by a screening effect due to the third particle. $\Delta_{-++}$ displays a similar landscape, apart from two differences. First, the well at $d_{12}=L-2,d_{23}=1$ is deeper than for $\Delta_{+++}$ (see Fig. 9, left panel, bottom-right corner). This is a jammed configuration, which is obtained by the $d_{12}=L-2,d_{23}=1$ configuration shown in the top-right corner of Fig. 7 by flipping the arrow of particle $1$ : the rightmost particle, then, is in the $-$ state, and faces the two $+$ particles on its left. The two outer particles (namely $1$ and $2$ ), whose interaction is screened by the middle particle, are pointing against each other. Their two-body potential, then, is higher than if they were parallel, so that the reduction in the three-body potential is greater than in the ${+}{+}{+}$ sector. Second, the well at $d_{12}=d_{13}=1$ (Fig. 9, left panel, bottom-left and top-right corners) is shallower than for $\Delta_{+++}$ . The outer particles of this configuration ( $1$ and $3$ ) are indeed aligned outwards, so that their two-body potential is minimal. Following this argument, it is natural that at $d_{12}=1,d_{23}=L-2$ , where the two outer particles ( $3$ and $2$ ) are aligned with each other, $\Delta_{-++}$ is similar to $\Delta_{+++}$ .

Upon increasing $s$ , the three-RTP potential of the ${+}{+}{+}$ sector becomes closer and closer to a pairwise potential in analogy with the TASEP potential (see the bottom-left corners of the colour plots of Fig. 9). Conversely, in the ${-}{+}{+}$ sector, the difference between three-body and pairwise potential increases with $s$ . This observation holds for $\gamma>1$ , i.e., in general, $\gamma>\omega$ . For $\gamma\ll\omega$ , i.e. approaching the limit of a symmetric simple exclusion process (SSEP), factorisation is achieved in both the ${+}{+}{+}$ and the ${-}{+}{+}$ sectors.

V Discussion

Is conditioning a route to ‘smart’ matter? The simplest example of just two interacting particles demonstrates that smart interaction can indeed emerge in this way: run-and-tumble particles develop an effective alignment interaction in order to sustain atypically large mobilities. This result provides a microscopic basis to the observation of aligned states in large work fluctuations of two-dimensional active Brownian particle systems Nemoto et al. (2019). It also points towards a generality which extends beyond the one-dimensional continuous-time processes considered in this paper. To judge whether conditioning yields an actual improvement on the individual energetics, we have proposed an efficiency framework which takes into account the energy-consuming nature of forces in active systems. Additionally, we have discussed the relationship between the effective potential in a two- and three-body scenario, which serves as a prototype for the generalization to higher particle counts.

In terms of the efficiency, we discover in both the RTP and TASEP models that there is a phenomenon of rapidly diminishing returns, such that a relatively small window of conditioning values accounts for most of the range of possible efficiency gain. Furthermore, the relative amount of gain in efficiency differs substantially between the models. Conditioning can only ‘act’ on naturally occurring fluctuations of the original dynamics, which are limited for the TASEP to fluctuations in hopping speed fortuitously correlated with inter-particle distances. In contrast, the RTP model, whose initial efficiency is lower due to head-to-head jamming, displays a broader repertoire of fluctuations to be exploited by conditioning (both speed and direction), which explains the larger efficiency gain compared to the TASEP. Formula Eq. (missing) 12 encapsulates this finding by providing a quantitative basis for the claim that a large variance in the output results in high gains in efficiency upon conditioning. In simple terms, when there are large and relatively likely fluctuations, conditioning can exploit them. At a mathematical level, high variance in output is equivalent to near-flatness of the saddle-point $s(\sigma)$ . In a sense, this amounts to being close to a dynamical phase transition, at which the saddle-point would become truly flat, signalling the break-down of the large-deviation principle. However, as that happens, the ensemble equivalence that underlies the effective process construction is moot. We stress that studying the so-called $s$ -ensemble, as is common, without relating it back to the value of the constraint $\sigma$ , misses a qualitatively important aspect of conditioning, namely how the structure of the rate function itself determines the outcome of conditioning.

By comparing the two- and three-particle scenarios, we confirm that, in the general case, many-body interactions emerge that are not simple to extrapolate from the knowledge of the two-body interactions. While this may be perceived as a fundamental limitation of the conditioning approach, our detailed study of the three-body cases demonstrates that these many-body interactions need not be overly complicated. In the cases we examine, for instance, they can be ascribed to a topological screening effect: by placing an intermediate particle between two nearby ones, they are effectively screened, making the pairwise interaction across the intermediate particle superfluous. Thus, the 1D setup may be a main contributive factor to the lack of factorisation of the interaction. However, there are certainly situations in which factorization of the many-body interaction does occur, as in the high-current TASEP phase. To this we add the observation that, in the SSEP-limit of the RTP, the effective interaction factorises for arbitrary $s$ . Conversely, for large Péclet number, three-body contributions to the RTP potential increase rather than decrease with the bias $s$ . Future research may investigate more systematically what aspects of the dynamics lead to factorization (e.g., integrability and/or reversibility) while giving a thorough characterisation of three-body contributions when factorization is not expected.

Let us also stress that the conditioning framework is not limited to the arena of statistical mechanics. One may think of diverse practical scenarios where a specific potential or force is sought to achieve some outcome—this is the subject of optimal control theory, with which the concepts here discussed have been rigorously linked (see Chetrite and Touchette (2015a) and references therein). As in active and driven systems, it may be desired that the chosen constraint be satisfied only by adding a potential-like interaction, as the ‘tilt’ factor $e^{s\alpha}$ implies an increased energy injection. To fix, in our language, the base hopping rate $\gamma$ , one could consider a replica ( $R$ ) of the naive process with hopping rate $\gamma_{R}\leq\gamma$ and choose a conditioning value $\sigma$ such that $R$ when conditioned on it attains a renormalized hopping rate $\gamma_{R}e^{s_{\gamma_{0}}(\sigma)}=\gamma$ , i.e. $\sigma=s^{-1}_{\gamma_{R}}(\log(\gamma/\gamma_{R}))$ . Finally, take $\gamma_{R}=\gamma^{*}$ as the value that optimizes the efficiency $\sigma/\gamma$ of this effective process,

[TABLE]

Construct the potential Eq. (missing) 5 from the tilt of the transition matrix with hopping rate $\gamma^{*}$ and with tilt parameter $s^{*}=\log(\gamma/\gamma^{*})$ . This interaction potential added to the rates of the naive process with hopping rate $\gamma$ will have a higher efficiency (or at least not lower) while keeping energy input fixed. The price paid is that the resulting effective process is not strictly speaking representing the most probable fluctuations of the naive process it is compared to.

In closing, conditioning remains an intriguing framework to derive non-trivial interactions. It is intimately linked to inference from data Bialek et al. (2012); Cavagna et al. (2014), and to optimal control. In the active matter context, the way conditioning exploits beneficial fluctuation is suggestive of an evolutionary point-of-view Nemoto et al. (2019), furthered by the similarity shared by rare-events sampling techniques Giardinà et al. (2006) and gene selection. We have here only taken elementary steps in setting out the main ideas behind the framework and applying it to toy models. Our results nonetheless point to the effective process having a certain structure and robustness that generalizes with larger system sizes, provided the system parameters and conditioning value are chosen in a physically plausible way. While presently calculating large deviation elements of large systems is prohibitively costly, we expect concurrent developments of advanced approximations Tizón-Escamilla et al. (2019) and numerical methods Jacobson and Whitelam (2019); Whitelam (2018) to overcome this hurdle.

Acknowledgements.

FC acknowledges studentship support from SFC; EM from EPSRC grant no. EP/N509644/1. FC and EM contributed equally to this work. The authors thank M. R. Evans for valuable feedback on the manuscript.

Bibliography53

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1Ramaswamy (2010) S. Ramaswamy, Annu. Rev. Condens. Matter Phys. 1 , 323 (2010) . · doi ↗
2Vicsek and Zafeiris (2012) T. Vicsek and A. Zafeiris, Phys. Rep. 517 , 71 (2012) . · doi ↗
3Helbing (2001) D. Helbing, Rev. Mod. Phys. 73 , 1067 (2001) . · doi ↗
4Vicsek et al. (1995) T. Vicsek, A. Czirók, E. Ben-Jacob, I. Cohen, and O. Shochet, Phys. Rev. Lett. 75 (1995), 10.1103/Phys Rev Lett.75.1226 . · doi ↗
5Cavagna et al. (2014) A. Cavagna, I. Giardina, F. Ginelli, T. Mora, D. Piovani, R. Tavarone, and A. M. Walczak, Phys. Rev. E 89 , 042707 (2014) . · doi ↗
6Nemoto et al. (2019) T. Nemoto, É. Fodor, M. E. Cates, R. L. Jack, and J. Tailleur, Phys. Rev. E 99 , 022605 (2019) . · doi ↗
7Tociu et al. (2019) L. Tociu, E. Fodor, T. Nemoto, and S. Vaikuntanathan, Phys. Rev. X 9 , 041026 (2019) . · doi ↗
8Evans (2004) R. M. L. Evans, Phys. Rev. Lett. 92 , 150601 (2004) . · doi ↗