Parrondo games as disordered systems

J. M. Luck

arXiv:1905.04140·cond-mat.stat-mech·August 20, 2019

Parrondo games as disordered systems

J. M. Luck

PDF

TL;DR

This paper explores Parrondo's paradox by mapping simple stochastic games onto disordered systems, analyzing how different rule patterns and parameters influence the paradoxical winning strategies.

Contribution

It introduces a systematic analogy between Parrondo games and 1D disordered systems, focusing on gain dependence and weak-contrast regimes in various game classes.

Findings

01

Gain depends non-linearly on parameters.

02

Weak-contrast regimes identified and analyzed.

03

Game pattern influences the paradoxical outcome.

Abstract

Parrondo's paradox refers to the counter-intuitive situation where a winning strategy results from a suitable combination of losing ones. Simple stochastic games exhibiting this paradox have been introduced around the turn of the millennium. The common setting of these Parrondo games is that two rules, $A$ and $B$ , are played at discrete time steps, following either a periodic pattern or an aperiodic one, be it deterministic or random. These games can be mapped onto 1D random walks. In capital-dependent games, the probabilities of moving right or left depend on the walker's position modulo some integer $K$ . In history-dependent games, each step is correlated with the $Q$ previous ones. In both cases the gain identifies with the velocity of the walker's ballistic motion, which depends non-linearly on model parameters, allowing for the possibility of Parrondo's paradox. Calculating the…

Tables2

Table 1. Table 1: Exact rational and numerical expressions of the gain amplitude g W subscript 𝑔 𝑊 g_{W} of the capital-dependent Parrondo game with all periodic rules W 𝑊 W with primitive period P ≤ 6 𝑃 6 P\leq 6 . For each period P 𝑃 P , unit cells W 𝑊 W are ordered according to increasing gains. Last column: corresponding rational rotation number ω 𝜔 \omega of the cut-and-project sequence (see Section 3.4.2 ), when applicable.

$P$	$W$	$g_{W}$	$ω$
2	$A B$	$0$	$1 / 2$
3	$A A B$	$16 / 81 = 0.197530$	$1 / 3$
	$A B B$	$56 / 81 = 0.691358$	$2 / 3$
4	$A B B B$	$2 / 25 = 0.080000$	$3 / 4$
	$A A A B$	$4 / 25 = 0.160000$	$1 / 4$
	$A A B B$	$6 / 25 = 0.240000$	-
5	$A A B B B$	$56 / 605 = 0.092561$	-
	$A A A A B$	$80 / 605 = 0.132231$	$1 / 5$
	$A A A B B$	$184 / 605 = 0.304132$	-
	$A A B A B$	$208 / 605 = 0.343801$	$2 / 5$
	$A B B B B$	$232 / 605 = 0.383471$	$4 / 5$
	$A B A B B$	$488 / 605 = 0.806611$	$3 / 5$
6	$A A A B B B$	$324 / 3969 = 0.081632$	-
	$A B A B B B$	$332 / 3969 = 0.083648$	-
	$A A A A A B$	$440 / 3969 = 0.110859$	$1 / 6$
	$A A B B B B$	$548 / 3969 = 0.138070$	-
	$A B B B B B$	$604 / 3969 = 0.152179$	$5 / 6$
	$A A A B A B$	$712 / 3969 = 0.179390$	-
	$A A A A B B$	$820 / 3969 = 0.206601$	-
	$A A B A B B$	$1404 / 3969 = 0.353741$	-

Table 2. Table 2: Exact expressions of P 𝑃 P times the gain amplitude g W subscript 𝑔 𝑊 g_{W} for all periodic history-dependent Parrondo games W 𝑊 W with primitive period P ≤ 6 𝑃 6 P\leq 6 .

$P$	$W$	$P g_{W}$
2	$A B$	1
3	$A A B$	$1 - μ$
	$A B B$	$1 - μ^{3}$
4	$A A A B$	$1 - μ$
	$A A B B$	$1 - μ^{2}$
	$A B B B$	$1 + μ$
5	$A A A A B$	$1 - μ$
	$A A A B B$	$1 - μ^{2}$
	$A A B A B$	$(1 - μ) (2 + μ)$
	$A A B B B$	$(1 - μ) {(1 + μ)}^{2}$
	$A B A B B$	$(1 - μ^{2}) (2 + μ^{2})$
	$A B B B B$	$(1 - μ^{5}) (1 + μ)$
6	$A A A A A B$	$1 - μ$
	$A A A A B B$	$1 - μ^{2}$
	$A A A B A B$	$(1 - μ) (2 + μ)$
	$A A A B B B$	$(1 - μ) {(1 + μ)}^{2}$
	${}^{⋆}A A B A B B$	$2 (1 - μ^{2})$
	${}^{⋆}A A B B A B$	$(1 - μ) (2 + μ + μ^{2})$
	$A A B B B B$	$(1 - μ^{3}) (1 + μ)$
	$A B A B B B$	$2 + μ$
	$A B B B B B$	$1 + μ + μ^{2}$

Equations323

G = t \to \infty lim \frac{n _{t}}{t} .

G = t \to \infty lim \frac{n _{t}}{t} .

Parrondo’s paradox: {G > 0, G_{A} \leq 0, G_{B} \leq 0} .

Parrondo’s paradox: {G > 0, G_{A} \leq 0, G_{B} \leq 0} .

⟨ n_{t} ⟩ = n_{0} + s = 1 \sum t (2 p_{s} - 1) .

⟨ n_{t} ⟩ = n_{0} + s = 1 \sum t (2 p_{s} - 1) .

G = 2 \overline{p} - 1.

G = 2 \overline{p} - 1.

d = K \mbox or d = 2^{Q},

d = K \mbox or d = 2^{Q},

{\bm{M}}=\pmatrix{0&q_{1}&p_{2}\cr p_{0}&0&q_{2}\cr q_{0}&p_{1}&0},

{\bm{M}}=\pmatrix{0&q_{1}&p_{2}\cr p_{0}&0&q_{2}\cr q_{0}&p_{1}&0},

ϕ_{t} = \pmatrix X_{t} \cr Y_{t} \cr Z_{t},

ϕ_{t} = \pmatrix X_{t} \cr Y_{t} \cr Z_{t},

ϕ_{t} = M_{A} ϕ_{t - 1},

ϕ_{t} = M_{A} ϕ_{t - 1},

{\bm{M}}_{A}=\pmatrix{0&q&p\cr p&0&q\cr q&p&0}.

{\bm{M}}_{A}=\pmatrix{0&q&p\cr p&0&q\cr q&p&0}.

ϕ_{A} = M_{A} ϕ_{A},

ϕ_{A} = M_{A} ϕ_{A},

ϕ_{A} = ϕ_{uni} = \frac{1}{3} \pmatrix 1 \cr 1 \cr 1 .

ϕ_{A} = ϕ_{uni} = \frac{1}{3} \pmatrix 1 \cr 1 \cr 1 .

G_{A} = J_{A} \cdot ϕ_{A},

G_{A} = J_{A} \cdot ϕ_{A},

{\bm{J}}_{A}=(2p-1)\pmatrix{1&1&1},

{\bm{J}}_{A}=(2p-1)\pmatrix{1&1&1},

G_{A} = 2 p - 1.

G_{A} = 2 p - 1.

ϕ_{t} = M_{B} ϕ_{t - 1},

ϕ_{t} = M_{B} ϕ_{t - 1},

{\bm{M}}_{B}=\pmatrix{0&q_{1}&p_{1}\cr p_{0}&0&q_{1}\cr q_{0}&p_{1}&0}.

{\bm{M}}_{B}=\pmatrix{0&q_{1}&p_{1}\cr p_{0}&0&q_{1}\cr q_{0}&p_{1}&0}.

ϕ_{B} = M_{B} ϕ_{B} .

ϕ_{B} = M_{B} ϕ_{B} .

ϕ_{B} = \pmatrix X_{B} \cr Y_{B} \cr Z_{B},

ϕ_{B} = \pmatrix X_{B} \cr Y_{B} \cr Z_{B},

X_{B} = \frac{1 - p _{1} q _{1}}{D}, Y_{B} = \frac{1 - q _{0} p _{1}}{D}, Z_{B} = \frac{1 - p _{0} q _{1}}{D}

X_{B} = \frac{1 - p _{1} q _{1}}{D}, Y_{B} = \frac{1 - q _{0} p _{1}}{D}, Z_{B} = \frac{1 - p _{0} q _{1}}{D}

D = 3 - p_{0} q_{1} - q_{0} p_{1} - p_{1} q_{1} .

D = 3 - p_{0} q_{1} - q_{0} p_{1} - p_{1} q_{1} .

G_{B} = J_{B} \cdot ϕ_{B},

G_{B} = J_{B} \cdot ϕ_{B},

{\bm{J}}_{B}=\pmatrix{p_{0}-q_{0}&p_{1}-q_{1}&p_{1}-q_{1}},

{\bm{J}}_{B}=\pmatrix{p_{0}-q_{0}&p_{1}-q_{1}&p_{1}-q_{1}},

G_{B} = \frac{3 ( p _{0} p _{1}^{2} - q _{0} q _{1}^{2} )}{D} .

G_{B} = \frac{3 ( p _{0} p _{1}^{2} - q _{0} q _{1}^{2} )}{D} .

\left[{\bm{M}}_{A},{\bm{M}}_{B}\right]=(p_{1}-p_{0})\pmatrix{2p-1&0&0\cr q&q&p\cr-p&-q&-p}.

\left[{\bm{M}}_{A},{\bm{M}}_{B}\right]=(p_{1}-p_{0})\pmatrix{2p-1&0&0\cr q&q&p\cr-p&-q&-p}.

p = \frac{1}{2},

p = \frac{1}{2},

p_{0} = \frac{( 1 - p _{1} ) ^{2}}{1 - 2 p _{1} ( 1 - p _{1} )},

p_{0} = \frac{( 1 - p _{1} ) ^{2}}{1 - 2 p _{1} ( 1 - p _{1} )},

p_{0} = \frac{1}{2} - \frac{v}{1 + v ^{2}}, p_{1} = \frac{1}{2} (1 + v),

p_{0} = \frac{1}{2} - \frac{v}{1 + v ^{2}}, p_{1} = \frac{1}{2} (1 + v),

\left[{\bm{M}}_{A},{\bm{M}}_{B}\right]=\frac{v(3+v^{2})}{4(1+v^{2})}\pmatrix{0&0&0\cr 1&1&1\cr-1&-1&-1}.

\left[{\bm{M}}_{A},{\bm{M}}_{B}\right]=\frac{v(3+v^{2})}{4(1+v^{2})}\pmatrix{0&0&0\cr 1&1&1\cr-1&-1&-1}.

A B A B B A B B B ⟷ B B B A B B A B A .

A B A B B A B B B ⟷ B B B A B B A B A .

\overline{ϕ}_{t} = \overline{M} \overline{ϕ}_{t - 1},

\overline{ϕ}_{t} = \overline{M} \overline{ϕ}_{t - 1},

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

11institutetext: Institut de Physique Théorique, Université Paris-Saclay, CEA and CNRS, 91191 Gif-sur-Yvette, France.

11email: [email protected]

Parrondo games as disordered systems

Jean-Marc Luck

Abstract

Parrondo’s paradox refers to the counter-intuitive situation where a winning strategy results from a suitable combination of losing ones. Simple stochastic games exhibiting this paradox have been introduced around the turn of the millennium. The common setting of these Parrondo games is that two rules, $A$ and $B$ , are played at discrete time steps, following either a periodic pattern or an aperiodic one, be it deterministic or random. These games can be mapped onto 1D random walks. In capital-dependent games, the probabilities of moving right or left depend on the walker’s position modulo some integer $K$ . In history-dependent games, each step is correlated with the $Q$ previous ones. In both cases the gain identifies with the velocity of the walker’s ballistic motion, which depends non-linearly on model parameters, allowing for the possibility of Parrondo’s paradox. Calculating the gain involves products of non-commuting Markov matrices, which are somehow analogous to the transfer matrices used in the physics of 1D disordered systems. Elaborating upon this analogy, we study a paradigmatic Parrondo game of each class in the neutral situation where each rule, when played alone, is fair. The main emphasis of this systematic approach is on the dependence of the gain on the remaining parameters and, above all, on the game, i.e., the rule pattern, be it periodic or aperiodic, deterministic or random. One of the most original sides of this work is the identification of weak-contrast regimes for capital-dependent and history-dependent Parrondo games, and a detailed quantitative investigation of the gain in the latter scaling regimes.

1 Introduction

Parrondo’s paradox refers to the counter-intuitive situation where a winning strategy results from a suitable combination of losing ones. Simple stochastic games exhibiting this paradox have been introduced by Parrondo and collaborators around the turn of the millennium [1, 2, 3, 4, 5]. References [6, 7, 8, 9] provide comprehensive reviews of early developments of Parrondo games, including historical aspects and extensive discussions of their paradoxical nature. Parrondo games were originally devised as discrete analogues of Brownian ratchets. The latter ratchets are extensions of Feynman’s celebrated thermal ratchet [10] to the microscopic scale, aimed at modeling the force-free motion of molecular motors [11, 12, 13]. Flashing Brownian ratchets consist of a point particle undergoing Brownian diffusion on the line under the effect of a periodic potential which is both spatially asymmetric and periodically modulated in time. The interplay of these two properties breaks detailed balance. Under generic circumstances, it yields a rectification of thermal noise and induces a steady ballistic motion of the particle (see [14, 15] for reviews).

Parrondo games belong to the realm of Markovian games of chance. The usual setting is that two stochastic rules, denoted as $A$ and $B$ , are played at discrete time steps in a specific order, following a periodic pattern such as $ABBABB\dots$ or an aperiodic one, either deterministic or random. It is advantageous to describe Parrondo games within the framework of a random walker occupying the sites of an infinite 1D lattice and moving to neighboring sites at discrete time steps according to the above stochastic rules. The discrete position $n_{t}$ of the walker at integer time $t$ identifies with the capital of the player. In the generic situation where the walker’s motion is ballistic, its velocity yields the gain $G$ of the player per time step:

[TABLE]

Parrondo’s paradox holds whenever the chosen game (rule pattern) yields a positive gain, whereas each rule, when played alone, either is fair or has a negative gain:

[TABLE]

There are two main classes of Parrondo games. The first class is referred to as capital-dependent games. The rules, either $A$ or $B$ or both, depend explicitly on the walker’s position (i.e., the player’s capital) $n_{t}\mathop{\;\rm mod\;}\nolimits K$ , where $K$ is some fixed integer111 $n\mathop{\;\rm mod\;}\nolimits K=0,\dots,K-1$ is the rest of the Euclidean division of $n$ by $K$ .. The game originally proposed by Parrondo [1, 2, 3, 4] corresponds to $K=3$ . Parrondo’s paradox also holds for some specific models with $K=2$ , where Rule $B$ depends on the parity of the player’s capital [16, 17]. A second class of Parrondo games, referred to as history-dependent games [5, 18, 19], has also been considered, even though it has not become as popular as capital-dependent games. There, the complexity of the dynamics originates in a memory effect between successive steps. The probability for the walker to move right or left is now independent of its position $n_{t}$ , but it depends on the $Q$ previous steps, in a way that is different for Rules $A$ and $B$ . Parrondo’s paradox already holds in some cases for $Q=1$ , and more generally for $Q=2$ [16].

Consider for the time being a random walker on an infinite 1D lattice, with time-dependent probabilities of moving to neighboring sites. Let $p_{t}$ (resp. $q_{t}=1-p_{t}$ ) be the probability that the walker moves to the right (resp. to the left) at time $t$ . The mean position of the walker at time $t$ reads

[TABLE]

This expression only depends on the sum of the probability differences $p_{s}-q_{s}=2p_{s}-1$ , and not on the order in which single steps are performed. In other words, elementary steps commute with each other. In the case of an annealed disorder, where the time-dependent probabilities $p_{t}$ are themselves drawn from some distribution, the velocity of the walker is self-averaging and reads

[TABLE]

The notations for averages used throughout this paper follow the usual conventions of the theory of disordered systems. Brackets, $\langle\dots\rangle$ , denote an average over realizations of the Markov process, i.e., over histories of the random walker, whereas a bar, ${\overline{\dots\vphantom{m}}}$ , denotes an annealed average over the distribution of the probabilities defining the Markov process, whenever the latter are themselves random.

In the case of Parrondo games, the existence of internal degrees of freedom (the walker’s position $n\mathop{\;\rm mod\;}\nolimits K$ for capital-dependent games, or the $Q$ previous steps for history-dependent games) makes the corresponding random walk non-trivial. The gain $G$ , i.e., the walker’s velocity, depends non-linearly on model parameters, allowing for the possibility of Parrondo’s paradox, defined by the inequalities (2). Parrondo games can be viewed as inhomogeneous Markov chains [6, 7, 8], whose study involves products of non-commuting Markov matrices acting on a finite-dimensional linear space with dimension

[TABLE]

encoding internal degrees of freedom. These products of Markov matrices are somehow temporal analogues of the spatial products of non-commuting transfer matrices that are ubiquitous in investigations of 1D disordered systems (see [20, 21, 22, 23, 24, 25] for reviews).

The goal of the present work is to elaborate on this analogy and to study Parrondo games by means of various analytical techniques freely inspired by the theory of 1D disordered systems. This line of thought allows us to deal with capital-dependent and history-dependent games on the same footing, and yields a wealth of new results on both classes of Parrondo games. We consider capital-dependent games in Sections 2 and 3 and history-dependent games in Sections 4 and 5. We choose for definiteness to work with one paradigmatic example of each class. Most of the time, we focus our attention onto the neutral situation where each rule, when played alone, is fair ( $G_{A}=G_{B}=0$ ). The main emphasis of this systematic approach is on the dependence of the gain $G$ on the remaining free parameters and, more importantly, on the game, i.e., the rule pattern, be it periodic or aperiodic, deterministic or random. One of the most original sides of this work is the identification of a weak-contrast scaling regime and its systematic investigation for both classes of games (Sections 3 and 5). Section 6 contains a brief overview.

2 Capital-dependent games

2.1 Generalities

The game originally proposed by Parrondo [1, 2, 3, 4] is a prototypical example of a capital-dependent game with $K=3$ , where rules depend on the player’s capital (i.e., of the walker’s position) mod 3. It is sufficient to monitor the dynamics of the walker in the three-dimensional internal space parametrized by its position $n\mathop{\;\rm mod\;}\nolimits 3=0,~{}1,~{}2$ . Within this framework, the most general Markovian stochastic rule is depicted in Figure 1 and corresponds to the Markov matrix

[TABLE]

with the notation $q_{n}=1-p_{n}$ .

The standard body of knowledge on Markov chains can be found in the classical references [26, 27, 28, 29, 30, 31]. Hereafter we not pretend at any mathematical rigor. We shall only need the following general result: the unique ergodicity of a discrete-time Markov chain, i.e., essentially the uniqueness of its stationary state, is ensured by the fact that the corresponding Markov matrix ${\bm{M}}$ has a simple (i.e., non-degenerate) unit eigenvalue, while all other eigenvalues are strictly less than unity in modulus.

We introduce the time-dependent state vector

[TABLE]

where $X_{t}$ , $Y_{t}$ and $Z_{t}$ are the probabilities that the walker’s position $n\mathop{\;\rm mod\;}\nolimits 3$ at time $t$ is respectively 0, 1 and 2.

Parrondo’s historical game consists of a combination of the following rules [1, 2, 3, 4].

$\bullet$

Rule $A$ . The three probabilities are equal: $p_{0}=p_{1}=p_{2}=p$ . If Rule $A$ is played at time $t$ , we have

[TABLE]

with

[TABLE]

If Rule $A$ is played alone, the walker executes a uniformly biased random walk. Its stationary state ${\bm{\phi}}_{A}$ , such that

[TABLE]

is uniform:

[TABLE]

We have

[TABLE]

where the current vector reads

[TABLE]

and so

[TABLE]

$\bullet$

Rule $B$ . It is defined by setting $p_{2}=p_{1}$ , keeping $p_{0}$ and $p_{1}$ as free parameters. If Rule $B$ is played at time $t$ , we have

[TABLE]

with

[TABLE]

If Rule $B$ is played alone, the stationary state of the system is described by the normalized eigenvector ${\bm{\phi}}_{B}$ associated with the unit eigenvalue of ${\bm{M}}_{B}$ , such that

[TABLE]

We thus obtain

[TABLE]

with

[TABLE]

and

[TABLE]

We have

[TABLE]

where the current vector reads

[TABLE]

and so

[TABLE]

The Markov matrices ${\bm{M}}_{A}$ and ${\bm{M}}_{B}$ generically do not commute with each other. We have indeed

[TABLE]

The commutator vanishes only for $p_{1}=p_{0}$ , i.e., when each rule corresponds to a uniformly biased random walk, so that the dynamics in internal space can be forgotten.

Hereafter the main focus will be on the neutral situation where each rule, when played alone, is fair ( $G_{A}=G_{B}=0$ ). In this situation, a given game, such as e.g. the periodic game $ABBABB\dots$ , exhibits Parrondo’s paradox whenever the corresponding gain, denoted $G_{ABB}$ , is positive (see (2)). The condition that Rule $A$ is fair reads

[TABLE]

expressing that the corresponding random walk is unbiased, i.e., symmetric. The condition that Rule $B$ is fair yields a relation between $p_{0}$ and $p_{1}$ ,

[TABLE]

leaving one free parameter. It is advantageous to choose the parametrization

[TABLE]

where the contrast parameter $v$ in the range $-1<v<1$ provides a measure of the difference between both rules. The expression (24) becomes

[TABLE]

We close this section by a discussion of symmetries.

$\bullet$

Parity, i.e., the change of sign of the walker’s position ( $n\longleftrightarrow-n$ ), corresponds to changing the orientation of the circle shown in Figure 1. It therefore amounts to exchanging the probabilities as $p\longleftrightarrow q$ for Rule $A$ , and $p_{0}\longleftrightarrow q_{0}$ , $p_{1}\longleftrightarrow q_{1}$ for Rule $B$ . In the neutral situation, this amounts to changing $v$ into its opposite ( $v\longleftrightarrow-v$ ). The gain $G$ is therefore an odd function of $v$ , irrespective of the game.

$\bullet$

Time reversal amounts to the sole reversal of the order of letters for a general game of finite duration, such as

[TABLE]

The model is indeed simple enough to ensure that each rule is reversible, i.e., coincides with its own time-reversed, as soon as it is fair. This is obvious for Rule $A$ . For Rule $B$ , the expression (23) shows that the condition for $G_{B}$ to vanish is $p_{0}p_{1}^{2}=q_{0}q_{1}^{2}$ . This is nothing but Kolmogorov’s criterion for the Markov chain defining Rule $B$ to be reversible (see e.g. [30, 31]). There is indeed only one non-trivial cycle (see Figure 1), and so Kolmogorov’s criterion amounts to one single equation. As a consequence of the above, the gain $G$ is left unchanged under a reversal of the game, i.e., of the rule pattern, such as (29).

2.2 Random games

The first situation demonstrating Parrondo’s paradox is that of an (infinitely long) random game, where at each time step Rule $B$ is chosen with probability $\rho$ and Rule $A$ with the complementary probability $1-\rho$ . In the following, we are only interested in the average gain ${\overline{G}}$ of this random game, and so it is sufficient to know the average state vector ${\overline{{\bm{\phi}}}}$ . The present problem is therefore easier than the investigation of usual 1D disordered systems, which requires the evaluation of the Lyapunov exponent of a matrix product (see [20, 21, 22, 23, 24, 25] for reviews). The time-dependent average state vector ${\overline{{\bm{\phi}}}}_{t}$ obeys a recursion of the form

[TABLE]

where the average Markov matrix,

[TABLE]

has the same functional form as ${\bm{M}}_{B}$ , albeit with effective parameters [7, 8]

[TABLE]

The average gain ${\overline{G}}$ of the random game is obtained by replacing in (23) $p_{0}$ and $p_{1}$ by the above effective values.

For the uniformly random game ( $\rho=1/2$ ), where at each time step Rules $A$ and $B$ are chosen with equal probabilities, we obtain

[TABLE]

with

[TABLE]

The expression (33) allows one to measure how rare is Parrondo’s paradox. In the present setting, it is natural to define the probability of observing Parrondo’s paradox as the volume of the three-dimensional domain in $(p,p_{0},p_{1})$ space such that the inequalities (2) hold, with $G$ given by (33). A numerical integration yields

[TABLE]

This very small number is in perfect agreement with an earlier estimate [32].

From now on, until the end of Section 3, we restrict the analysis to capital-dependent Parrondo games in the neutral situation where both rules, when played alone, are fair ( $G_{A}=G_{B}=0$ ), Using the parametrization (25), (27), we obtain the following expression for the average gain:

[TABLE]

The above result exhibits several features of interest. It is an odd function of the contrast parameter $v$ , as expected from the above considerations on parity. The average gain has the sign of $v$ , irrespective of $\rho$ . Parrondo’s paradox therefore holds for all $v>0$ and all non-trivial probabilities ( $0<\rho<1$ ). There is no discrepancy with the tininess of the probability (35), since we have fixed two of the three model parameters by focussing our attention onto the neutral situation. The average gain vanishes as $\rho\to 0$ and $\rho\to 1$ , where random games respectively degenerate to Rule $A$ and Rule $B$ . It reaches its absolute maximum,

[TABLE]

for

[TABLE]

and $v\to 1$ . The latter limit is however singular, as it corresponds to $p_{0}\to 0$ and $p_{1}\to 1$ . In this limit, the Markov matrix ${\bm{M}}_{B}$ looses the property of unique ergodicity, as its eigenvalues become 0 and $\pm 1$ .

In the weak-contrast regime ( $v\to 0$ ), the average gain vanishes cubically. We shall see in Section 3 that this cubic law holds for arbitrary games. We are thus led to introduce the gain amplitude (or amplitude, for short)

[TABLE]

For random games, the expression (36) yields

[TABLE]

For the uniformly random game ( $\rho=1/2$ ), the average amplitude reads

[TABLE]

When the probability $\rho$ of choosing Rule $B$ varies between 0 and 1, the amplitude (40) reaches its maximum

[TABLE]

for

[TABLE]

2.3 Periodic games

In this section we consider periodic games, i.e., periodic rule patterns, defined by the infinite repetition of a unit cell $W$ of length $P$ , like e.g. $W=ABB$ , which has period $P=3$ . We shall alternatively consider $W$ as a word consisting of $P$ letters, $A$ or $B$ , and introduce the symbols

[TABLE]

according to whether the $n$ th letter in $W$ is $A$ or $B$ . The stationary state of the game has the same period $P$ as the game itself. It is encoded in $P$ state vectors ${\bm{\phi}}_{n}$ obeying

[TABLE]

with periodic boundary conditions ( ${\bm{\phi}}_{P}={\bm{\phi}}_{0}$ ). The associated gain reads

[TABLE]

where the current vectors ${\bm{J}}_{A}$ and ${\bm{J}}_{B}$ are evaluated in the neutral situation, with parameters (25), (27), i.e.,

[TABLE]

The recursion (45) amounts to a system of $3P$ linear equations, whose solution may be obtained by means of a computer algebra system such as MACSYMA. The complexity of the expressions of the gain $G$ however grows very rapidly with the period $P$ . We recall that the gain is invariant under cyclic permutations and reversal of the unit cell. Its expressions for all games with periods 2 and 3 are given below.

$\bullet$

$P=2$ . There is only one non-trivial unit cell with period 2, namely $W=AB$ . The corresponding gain vanishes [19]:

[TABLE]

This result comes as a surprise, as it is not dictated by any obvious symmetry.

$\bullet$

$P=3$ . There are two inequivalent unit cells with period 3. The corresponding gains read

[TABLE]

The above expressions demonstrate that the gain vanishes cubically in the weak-contrast regime ( $v\to 0$ ), which will be the subject of Section 3. The corresponding amplitudes $g_{AB}=0$ , $g_{AAB}=16/81$ and $g_{ABB}=56/81$ (see (39)) are listed in the first three lines of Table 1.

3 Weak-contrast scaling regime of capital-dependent games

3.1 Generalities

In the weak-contrast scaling regime ( $v\to 0$ ), both rules are close to symmetric random walks, so that state vectors are expected to become close to the uniform one, given by (11). It can indeed be checked, in full generality, that the differences between $Y_{n}$ or $Z_{n}$ and $1/3$ are of order $v$ , whereas the difference between $X_{n}$ and $1/3$ is of order $v^{2}$ , and the resulting gain is of order $v^{3}$ .

Hereafter we use the shorthand notation

[TABLE]

Let us focus for a while our attention onto periodic games, considered in Section 2.3. The matrix recursion (45) between state vectors ${\bm{\phi}}_{n}$ boils down to two coupled linear recursions for the rescaled co-ordinates

[TABLE]

namely

[TABLE]

with periodic boundary conditions ( $y_{P}=y_{0}$ , $x_{P}=x_{0}$ ). The gain amplitude (see (39)) reads

[TABLE]

The recursions (54), (55) are instrumental in the investigation of the weak-contrast regime. Their key property is the occurrence of the uniform damping factor $\kappa$ , whereas the rule pattern, encoded in the symbol $\sigma_{n}=0$ or 1, according to (44), enters linearly. The above formalism extends to aperiodic games, either deterministic or random (see Section 3.4).

3.2 Random games

As a first application of the above formalism, let us revisit random games, already considered in Section 2.2. In (55), $\sigma_{n}$ and $y_{n-1}$ are statistically independent, and we have ${\overline{\sigma_{n}}}=\rho$ . The stationary averages ${\overline{y}}$ and ${\overline{x}}$ therefore obey

[TABLE]

hence

[TABLE]

and

[TABLE]

The result (40) is thus recovered.

3.3 Periodic games

We now turn to the case of periodic games, already considered in Section 2.3. The explicit solution to (54), (55) with periodic boundary conditions reads

[TABLE]

Inserting the latter expression for $x_{n}$ into (56), we obtain after some algebra

[TABLE]

In the above, all indices of $\sigma$ symbols are to be understood mod $P$ .

The result (63) provides an explicit expression of the gain of Parrondo’s historical game for an arbitrary periodic rule pattern in the weak-contrast regime. The cyclic and reversal invariance of the gain appear manifestly. The extension of the above result to aperiodic games will be considered in Section 3.4.

For the time being we keep the focus onto periodic games. For a given period $P$ , (63) shows that all amplitudes are rational numbers whose denominator divides $P(2^{P}-(-1)^{P})^{2}$ . In the case where the unit cell $W$ consists of only two blocks,

[TABLE]

with arbitrary integers $M$ , $N\geq 1$ , so that $P=M+N$ , the expression (63) simplifies to

[TABLE]

When both block lengths $M$ and $N$ become large, the amplitude falls off as

[TABLE]

up to exponentially small corrections. This decay law in $1/P$ can be interpreted as follows. Both rules $A$ and $B$ are fair, and so only the interfaces between blocks yield some gain. More generally, when one of the block lengths gets large, the other one being kept finite, (3.3) yields

[TABLE]

for $N\to\infty$ at fixed $M$ , and

[TABLE]

for $M\to\infty$ at fixed $N$ . Both sequences $a_{M}$ and $b_{N}$ converge to $8/9$ , consistently with (66), with exponentially damped oscillations. The smallest of them are $a_{2}=2/3$ and $b_{3}=1/2$ , whereas the largest read $a_{1}=b_{2}=4/3$ .

We now turn to general features of interest exhibited by the gain amplitudes of periodic games. The dependence of $g_{W}$ on the unit cell $W$ defining the periodic game appears to be very intricate in general. The result (3.3) indeed virtually exhausts all cases where (63) yields manageable closed-form expression.

Table 1 gives the exact rational and numerical expressions of the gain amplitude $g_{W}$ for all periodic games with primitive222The primitive period $P$ of a periodic sequence is its smallest positive period. period $P\leq 6$ . The explicit result (3.3) yields 15 of the 20 expressions given there, whereas the remaining five cases need a specific evaluation of the triple sum entering (63). The last column gives the corresponding rotation number $\omega$ of the cut-and-project sequence (see Section 3.4.2), when applicable.

For a given – not necessarily primitive – period $P$ , the $2^{P}$ possible unit cells $W$ of length $P$ can be enumerated by means of a computer routine, and the associated amplitudes $g_{W}$ evaluated by using (63). The finite-size average amplitude $g_{P}^{\rm ave}$ , obtained as a flat average of the $2^{P}$ values of $g_{W}$ thus generated, is shown in Figure 2 against period $P\leq 30$ . The last point involves $2^{30}=1\,073\,741\,824$ different games. The plotted quantity oscillates as a function of the period. These finite-size effects are however exponentially damped, and so $g_{P}^{\rm ave}$ converges very fast to the asymptotic limit $1/4$ , consistently with (41).

Let us now investigate which game yields the largest Parrondo effect, i.e., the largest gain amplitude. The maximal amplitude $g_{P}^{\rm max}$ among all $2^{P}$ periodic games with given period $P$ is shown in Figure 3 against $P$ . For the sake of clarity, the plotted range has been limited to $5\leq P\leq 30$ . The amplitude of the periodic game with period 5 and unit cell $W=ABABB$ , i.e.,

[TABLE]

(see Table 1), appears as the absolute maximum of the gain amplitudes of all games, irrespective of their periods. Whenever the period $P$ is a multiple of 5, the absolute maximum $g^{\rm max}$ is reached for the game whose unit cell is a repetition of $p/5$ times $W$ . If $P$ is not a multiple of 5, there are suboptimal periodic games whose gains converge, albeit rather slowly, to (69).

We make a digression out of the weak-contrast regime to mention that the periodic game $ABABB$ yields the highest gain for all values of the contrast parameter $v$ . Its gain in the $v\to 1$ limit, i.e.,

[TABLE]

is the absolute maximal gain of the model in the neutral situation where each rule, when played alone, is fair ( $G_{A}=G_{B}=0$ ). The $v\to 1$ limit is however singular (see below (38)). The universal optimality of the game $ABABB$ was already demonstrated by Dinis [33] by means of an algorithmic approach based upon backward induction.

It is interesting to notice that the values of $\rho$ yielding the maximal gain of random games, given by (38) for $v\to 1$ and (43) for $v\to 0$ , are very close to $3/5$ , characteristic of the optimal periodic game $ABABB$ . The gains achieved by those optimal random games are however far below the truly optimal values, given by (70) for $v\to 1$ and (69) for $v\to 0$ .

3.4 Aperiodic games

The expression (63) for the gain amplitude extends to any aperiodic game, either deterministic or random. Taking formally the $P\to\infty$ limit, forgetting about boundary conditions, we obtain

[TABLE]

In this expression,

[TABLE]

is the density of letters $B$ , i.e., the fraction of steps where Rule $B$ is chosen, whereas

[TABLE]

are the three-point correlation functions of the distribution of letters $B$ , depending on two distances $l$ and $m$ . The damping factor $\kappa^{l+m}$ ensures an exponential convergence of (71) for all aperiodic games with well-defined translationally invariant correlations.

Hereafter we consider two examples of aperiodic games in more detail. Games generated by chaotic dynamical systems have already been considered in the past [34]. The following examples are more directly inspired by the physics of 1D systems. The first example (Section 3.4.1) consists of an enrichment of the random games considered in Section 3.2 by the introduction of a memory kernel. The gain amplitude exhibits a smooth dependence on parameters (see Figure 5). The second example (Section 3.4.2) is based on quasiperiodic cut-and-project sequences. The amplitude has an irregular dependence on parameters (see Figure 7).

3.4.1 Random games with Markovian memory

In Sections 2.2 and 3.2 we have considered random games where at each time step the rule is chosen at random, irrespective of past and future. In other words, the symbols $\sigma_{n}$ introduced in (44) are independent random variables.

The goal of this section is to consider a richer type of random games based on random sequences with Markovian memory, where at each step the rule is chosen with probabilities depending on the rule at the previous step. This setting allows two free parameters, namely the probabilities $\alpha$ and $\beta$ , such that333Here and throughout the following, w. p. is a shorthand for ‘with probability’.

[TABLE]

In other words the game, i.e., the rule pattern, is generated by an auxiliary Markov chain, whereas each rule, either $A$ or $B$ , itself amounts to a Markov chain – as before. The above setting can be encoded into the Markov matrix

[TABLE]

The stationary state of the auxiliary Markov process is described by the eigenvector ${\bm{r}}$ such that ${\bm{r}}={\bm{m}}{\bm{r}}$ , i.e.,

[TABLE]

We have therefore

[TABLE]

The second eigenvalue of the Markov matrix ${\bm{m}}$ , characterizing the range of the memory effect, reads

[TABLE]

In order to determine correlation functions, an explicit representation of powers of ${\bm{m}}$ is required. We have

[TABLE]

with $\alpha_{k+1}=\alpha+\lambda\alpha_{k}$ and $\beta_{k+1}=\beta+\lambda\beta_{k}$ , and so

[TABLE]

The Markovian property of the sequence defining the random game implies

[TABLE]

Inserting this expression into (71), the double sum boils down to geometric series. We are thus left with the explicit result

[TABLE]

The amplitude vanishes as $\rho\to 0$ and $\rho\to 1$ , where random games respectively become Rule $A$ and Rule $B$ . The random games considered in Sections 2.2 and 3.2 correspond to an absence of memory, i.e., $\lambda=0$ . The result (40) is thus recovered for the third time.

Figure 4 shows the parameter space of random sequences with Markovian memory. Allowed values of density $\rho$ and memory rate $\lambda$ lie inside the black curve. For $\lambda>0$ , where successive symbols are positively correlated, all values of the density $\rho$ can be realized. The gain vanishes linearly as $\lambda\to 1$ , i.e., when the mean block length diverges. For $\lambda<0$ , where successive symbols are negatively correlated, only a limited range of densities, i.e.,

[TABLE]

can be realized. The upper (resp. lower) bound corresponds to $\alpha=1$ (resp. $\beta=1$ ), where letters $A$ (resp. $B$ ) are isolated. In the $\lambda\to-1$ limit, the range shrinks to the single point $\rho=1/2$ , where the random game reduces to the periodic game $AB$ .

Figure 5 shows the dependence of the amplitude $g$ on the density $\rho$ of letters $B$ , as given by (82), for several values of the memory rate $\lambda$ .

For fixed $\lambda$ , $g$ reaches its maximum

[TABLE]

for

[TABLE]

The dependence of this optimal density on $\lambda$ is shown in Figure 4 as a red curve. The latter leaves the range of allowed densities as it hits the $\alpha=1$ boundary for $\beta=1/2$ , i.e., $\lambda=-1/2$ and $\rho=2/3$ , where $g=32/81=0.395061$ .

The gain amplitude however reaches a slightly higher absolute maximum,

[TABLE]

somewhere further along the $\alpha=1$ boundary, i.e., for

[TABLE]

This optimal point is shown as blue square symbols in Figures 4 and 5.

3.4.2 Cut-and-project quasiperiodic games

Our second example of aperiodic games is very different in spirit. It is generated by the deterministic quasiperiodic cut-and-project sequences. These sequences, investigated first by de Bruijn [35], are in correspondence with irrational numbers $\omega$ . They have been extensively used to build model quasiperiodic structures that are 1D analogues of quasicrystals. In particular, for $\omega=1/\tau$ and $\omega=1/\tau^{2}$ , where $\tau=(1+\sqrt{5})/2=1.618033$ is the golden mean, Fibonacci sequences are obtained, which are germane to the first icosahedral quasicrystals, discovered in 1984 [36] (see [37, 38] for overviews). Since then, much attention has been paid to cut-and-project and other deterministic aperiodic sequences and to various physical models based upon these structures (see [39, 40] for reviews).

The cut-and-project sequence is based on an irrational rotation number in the range $0<\omega<1$ . Consider the points obtained by rotating around the unit circle in discrete steps by the angle $\omega$ , measured in revolutions, i.e., in units of $2\pi$ . The angle reached after $n$ steps reads

[TABLE]

where $\mathop{\rm Frac}\nolimits(x)=x-\mathop{\rm Int}\nolimits(x)$ is the fractional part of a real number $x$ , with $\mathop{\rm Int}\nolimits(x)$ being its integer part. The binary cut-and-project sequence of symbols $\sigma_{n}$ is defined by setting

[TABLE]

where

[TABLE]

In other words, we have $\sigma_{n}=1$ if the angle $x_{n}$ is in the interval $[0,\,\omega[$ , and $\sigma_{n}=0$ otherwise.

We consider the infinitely long Parrondo game defined by choosing Rule $A$ (resp. Rule $B$ ) at step $n$ if $\sigma_{n}=0$ (resp. $\sigma_{n}=1$ ), consistently with (44). For all irrational rotation numbers $\omega$ , the sequence $x_{n}$ is uniformly distributed over $[0,\;1]$ , so that the density of letters $B$ , i.e., the fraction of steps where Rule $B$ is chosen, reads

[TABLE]

The fluctuations in the letter numbers, measured by the differences

[TABLE]

belong to the interval $-1\leq\delta_{n}\leq 0$ . They are therefore bounded, whereas they would typically grow as $\sqrt{n}$ for a random sequence.

The correlation function $C_{l,m}$ is given by the length of the set of values of $x$ such that the three numbers $x$ , $\mathop{\rm Frac}\nolimits(x+l\omega)$ and $\mathop{\rm Frac}\nolimits(x-m\omega)$ all belong to $[0,\;\omega]$ . The construction of this set is sketched in Figure 6, with the notations

[TABLE]

The expression

[TABLE]

synthesizes the six different possible orders between the four points $s_{l}$ , $t_{l}$ , $u_{m}$ and $v_{m}$ (we have always $s_{l}<t_{l}$ and $u_{m}<v_{m}$ ).

Figure 7 shows the gain amplitude $g$ against the rotation number $\omega$ of the cut-and-project game, as obtained by inserting the expressions (92) and (95) into (71), evaluating individual terms and performing the sum numerically.

The amplitude $g$ appears to be a continuous function of $\omega$ , exhibiting cusps at rational values of $\omega$ , around which it varies linearly, albeit with two different slopes to the left and to the right. If $\omega$ goes to a rational $Q/P$ , assumed irreducible, the corresponding sequence becomes periodic, with period $P$ . Only a very specific subset of periodic sequences is attained in this way. The last column of Table 1 gives the values of $\omega$ corresponding to all periodic games thus obtained with primitive periods $P\leq 6$ . The corresponding data points are shown as blue symbols in Figure 7. The amplitude vanishes only for $\omega=0$ (Rule $A$ ), $\omega=1$ (Rule $B$ ) and $\omega=1/2$ (periodic game $AB$ ). It reaches its maximum (see (69)) for $\omega=3/5$ .

The amplitude vanishes linearly in the vicinity of both endpoints ( $\omega\to 0$ and $\omega\to 1$ ), up to exponentially small deviations. For $\omega\to 0$ , the smallest distances yielding a non-zero three-point correlation function $C_{l,m}$ are $m=l-1=\mathop{\rm Int}\nolimits(1/\omega)$ . A similar line of reasoning applies to $\omega\to 1$ as well. We thus obtain the estimates

[TABLE]

4 History-dependent games

4.1 Generalities

We now turn to history-dependent Parrondo games [5, 18, 19]. In this second class of games, the walker moves either right or left at step $t$ , i.e., its $t$ th step

[TABLE]

is chosen to be either $\varepsilon_{t}=+1$ or $\varepsilon_{t}=-1$ , with probabilities which are independent of its position $n_{t}$ , but depend on the $Q$ previous steps, in a way that is different for Rules $A$ and $B$ .

Hereafter we restrict the analysis to the smallest relevant memory range, i.e., $Q=2$ . It is sufficient to characterize the system by the four-dimensional time-dependent state vector

[TABLE]

with

[TABLE]

The mean displacement during the $t$ th step reads

[TABLE]

where the displacement vector reads

[TABLE]

The expression (1) of the gain therefore translates to

[TABLE]

The usual class of history-dependent Parrondo games consists of a combination of the following rules [5, 18, 19].

$\bullet$

Rule $A$ . This rule coincides with Rule $A$ in capital-dependent games. In the present setting, each step is chosen according to

[TABLE]

irrespective of the past, where the notation $p$ is consistent with Sections 2 and 3. Therefore, if Rule $A$ is played at time $t$ , we have

[TABLE]

with

[TABLE]

If Rule $A$ is played alone, the walker executes a uniformly biased random walk. Its stationary state reads

[TABLE]

We have (see (103))

[TABLE]

i.e.,

[TABLE]

consistently with (14).

$\bullet$

Rule $B$ . This is the most general rule with memory range $Q=2$ , If Rule $B$ is played at time $t$ , the displacement $\varepsilon_{t}=\pm 1$ is chosen according to the following stochastic rules, depending on the two previous steps $(\varepsilon_{t-2},\varepsilon_{t-1})$ :

[TABLE]

with the notation $q_{i}=1-p_{i}$ . The $p_{i}$ are considered as four free parameters.

We have therefore

[TABLE]

with

[TABLE]

If Rule $B$ is played alone, the stationary state of the system reads

[TABLE]

with

[TABLE]

and

[TABLE]

We have (see (103))

[TABLE]

i.e.,

[TABLE]

Hereafter the main focus will again be on the neutral situation where each rule, when played alone, is fair ( $G_{A}=G_{B}=0$ ). The condition for Rule $A$ to be fair is again (25), expressing that the corresponding random walk is symmetric. The condition that Rule $B$ is fair reads

[TABLE]

This non-linear relation leaves three free parameters. We choose the parametrization

[TABLE]

and introduce for further convenience the logarithmic co-ordinates

[TABLE]

Figure 8 shows the parameter space of the neutral situation in the ( $u,v$ ) plane, for a fixed value of $a$ in the range $0<a<1$ . Allowed parameter values lie inside a square with vertices C $(\lambda,0)$ , E $(0,\lambda)$ , F $(-\lambda,0)$ and H $(0,-\lambda)$ . The edges of the square correspond to limiting cases: we have $p_{4}=1$ along CE, $q_{2}=1$ along EF, $p_{3}=1$ along FH and $q_{1}=1$ along HC. Symbols $+$ and $-$ refer to the sign of the gain (see below (126)). The midpoints D ( $q_{1}=q_{2}=a$ , $p_{3}=a^{2}$ , $p_{4}=1$ ) and G ( $q_{1}=q_{2}=a$ , $p_{3}=1$ , $p_{4}=a^{2}$ ) of the edges CE and FH play a part in the subsequent discussion.

Parity, i.e., the change of sign of the walker’s position ( $n\longleftrightarrow-n$ ), amounts to exchanging parameters according to $p\longleftrightarrow q$ for Rule $A$ , and for Rule $B$ $p_{4}\longleftrightarrow q_{1}$ , $p_{3}\longleftrightarrow q_{2}$ , i.e., $c\longleftrightarrow 1/c$ or $v\longleftrightarrow-v$ . Parity therefore amounts to a reflection of Figure 8 with respect to its horizontal $u$ -axis. No symmetry is associated with the reflection of Figure 8 with respect to its vertical $v$ -axis. Moreover, at variance with the capital-dependent games considered in Sections 2 and 3, the history-dependent Parrondo games considered here do not exhibit any simple transformation under time reversal.

4.2 Random games

The first situation of interest demonstrating Parrondo’s paradox is again that of random games, where at each time step Rule $B$ is chosen with probability $\rho$ and Rule $A$ with the complementary probability $1-\rho$ . In order to determine the average gain ${\overline{G}}$ of random games, it is sufficient to know the stationary average state vector ${\overline{{\bm{\phi}}}}$ . The average Markov matrix ${\overline{{\bm{M}}}}$ (see (31)) again has the same functional form as ${\bm{M}}_{B}$ , with effective parameters

[TABLE]

The average gain ${\overline{G}}$ is obtained by replacing all parameters entering (117) by the above effective values.

For the uniformly random game ( $\rho=1/2$ ), where at each time step Rules $A$ and $B$ are chosen with equal probabilities, we thus obtain

[TABLE]

with

[TABLE]

The expression (122) again allows one to measure the rarity of Parrondo’s paradox. We define the probability of observing Parrondo’s paradox as the volume of the five-dimensional domain in $(p,p_{1},p_{2},p_{3},p_{4})$ space such that the inequalities (2) hold, with $G$ given by (122). A numerical integration again yields a very small number (see (35))

[TABLE]

From now on, we restrict the analysis to history-dependent Parrondo games in the neutral situation where each rule, when played alone, is fair ( $G_{A}=G_{B}=0$ ). Using the parametrization (25), (119), we obtain the following expression for the average gain:

[TABLE]

with

[TABLE]

The expression (125) shows that the gain has the sign of the product $(b^{2}-1)(c^{2}-1)$ , i.e., equivalently, of the product $uv$ , irrespective of $a$ and of the probability $\rho$ . Therefore, in the neutral situation under consideration, Parrondo’s paradox holds in one half of parameter space, i.e., in the two regions marked by $+$ signs in Figure 8. The average gain vanishes as $\rho\to 0$ and $\rho\to 1$ , where random games respectively degenerate to Rule $A$ and Rule $B$ . It reaches its absolute maximum,

[TABLE]

in the limit where $a\to 0$ and $\rho\to 1$ simultaneously. More precisely, for a fixed small value of $a$ , the average gain ${\overline{G}}$ reaches its maximum with respect to $\rho$ , $b$ and $c$ for

[TABLE]

The corresponding point in Figure 8 is along the edge FH and close to its midpoint G. This maximum reads

[TABLE]

so that (127) is attained in the $a\to 0$ limit. This limit is however singular – irrespective of the parameters $b$ and $c$ , provided they remain in the allowed range – as another eigenvalue of the Markov matrix ${\bm{M}}_{B}$ goes to unity, so that the latter matrix loses its property of unique ergodicity.

The weak-contrast scaling regime is defined by the conditions that both parameters $b$ and $c$ are close to unity, i.e., that $u$ and $v$ are simultaneously small. This scaling regime therefore corresponds to zooming on the center of Figure 8. At variance with the situation of capital-dependent games, in the present case the weak-contrast regime keeps one free parameter, $a$ . For random games, the expression (125) for the average gain vanishes proportionally to $uv$ . We shall see in Section 5 that a similar scaling holds for arbitrary games. We are thus led to introduce the gain amplitude

[TABLE]

For random games, (125) yields

[TABLE]

For the uniformly random game ( $\rho=1/2$ ), this reads

[TABLE]

When the probability $\rho$ of choosing Rule $B$ varies between 0 and 1, the amplitude (131) reaches its maximum

[TABLE]

for

[TABLE]

4.3 Periodic games

We now turn to periodic games, defined by the periodic repetition of a unit cell $W$ of length $P$ . Here, too, the stationary state of the game has the same period $P$ as the game itself. It is encoded in $P$ state vectors ${\bm{\phi}}_{n}$ obeying

[TABLE]

with the notation (44), and with periodic boundary conditions ( ${\bm{\phi}}_{P}={\bm{\phi}}_{0}$ ). The associated gain reads

[TABLE]

(see (103)). The recursion (135) amounts to a system of $4P$ linear equations. The complexity of the expressions of the gain $G$ again grows very rapidly with the period $P$ . The gain is invariant under cyclic permutations, but not under reversal of the unit cell. Its expressions for periods 2 and 3 are as follows.

$\bullet$

$P=2$ . There is only one unit cell with period 2. The corresponding gain reads

[TABLE]

$\bullet$

$P=3$ . There are two unit cells with period 3. The corresponding gains read

[TABLE]

with

[TABLE]

The above expressions demonstrate that the gain vanishes proportionally to $(b^{2}-1)(c^{2}-1)$ , i.e., to $uv$ in the weak-contrast regime. The corresponding gain amplitudes (see (130)) are listed in the first three lines of Table 2.

5 Weak-contrast scaling regime of history-dependent games

5.1 Generalities

The problem again simplifies in the weak-contrast regime ( $u,v\to 0$ ). Hereafter we use the shorthand notation

[TABLE]

so that $0<a<1$ translates to $|\mu|<1$ .

For the periodic games considered in Section 4.3, the matrix recursion (135) boils down to two coupled linear recursion relations for the rescaled co-ordinates

[TABLE]

namely, with the notation (44):

[TABLE]

with periodic boundary conditions ( $y_{P}=y_{0}$ , $x_{P}=x_{0}$ ). The gain amplitude (see (130)) reads

[TABLE]

Here, too, the above formalism extends to aperiodic games (see Section 5.4).

There are analogies and differences between the studies of the weak-contrast regimes exposed in Sections 3.1 and 5.1. The main difference is that in (54), (55) the damping factor $\kappa$ is uniform and the variable $\sigma_{n}$ encoding the rule applied at step $n$ enters linearly, whereas the full structure of the recursions (144), (145) depends on $\sigma_{n}$ .

5.2 Random games

As a first application of the above formalism, let us revisit random games, considered in Section 4.2. As a consequence of (144), (145), the stationary averages ${\overline{x}}$ and ${\overline{y}}$ obey

[TABLE]

hence

[TABLE]

The result (131) is thus recovered.

5.3 Periodic games

We now revisit the situation of periodic games, considered in Section 4.3. At variance with (54), (55), where the variable $\sigma_{n}$ enters linearly, allowing for the explicit solution (63), in the present situation (144), (145) cannot be solved in closed form for periodic games with arbitrary unit cell $W$ .

An explicit formula for the gain amplitude can however be obtained in the case where the unit cell consists of only two blocks (see (64)), i.e.,

[TABLE]

with $M$ , $N\geq 1$ and $P=M+N$ . The form of the result depends on whether $M$ is one or larger, and on the parity of $N$ . Omitting details, we obtain

[TABLE]

When both blocks lengths $M$ and $N$ become simultaneously large, the amplitude falls off as

[TABLE]

up to exponentially small corrections. This $1/P$ fall-off can again be interpreted by stating that only the interfaces between blocks yield some gain.

We now turn to general features of interest exhibited by the amplitudes of periodic games. The dependence of the amplitude $g_{W}$ on the unit cell $W$ again appears to be very intricate in general. Table 2 gives the product $Pg_{W}$ for all periodic games with primitive period $P\leq 6$ . The explicit results (152)–(155) yield 15 of the 21 expressions given there, whereas the remaining six cases require a specific solution of the recursion (144), (145).

The following characteristics emerge from the results listed in Table 2. For all periodic games, the product $Pg_{W}$ is a polynomial in $\mu$ with integer coefficients. At variance with the case of capital-dependent games, the gain amplitude is not invariant under time reversal. The two unit cells of period 6 marked by asterisks are the shortest ones exhibiting this lack of symmetry. They are time-reversed of each other and have different amplitudes.

The situation where $\mu=0$ , i.e., $a=1/2$ , is very special. Indeed, for $u=v=0$ both Rule $A$ and Rule $B$ correspond to symmetric random walks. This is the only case where an exact expression of the gain amplitude $g_{W}$ can be obtained for all periodic games, namely

[TABLE]

where $\nu$ is the number of blocks of letters $A$ (or, equivalently, of blocks of letters $B$ ) in the unit cell $W$ . In other words, $2\nu$ is the number of interfaces between blocks per period.

The maximal gain amplitude is reached for either the first or the second of the periodic games listed in Table 2, according to values of $a$ , namely

[TABLE]

It has been checked by means of an exhaustive enumeration that no higher gain is reached for periods up to $P=30$ . For $a=1/2$ , the above result is a consequence of (157), as the ratio $\nu/P$ reaches its maximum $1/2$ for the periodic game $AB$ . It however comes as a surprise that $AB$ remains the optimal game over three quarters of the range of the parameter $a$ .

We again make a digression out of the weak-contrast regime in order to look at the maximal gain of the history-dependent Parrondo game all over its parameter space. For fixed $a$ , the periodic games $AB$ and $AAB$ reach their respective highest gain, namely

[TABLE]

at both midpoints D and G (see Figure 8). For fixed $a$ in the range $a>1/2$ , the maximal gain – over $b$ and $c$ and over all possible rule patterns – is always the larger of both expressions given in (159). The situation is however different for $a<1/2$ . There, the optimal periodic game undergoes an infinite sequence of transitions towards longer and longer periods as $a$ becomes smaller and smaller. The absolute maximal gain is given by

[TABLE]

This limiting value was already encountered in the framework of random games (see (127)). It is approached in the coupled singular limit where $a\to 0$ , whereas the periods of optimal rule patterns diverge.

5.4 Aperiodic games

The formalism of Section 5.1 extends to any aperiodic game, either deterministic or random. We do not have any analytical result such as (71). Nevertheless, the recursions (144), (145) can be iterated by numerical means for any given aperiodic sequence. Because of the exponential damping property of these recursions, very accurate numerical values of the amplitude $g$ can be obtained, especially in situations where the fluctuations $\delta_{n}$ defined in (93) are small.

We again consider the cut-and-project aperiodic game introduced in Section 3.4.2. Figure 9 shows plots of the gain amplitude $g$ against the rotation number $\omega$ defining the cut-and-project sequence, for several values of the parameter $a$ . Curves for $a\leq 1/2$ and $a\geq 1/2$ are shown in two separate panels, for the sake of clarity.

For $a=1/2$ , the result (157) translates to

[TABLE]

The corresponding triangular shape is shown in black in both panels of Figure 9. For $\omega\leq 1/3$ , all letters $B$ are isolated and separated from each other by at least two letters $A$ . Setting $k=0$ in the expression (155), we predict that each letter $B$ in the sequence brings a contribution $1-\mu=2a$ to the gain. We thus obtain the linear law

[TABLE]

that is clearly visible to the left of the vertical dashed lines in both panels of Figure 9. As a general rule, the dependence of the amplitude $g$ on the rotation number $\omega$ exhibits more and more pronounced fine details as $|\mu|$ grows, i.e., as $a$ departs from $1/2$ on both sides. Red curves correspond to the largest values of $|\mu|$ , namely $\mu=4/5$ ( $a=1/10)$ in the upper panel, and $\mu=-4/5$ ( $a=9/10)$ in the lower panel.

Figure 10 shows the dependence of the amplitude $g$ on the parameter $a$ for four typical irrational rotation numbers: $\omega_{1}=1/\tau=(\sqrt{5}-1)/2$ , $\omega_{2}=1/\tau^{2}=(3-\sqrt{5})/2$ , $\omega_{3}=\sqrt{2}-1$ , $\omega_{4}=2-\sqrt{2}$ . The first two numbers are related to Fibonacci (or golden-mean) sequences, the last two to octonacci (or silver-mean) sequences (see [37, 38] for overviews). The amplitude ${\overline{g}}$ of the uniformly random game (see (132)) and the maximal amplitude $g^{\rm max}$ (see (158)) are also shown for comparison.

6 Overview

This paper is aimed at being part of a special issue on the theory of disordered systems. It has been written in a fully self-contained manner. Of course, we have no claim to compete with either historical [6, 7, 8, 9] or very recent [41] reviews on Parrondo games and Parrondo’s paradox. Our motivation was to draw on the analogy between the temporal products of non-commuting Markov matrices involved in the study of Parrondo games and the spatial products of non-commuting transfer matrices which are ubiquitous in the physics of 1D disordered systems. There are many similarities as well as differences between both situations. The most salient common feature is that the non-commutativity of the matrix products ascribes a crucial role to the order of factors, representing either the rule pattern in Parrondo games or the positions of impurities in disordered chains. Markov matrices however enjoy a very specific property. They conserve probability, and so the entries of products of Markov matrices are bounded by unity. The concept of Lyapunov exponent, which is otherwise central in most situations involving products of random matrices, is therefore virtually useless in the present setting.

The investigations of Parrondo games reported here have been freely inspired by the theory of 1D disordered systems. We have dealt with both capital-dependent and history-dependent Parrondo games on the same footing in a systematic way, by means of a mapping onto a random walker on the 1D lattice. Within this unifying framework, the gain $G$ of the player identifies with the velocity of the walker’s ballistic motion. For definiteness, we have chosen one paradigmatic game in each class, and focussed our attention onto the neutral situation where each rule, when played alone, is fair ( $G_{A}=G_{B}=0$ ). The main emphasis is on the dependence of the gain on the remaining free parameters and, more importantly, on the game, i.e., the rule pattern, be it periodic or aperiodic, deterministic or random.

One of the most original sides of this work is the identification of weak-contrast regimes for both classes of Parrondo games considered here, and a detailed quantitative investigation of the gain in the latter scaling regimes. For the capital-dependent game mod 3 introduced in Section 2, encompassing Parrondo’s historical example, one single asymmetry parameter $v$ characterizes the neutral situation. The weak-contrast regime, studied in Section 3, corresponds to $v\to 0$ , where the gain of a generic game scales as $G\approx gv^{3}$ . For the two-step history-dependent game introduced in Section 4, the neutral situation is richer, as it depends on three parameters. The weak-contrast regime, studied in Section 5, corresponds to both relevant asymmetry parameters $u$ and $v$ being simultaneously small. The gain of a generic game now scales as $G\approx guv$ . For both classes of games, the determination of the gain amplitude $g$ has been reduced to the solution of two coupled linear recursions. This reduction allowed us to derive a wealth of novel results on both classes of Parrondo games. It is expected that more complex Parrondo games, with either $K>3$ for capital-dependent games or $Q>2$ for history-dependent games, admit weak-contrast scaling regimes in full generality, even though the number of remaining relevant parameters in those regimes grows very fast with the complexity of the game.

Bibliography41

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] G.P. Harmer, D. Abbott, Nature 402 , 864 (1999)
2[2] P.V.E. Mc Clintock, Nature 401 , 23 (1999)
3[3] G.P. Harmer, D. Abbott, Statist. Sci. 14 , 206 (1999)
4[4] G.P. Harmer, D. Abbott, P.G. Taylor, Proc. R. Soc. Lond. A 456 , 247 (2000)
5[5] J.M.R. Parrondo, G.P. Harmer, D. Abbott, Phys. Rev. Lett. 85 , 5226 (2000)
6[6] G.P. Harmer, D. Abbott, P.G. Taylor, J.M.R. Parrondo, Chaos 11 , 705 (2001)
7[7] G.P. Harmer, D. Abbott, Fluc. Noise Lett. 2 , R 71 (2002)
8[8] J.M.R. Parrondo, L. Dinis, Contemp. Phys. 45 , 147 (2004)