Quantitative evaluation of an active Chemotaxis model in Discrete time

Abhishek Pal Majumder

arXiv:1701.02064·math.PR·January 10, 2017

Quantitative evaluation of an active Chemotaxis model in Discrete time

Abhishek Pal Majumder

PDF

Open Access

TL;DR

This paper develops a discrete-time, nonlinear model for active chemotaxis involving interacting particles and medium concentration, providing stability analysis and convergence results for large particle systems.

Contribution

It introduces a new discrete-time formulation of an active chemotaxis model with non-linear interactions, extending previous work by removing restrictive domain assumptions.

Findings

01

Established conditions for unique fixed points in the dynamical system.

02

Proved uniform convergence rates of particle empirical measures to the limit.

03

Extended stability analysis to unbounded domain settings.

Abstract

A system of $N$ particles in a chemical medium in $R^{d}$ is studied in a discrete time setting. Underlying interacting particle system in continuous time can be expressed as \begin{eqnarray} dX_{i}(t) &=&[-(I-A)X_{i}(t) + \bigtriangledown h(t,X_{i}(t))]dt + dW_{i}(t), \,\, X_{i}(0)=x_{i}\in \mathbb{R}^{d}\,\,\forall i=1,\ldots,N\nonumber\\ \frac{\partial}{\partial t} h(t,x)&=&-\alpha h(t,x) + D\bigtriangleup h(t,x) +\frac{\beta}{n} \sum_{i=1}^{N} g(X_{i}(t),x),\quad h(0,\cdot) = h(\cdot).\label{main} \end{eqnarray} where $X_{i} (t)$ is the location of the $i$ th particle at time $t$ and $h (t, x)$ is the function measuring the concentration of the medium at location $x$ with $h (0, x) = h (x)$ . In this article we describe a general discrete time non-linear formulation of the aforementioned model and a strongly coupled particle system approximating it. Similar models have been studied…

Equations536

d X_{i} (t)

d X_{i} (t)

\frac{\partial}{\partial t} h (t, x)

d X_{i} (t)

d X_{i} (t)

\frac{\partial}{\partial t} h (t, x)

∥ μ ∥_{k} := (\int ∣ x ∣^{k} d μ (x))^{\frac{1}{k}} < \infty.

∥ μ ∥_{k} := (\int ∣ x ∣^{k} d μ (x))^{\frac{1}{k}} < \infty.

X^{+} = A x + δ f (\nabla η (x), μ, x, ϵ) + B (ϵ),

X^{+} = A x + δ f (\nabla η (x), μ, x, ϵ) + B (ϵ),

X_{n + 1}^{i}

X_{n + 1}^{i}

η^{+} (y) = \int_{R^{d}} η (x) R_{μ}^{α} (x, y) l (d x)

η^{+} (y) = \int_{R^{d}} η (x) R_{μ}^{α} (x, y) l (d x)

R_{μ}^{α} (x, C) := (1 - α) P (x, C) + α μ P^{'} (C), x \in R^{d}, C \in B (R^{d}) .

R_{μ}^{α} (x, C) := (1 - α) P (x, C) + α μ P^{'} (C), x \in R^{d}, C \in B (R^{d}) .

η_{n + 1}^{N} (y) = \int_{R^{d}} η_{n}^{N} (x) R_{μ_{n}^{N}}^{α} (x, y) l (d x) .

η_{n + 1}^{N} (y) = \int_{R^{d}} η_{n}^{N} (x) R_{μ_{n}^{N}}^{α} (x, y) l (d x) .

W_{1} (μ_{0}, γ_{0}) := X, Y in f E ∣ X - Y ∣, μ_{0}, ν_{0} \in P_{1} (R^{d}),

W_{1} (μ_{0}, γ_{0}) := X, Y in f E ∣ X - Y ∣, μ_{0}, ν_{0} \in P_{1} (R^{d}),

W_{1} (μ_{0}, γ_{0}) = f \in \mbox L i p_{1} (R^{d}) sup ∣ ⟨ f, μ_{0} - γ_{0} ⟩ ∣, μ_{0}, ν_{0} \in P_{1} (R^{d}) .

W_{1} (μ_{0}, γ_{0}) = f \in \mbox L i p_{1} (R^{d}) sup ∣ ⟨ f, μ_{0} - γ_{0} ⟩ ∣, μ_{0}, ν_{0} \in P_{1} (R^{d}) .

L (Y_{1}, Y_{2}, \dots, Y_{N}) = L (Y_{π (1)}, Y_{π (2)}, \dots, Y_{π (N)})

L (Y_{1}, Y_{2}, \dots, Y_{N}) = L (Y_{π (1)}, Y_{π (2)}, \dots, Y_{π (N)})

N \to \infty lim ⟨ f_{1} \otimes f_{2} \otimes \dots \otimes f_{k} \otimes 1 \dots \otimes 1, ν_{N} ⟩ = i = 1 \prod k ⟨ f_{i}, ν ⟩ .

N \to \infty lim ⟨ f_{1} \otimes f_{2} \otimes \dots \otimes f_{k} \otimes 1 \dots \otimes 1, ν_{N} ⟩ = i = 1 \prod k ⟨ f_{i}, ν ⟩ .

\nabla_{x} f (x, y) := (\frac{\partial f}{\partial x _{1}}, \frac{\partial f}{\partial x _{2}}, \dots, \frac{\partial f}{\partial x _{d}})^{'} .

\nabla_{x} f (x, y) := (\frac{\partial f}{\partial x _{1}}, \frac{\partial f}{\partial x _{2}}, \dots, \frac{\partial f}{\partial x _{d}})^{'} .

Q^{ρ, μ} (x, C) = \int_{R^{m}} 1_{{A x + δ f (\nabla ρ (x), μ, x, z) + B (z) \in C}} θ (d z), (x, C) \in R^{d} \times B (R^{d}) .

Q^{ρ, μ} (x, C) = \int_{R^{m}} 1_{{A x + δ f (\nabla ρ (x), μ, x, z) + B (z) \in C}} θ (d z), (x, C) \in R^{d} \times B (R^{d}) .

Q^{ρ, μ} ϕ (x) = \int_{R^{d}} ϕ (y) Q^{ρ, μ} (x, d y), ϕ \in B M (R^{d}), x \in R^{d} .

Q^{ρ, μ} ϕ (x) = \int_{R^{d}} ϕ (y) Q^{ρ, μ} (x, d y), ϕ \in B M (R^{d}), x \in R^{d} .

μ Q^{ρ, μ_{1}} (C) = \int_{R^{d}} Q^{ρ, μ_{1}} (x, C) μ (d x), C \in B (R^{d}) .

μ Q^{ρ, μ_{1}} (C) = \int_{R^{d}} Q^{ρ, μ_{1}} (x, C) μ (d x), C \in B (R^{d}) .

Ψ (μ, η)

Ψ (μ, η)

⎩ ⎨ ⎧ P (X_{k} (N) \in C ∣ F_{k - 1}^{N}) = ⨂_{i = 1}^{N} (δ_{X_{k - 1}^{j}} Q^{η_{k - 1}^{N}, μ_{k - 1}^{N}}) (C) \forall C \in B (R^{d N}), μ_{k}^{N} = \frac{1}{N} \sum_{i = 1}^{N} δ_{X_{k}^{i}}, η_{k}^{N} = η_{k - 1}^{N} R_{μ_{k - 1}^{N}}^{α}, F_{k}^{N} = σ {η_{k}^{N}, X_{k} (N)} \lor F_{k - 1}^{N} .

⎩ ⎨ ⎧ P (X_{k} (N) \in C ∣ F_{k - 1}^{N}) = ⨂_{i = 1}^{N} (δ_{X_{k - 1}^{j}} Q^{η_{k - 1}^{N}, μ_{k - 1}^{N}}) (C) \forall C \in B (R^{d N}), μ_{k}^{N} = \frac{1}{N} \sum_{i = 1}^{N} δ_{X_{k}^{i}}, η_{k}^{N} = η_{k - 1}^{N} R_{μ_{k - 1}^{N}}^{α}, F_{k}^{N} = σ {η_{k}^{N}, X_{k} (N)} \lor F_{k - 1}^{N} .

μ_{n + 1} = μ_{n} Q^{η_{n}, μ_{n}}, η_{n + 1} = η_{n} R_{μ_{n}}^{α}, n \geq 0.

μ_{n + 1} = μ_{n} Q^{η_{n}, μ_{n}}, η_{n + 1} = η_{n} R_{μ_{n}}^{α}, n \geq 0.

(μ_{n + 1}, η_{n + 1}) = Ψ (μ_{n}, η_{n}), n \in N_{0} .

(μ_{n + 1}, η_{n + 1}) = Ψ (μ_{n}, η_{n}), n \in N_{0} .

A_{1} (ϵ)

A_{1} (ϵ)

∣ f (y, μ, x, ϵ) ∣ \leq (∣ y ∣ + ∥ μ ∥_{1} + ∣ x ∣) A_{1} (ϵ) + A_{2} (ϵ)

∣ f (y, μ, x, ϵ) ∣ \leq (∣ y ∣ + ∥ μ ∥_{1} + ∣ x ∣) A_{1} (ϵ) + A_{2} (ϵ)

\int_{\mathbb{R}^{m}}\Big{(}A_{2}(z)+|B(z)|\Big{)}\theta(dz)<\infty.

\int_{\mathbb{R}^{m}}\Big{(}A_{2}(z)+|B(z)|\Big{)}\theta(dz)<\infty.

∣ \nabla_{y} P (x, y) - \nabla_{y} P (x^{'}, y^{'}) ∣

∣ \nabla_{y} P (x, y) - \nabla_{y} P (x^{'}, y^{'}) ∣

∣ \nabla_{y} P^{'} (x, y) - \nabla_{y} P^{'} (x^{'}, y^{'}) ∣

x \in R^{d} sup {∣ \nabla_{y} P (x, 0) ∣ \lor ∣ \nabla_{y} P^{'} (x, 0) ∣} < \infty.

x \in R^{d} sup {∣ \nabla_{y} P (x, 0) ∣ \lor ∣ \nabla_{y} P^{'} (x, 0) ∣} < \infty.

s u p_{x \in R^{d}} ∣ \nabla_{y} P (x, y) ∣ \leq M_{P}^{\nabla} (1 + ∣ y ∣) .

s u p_{x \in R^{d}} ∣ \nabla_{y} P (x, y) ∣ \leq M_{P}^{\nabla} (1 + ∣ y ∣) .

f \in L i p_{1} (R^{d}) sup x \neq = y \in R^{d} sup \frac{P f ( x ) - P f ( y )}{∣ x - y ∣} := l (P) < \infty

f \in L i p_{1} (R^{d}) sup x \neq = y \in R^{d} sup \frac{P f ( x ) - P f ( y )}{∣ x - y ∣} := l (P) < \infty

P f (\cdot) := E f (g_{1} (\cdot, ε_{1})), P^{'} f (\cdot) := E f (g_{2} (\cdot, ε_{2}))

P f (\cdot) := E f (g_{1} (\cdot, ε_{1})), P^{'} f (\cdot) := E f (g_{2} (\cdot, ε_{2}))

E (G_{1} (ε_{1})) \leq l (P) and E (G_{2} (ε_{2})) \leq l (P^{'}),

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMicro and Nano Robotics · 3D Printing in Biomedical Research · Mathematical Biology Tumor Growth

Full text

Quantitative evaluation of an active Chemotaxis model in Discrete time.

Abhishek Pal Majumder University of Copenhagen

Abstract

A system of $N$ particles in a chemical medium in $\mathbb{R}^{d}$ is studied in a discrete time setting. Underlying interacting particle system in continuous time can be expressed as

[TABLE]

where $X_{i}(t)$ is the location of the $i$ th particle at time $t$ and $h(t,x)$ is the function measuring the concentration of the medium at location $x$ with $h(0,x)=h(x)$ . In this article we describe a general discrete time non-linear formulation of the model (0.1) and a strongly coupled particle system approximating it. Similar models have been studied before (Budhiraja et al.(2010)) under a restrictive compactness assumption on the domain of particles. In current work the particles take values in $\mathbb{R}^{d}$ and consequently the stability analysis is particularly challenging. We provide sufficient conditions for the existence of a unique fixed point for the dynamical system governing the large $N$ asymptotics of the particle empirical measure. We also provide uniform in time convergence rates for the particle empirical measure to the corresponding limit measure under suitable conditions on the model.

AMS 2010 subject classifications: Primary 60J05, 60K35, 60F10.

Keywords: Weakly interacting particle system, propagation of chaos, nonlinear Markov chains, Wasserstein distance, McKean-Vlasov equations, exponential concentration estimates, transportation inequalities, metric entropy, stochastic difference equations, long time behavior, uniform concentration estimates.

1 Introduction

There have been a surge of significant research activities aimed towards understanding the dynamics of collective behavior of a multi-agent system in the time limit. Motivations for such problems come from various examples of self organizing systems such as consensus formation in opinion dynamics [11], active chemotaxis [3], self organized networks [13], large communication systems [12], multi target tracking [6], swarm robotics [14] (additional applications can be found in [15]) etc. One of the basic challenges is to understand how a large group of autonomous agents with decentralized local interactions that gives rise to a coherent behavior.

In this paper we consider a reduced model motivated by both [3],[5] for a system of interacting agents in a stochastic diffusing environment, variations of which have been proposed (see [3],[14] and references therein). Consider for each $i=1,\ldots,N$ $X_{i}(0)=x_{i}\in\mathbb{R}^{d}$

[TABLE]

Here $W_{i},i=1,...,N$ are independent Brownian motions that drive the state process $X_{i}$ of the $N$ interacting particles. The interaction between the particles arises directly from the evolution equation (1.1) and indirectly through the underlying potential field $h$ which changes continuously according to a diffusion equation and through the aggregated input of the $N$ particles. One example of such an interaction is in Chemotaxis where cells preferentially move towards a higher chemical concentration and themselves release chemicals into the medium, in response to the local information on the environment, thus modifying the potential field dynamically over time. In this context, $h(t,x)$ represents the concentration of a chemical at time $t$ and location $x$ . Diffusion of the chemical in the medium is captured by the Laplacian in (1.1) and the constant $\alpha>0$ models the rate of decay or dissipation of the chemical. The first equation in (1.1) describes the motion of a particle in terms of diffusion process with a drift consisting of three terms. The first term models a restoring force towards the origin where origin represents the natural rest state of the particles. The second term is the gradient of the chemical concentration and captures the fact that particles tend to move particularly towards regions of higher chemical concentration. Finally the third term captures the interaction(e.g attraction or repulsion) between the particles. Contribution of the agents to the chemical concentration field is given through the last term in the second equation. The function $g$ captures the agent response rules and can be used to model a wide range of phenomenon [15].

In [3] the authors considered a discrete time model which captures some of the key features of the dynamics in (1.1) and studied several long time properties of the system. One aspect that greatly simplified the analysis of [3] is that the state space of the particles is taken to be a compact set in $\mathbb{R}^{d}$ . However this requirement is restrictive and may be unnatural for the time scales at which the particle evolution is being modeled. In [14] authors had considered a number of variations of (1.1). The theoretical properties obtained in this work on the long time behavior of the particle system can be also applied for such systems with some minor modifications.

We now give a general description of the $N$ - particle system that gives a discrete time approximation of the mechanism outlined above. The space of real valued bounded measurable functions on $S$ is denoted as $BM(S)$ . Borel $\sigma$ field on a metric space will be denoted as $\mathcal{B}(S)$ . $\mathcal{C}_{b}(S)$ denotes the space of all bounded and continuous functions $f:S\to\mathbb{R}$ . For a measurable space S, $\mathcal{P}(S)$ denotes the space of all probability measures on $S$ . For $k\in\mathbb{N},$ let $\mathcal{P}_{k}(\mathbb{R}^{d})$ be the space of $\mu\in\mathcal{P}(\mathbb{R}^{d})$ such that

[TABLE]

Consider a system of $N$ interacting particles that evolve in $\mathbb{R}^{d}$ governed by a random dynamic chemical field according to the following discrete time stochastic evolution equation given on some probability space $(\Omega,\mathbb{F},P)$ . Suppose that the chemical field at time instant $n$ is given by a nonnegative $C^{1}$ (i.e continuously differentiable) real function on $\mathbb{R}^{d}$ satisfying $\int_{\mathbb{R}^{d}}\eta(x)dx=1$ . Then, given that particle state at time instant $n$ is $x$ and the empirical measure of the particle states at time $n$ is $\mu,$ the particle state $X^{+}$ at time $(n+1)$ is given as

[TABLE]

where $A$ is a $d\times d$ matrix, $\delta$ is a small parameter, $\epsilon$ is a $\mathbb{R}^{m}$ valued random variable with probability law $\theta$ and $f:\mathbb{R}^{d}\times\mathcal{P}(\mathbb{R}^{d})\times\mathbb{R}^{d}\times\mathbb{R}^{m}\longrightarrow\mathbb{R}^{d}$ is a measurable function. Here we consider a somewhat more general form of dependence of the particle evolution on the concentration profile than the additive form that appears in (1.1). Additional assumptions on $A,\theta,f$ will be introduced shortly. Nonlinearity (modeled by $f$ and $B$ ) of the system can be very general and as described below. Denote by $X_{n}^{i}\equiv X_{n}^{i,N}$ (a $\mathbb{R}^{d}$ valued random variable) the state of the $i$ -th particle $(i=1,\ldots,N)$ and by $\eta_{n}^{N}$ the chemical concentration field at time instant $n$ . Let $\mu_{n}^{N}:=\frac{1}{N}\sum_{i=1}^{N}\delta_{X_{n}^{i}}$ be the empirical measure of the particle values at time instant $n$ . The stochastic evaluation equation for the $N$ -particle system is given as

[TABLE]

In (1.3) $\{\epsilon_{n}^{i},i=1,...,N,\quad n\geq 1\}$ is an i.i.d array of $\mathbb{R}^{m}$ valued random variables with common probability law $\theta$ . Here $\{X_{0}^{i},i=1,...,N\}$ are assumed to be exchangeable with common distribution $\mu_{0}$ where $\mu_{0}\in\mathcal{P}_{1}(\mathbb{R}^{d})$ . Note that in the notation we have suppressed the dependence of the sequence $\{X_{n}^{i}\}$ on $N$ .

We now describe the evolution of the chemical field approximating the second equation in (1.1) and its interaction with the particle system. A transition probability kernel on $S$ is a map $P:S\times\mathcal{B}(S)\to[0,1]$ such that $P(x,\cdot)\in\mathcal{P}(S)\quad\forall x\in S$ and for each $A\in\mathcal{B}(S),$ $P(\cdot,A)\in BM(S)$ . Given the concentration profile at time $n$ is a $C^{1}$ probability density function $\eta$ on $\mathbb{R}^{d}$ and the empirical measure of the state of $N$ -particles at time instant $n$ is $\mu$ , the concentration probability density $\eta^{+}$ at time $(n+1)$ is given by the relation

[TABLE]

where $l$ denotes the Lebesgue measure on $\mathbb{R}^{d},$ and $R^{\alpha}_{\mu}(x,y)$ is the Radon-Nikodym derivative of the transition probability kernel with respect to the Lebesgue measure $l(dy)$ on $\mathbb{R}^{d}.$ The kernel $R^{\alpha}_{\mu}$ is given as follows. We considered the same model as introduced in [3]. Let $P$ and $P^{\prime}$ betwo transition probability kernels on $\mathbb{R}^{d}.$ For $\mu\in\mathcal{P}(\mathbb{R}^{d})$ and $\alpha\in(0,1)$ define the transition probability kernel $R_{\mu}^{\alpha}$ on $\mathbb{R}^{d}$ as

[TABLE]

Here $P$ represents the background diffusion of the chemical concentration while $\delta_{x}P^{\prime}$ captures the contribution to the field by a particle with location $x$ . So the kernel $P^{\prime}$ gives a spike at origin which can be approximated by a smooth density function as $P(x,dy)=\frac{1}{\sqrt{2\pi}\lambda}e^{-\frac{(x-y)^{2}}{2\lambda^{2}}}dy$ with very small $\lambda>0$ . The parameter $\alpha$ gives a convenient way for combining the contribution from the background diffusion and the individual particles. For each $x\in\mathbb{R}^{d},$ both $P(x,\cdot)$ and $P^{\prime}(x,\cdot)$ are assumed to be absolutely continuous with respect to Lebesgue measure and throughout this article we will denote the corresponding Radon-Nikodym derivatives with the same notations $P(x,\cdot)$ and $P^{\prime}(x,\cdot)$ respectively. Additional properties of $P$ and $P^{\prime}$ will be specified shortly. The evolution equation for the chemical field is then given as

[TABLE]

In contrast to the model studied in [5], the situation here is somewhat more involved. Note that $\{X_{n}(N)\}_{n\geq 0}:=(X_{n}^{1,N},X_{n}^{2,N},\ldots,X_{n}^{N,N})_{n\geq 0}$ is not a Markov process and in order to get a Markovian state descriptor one needs to consider $\{X_{n}(N),\eta_{n}^{N}\}_{n\geq 0}$ which is a discrete time Markov chain with values in $(\mathbb{R}^{d})^{N}\times\mathcal{P}(\mathbb{R}^{d})$ .

We will show that as $N\to\infty$ $(\mu^{N}_{n},\eta^{N}_{n})_{n\in\mathbb{N}_{0}}$ converges to a deterministic nonlinear dynamical system $(\mu_{n},\eta_{n})_{n\in\mathbb{N}_{0}}$ with methods followed in [3]. We established further sharp quantitative bounds (with techniques used in [10] and [5]) for weakly interacting particle system jointly with the stochastic field potential to the nonlinear system of interest. For both polynomial and exponential concentration bound it requires further constraints on the tail of the transition kernels $P,P^{\prime}$ used in modeling the diffusive environment. One major motivation of cthe current article is giving a sharp uniform in time quantitative estimate for the particle system $(\mu_{n}^{N},\eta_{n}^{N})$ to the non-linear system of interest $(\mu_{n},\eta_{n})$ so that any functional of the form $\big{<}\phi_{1},\mu_{n}\big{>}+\big{<}\phi_{2},\eta_{n}\big{>}$ can be approximated by $\frac{1}{N}\sum_{i=1}^{N}\phi_{1}(X^{i}_{n})+\big{<}\phi_{2},\eta_{n}^{N}\big{>}$ with desired precision. Previous work on concentration bounds for similar particle system in discrete time includes [8] but that involves a Dobrushin type stability condition which is not very effective if the particles are assumed to come from a non-compact domain. A very recent work [4] addresses several quantitative bounds for Chemotaxis model motivated by Patlak-keller-segel type non-linear equations.

The following notations will be used in this article. $\mathbb{R}^{d}$ will denote the $d$ dimensional Euclidean space with the usual Euclidean norm $|\cdot|$ . The set of natural numbers (resp. whole numbers) is denoted by $\mathbb{N}$ (resp. $\mathbb{N}_{0}$ ). Cardinality of a finite set $S$ is denoted by $|S|$ . For $x\in\mathbb{R}^{d}$ , $\delta_{x}$ is the Dirac delta measure on $\mathbb{R}^{d}$ that puts a unit mass at location $x$ . The supremum norm of a function $f:S\to\mathbb{R}$ is $\|f\|_{\infty}=\sup_{x\in S}|f(x)|$ . When $S$ is a metric space, the Lipschitz seminorm of $f$ is defined by $\|f\|_{1}=\sup_{x\not=y}\frac{|f(x)-f(y)|}{d(x,y)}$ where $d$ is the metric on the space $S$ . For a bounded Lipschitz function $f$ on $S$ we define $\|f\|_{BL}:=\|f\|_{1}+\|f\|_{\infty}$ . $\mbox{Lip}_{1}(S)$ (resp. $BL_{1}(S)$ ) denotes the class of Lipschitz (resp. bounded Lipschitz) functions $f:S\to\mathbb{R}$ with $\|f\|_{1}$ (resp. $\|f\|_{BL}$ ) bounded by 1. Occasionally we will suppress $S$ from the notation and write $\mbox{Lip}_{1}$ and $BL_{1}$ when clear from the context. For a Polish space $S$ , $\mathcal{P}(S)$ is equipped with the topology of weak convergence. A convenient metric metrizing this topology on $\mathcal{P}(S)$ is given as $\beta(\mu,\gamma)=\sup\{|\int fd\mu-\int fd\gamma|:\|f\|_{BL_{1}}\leq 1\}$ for $\mu,\gamma\in\mathcal{P}(S)$ . For a signed measure $\gamma$ on $\mathbb{R}^{d}$ , we define $\langle f,\gamma\rangle:=\int fd\gamma$ whenever the integral makes sense. The space $\mathcal{P}_{1}(\mathbb{R}^{d})$ will be equipped with the Wasserstein-1 distance that is defined as follows:

[TABLE]

where the infimum is taken over all $\mathbb{R}^{d}$ valued random variables $X,Y$ defined on a common probability space and where the marginals of $X,Y$ are $\mu_{0}$ and $\gamma_{0}$ respectively. From Kantorovich-Rubenstein duality (cf. [17]) one sees the Wasserstein-1 distance has the following characterization

[TABLE]

For a signed measure $\mu$ on $(S,\mathcal{B}(S))$ , the total variation norm of $\mu$ is defined as $|\mu|_{TV}:=\sup_{||f||_{\infty}\leq 1}\langle f,\mu\rangle$ . Probability distribution of a $S$ valued random variable $X$ will be denoted as $\mathcal{L}(X)$ . Convergence in distribution of a $S$ valued sequence $\{X_{n}\}_{n\geq 1}$ to a $S$ valued random variable $X$ will be written as $X_{n}\Rightarrow X$ .

A finite collection $\{Y_{1},Y_{2},\ldots,Y_{N}\}$ of $S$ valued random variables is called exchangeable if

[TABLE]

for every permutation $\pi$ on the $N$ symbols $\{1,2,\ldots,N\}$ . Let $\{Y_{i}^{N},i=1,\ldots,N\}_{N\geq 1}$ be a collection of $S$ valued random variables, such that for every $N$ , $\{Y_{1}^{N},Y_{2}^{N},\ldots,Y_{N}^{N}\}$ is exchangeable. Let $\nu_{N}=\mathcal{L}(Y_{1}^{N},Y_{2}^{N},\ldots,Y_{N}^{N})$ . The sequence $\{\nu_{N}\}_{N\geq 1}$ is called $\nu$ -chaotic (cf. [16]) for a $\nu\in\mathcal{P}(\mathcal{S})$ , if for any $k\geq 1$ , $f_{1},f_{2},\ldots,f_{k}\in\mathcal{C}_{b}(\mathcal{S}),$ one has

[TABLE]

Denoting the marginal distribution on first $k$ coordinates of $\nu_{N}$ by $\nu_{N}^{k}$ , equation (1.7) says that, for every $k\geq 1,$ $\nu_{N}^{k}\rightarrow\nu^{\otimes k}$ . The gradient of a real differentiable function $f$ on $\mathbb{R}^{d}$ denoted by $\nabla f$ is defined as the $d$ dimensional vector field $\nabla f:=(\frac{\partial f}{\partial x_{1}},\frac{\partial f}{\partial x_{2}},\ldots,\frac{\partial f}{\partial x_{d}})^{\prime}$ . For a function $f:\mathbb{R}^{d}\times\mathbb{R}^{m}\to\mathbb{R}$

[TABLE]

The function $\nabla_{y}f(x,y)$ is defined similarly. Absolute continuity of a measure $\mu$ with respect to a measure $\nu$ will be denoted by $\mu\ll\nu.$ We will denote the Radon-Nikodym derivative of $\mu$ with respect to $\nu$ by $\frac{d\mu}{d\nu}$ . For $f\in BM(\mathcal{S})$ and a transition probability kernel $P$ on $S$ , define $Pf\in BM(\mathcal{S})$ as $Pf(\cdot)=\int_{S}f(y)P(\cdot,dy)$ . For any closed subset $B\in S$ , and $\mu\in\mathcal{P}(B),$ define $\mu P\in\mathcal{P}(S)$ as $\mu P(A)=\int_{B}P(x,A)\mu(dx)$ . For a matrix $B$ the usual operator norm is denoted by $\|B\|$ .

2 Description of the nonlinear system:

We now describe the nonlinear dynamical system obtained on taking the limit $N\to\infty$ of $(\mu_{n}^{N},\eta_{n}^{N})$ . Given a $C^{1}$ density function $\rho$ on $\mathbb{R}^{d}$ and $\mu\in\mathcal{P}(\mathbb{R}^{d})$ , define a transition probability kernel $Q^{\rho,\mu}$ on $\mathbb{R}^{d}$ as

[TABLE]

With an abuse of notation we will also denote by $Q^{\rho,\mu}$ the map from $BM(\mathbb{R}^{d})$ to itself, defined as

[TABLE]

For $\mu,\mu_{1}\in\mathcal{P}(\mathbb{R}^{d})$ , let $\mu Q^{\rho,\mu_{1}}\in\mathcal{P}(\mathbb{R}^{d})$ be defined as

[TABLE]

Note that $\mu Q^{\rho,\mu_{1}}=\mathcal{L}\big{(}AX+\delta f(\nabla\rho(X),\mu_{1},X,\epsilon)+B(\epsilon)\big{)}$ where $\mathcal{L}(X,\epsilon)=\mu\otimes\theta$ .

Define $\mathcal{P}_{1}^{*}(\mathbb{R}^{d}):=\{\mu\in\mathcal{P}_{1}(\mathbb{R}^{d}):\mu\ll l,\frac{d\mu}{dl}$ is continuously differentiable and $\|\nabla\frac{d\mu}{dl}\|_{1}<\infty\}.$ For notational simplicity we will identify an element in $\mathcal{P}_{1}^{*}(\mathbb{R}^{d})$ with its density and denote both by the same symbol. Define the map $\Psi:\mathcal{P}(\mathbb{R}^{d})\times\mathcal{P}_{1}^{*}(\mathbb{R}^{d})\to\mathcal{P}(\mathbb{R}^{d})\times\mathcal{P}(\mathbb{R}^{d})$ as

[TABLE]

Under suitable assumptions (which will be introduced in Section 3) it will follow that for every $(\mu,\eta)\in\mathcal{P}_{1}(\mathbb{R}^{d})\times\mathcal{P}_{1}^{*}(\mathbb{R}^{d}),$ $\eta^{+}$ defined by (1.4) is in $\mathcal{P}_{1}^{*}(\mathbb{R}^{d})$ and $\mu Q^{\eta,\mu}$ defined by (2.1) is in $\mathcal{P}_{1}(\mathbb{R}^{d})$ . Thus (under those assumptions) $\Psi$ is a map from $\mathcal{P}_{1}(\mathbb{R}^{d})\times\mathcal{P}_{1}^{*}(\mathbb{R}^{d})$ to itself. Using the above notation we see that $\{(X_{n}^{1},...,X_{n}^{N}),\mu_{n}^{N},\eta_{n}^{N}\}_{n\geq 0}$ is a $\mathbb{(}\mathbb{R}^{d})^{N}\times\mathcal{P}_{1}(\mathbb{R}^{d})\times\mathcal{P}_{1}^{*}(\mathbb{R}^{d})$ valued discrete time Markov chain defined recursively as follows. Let $X_{k}(N)\equiv(X_{k}^{1},X_{k}^{2},...,X_{k}^{N})$ , and $\eta_{0}^{N}$ be the initial chemical field which is a random element of $\mathcal{P}_{1}^{*}(\mathbb{R}^{d})$ . Let $\mathcal{F}_{0}=\sigma\{X_{0}(N),\eta_{0}^{N}\}.$ Then, for $k\geq 1$

[TABLE]

We will call this particle system as $\mathbb{IPS}_{1}$ . We next describe a nonlinear dynamical system which is the formal Vlasov-Mckean limit of the above system, as $N\to\infty$ . Given $(\mu_{0},\eta_{0})\in\mathcal{P}_{1}(\mathbb{R}^{d})\times\mathcal{P}_{1}^{*}(\mathbb{R}^{d})$ define a sequence $\{(\mu_{n},\eta_{n})\}_{n\geq 0}$ in $\mathcal{P}_{1}(\mathbb{R}^{d})\times\mathcal{P}_{1}^{*}(\mathbb{R}^{d})$ as

[TABLE]

Using (2.2) the above evolution can be represented as

[TABLE]

As in [5], the starting point of our investigation on long time asymptotics of the above interacting particle system will be to study the stability properties of (2.4). We identify $\eta,\eta^{\prime}\in\mathcal{P}(\mathbb{R}^{d})$ that are equal a.e under the Lebesgue measure on $\mathbb{R}^{d}$ . From a computational point of view we are approximating $(\mu_{n},\eta_{n})$ by $(\mu_{n}^{N},\eta_{n}^{N})$ uniformly in time parameter $n$ , with explicit uniform concentration bounds. Such results are particularly important for developing sampling methods for approximating the steady state distribution of the mean field models such as in (2.4).

The third equation in (2.3) makes the simulation of $\mathbb{IPS}_{1}$ numerically challenging. In section 3 we will mention another particle system (based on the second particle system in [3]) referred to as $\mathbb{IPS}_{2}$ which also gives an asymptotically consistent approximation of (2.4) and is computationally more tractable. We show in THeorem 3.2 that under conditions that include a Lipschitz property of $f$ (Assumptions 1 and 2), smoothness assumptions on the transition kernels of the background diffusion of the chemical medium (Assumption 4) the Wasserstein-1( $\mathcal{W}_{1}$ ) distance between the occupation measure of the particles along with the chemical medium $(\mu_{n}^{N},\eta_{n}^{N})$ and $(\mu_{n},\eta_{n})$ converges to [math], for every time instant $n.$ Under an additional condition on the contractivity of $A$ and $\delta,\alpha$ being sufficiently small we show that the nonlinear system (2.5) has a unique fixed point and starting from an arbitrary initial condition, convergence to the fixed point occurs at a geometric rate. Using these results we next argue in Theorem 1 that under some integrability conditions (Assumption 7-8), as $N\to\infty$ , the empirical occupation measure of the $N$ -particles and density of the chemical medium at time instant $n$ , namely $(\mu_{n}^{N},\eta_{n}^{N})$ converges to $(\mu_{n},\eta_{n})$ in the $\mathcal{W}_{1}$ distance, in $L^{1}$ , uniformly in $n$ . This result in particular shows that the $\mathcal{W}_{1}$ distance between $(\mu_{n}^{N},\eta_{n}^{N})$ and the unique fixed point $(\mu_{\infty},\eta_{\infty})$ of (2.5) converges to zero as $n\to\infty$ and $N\to\infty$ in any order. We next show that for each $N$ , there is unique invariant measure $\Theta^{N}_{\infty}$ of the $N$ -particle dynamics with integrable first moment and this sequence of measures is $\mu_{\infty}$ -chaotic, namely as $N\to\infty$ , the projection of $\Theta^{N}_{\infty}$ on the first $k$ -coordinates converges to $\mu_{\infty}^{\otimes k}$ for every $k\geq 1$ . This propagation of chaos property all the way to $n=\infty$ crucially relies on the uniform in time convergence of $(\mu_{n}^{N},\eta_{n}^{N})$ to $(\mu_{\infty},\eta_{\infty})$ . Such a result is important since it says that the steady state of a $N$ -dimensional fully coupled Markovian system has a simple approximate description in terms of a product measure when $N$ is large. This result is key in developing particle based numerical schemes for approximating the fixed point of the evolution equation (2.5). Next we present some uniform in time concentration bounds of $\mathcal{W}_{1}(\mu^{N}_{n},\mu_{n})+\mathcal{W}_{1}(\eta_{n}^{N},\eta_{n})$ . Proof is very similar to that of Theorem 3.8 of [5] so we only provide a sketch after showing necessary conditions.

3 Main Results:

We now introduce our main assumptions on the problem data. Recall that $\{X_{0}^{i},i=1,\ldots N\}$ is assumed to be exchangeable with common distribution $\mu_{0}.$ We assume further $(\mu_{0},\eta_{0})\in\mathcal{P}_{1}(\mathbb{R}^{d})\times\mathcal{P}^{*}_{1}(\mathbb{R}^{d}).$ For a $d\times d$ matrix B we denote its norm by $\|B\|,$ i.e. $\|B\|=\sup_{x\in\mathbb{R}^{d}\setminus\{0\}}\frac{|Bx|}{|x|}$ .

Assumption 1

The error distribution $\theta$ is such that $\int A_{1}(z)\theta(dz):=\sigma\in(0,\infty)$ where

[TABLE]

It follows that $\forall x,y\in\mathbb{R}^{d},\mu\in\mathcal{P}_{1}(\mathbb{R}^{d}),$

[TABLE]

where $A_{2}(\epsilon):=f(0,0,\epsilon)$ .

Recall the function $B:\mathbb{R}^{m}\to\mathbb{R}^{d}$ introduced in (1.2).

Assumption 2

The error distribution $\theta$ is such that

[TABLE]

Assumption 3

$\eta_{0}^{N}$ * (the density function) is a Lipschitz function on $\mathbb{R}^{d}$ and $\eta_{0}^{N}\in\mathcal{P}_{1}^{*}(\mathbb{R}^{d})$ .*

Assumptions 4 and 5 on the kernels $P$ and $P^{\prime}$ hold quite generally. In particular, they are satisfied for Gaussian kernels.

Assumption 4

There exist $l_{P}^{\nabla}\in(0,1]$ and $l_{P^{\prime}}^{\nabla}\in(0,\infty)$ such that for all $x,y,x^{\prime},y^{\prime}\in\mathbb{R}^{d}$

[TABLE]

Furthermore

[TABLE]

Using the Lipschitz property in (3.3) and the growth condition (3.6) one has the linear growth property for some $M_{P}^{\nabla}\in(0,\infty)$

[TABLE]

A similar inequality holds for $P^{\prime}$ from (3.4) with $M_{P^{\prime}}^{\nabla}\in(0,\infty)$ .

Denote $(1-\alpha)l_{P}^{\nabla}+\alpha l_{P^{\prime}}^{\nabla}$ by $l_{PP^{\prime}}^{\nabla,\alpha}$ .

Assumption 5

For every $f\in\mbox{Lip}_{1}(\mathbb{R}^{d}),$ $Pf$ and $P^{\prime}f$ are also Lipschitz and

[TABLE]

Also $l(P^{\prime})$ defined as above for $P^{\prime}$ is finite.

Assumption 6

Both $P(x,\cdot)$ and $P^{\prime}(x,\cdot)$ are such that for any compact set $K\subset\mathbb{R}^{d},$ the families of probability measures $\{P(x,\cdot):x\in K\}$ and $\{P^{\prime}(x,\cdot):x\in K\}$ are both uniformly integrable.

Let $\max\{l(P),l(P^{\prime})\}=l_{PP^{\prime}}$ .

Remark 3.1

Assumption 5 is satisfied if $P,P^{\prime}$ are given as follows. For any $f\in\mathcal{C}_{b}(\mathbb{R}^{d}),$ let

[TABLE]

where ${\varepsilon}_{1},{\varepsilon}_{2}$ are $\mathbb{R}^{m}$ valued random variables and $\varepsilon_{1},\varepsilon_{2}$ and $g_{1},g_{2}:\mathbb{R}^{d}\times\mathbb{R}^{m}\to\mathbb{R}^{d}$ are maps with following properties:

[TABLE]

where

[TABLE]

Simulation of the system is numerically intractable due to the step that involves the updating of $\eta_{n-1}^{N}$ to $\eta_{n}^{N}.$ This requires computing the integral in (1.4) which, since $R_{\mu}^{\alpha}$ is a mixture of two transition kernels, over time leads to an explosion of terms in the mixture that need to be updated. An approach (proposed in [3]) that addresses this difficulty is, without directly updating $\eta_{n-1}^{N}$ , to use the empirical distribution of the observations drawn independently from $\eta_{n-1}^{N}.$

Denote $\bar{X}_{0}(N)$ by $(\bar{X}_{0}^{1,N},\ldots,\bar{X}_{0}^{N,N})$ a sample of size $N$ from $\mu_{0}.$ Let $M\in\mathbb{N}$ . The new particle scheme will be described as a family $(\bar{X}_{k}(N),\bar{\mu}^{N}_{k},\bar{\eta}^{M}_{k})_{k\in\mathbb{N}_{0}}$ of $(\mathbb{R}^{d})^{N}\times\mathcal{P}(\mathbb{R}^{d})\times\mathcal{P}^{*}(\mathbb{R}^{d})$ valued random elements on some probability space defined recursively as follows. Set $\bar{X}_{0}(N)=(\bar{X}_{0}^{1,N},\ldots,\bar{X}_{0}^{N,N}),\bar{\eta}^{M}_{0}=\eta_{0},\bar{\mathcal{F}}^{M,N}_{0}=\sigma(\bar{X}^{N}(0))$ . For $k\geq 1$

[TABLE]

where $S^{M}(\bar{\eta}_{k-1}^{M})$ is the random measure defined as $\frac{1}{M}\sum_{i=1}^{M}\delta_{Y^{i,M}_{k-1}}$ where $\{Y^{i,M}_{k-1}\}_{i=1,\ldots,M}$ conditionally on $\bar{\mathcal{F}}_{k-1}^{M,N},$ are $M$ i.i.d distributed according to $\bar{\eta}_{k-1}^{M}.$ We will call this particle system as $\mathbb{IPS}_{2}$ . We remark that our notation is not accurate since both the quantities $\bar{\mu}_{k}^{N},\bar{\eta}_{k}^{M}$ depend on $M,N.$ The superscripts only describe the number of particles/samples used in the procedure to combine them. Note that like $\mathbb{IPS}_{1},$ here $(\bar{X}_{k}(N),\bar{\eta}_{k}^{M})_{k\geq 0}$ is not a Markov chain on $(\mathbb{R}^{d})^{N}\times\mathcal{P}_{1}^{*}(\mathbb{R}^{d})$ anymore. Rather $(\bar{X}^{N}(k),\bar{\eta}_{k}^{M},S^{M}(\bar{\eta}_{k}^{M}))_{k\geq 0}$ is a discrete time Markov chain on $(\mathbb{R}^{d})^{N}\times\mathcal{P}_{1}^{*}(\mathbb{R}^{d})\times\mathcal{P}_{1}(\mathbb{R}^{d})$ .

For any random variable $Z$ we denote $E\big{[}Z\big{|}\mathcal{F}_{k}^{M,N}\big{]}$ by $E_{k}^{M,N}\big{[}Z\big{]}$ . The following result shows that the particle systems in (2.3) and (3.10) approximate the dynamical system in (2.4) as $N$ (respectively $\min\{M,N\}$ for $\mathbb{IPS}_{2}$ ) becomes large for a fixed time instant.

Proposition 3.2

Suppose Assumptions 1,2,4 and 5 hold.

(a)

Consider the particle system $\mathbb{IPS}_{1}$ in (1.3,1.5). Suppose the sampling of the exchangeable datapoints $X_{0}(N)\equiv(X_{0}^{1},X_{0}^{2},\ldots,X_{0}^{N})$ is exchangeable and $\{\mathcal{L}(X_{0}(N))\}_{N\in\mathbb{N}}$ is $\mu_{0}$ - chaotic. Suppose $E\mathcal{W}_{1}(\eta_{0}^{N},\eta_{0})\to 0$ as $N\to\infty$ . Then, as $N\to\infty$

[TABLE]

for all $n\geq 0$ where $\mu_{n},\eta_{n}$ are as in (2.4). 2. (b)

Consider the second particle system $\mathbb{IPS}_{2}.$ Suppose that in addition Assumption 6 holds. Suppose the sampling of the exchangeable datapoints $\bar{X}_{0}(N)\equiv(\bar{X}_{0}^{1},\bar{X}_{0}^{2},\ldots,\bar{X}_{0}^{N})$ is exchangeable and $\{\mathcal{L}(\bar{X}_{0}(N))\}_{N\in\mathbb{N}}$ is $\mu_{0}$ - chaotic. Then as $\min\{N,M\}\to\infty$

[TABLE]

for all $n\geq 0$ .

As a consequence of Proposition 3.2, we have a finite time propagation of chaos result of the following form. Let $\nu_{n}^{N}=\mathcal{L}(X_{n}^{1,N},X_{n}^{2,N},\ldots,X_{n}^{N,N}).$

Corollary 3.3

Under Assumptions as in Proposition 3.2 the family $\{\nu_{n}^{N}\}_{N\geq 1}$ is $\mu_{n}$ chaotic for every $n\geq 1$ .

As noted in introduction, the primary goal is studying long time properties of (1.3) and the non-linear dynamical system (2.4). Following proposition identifies the range of values of the modeling parameters that leads to stability of the system.

Proposition 3.4

Suppose Assumptions (1)-(5) hold. Then there exist $\omega_{0},\alpha_{0},\delta_{0}\in(0,1)$ such that for all $\|A\|<\omega_{0},\alpha\in(0,\alpha_{0})$ , and $\delta\in(0,\delta_{0})$ . The map $\Psi$ defined in (2.2) has a unique fixed point $(\mu_{\infty},\eta_{\infty})$ in $\mathcal{P}_{1}(\mathbb{R}^{d})\times\mathcal{P}_{1}^{*}(\mathbb{R}^{d}).$

Now we will give more stringrent conditions under which a non-asymptotic bound on convergence rates of the particle system to the deterministic nonlinear dynamics and their consequences for the steady state behavior can be established.

Assumption 7

For some $\tau>0,$

[TABLE]

We need to impose the following condition on $P,P^{\prime}$ for uniform in time convergence.

Assumption 8

For some $\left<|x|^{1+\tau},\eta_{0}\right><\infty.$ There exist $m_{\tau}(P)$ and $m_{\tau}(P^{\prime})$ in $\mathbb{R}^{+}$ such that following holds for all $x\in\mathbb{R}^{d}$

[TABLE]

Now we state a generalization of the Proposition 3.2, which gives the convergence rate of

[TABLE]

uniformly over all $n\geq 0$ in a nonasymptotic manner.

Recall $l_{P}^{\nabla},l_{P^{\prime}}^{\nabla}$ introduced in Assumption 3. For $\alpha\in(0,1),$ let $l_{PP^{\prime}}^{\nabla,\alpha}=(1-\alpha)l_{P}^{\nabla}+\alpha l_{P^{\prime}}^{\nabla}$ . With the notations of Assumption 1 we define

[TABLE]

For $(\mu_{n},\eta_{n}),(\mu^{\prime}_{n},\eta^{\prime}_{n})\in\mathcal{P}_{1}(\mathbb{R}^{d})\times\mathcal{P}_{1}^{*}(\mathbb{R}^{d})$ define the following distance on $\mathcal{P}_{1}(\mathbb{R}^{d})\times\mathcal{P}_{1}^{*}(\mathbb{R}^{d})$

[TABLE]

Theorem 1

Consider the particle system $\mathbb{IPS}_{2}$ . Suppose Assumptions (1)-(5) and Assumptions (7),(8) hold for some $\tau>0$ . Let $N_{1}:=\min\{M,N\}.$ Also assume $\delta\in(0,a_{0}),\quad(1-\alpha)m_{\tau}(P)<1$ and

[TABLE]

Then there exists $\theta<1,$ and $a\in(0,\infty)$ such that for each $n\geq 0,$ the upperbound $b(N_{1},\tau,d)$ of

[TABLE]

can be expressed as

[TABLE]

where the value of the constant $C$ will vary for each of the cases.

Remark 3.5

For the first particle system (1.3-1.5) similar results hold where the explicit bounds are given in terms of number of particles $N$ instead of $N_{1}.$ For $\mathbb{IPS}_{2}$ if the initial sampling scheme of $\bar{X}_{0}(N)\equiv(\bar{X}_{0}^{1},\bar{X}_{0}^{2},...,\bar{X}_{0}^{N})$ is $\mu_{0}$ -chaotic then using the fact $E\mathcal{W}_{1}(\bar{\mu}_{0}^{N},\mu_{0})\to 0$ as $N\to\infty,$ it follows from the conclusion of the Theorem 1

[TABLE]

as $\min{\{N,M\}}\to\infty.$ For the first particle system in (1.3-1.5), if $E\mathcal{W}_{1}(\eta_{0}^{N},\eta_{0})\to 0$ as $N\to\infty,$ and $X_{0}(N)\equiv(X_{0}^{1},X_{0}^{2},...,X_{0}^{N})$ is $\mu_{0}$ -chaotic then following

[TABLE]

holds for $N\to\infty$ .

One consequence of above theorem and Proposition 3.4 will be the following interchange of limit results which is analogous to Corollary 3.5 from [5].

Corollary 3.6

Under conditions of the Theorem 1

[TABLE]

Suppose Assumptions of Theorem 1 hold and let $(\mu_{\infty},\eta_{\infty})$ be the fixed point of the map $\Psi$ of (2.5). We are interested in establishing a propagation of chaos result for $n=\infty.$ Recall for $\mathbb{IPS}_{2},$ $S^{M}(\bar{\eta}_{n}^{M})$ is the random measure defined as $\frac{1}{M}\sum_{i=1}^{M}\delta_{Y^{i,M}_{n}}$ where $\{Y^{i,M}_{n}\}_{i=1,\ldots,M}$ conditionally on $\mathcal{F}_{n}^{M,N},$ are $M$ i.i.d distributed $\mathbb{R}^{d}$ valued random variables according to $\bar{\eta}_{k-1}^{M}.$ Denote $Y_{n}(M):=(Y^{1,M}_{n},\ldots,Y^{M,M}_{n})$ .

Theorem 2

Consider the second particle system $\mathbb{IPS}_{2}$ . Suppose Assumptions 1,2,4,5 hold with conditions

[TABLE]

Then for every $N,M\geq 1,$ the Markov process $\big{(}\bar{X}^{N}(n),\bar{\eta}_{n}^{M},S^{M}(\bar{\eta}_{n}^{M})\big{)}_{n\geq 0}$ on $(\mathbb{R}^{d})^{N}\times\mathcal{P}_{1}^{*}(\mathbb{R}^{d})\times\mathcal{P}(\mathbb{R}^{d})$ has a unique invariant measure $\Theta_{\infty}^{N,M}$ if following holds

[TABLE]

Let $\Theta_{\infty}^{1,N,M}$ be the marginal distribution on $(\mathbb{R}^{d})^{N}$ of the first co-ordinate of $\Theta_{\infty}^{N,M}$ . Suppose additionally Assumption 4,3 and Assumption 7,8 hold with further condition for some $\tau>0$

[TABLE]

Then $\Theta_{\infty}^{1,N,M}$ is $\mu_{\infty}$ - chaotic, where $\mu_{\infty}$ is defined in Proposition 3.4.

Remark 3.7

For first particle system $(\mathbb{IPS}_{1})$ similar steady state result holds for the discrete time Markov chain $\big{(}\bar{X}^{N}(n),\bar{\eta}_{n}^{N}\big{)}_{n\geq 0}$ on $(\mathbb{R}^{d})^{N}\times\mathcal{P}_{1}^{*}(\mathbb{R}^{d}).$

3.1 Concentration Bounds:

In order to obtain uniform in time concentration bounds of $\mathcal{W}_{1}\big{(}(\mu_{n}^{N},\eta_{n}^{N}),(\mu_{n},\eta_{n})\big{)}$ we proceed according to those in Theorem 3.7 and Theorem 3.8 of [5] respectively. Here we establish two different types of concentration bounds. The first one is with initial non iid (i.e initial samples are $\mu_{0}$ chaotic) assumption and the second one is without that.

Assumption 9

(i) For some $K\in(1,\infty)$ , $A_{1}(x)\leq K$ for $\theta$ a.e. $x\in\mathbb{R}^{m}$ .

(ii) There exists $\alpha\in(0,\infty)$ such that $\int e^{\alpha|x|}\mu_{0}(dx)<\infty$ and there exists $\alpha(\delta)\in(0,\alpha)$ such that

[TABLE]

Assumption 10

Suppose there exists functions $h_{1}(\cdot)$ , $h_{2}(\cdot)$ , $h^{\prime}_{1}(\cdot)$ , $h^{\prime}_{2}(\cdot),h_{3}(\cdot),h^{\prime}_{3}(\cdot)$ ( $h_{2},h^{\prime}_{2},h_{3},h^{\prime}_{3}$ are nondecreasing with $h_{2}(0)=0$ , $h^{\prime}_{2}(0)=0;$ ), and constants $l_{h_{1}}\in(0,1],\,\,l_{h^{\prime}_{1}}\in(0,\infty)$ such that $h_{1}(x),$ and $h^{\prime}_{1}(x)$ are respectively $l_{h_{1}}$ and $l_{h^{\prime}_{1}}$ Lipschitz. There exists $\alpha\in(0,\infty)$ such that following hold for all $\alpha_{1}\in(0,\alpha)$

[TABLE]

Remark 3.8

(a)

For Gaussian transtion kernel $P(x,dy)=\frac{1}{\sqrt{2\pi}\lambda}e^{-\frac{(x-y)^{2}}{2\lambda^{2}}}dy,$ one has

[TABLE]

where $\Phi(\cdot)$ is the cumulative distribution function of Normal distribution. So (3.15) holds with $h_{1}(x)=x,\quad h_{3}(\cdot)=0,\quad h_{2}(\alpha_{1})=\frac{\lambda^{2}\alpha_{1}^{2}}{2}.$ 2. (b)

For Bi-exponential kernel $P(x,dy)=\frac{1}{2\lambda}e^{-\frac{|x-y|}{\lambda}}dy$ one has

[TABLE]

So (3.15) holds under condition $\alpha_{1}<\frac{1}{\lambda_{1}}$ with $h_{1}(x)=x,\quad h_{3}(\cdot)=0,\quad h_{2}(\alpha_{1})=\log\Big{[}\frac{1}{1-\alpha_{1}^{2}\lambda^{2}}\Big{]}.$ Note that any kernel with tail lighter than exponential (like Gaussian) will satisfy (3.15) for all $\alpha_{1},$ where for kernels with exponential like tail will have a specific restriction on $\alpha_{1}.$ 3. (c)

We worked here only for $l_{h_{1}}=1$ as the upper bound. It only influences in the choice of $\alpha_{1}$ for which

[TABLE]

For $l_{h_{1}}=1$ one has a definite upper bound of $\alpha_{1}.$ More precisely denoting $\alpha_{1}h_{1}(0)\sum_{j=0}^{i}l^{j}_{h_{1}}+\sum_{j=0}^{i}h_{2}(\alpha_{1}l^{j}_{h_{1}})$ by $g(i)$ if $g(i)$ is linear in $i$ (happens only for $l_{h_{1}}=1$ ) then there exists $\alpha^{*}$ such that (3.16) holds for $\alpha_{1}<\alpha^{*}$ . On the other hand if $g(\cdot)$ is bounded, then $\sup_{n\geq 0}\sup_{M,N\geq 1}E\left<e^{\alpha_{1}|x|},\bar{\eta}^{M}_{n}\right>$ will remain finite for all $\alpha_{1}>0$ . If $g(i)$ is exponential in $i$ (when $l_{h_{1}}>1$ ) then the upper bound of $\sup_{n\geq 0}\sup_{M,N\geq 1}E\left<e^{\alpha_{1}|x|},\bar{\eta}^{M}_{n}\right>$ will diverge.

With $\tau,\sigma_{1}(\tau)$ defined above in Assumption 7 let

[TABLE]

Theorem 3

(a)

(Polynomial Concentration) Let $N_{1}=\min\{M,N\}.$ Suppose Assumptions (1-5) and Assumptions (7),(8) hold for some $\tau>0$ . Suppose that $\delta\in(0,a(\tau)^{\frac{1}{1+\tau}}),(1-\alpha)l_{\tau}(P)<1$ and

[TABLE]

Then there exits $\nu>1,\gamma\in(0,1)$ , $N_{0}\in\mathbb{N}_{0}$ and $C_{1}\in(0,\infty)$ such that for all $\epsilon>0,$ and for all $n\geq 0,$

[TABLE]

for all $N_{1}>N_{0}\left(\max\left\{1,\log^{+}\varepsilon\right\}\right)^{\frac{d+2}{d}}$ . 2. (b)

(Exponential Concentration)Let $N_{1}=\min\{M,N\}.$ Suppose that Assumptions 9 and 10 hold with (3.18). Suppose $\delta\in\Big{[}0,\frac{1-\|A\|}{(2+l^{\nabla,\alpha}_{PP^{\prime}})K}\Big{)}$ and $\alpha_{1}\in\left[0,\min\{\alpha^{*},\frac{\alpha(\delta)}{\delta}\}\right)$ where

[TABLE]

Then there exists $N_{0}\in\mathbb{N},\nu>1,\gamma\in(0,1)$ and $C_{2}\in(0,\infty)$ such that for all ${\varepsilon}>0$

[TABLE]

for all $n\geq 0$ , $N_{1}\geq N_{0}\max\{(\frac{1}{{\varepsilon}}\log^{+}\frac{1}{{\varepsilon}})^{d+2},{\varepsilon}^{(d+2)/(d-1)}\}$ , if $d>1$ ; and

[TABLE]

for all $n\geq 0$ , $N_{1}\geq N_{0}\max\{(\frac{1}{{\varepsilon}}\log^{+}\frac{1}{{\varepsilon}})^{d+2},1\}$ , if $d=1$ .

Remark 3.9

(a)

Similar concentration bounds hold for the first particle system $\mathbb{IPS}_{1}.$ 2. (b)

*Here the nonlinearity in the kernel of the nonlinear Markov process has a linear structure (linear combination of $P$ and $\mu P^{\prime}$ ) which is handled through $\mathcal{W}_{1}$ distance. It can be further generalized for any nonlinear Markov process where the nonlinearity in the kernel depends on the higher order moments (of * $p$ th order) of the law of the chain, then working with $\mathcal{W}_{p}$ distance would yield similar results.

Note that the bounds in Theorems 3 are not dimensions independent while the initial sampling assumptions are not restrictive. It will be interesting to see if one can get sharper bounds under stronger conditions than above theorems. The following result shows that such bounds can be obtained in cases where initial locations of $N$ particles are i.i.d and under a more stringent condition on other parameters.

Theorem 4

Consider the first particle system $\mathbb{IPS}_{1}$ with initial condition $\eta_{0}^{N}\equiv\eta_{0}$ . Suppose that $\{X_{0}^{i,N}\}_{i=1,\ldots,N}$ are i.i.d. with common distribution $\mu_{0}$ for each $N$ . Let

[TABLE]

Suppose that Assumptions 1,4,5 and 9 hold with conditions $\chi_{1}\in(0,1)$ , $\delta\in\Big{[}0,\frac{1-\|A\|}{(2+l^{\nabla,\frac{\alpha}{\delta}}_{PP^{\prime}})K}\Big{)}$ and $\alpha_{1}<\frac{\alpha(\delta)}{\delta}$ . Then there exist $a_{1},a_{2},a^{\prime}_{1},a^{\prime}_{2},a^{\prime\prime}_{1},a^{\prime\prime}_{2}\in(0,\infty)$ and $N_{0},N_{1},N_{2}$ for all ${\varepsilon}>0$

[TABLE]

Remark 3.10

(a)

If Assumption 9 is strengthened to $\int e^{\alpha(\delta)\left(A_{1}^{2}(z)+\frac{|B(z)|)^{2}}{\delta^{2}}\right)}\theta(dz)<\infty$ for some $\alpha(\delta)>0$ then one can strengthen the conclusion of Theorem 4 as follows: For $\delta,\alpha$ sufficiently small there exist $N_{0},a_{1},a_{2}\in(0,\infty)$ and a nonincreasing function $\varsigma_{2}:(0,\infty)\to(0,\infty)$ such that $\varsigma_{2}(t)\downarrow 0$ as $t\uparrow\infty$ and for all ${\varepsilon}>0$ and $N\geq N_{0}\varsigma_{2}({\varepsilon})$

[TABLE] 2. (b)

Here stability condition (3.18) which is a crucial assumption for Lemma 5.4 is not used. Such is the power of the coupling that we used in Theorem 4.

4 Discussion and Conclusion

This article decribes a modified version of discrete time particle approximation scheme described in [3] which incorporates the evolution of particles in a non-compact domain. A similar form of stability condition is obtained under which the nonlinear system has a unique fixed point. Our contribution is computing the quantitative nonasymptotic bounds on these approximation schemes and how these relate to the conditions on the tail and smothness of the transition kernels $P,P^{\prime}$ that were used to model the diffussive environment. As an additional result we obtained the propagation of chaos result of the particle scheme at time $n=\infty.$ There are few questions and remarks that should be addressed in future.

(a)

Theorem 4 is developed exclusvely for $\mathbb{IPS}_{1}$ . For $\mathbb{IPS}_{2}$ we would have an extra term $\mathcal{W}_{1}\left(S^{M}(\bar{\eta}^{M}_{n-1}),\bar{\eta}_{n-1}^{M}\right)$ in the expression of $\mathcal{W}_{1}(\mu_{n}^{N},\mu_{n})$ . Now the problem will arise in computing sharper (than (5.109)) bound of

[TABLE]

Concentration bound of the conditional probability can be given in terms of random $\big{<}e^{\alpha_{1}|x|},\bar{\eta}_{n-1}^{M}\big{>}$ but getting an explicit relationship of the bound with the conditional exponential moment is unavailable. After taking expectation it is impossible conclude whether the inequality of upper bound still holds or not. Illustratively if the conditional concentration bound of $P\big{[}\mathcal{W}_{1}\left(S^{M}(\bar{\eta}^{M}_{n-1}),\bar{\eta}_{n-1}^{M}\right)>\varepsilon\big{|}\bar{\mathcal{F}}_{n-1}^{M,N}\big{]}$ is a concave function of $\big{<}e^{\alpha_{1}|x|},\bar{\eta}_{n-1}^{M}\big{>}$ then by Jensen’s inequality reasonable conclusion would hold but to our knowledge such explicit relationship is not present in literature. 2. (b)

The concentration bounds established in [10] for $\mathcal{W}_{1}$ distance of empirical distribution of i.i.d observations to the true distribution is sharp however their method can be applied here only for $\mathbb{IPS}_{1}$ as done in Theorem 4 using the well known coupling construction that works for all Vlasov McKean type systems. Without using that coupling, we attempted to use the grid based methods of [10] in order to find sharper bounds for $P[\mathcal{W}_{1}\big{(}(\bar{\mu}_{n}^{N},\bar{\eta}_{n}^{M}),\Psi(\bar{\mu}_{n-1}^{N},\bar{\eta}_{n-1}^{M})\big{)}>\varepsilon]$ along the line of Theorem 3. We faced similar problem as in the previous remark. Since one can derive a bound for $P[\mathcal{W}_{1}\big{(}(\bar{\mu}_{n}^{N},\bar{\eta}_{n}^{M}),\Psi(\bar{\mu}_{n-1}^{N},\bar{\eta}_{n-1}^{M})\big{)}>\varepsilon\big{|}\bar{\mathcal{F}}_{n-1}^{M,N}]$ keeping $\big{<}e^{\alpha_{1}|x|},\bar{\eta}_{n-1}^{M}\big{>},\big{<}e^{\alpha_{1}|x|},\bar{\mu}_{n-1}^{N}\big{>}$ as constants but we do not know explicit structure how these bounds are functionally depending on $\big{<}e^{\alpha_{1}|x|},\bar{\eta}_{n-1}^{M}\big{>},\big{<}e^{\alpha_{1}|x|},\bar{\mu}_{n-1}^{N}\big{>},$ so that unconditionally we can conclude something useful. These issues will be addressed in future.

5 Proofs

The following two elementary lemmas give a basic moment bound that will be used in the proofs. We denote the function $f(\cdot,\cdot,\cdot,x)+\frac{B(x)}{\delta}$ by $f_{\delta}(\cdot,\cdot,\cdot,x).$

Lemma 5.1

For an interacting particle system illustrated in (1.3) and (1.5),

(a)

Suppose Assumptions 1, 2 and 4 hold. Then, for every $n\geq 1,\quad M_{n}=\sup_{N\geq 1}E|X_{n}^{i}|<\infty.$ Moreover if Assumption 1 holds, then under $\delta\in(0,a_{0})$ then $\quad\sup_{n\geq 1}M_{n}<\infty.$ 2. (b)

With the assumptions in part(a) suppose additionally Assumption 7 holds for some $\tau>0$ and suppose $\delta\in(0,a(\tau)^{\frac{1}{1+\tau}}).$ Then

[TABLE]

where in limit $a(\tau)^{\frac{1}{1+\tau}}\to a_{0}$ as $\tau\to 0^{+}$ .

Remark 5.1

Note that the same bound for $\sup_{n}\sup_{N,M\geq 1}E|\bar{X}_{n+1}^{i}|$ and $\sup_{n}\sup_{N,M\geq 1}E|\bar{X}_{n+1}^{i}|^{1+\tau}$ also hold for $\mathbb{IPS}_{2}$ under same condition on $\delta$ .

5.0.1 Proof of Lemma 5.1

(a)

We prove the second statement. Proof of the first statement is similar. For each $n\geq 1$ and $i=1,\ldots,N,$ applying Assumption 1 on particle system in (1.3) with definitions of $A_{1}(\cdot)$ and $A_{2}(\cdot)$

[TABLE]

Now by Assumption 4 using DCT one has

[TABLE]

for every $y$ since from Assumption 4 $\sup_{x\in\mathbb{R}^{d}}|\nabla_{y}R^{\alpha}_{\mu_{n}}(x,y)|\leq l^{\nabla,\alpha}_{PP^{\prime}}\,|y|+\sup_{x\in\mathbb{R}^{d}}\big{(}(1-\alpha)|\nabla_{y}P(x,0)|+\alpha|\nabla_{y}P^{\prime}(x,0)|\big{)}.$ Applying the same condition followed by the inequality $|\nabla\eta_{n+1}(y)|\leq$

$\int_{\mathbb{R}^{d}}\eta_{n}(x)|\nabla_{y}R^{\alpha}_{\mu_{n}}(x,y)|dx,$ one has

[TABLE]

Also note by exchangeability $E\|\mu_{n}^{N}\|_{1}=E\int|x|\mu_{n}^{N}(dx)=E|X_{n}^{i}|$ . Taking expectation in (5.1) and using (5.3) and independence between $\epsilon_{n+1}^{i}$ and $\{X_{n}^{j}\}_{j=1}^{N},$ one has

[TABLE]

The assumption on $\delta$ implies that $\gamma:=\|A\|+\delta\sigma\left(2+l^{\nabla,\alpha}_{PP^{\prime}}\right)\in(0,1).$ A recursion on (5.4) will give $M_{n}\leq\gamma^{n}E|X_{0}^{i}|+\frac{\delta[\sigma c^{\nabla,\alpha}_{PP^{\prime}}+\sigma_{2}]}{1-\gamma},$ from which the result follows. 2. (b)

By Holder’s inequality for any three nonnegative real numbers $a_{1},a_{2},a_{3},a_{4}$

[TABLE]

Starting with (5.1), applying (5.5), and Assumption 1, on (5.1) we have

[TABLE]

For any convex function $\phi(\cdot),$ applying Jensen’s inequality one gets $\phi(\|\mu_{n}^{N}\|_{1})\leq\int|\phi(x)|\mu_{n}^{N}(dx)$ $=\frac{1}{N}\sum_{i=1}^{N}|\phi(X_{n}^{i})|.$ Using $\phi(x)=x^{1+\tau},$ after taking expectation one gets following recursive equation for $E|X_{n+1}^{i}|^{1+\tau}$ ,

[TABLE]

Note that for our condition on $\delta,\quad\quad\kappa_{1}:=4^{\tau}\bigg{[}\|A\|^{(1+\tau)}+\delta^{1+\tau}\sigma_{1}(\tau)\big{[}(1+l^{\nabla,\alpha}_{PP^{\prime}})^{1+\tau}+1\big{]}\bigg{]}<1.$ Thus

[TABLE]

$\square$

Lemma 5.2

Suppose Assumptions 1,2,4 and 5 hold.

(a)

Consider the interacting particle system described in (1.3) and (1.5). Then, for every $n\geq 1,$

[TABLE]

Moreover if Assumption 1 holds, then under conditions

[TABLE]

one has $\sup_{n\geq 1}\left<|x|,\eta_{n}\right><\infty.$

Additionally assuming $\sup_{N\geq 1}E\left<|x|,\eta_{0}^{N}\right><\infty$ one gets

[TABLE] 2. (b)

With the assumptions in part(a) suppose additionally Assumption 7,8 hold for some $\tau>0$ and suppose $\delta\in(0,a(\tau)^{\frac{1}{1+\tau}}).$ Then with condition $(1-\alpha)m_{\tau}(P)<1$ one has $\sup_{n\geq 1}\left<|x|^{1+\tau},\eta_{n}\right><\infty.$ Additionally assuming $\sup_{N\geq 1}E\left<|x|^{1+\tau},\eta_{0}^{N}\right><\infty$ one gets $\sup_{n\geq 1}\sup_{N\geq 1}E\left<|x|^{1+\tau},\eta^{N}_{n}\right><\infty,$ where in limit $a(\tau)^{\frac{1}{1+\tau}}\to a_{0}$ as $\tau\to 0^{+}$ .

Remark 5.2

The second condition in (5.8) is very general. It doesn’t impose any condition on $\alpha\in(0,1).$ The condition holds for all transition kernels $P(x,\cdot),P^{\prime}(x,\cdot)$ with finite first moment. Only thing one needs to check

[TABLE]

where $g(i)$ is some polynomial in $i$ (For Gaussian it’s linear). If $g(\cdot)$ is an exponential function then it will impose a further lower bound condition on $\alpha$ .

Corollary 5.3

For $\mathbb{IPS}_{2}$ same conclusion about $\bar{\eta}^{M}_{n}$ holds as $\eta_{n}^{N}$ in first particle system specified in Lemma 5.2 under same set of conditions on $\delta,\alpha$ . Note that $\bar{\eta}_{0}^{M}=\eta_{0},$ so we don’t need to assume anything about the initial sampling scheme like $\sup_{M\geq 1}E\left<|x|,\bar{\eta}_{0}^{M}\right><\infty$ (or $\sup_{M\geq 1}E\left<|x|^{1+\tau},\bar{\eta}_{0}^{M}\right><\infty$ ) since they automatically hold for $\eta_{0}\in\mathcal{P}_{1}^{*}(\mathbb{R}^{d})$ (or $\eta_{0}\in\mathcal{P}_{1+\tau}^{*}(\mathbb{R}^{d})$ ) respectively.

5.0.2 Proof of Lemma 5.2

We will start with the second part of part (a) of the lemma. First part will follow similarly. We will show if $\eta_{0}\in\mathcal{P}^{*}_{1}(\mathbb{R}^{d})$ then $\eta_{n}\in\mathcal{P}_{1}(\mathbb{R}^{d})$ for all $n\geq 1.$ Note that

[TABLE]

From Assumption 5, it is obvious that $P^{\prime}P^{i}f$ is $l(P^{\prime})l(P)^{i}$ Lipschitz if $f$ is a $1$ -Lipschitz function. It implies $|P^{\prime}P^{i}f(x)-P^{\prime}P^{i}f(0)|\leq l(P^{\prime})l(P)^{i}|x|$ for any $f\in\mbox{Lip}_{1}(\mathbb{R}^{d}).$ Since $|x|$ is $1$ -Lipschitz, one has

[TABLE]

Using this inequality one has from (5.9)

[TABLE]

By Assumption 5, $l(P)\leq 1,$ implies $(1-\alpha)l(P)<1$ . From similar derivation done in Lemma 5.1, one has $\sup_{n\in\mathbb{N}}\left<|x|,\mu_{n}\right><\infty$ if $\delta\in(0,a_{0}).$ The result follows using all the conditions

[TABLE]

For $E\left<|x|,\eta^{N}_{k}\right>$ note that for any function $f,$

[TABLE]

From Lemma 5.1 $\sup_{n\geq 0}\sup_{N\geq 1}E\left<|x|,\mu^{N}_{n}\right><\infty$ for $\delta\in(0,a_{0}).$ Putting $f(x)=|x|,$ then expanding $\left<|x|,\eta^{N}_{n}\right>$ similarly like (5.10) after taking expectation one gets a similar bound and finiteness of $\sup_{n}\sup_{N\geq 1}E\left<|x|,\eta^{N}_{n}\right>$ follows from that.

$\square$

Proof of Lemma 5.2(b): From (5.9),

[TABLE]

From Assumption 8 we get the following recursion for $a_{i}:=\left<\mu P^{\prime}P^{i},|x|^{1+\tau}\right>$ for any measure $\mu\in\mathcal{P}_{1+\tau}(\mathbb{R}^{d})$

[TABLE]

since $P|x|^{1+\tau}\leq m_{\tau}(P)(1+|x|^{1+\tau})$ from Assumption 8. Using the fact $a_{0}:=\left<\mu,P^{\prime}|x|^{1+\tau}\right>\leq m_{\tau}(P^{\prime})(1+\left<\mu,|x|^{1+\tau}\right>),$ we finally have

[TABLE]

Under condition $\delta\in(0,a(\tau)^{\frac{1}{1+\tau}})$ and $(1-\alpha)m_{\tau}(P)<1$ one gets $\sup_{n}\left<\eta_{n},|x|^{1+\tau}\right><\infty.$ Similarly the same bound can be derived for $\sup_{n}\sup_{N\geq 1}E\left<|x|^{1+\tau},\eta_{n}^{N}\right>$ under the same set of conditions.

$\square$

5.0.3 Proof of Corollary 5.3

To prove the Corollary about $\bar{\eta}_{n}^{M},$ define the random operator $S^{M}\circ P$ acting on the probability measure $\mu$ on $\mathbb{R}^{d}:\quad\mu(S^{M}\circ P)=(S^{M}(\mu))P.$ Note the following recursive form of $\bar{\eta}_{n}^{M}$ :

[TABLE]

Note that for any function $f$ one has

[TABLE]

Now by expanding $\mu(S^{M}\circ P)^{k}$ one gets,

[TABLE]

Taking expectation one has

[TABLE]

Continuing this calculation $k-1$ times one has $E\left<\mu(S^{M}\circ P)^{k},f\right>=\left<\mu P^{k},f\right>$ which leads to the following expression

[TABLE]

The corollary is proved by observing (5.16). The same bound holds for both $E\left<\bar{\eta}^{M}_{n},f\right>$ , $E\left<\eta^{N}_{n},f\right>$ because of the similarity of bounds of $E\left<f,\mu_{n}^{N}\right>$ , and $E\left<f,\bar{\mu}_{n}^{N}\right>$ for $f(x)=|x|,|x|^{1+\tau},e^{\alpha|x|^{p}}$ which follows from Remark 5.1.

$\square$

5.1 Proof of Proposition 3.2

We will prove part (b) of the theorem. Part (a) will follow similarly. We will start with the following lemma.

Lemma 5.3

(a)

Under Assumptions 1,2,4, for every $\epsilon>0$ and $n\geq 1$ , there exists a compact set $K_{\epsilon,n}\in\mathcal{B}(\mathbb{R}^{d})$ such that

[TABLE] 2. (b)

Suppose Assumptions 1,2,4,5,6 hold. Then for every $\epsilon>0$ and $k\geq 1$ , there exists a compact set $K_{\epsilon,k}\in\mathcal{B}(\mathbb{R}^{d})$ such that

[TABLE]

This part of the lemma is exclusively for part (b) of the Proposition 3.2.

Proof: Note that for any non-negative $\phi:\mathbb{R}^{d}\to\mathbb{R}$ ,

[TABLE]

To get the desired result from above equalities it suffices to show that

[TABLE]

We will prove (5.19) by induction on $n$ . Once more we suppress $N$ from the super-script. Clearly by our assumptions $\{X_{0}^{i},i=1,...,N;N\geq 1\}$ is uniformly integrable. Now suppose that the Statement (5.19) holds for some $n$ . Note that from (5.1) and (5.3)

[TABLE]

From Assumptions 1 and 2 the families $\{A_{1}(\epsilon_{n+1}^{i});i\geq 1\}$ , $\{A_{2}(\epsilon_{n+1}^{i});i\geq 1\}$ $\{B_{2}(\epsilon_{n+1}^{i})$ are uniformly integrable. Now by exchangeability, $\frac{1}{N}\sum_{i=1}^{N}|X_{n}^{i}|=E\Big{[}|X_{n}^{i}|\Big{|}\sigma\Big{(}\frac{1}{N}\sum_{i=1}^{N}\delta_{X_{n}^{i}}\Big{)}\Big{]}.$ If $\{X_{\alpha}:\alpha\in\Gamma_{1}\}$ is uniformly integrable, and $\{\sigma_{\beta},\beta\in\Gamma_{2}\}$ is a collection of $\sigma$ - fields where $\Gamma_{1},\Gamma_{2}$ are arbitrary index sets, then $\{E(X_{\alpha}|\sigma_{\beta}),(\alpha,\beta)\in\Gamma_{1}\times\Gamma_{2}\}$ is also a uniformly integrable family. It follows that $\{\frac{1}{N}\sum_{i=1}^{N}|X_{N}^{i}|,N\geq 1\}$ is a uniformly integrable family from induction hypothesis. Using (5.19) again along with independence between $\{\epsilon_{n+1}^{i},i=1,\ldots,N\}$ and $\{X_{n}^{i}:i=1,\ldots,N;N\geq 1\}$ yield that the family $\{|X_{n+1}^{i}|:i=1,\ldots,N;N\geq 1\}$ is uniformly integrable. The result follows. $\quad\square$

Proof of Lemma 5.3(b): Note that $S^{M}(\bar{\eta}_{k}^{M})=\frac{1}{M}\sum_{i=1}^{M}\delta_{Y^{i,M}_{k}}$ where $\{Y^{i,M}_{k}\}_{i=1}^{M}\bigg{|}\mathcal{F}^{M,N}_{k}$ are i.i.d from $\bar{\eta}_{k}^{M}.$ So for any non-negative function $\phi$ we have

[TABLE]

We will prove the result if we can show the family

[TABLE]

We will prove (5.21) through induction on $k$ . For $k=0,$ the result follows trivially since $\{Y^{i,M}_{0},i=1,\ldots,M;M\geq 1\}$ are i.i.d from $\eta_{0}.$ Suppose it holds for $k=n.$ We will show that both,

[TABLE]

Then from the structure $\bar{\eta}_{n+1}^{M}=(1-\alpha)S^{M}(\bar{\eta}_{n}^{M})P+\alpha\bar{\mu}_{n}^{N}P,$ it is evident that $\{\bar{\eta}_{n+1}^{M}:M,N\geq 1\}$ is uniform integrable which equivalently implies $\{Y_{n+1}^{i,M}:i=1,\ldots,M;M,N\geq 1\}$ is UI too. On proving the first assertion in (5.1), note that due to the exchangeability of $\{Y_{n}^{i,M}:i=1,\ldots,M\},$ one has

[TABLE]

We know that if $\{Z_{\alpha},\alpha\in\Gamma_{1}\}$ is a uniformly integrable family and $\{\mathcal{H}_{\beta},\beta\in\Gamma_{2}\}$ is a collection of $\sigma$ -fields where $\Gamma_{1},\Gamma_{2}$ are arbitrary index sets, then $\{E(Z_{\alpha}\mid\mathcal{H}_{\beta}),(\alpha,\beta)\in\Gamma_{1}\times\Gamma_{2}\}$ is a uniformly integrable family. So from (5.23) it suffices to prove that $\{\delta_{Y_{n}^{i,M}}P:i=1,\ldots,M;M,N\geq 1\}$ is uniformly integrable. Define a function $f_{k}(.)$ such that, $f_{k}(x)=0$ , if $|x|\in[0,\frac{k}{2}]$ and $f_{k}(x)=|x|$ , if $|x|\geq k$ and linear in between range. Then by construction $f_{k}(.)$ is Lipschitz with coefficient 2 and $x.1_{\{|x|>k\}}\leq f_{k}(x)$ for all $x\in\mathbb{R}^{d}.$ By Assumption 6 we have that $\{P(z,.):z\in K\}$ is uniformly integrable. So taking the compact set $K=\{|x|\leq k\}$ assuming $Y_{n}^{i,M}$ has unconditional law $m^{n}_{i}$ for all $i=1,\ldots,M,$ the quantity

[TABLE]

The display in (5.24) follows from Assumption 5 and using Lipschitz property of $f_{k}.$ After taking supremum in the set $\{i=1,\ldots,M;M,N\geq 1\}$ in both sides of (5.25), second part of R.H.S goes to $0,$ as $L\to\infty$ by induction hypothesis. About the first part $Pf_{k}(0)$ goes to [math] as $k\to\infty$ by D.C.T since ( $\int|y|P(0,dy)<\infty$ ) and also $\int_{|z|>L}m_{i}^{n}(dz)$ converges to [math] (as $L$ goes to $\infty$ ) due to the tightness of $\{m_{i}^{n}:i=1,\ldots,M;M,N\geq 1\}$ which also follows from induction hypothesis. The second assertion that $\{\bar{\mu}_{n}^{N}P^{\prime}:N\geq 1\}$ is uniformly integrable follows similarly through induction.

$\square$

We will proceed to the main proof via induction on $n\in\mathbb{N}$ for the quantity $E\left[\mathcal{W}_{1}(\bar{\mu}_{n}^{N},\mu_{n})+\mathcal{W}_{1}(\bar{\eta}_{n}^{N},\eta_{n})\right]$ . For $n=0$ , we will first show that $E\mathcal{W}_{1}(\bar{\mu}_{0}^{N},\mu_{0})\to 0$ as $N\to\infty.$ From [16] we have

[TABLE]

From Lemma 5.3 one can construct $K_{0,\epsilon}$ compact ball containing $0,$ so that $E\left<|x|.1_{K^{c}_{0,\epsilon}},\bar{\mu}_{0}^{N}\right><\frac{\epsilon}{2}$ and $\left<|x|.1_{K^{c}_{0,\epsilon}},\mu_{0}\right><\frac{\epsilon}{2}$ hold. So using the fact for any $f\in\mbox{Lip}_{1}(\mathbb{R}^{d})$ with $f(0)=0,$ one has $|f(x)|\leq|x|.$

[TABLE]

In last display we used the fact that $\sup_{x\in K_{0,\epsilon}}|f(x)|\leq\text{diam}(K_{0,\epsilon})$ . Note that $\beta(\bar{\mu}_{0}^{N},\mu_{0})$ is bounded by $2$ (so Uniformly Integrable) and $\beta(\bar{\mu}_{0}^{N},\mu_{0})\overset{p}{\to}0$ implies $E\beta(\mu_{0}^{N},\mu_{0})\to 0$ as $N\to\infty$ proving the assertion (3.12) for $n=0$ . Suppose it holds for $n\leq k.$ We start with the following triangular inequality

[TABLE]

Consider the third term of (5.27). From the general calculations follwed by (5.45)-(5.47), we have the following estimate,

[TABLE]

Now we consider the first term of the right hand side of (5.27). We will use Lemma 5.3(a). Fix $\epsilon>0$ and let $K_{\epsilon}$ be a compact set in $\mathbb{R}^{d}$ such that

[TABLE]

Let $\mbox{Lip}_{1}^{0}(\mathbb{R}^{d}):=\{f\in\mbox{Lip}_{1}(\mathbb{R}^{d}):f(0)=0\}$ . Then,

[TABLE]

We will now apply Lemma A.1 in the Appendix. Note that for any $\phi\in\mbox{Lip}_{1}^{0}(\mathbb{R}^{d})$ , $\sup_{x\in K_{\epsilon}}|\phi(x)|\leq diam(K_{\epsilon}):=m_{\epsilon}.$

Thus with notation as in Lemma A.1

[TABLE]

where we have denoted the restrictions of $\bar{\mu}_{k+1}^{N}$ and $\bar{\mu}_{k}^{N}Q^{\bar{\eta}_{k}^{N}}$ to $K_{\epsilon}$ by the same symbols. Using the above inequality in (5.29), we obtain

[TABLE]

Using Lemma A.2 we see that the first term on the right hand side can be bounded by $\frac{2m_{\epsilon}|\mathcal{F}^{\epsilon}_{m_{\epsilon,1}}(K_{\epsilon})|}{\sqrt{N}}$ .

Consider the second term of R.H.S of (5.27). From Assumption 4 applying DCT one has

[TABLE]

Suppose $\bar{X}_{k}$ is a random variable conditioned on $\mathcal{F}_{k}^{M,N}$ is distributed with law $\bar{\mu}_{k}^{N}$ . Then almost surely $\mathcal{W}_{1}(\bar{\mu}_{k}^{N}Q^{\bar{\eta}_{k}^{N},\bar{\mu}_{k}^{N}},\bar{\mu}_{k}^{N}Q^{\eta_{k},\bar{\mu}_{k}^{N}})$ is

[TABLE]

(5.34) follows by using Assumption 4. About the first term in (5.34) note that from triangular inequality,

[TABLE]

The first term in (5.35) can be written as

[TABLE]

By Lemma 5.3(b), for a specified $\epsilon>0,$ one can construct a compact set $K_{k,\epsilon}$ containing [math] such that,

[TABLE]

Denote $m_{k,\epsilon}=\text{diam}(K_{k,\epsilon})$ . Using Lemma A.1 we have the L.H.S of (5.36)

[TABLE]

where (5.36) follows from similar arguments used in (5.31). Note that the Lemma 5.3 also suggests the compact set $K_{k,\epsilon}$ is non-random, which only depends on $k$ and $\epsilon$ only. So from the display above we have

[TABLE]

Using Lemma A.2 we get the final bound of the first term in RHS of (5.37) as $\frac{2m_{k,\epsilon}|\mathcal{F}^{\epsilon}_{m_{k,\epsilon,1}}(K_{k,\epsilon})|}{\sqrt{M}}$ . Combining this estimate with (5.28),(5.31) and (5.34) we now have

[TABLE]

For the term $E\mathcal{W}_{1}(\bar{\eta}_{k+1}^{M},\eta_{k+1}),$ we start with the following recursive form

[TABLE]

which leads to the following inequality

[TABLE]

Using earlier estimates one has the final estimate for

[TABLE]

Adding (5.38) and (5.41), using induction hypothesis and sending $M,N\to\infty$ we have

[TABLE]

Since $\epsilon>0$ arbitrary, the result follows.

Part (a) can be proved similarly. The change will come from the structural difference of $\bar{\eta}^{N}_{k}$ and $\eta^{N}_{k}$ because of the change in the updating kernel. So the term coming from the quantity $S^{M}(\bar{\eta}_{k}^{M})-\bar{\eta}_{k}^{M}$ won’t appear here. Hence we get the following final estimate

[TABLE]

from which the result follows by induction.

$\square$

5.2 Proof of Proposition 3.4

The techniques that we used is very similar with the contraction based method that was used in [3]. We will start with the following lemma and then prove the Proposition 3.4 using it. Define the following distance on $\mathcal{P}_{1}(\mathbb{R}^{d})\times\mathcal{P}_{1}^{*}(\mathbb{R}^{d})$ for $(\mu_{n},\eta_{n}),(\mu^{\prime}_{n},\eta^{\prime}_{n})\in\mathcal{P}_{1}(\mathbb{R}^{d})\times\mathcal{P}_{1}^{*}(\mathbb{R}^{d})$

[TABLE]

Note that it is a complete separable metric of the space $\mathcal{P}_{1}(\mathbb{R}^{d})\times\mathcal{P}_{1}^{*}(\mathbb{R}^{d}).$

Lemma 5.4

Let $\mu_{0},\mu^{\prime}_{0}\in\mathcal{P}_{1}(\mathbb{R}^{d})$ and $\eta_{0},\eta^{\prime}_{0}\in\mathcal{P}_{1}^{*}(\mathbb{R}^{d})$ . Suppose Assumptions 1,2, 4 and 5 hold. Then the transformation $\Psi:\mathcal{P}_{1}(\mathbb{R}^{d})\times\mathcal{P}_{1}^{*}(\mathbb{R}^{d})\to\mathcal{P}_{1}(\mathbb{R}^{d})\times\mathcal{P}_{1}^{*}(\mathbb{R}^{d})$ is well defined if following hold

[TABLE]

Moreover if Assumptions 4,3 and 5 hold with the following condition:

[TABLE]

Then there exist a $\theta\in(0,1)$ and a constant $a_{1}\in(0,\infty)$ such that for any $n\in\mathbb{N},$

[TABLE]

Remark 5.4

The condition (5.43) implies the first condition of (5.42) while the second one is very general.

5.2.1 Proof of Lemma 5.4

For fixed $\mu_{0},\mu^{\prime}_{0}\in\mathcal{P}_{1}(\mathbb{R}^{d})$ and $\eta_{0},\eta^{\prime}_{0}\in\mathcal{P}_{1}^{*}(\mathbb{R}^{d})$ define the following quantities for $n\geq 1$

[TABLE]

First we will show that under transformation $\Psi$ the $(\mu_{n},\nu_{n})\in\mathcal{P}_{1}(\mathbb{R}^{d})\times\mathcal{P}^{*}_{1}(\mathbb{R}^{d})$ for $(\mu_{0},\nu_{0})\in\mathcal{P}_{1}(\mathbb{R}^{d})\times\mathcal{P}^{*}_{1}(\mathbb{R}^{d}),$ so that the quantity $\mathcal{W}_{1}(\mu_{n},\mu^{\prime}_{n})+\mathcal{W}_{1}(\nu_{n},\nu^{\prime}_{n})$ is well defined. Note that , if $\delta\in(0,a_{0}),$ then $\gamma=\|A\|+\delta\sigma\left(2+l^{\nabla,\alpha}_{PP^{\prime}}\right)\in(0,1),$ implying

[TABLE]

which follows similarly from the proof of Lemma 5.1(a). It means if $\delta\in(0,a_{0})$ and $\left<|x|,\mu_{0}\right><\infty$ hold, then $\mu_{n}\in\mathcal{P}_{1}(\mathbb{R}^{d})$ for all $n\geq 1$ . Under conditions in (5.42) one also has $\sup_{n>0}\left<|x|,\eta_{n}\right><\infty$ for all $n\in\mathbb{N}.$ One has $\nabla\eta_{n+1}(y)=\int_{\mathbb{R}^{d}}\eta_{n}(x)[\nabla_{y}R^{\alpha}_{\mu_{n}}(x,y)]dx$ by Assumption 4 using DCT. From that condition it follows that for any $n\geq 1$ , $\|\nabla\eta_{n}(\cdot)\|_{1}<(1-\alpha)l_{P}^{\nabla}+\alpha l^{\nabla}_{P^{\prime}}=l^{\nabla,\alpha}_{PP^{\prime}}<\infty$ showing $\eta_{n}\in\mathcal{P}^{*}_{1}(\mathbb{R}^{d})$ for all $n>0$ if $\eta_{0}\in\mathcal{P}^{*}_{1}(\mathbb{R}^{d})$ .

Now we will go back to the proof of the second part of the lemma regarding the contraction part. Assume $n\geq 2$ . The first term of $\mathcal{W}_{1}((\mu_{n},\eta_{n}),(\mu^{\prime}_{n},\eta^{\prime}_{n}))$ can be expressed as

[TABLE]

The last inequality (5.45) follows from Assumption 1. As a consequence of Assumption 4 from (5.2) it follows that

[TABLE]

With that estimate, taking infimum at R.H.S of (5.45) with all possible couplings of $(X,Y)$ with marginals respectively $\mu_{n-1}$ and $\mu^{\prime}_{n-1}$ , one gets

[TABLE]

Let $X$ be a $\mathbb{R}^{d}$ valued random variable with law $\mu^{\prime}_{n-1}$ . Now about the term $T_{2}$ ,

[TABLE]

Note that

[TABLE]

Since from Assumption 4 $\nabla_{y}P^{\prime}(\tilde{x},x)$ is a Lipschitz function with coefficient $l^{\nabla}_{P^{\prime}}$ , the first integrand in (5.49) will be bounded by $l^{\nabla}_{P^{\prime}}.\mathcal{W}_{1}(\mu_{n-2},\mu^{\prime}_{n-2})$ which gives

[TABLE]

Now using Assumption 3 the second term $T^{(2)}_{2}$ gives similarly

[TABLE]

Using the Assumption 5 we have

[TABLE]

Combining (5.50),(5.51) and (5.52) we have the following recursion for $n\geq 2,$

[TABLE]

Define a sequence $a_{n}:=\mathcal{W}_{1}(\mu_{n},\mu^{\prime}_{n})+\mathcal{W}_{1}(\eta_{n},\eta^{\prime}_{n}),$ for $n\geq 2$ and and first two terms we set them to be

[TABLE]

which are well defined for $\mu_{0},\mu^{\prime}_{0}\in\mathcal{P}_{1}(\mathbb{R}^{d})$ and $\eta_{0},\eta^{\prime}_{0}\in\mathcal{P}_{1}^{*}(\mathbb{R}^{d}).$ Then from (5.53) and denoting $c_{1}:=\max\left\{\left(\big{(}\|A\|+\delta\sigma(2+l_{PP^{\prime}}^{\nabla,\alpha})\big{)}+\alpha l(P^{\prime})\right),(1-\alpha)l(P)\right\}$ , $c_{2}:=\delta\sigma\max\big{\{}\alpha l_{P^{\prime}}^{\nabla},(1-\alpha)l_{P}^{\nabla}\big{\}}$ following holds

[TABLE]

for $n\geq 2.$ Given $(\omega,\delta,\alpha)$ if there exists a $\theta\in(0,1)$ for which the following inequality holds

[TABLE]

then denoting $\lambda=\frac{c_{2}}{\theta},$ we have

[TABLE]

Existence of a solution $\theta\in(0,1)$ satisfying (5.55) is valid under $c_{1}+c_{2}<1$ which is equivalent to the condition

[TABLE]

in (5.43) satisfied by $(\delta,\alpha,\|A\|)$ . From (5.57) it follows

[TABLE]

for $n\geq 2$ . Since

[TABLE]

where $X\sim\mu^{\prime}_{0}.$ Final estimate for $a_{n}$ is

[TABLE]

Since $X\sim\mu^{\prime}_{0}\in\mathcal{P}_{1}(\mathbb{R}^{d})$ and $\nabla\eta_{0},\nabla\eta^{\prime}_{0}$ have linear growth (since $\eta_{0},\eta^{\prime}_{0}\in\mathcal{P}^{*}_{1}(\mathbb{R}^{d})$ ), the second term inside the bracket is finite. A general formula can be observed for $a_{n}$

[TABLE]

where

[TABLE]

Observe that the quantity inside the bracket of RHS of (5.58) is finite for $\mu_{0},\mu^{\prime}_{0}\in\mathcal{P}_{1}(\mathbb{R}^{d})$ and $\eta_{0},\eta^{\prime}_{0}\in\mathcal{P}_{1}^{*}(\mathbb{R}^{d})$ . Hence proved the lemma.

$\square$

We now complete the proof of the theorem. Given $l(PP^{\prime})<1$ from Assumption (5), one can always find $(\omega_{0},\alpha_{0},\delta_{0})\in(0,1)\times(0,1)\times(0,1)$ for which (5.57) holds under

[TABLE]

For existence we need to show that under $\mathcal{W}_{1}\left((\cdot,\cdot),(\cdot,\cdot)\right)$ distance $\mathcal{P}_{1}(\mathbb{R}^{d})\times\mathcal{P}_{1}^{*}(\mathbb{R}^{d})$ is complete. From Lemma 5.4 one can choose $(\omega,\alpha,\delta)$ such that (5.43) holds. It follows that using the $\theta$ from that lemma the sequence $\{\Psi^{n}(\mu_{0},\eta_{0})\}_{n\geq 1}^{\infty}$ is a cauchy sequence in $\mathcal{P}_{1}(\mathbb{R}^{d})\times\mathcal{P}_{1}(\mathbb{R}^{d})$ which is a complete metric space under $\mathcal{W}_{1}\left((\cdot,\cdot),(\cdot,\cdot)\right).$ So there exists a $(\mu_{\infty},\eta_{\infty})\in\mathcal{P}_{1}(\mathbb{R}^{d})\times\mathcal{P}_{1}(\mathbb{R}^{d})$ such that $\Psi^{n}(\mu_{0},\eta_{0})\to(\mu_{\infty},\eta_{\infty})$ as $n\to\infty$ . Our assertion for existence will be proved if we prove $\eta_{\infty}\in\mathcal{P}_{1}^{*}(\mathbb{R}^{d}).$ Given the initial conditon $\|\nabla\eta_{0}(x)\|_{1}<\infty,$ we will always have from (5.2) $\|\nabla\eta_{k}(x)\|_{1}<\infty\quad\forall\quad k>1.$ Note that for $\eta_{0}\in\mathcal{P}^{*}_{1}(\mathbb{R}^{d})$ , one has $\eta_{k}\in\mathcal{P}^{*}_{1}(\mathbb{R}^{d})$ for all $k.$ This implies $\eta_{\infty}\in\mathcal{P}^{*}_{1}(\mathbb{R}^{d}).$ So

[TABLE]

Observe further for $\theta\in(0,1)$ in (5.58) of Lemma 5.4

[TABLE]

Uniqueness of fixed points follows immediately from (5.59).

$\square$

5.3 Proof of Theorem 1

We will prove part (b) of the theorem. Part (a) will follow similarly. We need to prove the following Lemma first.

Lemma 5.5

Consider the second particle system $\mathbb{IPS}_{2}.$ Suppose that Assumptions 7,8 hold. Denote $N_{1}=\min{\{N,M\}}.$ Then there exist a constant $C\in(0,\infty)$ such that the upper-bound $b(\tau,d)$ of the quantity $\sup_{k\geq 1}E\mathcal{W}_{1}\big{(}(\bar{\mu}_{k}^{N},\bar{\eta}_{k}^{M}),\Psi(\bar{\mu}_{k-1}^{N},\bar{\eta}_{k-1}^{M})\big{)}$ can be given as $b(N_{1},\tau,d)$ as defined in Theorem 1. The constant $C$ will vary for dfferent cases.

5.3.1 Proof of Lemma 5.5

We start with the fact that

[TABLE]

In order to bound both terms in (5.60) we borrow the following formulation from [10] about the convergence rate of empirical distribution of iid random variables to its common distribution, where the key idea of bounding Wasserstein distance came from the constructive quantization context [9]. A similar idea was also developed in [1]. We will maintain the same notation used in [10]. Let $\mathcal{P}_{l}$ be the natural partition of $(-1,1]^{d}$ into $2^{dl}$ translations of $(-2^{-l},2^{-l}]^{d}.$ Define a sequence of sets $\{B_{n}\}_{n\geq 0}$ such that $B_{0}:=(-1,1]^{d}$ and, for $n\geq 1$ , $B_{n}:=(-2^{n},2^{n}]^{d}\setminus(-2^{n-1},2^{n-1}]^{d}.$ For a set $F\subset\mathbb{R}^{d}$ denote the set $2^{n}F$ as $\{2^{n}x:x\in F\}.$ For any two probability measures $\mu$ and $\nu$ , combining Lemma $5$ and $6$ of [10] one has the following inequality for the Wasserstein- $1$ distance,

[TABLE]

where $C$ is a constant depends only on $d.$ We denote $a_{k}^{i,M,N}:=\delta_{\bar{X}_{k}^{i}}-\delta_{\bar{X}_{k-1}^{i}}Q^{\bar{\eta}_{k-1}^{M},\bar{\mu}_{k-1}^{N}}.$ It follows that $\bar{\mu}_{k}^{N}-\bar{\mu}_{k-1}^{N}Q^{\bar{\eta}_{k-1}^{M},\bar{\mu}_{k-1}^{N}}=\frac{1}{N}\sum_{i=1}^{N}a_{k}^{i,M,N}.$ Note that on conditioned upon $\mathcal{F}_{k-1}^{M,N},$ the family of signed measures $\{a_{k}^{i,M,N}\}_{i=1,\ldots,M}$ is an independent class of measures while unconditionally they are just identical. Using the fact that for any set $A\in\mathcal{B}(\mathbb{R}^{d}),\quad\delta_{\bar{X}_{k}^{i}}(A)\bigg{|}\mathcal{F}_{k-1}^{M,N}\sim\text{Bernoulli}(\delta_{\bar{X}_{k-1}^{i}}Q^{\bar{\eta}_{k-1}^{M},\bar{\mu}_{k-1}^{N}}(A)),$ we have

[TABLE]

which implies the unconditional expectation $E\big{[}\big{(}a_{k}^{i,M,N}(A)\big{)}^{2}\big{]}\leq P\big{[}\bar{X}^{i}_{k-1}+\delta f_{\delta}(\nabla\bar{\eta}^{M}_{k-1},\bar{\mu}_{k-1}^{N},\bar{X}^{i}_{k-1},\epsilon^{N}_{k})\in A\big{]}.$ Using all these we have

[TABLE]

Using these with Cauchy-Schwarz inequality one gets following bound

[TABLE]

where second term inside the bracket of RHS of (5.63) follows trivially. Denoting the whole constant in R.H.S of (5.61) as $C_{d},$ we have

[TABLE]

Note that $\#\mathcal{P}_{l}=2^{dl}$ . Using Cauchy-Schwarz inequality with (5.63) and Jensen’s inequality $E\sqrt{X}\leq\sqrt{EX}$ for non-negative random variable $X$ , the last sum $E\sum_{F\in\mathcal{P}_{l}}\big{[}\bar{\mu}_{k}^{N}(2^{n}F\cap B_{n})-\bar{\mu}_{k-1}^{N}Q^{\bar{\eta}_{k-1}^{M},\bar{\mu}_{k-1}^{N}}(2^{n}F\cap B_{n})\big{]}$ in the R.H.S of (5.64) can be bounded by

[TABLE]

Now using Remark 5.1 along with Lemma 5.1, if $\delta\in(0,a(\tau))$ the quantity $\sup_{n\geq 0}\sup_{M,N\geq 1}E|\bar{X}_{n}^{i}|^{1+\tau}:=b(\tau)<\infty,$ one has by Chebyshev inequality for $n\geq 1,$

[TABLE]

Note that $a(\tau)^{\frac{1}{1+\tau}}\to a_{0}$ as $\tau\to 0$ and $\delta\in(0,a_{0}),$ we can find $\tau_{0}\in(0,a(\tau))$ such that $\delta\in(0,a(\tau_{0})^{\frac{1}{1+\tau_{0}}}).$ So the bound in (5.64) can be restated as

[TABLE]

where $b(\tau)$ is just a constant and the last display is obtained by accumulating upper bounds of all the constants to $C^{\prime}_{d}$ . Now proceeding exactly like step 1 to step 4 of the proof of Theorem 1 (for $p=1,q=1+\tau$ ) in [10] one gets the following bounds

[TABLE]

Now we will fill the gaps for each of the three special cases $\tau=1,\tau=1$ and $\tau=\frac{1}{d-1}$ of three regimes respectively $d=1,d=2$ and $d>2$ . We note that one can generalize the choice of $\l_{N,\varepsilon}$ done in step 1 of Theorem 1 of [10] where $l_{N,\varepsilon}$ could be taken as $\frac{\frac{1}{2}\log(\varepsilon N)}{d\log 2}\vee 0$ instead of $\frac{\log(2+\varepsilon N)}{d\log 2}$ though it doesn’t change the conclusion of the main theorem. After step $1$ with $p=1,q=1+\tau,\varepsilon=2^{-(1+\tau)n}$ one will get

[TABLE]

where the constant $C$ will vary from case to cases. Suppose $d=1.$ From (5.66) for general $\tau>0$ one has

[TABLE]

Note that for $n\geq n_{N,\tau}:=\frac{\log N}{(1+\tau)\log 2},$ one has $2^{-(1+\tau)n}\leq\left(\frac{2^{-(1+\tau)n}}{N}\right)^{\frac{1}{2}}.$ So for $\tau=1,$

[TABLE]

For $d=2,$ from (5.66) for general $\tau>0$ one has

[TABLE]

For $\tau=1,$ $\varepsilon=2^{-2n}.$ Note that if $n<n^{(2)}_{N}:=\log_{4}N-\log_{2}\left(\log N\right),$ then one has

[TABLE]

By proceeding similarly, for all non regular cases we will end up getting the following results (the constant $C$ will vary from case to cases):

[TABLE]

Now about the second term of (5.60) using (5.61), the upperbound of $E\mathcal{W}_{1}(S^{M}(\bar{\eta}_{k-1}^{M})\bar{\eta}_{k-1}^{M})$ is

[TABLE]

By Cauchy Schwarz inequality and using Jensen inequality $E\sqrt{X}\leq\sqrt{EX}$ for a nonnegative random variable $X,$ one gets the upperbound of

[TABLE]

Using similar argument used in (5.62) the R.H.S of (5.71) will be less than

[TABLE]

Finally using Jensen inequality $E\sqrt{X}\leq\sqrt{EX},$ and from Corollary 5.3 followed by Lemma 5.2(b) denoting $c(\tau):=\sup_{k\geq 1}\sup_{M\geq 1}E\left<|x|^{1+\tau},\bar{\eta}_{k-1}^{M}\right>$ one gets

[TABLE]

Hence the conclusion about the upper bound of $E\mathcal{W}_{1}(S^{M}(\bar{\eta}_{k-1}^{M}),\bar{\eta}_{k-1}^{M})$ will be similar to the first term of (5.60). It will be a function of the sample size of the concentration gradient $M$ in place of $N$ in the bound of $E\mathcal{W}_{1}(\bar{\mu}_{k}^{N},\bar{\mu}_{k-1}^{N}Q^{\bar{\eta}_{k-1}^{M},\bar{\mu}_{k-1}^{N}})$ . Combining this with the conclusion about the first term of (5.60) we can state the bound in terms of $N_{1}=\min\{M,N\}$ and the result of Lemma 5.5 will follow.

$\square$

Now we will complete the theorem. Observe the following identity

[TABLE]

Using Triangular inequality and Lemma 5.4 following holds

[TABLE]

where (5.74) follows from (5.58) with specified constants $a$ and $b$ and $\bar{\mu}^{(i-1)}_{M,N}:=\bar{\mu}^{N}_{i-1}Q^{\bar{\eta}_{i-1}^{M},\bar{\mu}^{N}_{i-1}}$ . Let $X_{i}^{M,N}$ be a random variable, conditioned on $\mathcal{F}_{i-1}^{M,N},$ sampled from $\bar{\mu}^{(i-1)}_{M,N}.$ We have

[TABLE]

Last display follows from Assumption 4. Since $\bar{\eta}_{0}^{M}=\eta_{0},$ one has

[TABLE]

Combining the results (5.75),(5.76), with (5.74) we get for each $n,$

[TABLE]

Using Lemma 5.5 the result follows.

$\square$

5.4 Proof of Corollary 3.6:

Using triangular inequality and from (5.58) one gets

[TABLE]

Combining this with (5.77) we get

[TABLE]

The result is obvious after using Lemma 5.5.

$\square$

5.5 Proof of Theorem 2:

Fix $N$ and $M$ . Define $\Theta_{n}^{N,M}\in\mathcal{P}((\mathbb{R}^{d})^{N}\times\mathcal{P}_{1}^{*}(\mathbb{R}^{d})\times\mathcal{P}(\mathbb{R}^{d}))$ as

[TABLE]

for $N\geq 1,M\geq 1$ and $n\in\mathbb{N}_{0}$ where $\{(\bar{X}_{j}(N),\bar{\eta}_{j}^{M},S^{M}(\bar{\eta}_{j}^{M})),\,\,j\in\mathbb{N}_{0},i=1,\ldots,N\}$ are as defined in the context of $\mathbb{IPS}_{2}$ . Note that $(\mathbb{R}^{d})^{N}\times\mathcal{P}_{1}^{*}(\mathbb{R}^{d})\times\mathcal{P}(\mathbb{R}^{d})$ is a complete separable metric space with metric $d((x,\mu_{1},\mu_{3}),(y,\mu_{2},\mu_{4})):=\|x-y\|+\frac{1}{2}\mathcal{W}_{1}(\mu_{1},\mu_{2})+\frac{1}{2}\mathcal{W}_{1}\big{(}\mu_{3},\mu_{4}\big{)}$ where $\|x\|:=\frac{1}{N}\sum_{i=1}^{N}|x_{i}|$ for $x=(x_{1},\ldots,x_{N})\in(\mathbb{R}^{d})^{N}$ . From Lemma 5.1 and 5.2 it follows that, for each $N,M\geq 1,$ the sequence $\{\Theta_{n}^{N,M},n\geq 1\}$ is relatively compact (By Prohorov’s Theorem) and using Assumption 1 it is easy to see that any limit point $\Theta_{\infty}^{N,M}$ of $\Theta_{n}^{N,M}$ (as $n\to\infty$ ) is an invariant measure of the Markov chain $\{X_{n}(N),\bar{\eta}_{n}^{M},S^{M}(\bar{\eta}_{n}^{M})\}_{n\geq 0}$ and from Lemma 5.1 it satisfies $\int_{(\mathbb{R}^{d})^{N}\times\mathcal{P}_{1}^{*}(\mathbb{R}^{d})\times\mathcal{P}(\mathbb{R}^{d})}|x|\;\Theta_{\infty}^{N,M}(dx)<\infty$ (Taking the norm of the product space as $|(x,y,z)|=\|x\|+\frac{1}{2}\|y\|_{1}+\frac{1}{2}\|z\|_{1}$ where $(x,y,z)\in(\mathbb{R}^{d})^{N}\times\mathcal{P}_{1}^{*}(\mathbb{R}^{d})\times\mathcal{P}(\mathbb{R}^{d})$ ). Uniqueness of invariant measure can be proved by the following simple coupling argument (see for example [5]): Suppose $\Theta_{\infty}^{N,M}$ , $\tilde{\Theta}_{\infty}^{N,M}$ are two invariant measures that satisfy $\int_{(\mathbb{R}^{d})^{N}\times\mathcal{P}_{1}^{*}(\mathbb{R}^{d})\times\mathcal{P}(\mathbb{R}^{d})}|x|\;\Theta_{\infty}^{N,M}(dx)<\infty$ , $\int_{(\mathbb{R}^{d})^{N}\times\mathcal{P}_{1}^{*}(\mathbb{R}^{d})\times\mathcal{P}(\mathbb{R}^{d})}|x|\tilde{\Theta}_{\infty}^{N,M}(dx)<\infty$ .

Let $\big{(}X_{0}(N),\eta_{0}^{M},S^{M}(\eta_{0}^{M})\big{)}$ and $\big{(}\tilde{X}_{0}(N),\tilde{\eta}_{0}^{M},S^{M}(\tilde{\eta}_{0}^{M})\big{)}$ with probability laws $\Theta_{\infty}^{N,M}$ and $\tilde{\Theta}_{\infty}^{N,M}$ respectively be given on a common probability space under same noise sequence (i.e in which an i.i.d. array of $\mathbb{R}^{m}$ valued random variables $\{\epsilon^{i}_{n},i=1,\ldots,N,n\geq 1\}$ are defined that is independent of $(X_{0}(N),\eta_{0}^{M},\tilde{X}_{0}(N),\tilde{\eta}_{0}^{M})$ with common probability law $\theta$ ) and the evolution equations are following.

[TABLE]

where recall $f_{\delta}(\cdot,\cdot,\cdot,x)=f(\cdot,\cdot,\cdot,x)+\frac{B(x)}{\delta}.$ Note that

[TABLE]

for any two arrays $\{X_{i}\}_{i=1}^{N}$ and $\{Y_{i}\}_{i=1}^{N}$ . Using the independence of the noise sequence along with (5.80) and Assumption 1 we have

[TABLE]

Now applying Assumption 4 (doing similar calculations as in (5.48),(5.50),(5.51)) following inequality holds

[TABLE]

Note that (5.80) implies

[TABLE]

from which following holds from (5.82)

[TABLE]

We also have

[TABLE]

and after taking expectation

[TABLE]

Letting $A^{(M,N)}_{n+1}:=\frac{1}{N}\sum_{i=1}^{N}|X_{n+1}^{i}-\tilde{X}_{n+1}^{i}|+\mathcal{W}_{1}\big{(}\eta^{M}_{n+1},\tilde{\eta}^{M}_{n+1}\big{)}$ , we have the following recursion relation combining (5.81),(5.84) and (5.86)

[TABLE]

which is the same recursion as in (5.54). Now for the chosen $\delta,\alpha$ satisfying (5.57) there exists a $\theta\in(0,1)$ such that

[TABLE]

Also, since $\Theta_{\infty}^{N,M}$ and $\tilde{\Theta}_{\infty}^{N,M}$ are invariant distributions, for every $n\in\mathbb{N}_{0}$ , $\big{(}X_{n+1}(N),\eta_{n+1}^{M},S^{M}(\eta_{n+1}^{M})\big{)}$ is distributed as $\Theta_{\infty}^{N,M}$ and $\big{(}\tilde{X}_{n+1}(N),\tilde{\eta}_{n+1}^{M},S^{M}(\tilde{\eta}_{n+1}^{M})\big{)}$ is distributed as $\tilde{\Theta}_{\infty}^{M,N}$ . Thus

$(X_{n+1}(N),\eta_{n+1}^{M},S^{M}(\eta_{n+1}^{M}))$ and $\big{(}\tilde{X}_{n+1}(N),\tilde{\eta}_{n+1}^{M},S^{M}(\tilde{\eta}_{n+1}^{M})\big{)}$ define a coupling of random variables with laws $\Theta_{\infty}^{N,M}$ and $\tilde{\Theta}_{\infty}^{N,M}$ respectively. From (5.88) we then have

[TABLE]

as $n\to\infty.$ So there exists a unique invariant measure $\Theta_{\infty}^{N,M}\in\mathcal{P}_{1}\big{(}(\mathbb{R}^{d})^{N}\times\mathcal{P}_{1}^{*}(\mathbb{R}^{d})\times\mathcal{P}(\mathbb{R}^{d})\big{)}$ for this Markov chain and, as $n\to\infty$ ,

[TABLE]

This proves the first part of the theorem. Denote $\Theta^{N,M}_{\infty}\left(\cdot,\mathcal{P}^{*}_{1}(\mathbb{R}^{d}),\mathcal{P}(\mathbb{R}^{d})\right)$ by $\Theta^{1,N,M}_{\infty}$ and

$\Theta^{N,M}_{n}\left(\cdot,\mathcal{P}^{*}_{1}(\mathbb{R}^{d}),\mathcal{P}(\mathbb{R}^{d})\right)$ by $\Theta^{1,N,M}_{n}$ .

Define $r_{N}:(\mathbb{R}^{d})^{N}\to\mathcal{P}(\mathbb{R}^{d})$ as

[TABLE]

Let $\nu_{n}^{N,M}=\Theta_{n}^{1,N,M}\circ r_{N}^{-1}$ and $\nu_{\infty}^{N,M}=\Theta_{\infty}^{1,N,M}\circ r_{N}^{-1}$ . In order to prove that $\Theta_{\infty}^{1,N,M}$ is $\mu_{\infty}$ -chaotic, it suffices to argue that (cf. [16])

[TABLE]

We first argue that as $n\to\infty$

[TABLE]

It suffices to show that $\langle F,\nu_{n}^{N,M}\rangle\to\langle F,\nu_{\infty}^{N,M}\rangle$ for any continuous and bounded function $F:\mathcal{P}(\mathbb{R}^{d})\to\mathbb{R}$ . But this is immediate on observing that

[TABLE]

the continuity of the map $r_{N}$ and the weak convergence of $\Theta_{n}^{N,M}$ to $\Theta_{\infty}^{N,M}$ . Next, for any $f\in BL_{1}(\mathcal{P}(\mathbb{R}^{d}))$

[TABLE]

Fix $\epsilon>0$ . For every $N,M\in\mathbb{N}$ there exists $n_{0}(N,M)\in\mathbb{N}$ such that for all $n\geq n_{0}(N,M)$

[TABLE]

Thus for all $n,N,M\in\mathbb{N}$

[TABLE]

Finally

[TABLE]

where the first equality is from (5.91), the second uses (5.92) and the third is a consequence of Corollary 3.6. Since $\epsilon>0$ is arbitrary, we have (5.90) and the result follows.

$\square$

5.6 Proof of Concentration bounds:

5.6.1 Proof of Theorem 3 (a):

We start with the following lemma where we establish a concentration bound for $\mathcal{W}_{1}\big{(}(\bar{\mu}_{n}^{N},\bar{\eta}_{n}^{M}),\Psi(\bar{\mu}_{n-1}^{N},\bar{\eta}_{n-1}^{M})\big{)}$ for each fixed time $n\in\mathbb{N}$ and then combine it with the estimate in (5.74) in order to get the desired result.

Lemma 5.6

*Let $N_{1}=\min\{M,N\}.$ Assumptions (1-4) and Assumptions (7),(8) hold for some $\tau>0$ . Suppose that $\delta\in(0,a(\tau)^{\frac{1}{1+\tau}})$ , and $(1-\alpha)l_{\tau}(P)<1.$ Then there exist

$a_{1},a_{2},a_{3},a^{\prime}_{1},a^{\prime}_{2},a^{\prime}_{3}\in(0,\infty)$ such that for all $\epsilon,R>0,n\in\mathbb{N}$ , and $N_{1}\geq\max\{1,a_{1}(\frac{R}{\epsilon})^{d+2}\}.$ *

[TABLE]

5.6.2 Proof of Lemma 5.6

Second concentration bound will follow by proceeding as Lemma 4.5 of [5]. The proof relies on an idea of restricting measures to a compact set and estimates on metric entropy [2] (see also [17]). The basic idea is to first obtain a concentration bound for the $\mathcal{W}_{1}$ distance between the truncated law and its corresponding empirical law in a compact ball of radius $R$ and getting a tail estimate from Lemma 5.2 and Corollary 5.3 after conditioning by $\mathcal{F}_{n-1}^{M,N}$ . With the notations (for example $\mu_{R}$ is the truncated measure of $\mu$ restricted on a ball $B_{\mathbb{R}}(0)$ of $R$ radius) introduced in Lemma 4.5 of [5] we sketch the proof of the second bound. With that notation the truncated version of $\bar{\eta}_{n-1}^{M}$ is denoted by $\bar{\eta}_{n-1,R}^{M}$ . Suppoe $\{Y_{n-1}^{i,M}:i=1,\ldots,M\}$ are iid from $\bar{\eta}_{n-1}^{M}$ conditioned on $\mathcal{F}_{n-1}^{M,N}.$ where $\{Z_{i}^{M,R}:i=1,\ldots,M\}$ are iid from $\bar{\eta}_{n-1,R}^{M}$ conditioned under $\mathcal{F}_{n-1}^{M,N}.$ Define

[TABLE]

Note that $P(X_{n-1}^{i,M}\in A\mid\mathcal{F}_{n-1}^{M,N})=P(Z_{n-1}^{i,M}\in A\mid\mathcal{F}_{n-1}^{M,N}).$ Denote $S^{M}(\bar{\eta}^{M}_{n-1,R}):=\frac{1}{M}\sum_{i=1}^{M}\delta_{X_{n-1}^{i,M}}.$ Now denoting $a(1+\tau):=\sup_{n\geq 0}\sup_{M,N}E\left<|x|^{1+\tau},\bar{\eta}^{M}_{n}\right>$ , from (5.80) we have

[TABLE]

Now using Azuma Hoeffding inequality as done in display (4.35) of Lemma 4.5 in [5] one has

[TABLE]

From the definition of $\bar{\eta}_{n-1,R}^{M}$

[TABLE]

Using triangular inequality

[TABLE]

combining (5.95),(5.96) and (5.97) the result (5.109) will follow.

The first one (5.108) follows by noting that

[TABLE]

Proceeding like Lemma 4.5 of [5] the bound for the first term in RHS of (5.98) can be established.

$\square$

5.6.3 Proof of Theorem 3(a)

Combining (5.74),(5.75) and (5.76) it follows that

[TABLE]

Denoting $c_{1}:=\max\left\{\left(\big{(}\|A\|+\delta\sigma(2+l_{PP^{\prime}}^{\nabla,\alpha})\big{)}+\alpha l(P^{\prime})\right),(1-\alpha)l(P)\right\}$ , $c_{2}:=\delta\sigma\max\big{\{}\alpha l_{P^{\prime}}^{\nabla},(1-\alpha)l_{P}^{\nabla}\big{\}}$ define the function $g_{0}(\cdot)$ as

[TABLE]

Since $g_{0}(0)=c_{2}+c_{1}-1<0$ (from the assumption), $g_{0}(1)=c_{2}>0$ and $g(\cdot)$ is continuous. So there exists a $\gamma>0$ such that $g_{0}(\gamma)<0$ or equivalently

[TABLE]

So there exists a $\theta\in(0,1-\gamma)$ such that statement of Lemma 5.4 holds. Now using that $\gamma$ from (5.99) one has

[TABLE]

Let $\beta_{1}=\frac{\gamma\varepsilon}{2a},\,\,\beta_{2}=\frac{\gamma\varepsilon}{2bl_{P}^{\nabla}(1-\alpha)}\,\,\beta_{3}=\gamma\varepsilon.$ Note that $\nu:=\big{(}\frac{1-\gamma}{\theta}\big{)}>1,$ from our choice of $\gamma$ . Therefore denoting $\beta:=\min\{\beta_{1},\beta_{2}\},\quad$ $N_{1}\geq a_{1}\Big{(}\frac{R}{\beta}\Big{)}^{d+2}\vee 1$ implies $N_{1}\geq a_{1}\Big{(}\frac{R}{\beta\nu^{n}}\Big{)}^{d+2}\vee 1$ for all $n\in\mathbb{N}_{0}$ and a consequence of Lemma 5.6 gives

[TABLE]

Now proceeding similarly like the proof of Theorem 3.7 of [5] through optimizing the value of $R$ the conclusion will follow.

5.6.4 Proof of Theorem 3(b)

Second part regarding the exponential concentration bound will follow similarly (like Theorem 3.8 of [5]) under the following lemmas on uniform exponential integrability.

Lemma 5.7

Suppose Assumptions 9 and 10 hold. Suppose there exists $\alpha^{*}>0$ such that

[TABLE]

Then for all $\alpha_{1}\in[0,\min\big{\{}\alpha^{*},\frac{\alpha(\delta)}{\delta}\big{\}})$ and $\delta\in\Big{[}0,\frac{1-\|A\|}{(2+l^{\nabla,\alpha}_{PP^{\prime}})K}\Big{)}$ ,

[TABLE]

Proof. We will start by proving the second inequality. Note that from Corollary 5.3 the conditions for $``\sup_{n\geq 0}\sup_{M,N\geq 1}E\left<e^{\alpha_{1}|x|},\bar{\eta}^{M}_{n}\right><\infty$ ” are same as the conditions for $\sup_{n\geq 0}\sup_{N\geq 1}E\left<e^{\alpha_{1}|x|},\eta^{N}_{n}\right><\infty$ in $\mathbb{IPS}_{1}$ and from Lemma 5.2 they are again same as the conditions for finiteness of $\sup_{n\geq 0}\left<e^{\alpha_{1}|x|},\eta_{n}\right>.$ Note that

[TABLE]

Now from Assumption 10, using lipshitz property $|h_{1}(x)|\leq l_{h_{1}}|x|+|h_{1}(0)|$ one has $\left<\mu_{k}P^{\prime}P^{i},e^{\alpha_{1}|x|}\right>\leq e^{\alpha_{1}|h_{1}(0)|+h_{2}(\alpha)}\left<\mu_{k}P^{\prime}P^{i-1},e^{\alpha_{1}l_{h_{1}}|x|}\right>+e^{h_{3}(\alpha_{1})+h_{2}(\alpha_{1})}.$ So we have an upperbound of $\big{<}\mu P^{\prime}P^{i},e^{\alpha_{1}|x|}\big{>}$ that is

[TABLE]

Last inequality follows since $h_{2}(\cdot),h_{3}(\cdot)$ are non-decreasing and $l_{h_{1}}\leq 1.$ Using (5.102) under the condition $\sup_{n\geq 0}\left<e^{\alpha_{1}|x|},\mu_{n}\right><\infty$ (which we prove shortly) we conclude that $\sup_{k\geq 0}\left<\eta_{k+1},e^{\alpha_{1}|x|}\right><\infty$ or equivalently $\sum_{i=0}^{\infty}(1-\alpha)^{i}e^{i\big{[}h_{2}(\alpha)+\alpha_{1}|h_{1}(0)|\big{]}}<\infty$ if there exists an $\alpha_{1}$ such that $\alpha_{1}|h_{1}(0)|+h_{2}(\alpha_{1})+\log(1-\alpha)<0.$ Since $g(\alpha_{1}):=\alpha_{1}|h_{1}(0)|+h_{2}(\alpha_{1}),$ is an increasing function of $\alpha_{1}$ and $g(0)=0$ . From the definition of $\alpha^{*}$ we can always find $0<\alpha_{1}<\alpha^{*}$ such that $\sup_{n\geq 0}\left<e^{\alpha_{1}|x|},\eta_{n}\right><\infty.$

Now we prove $\sup_{n\geq 0}\left<e^{\alpha_{1}|x|},\mu_{n}\right><\infty$ or equivalently the first term in (5.101). Note that from (5.1) for $n\geq 1$

[TABLE]

Now from the choice $\alpha_{1}\leq\frac{\alpha(\delta)}{\delta},$ taking expectation after having exponential

[TABLE]

where $\mathcal{E}_{1}(\alpha_{1})=e^{\alpha_{1}\delta Kc_{PP^{\prime}}^{\alpha}}\int e^{\alpha_{1}\delta\big{(}A_{2}(z)+\frac{|B(z)|}{\delta}\big{)}}\theta(dz).$ We note that from Assumption 10 there always exist $\alpha^{**}<\frac{\alpha(\delta)}{\delta},\quad c_{3}$ such that for all $\alpha_{1}\in(0,\alpha^{**})$

[TABLE]

Using conditioning argument we have

[TABLE]

where (5.105) follows from exchangeability of $\{X_{n}^{i,N}\}_{i=1,\ldots,N}$ . Observing $\|\mu_{n}^{N}\|_{1}=\int|x|\mu_{n}^{N}(dx)$ and using Jensen’s inequality applied to the function $x\to e^{\alpha_{1}\delta Kx},$ we have after taking expectation

[TABLE]

Since $f_{1}(x):=e^{\alpha_{1}\delta Kx}$ and $f_{2}(x):=e^{\alpha_{1}x\big{[}\|A\|+\delta K\big{(}1+l^{\nabla,\alpha}_{PP^{\prime}}\big{)}\big{]}}$ are both non-decreasing, so putting $\mu=\mu_{n}^{N}$ almost surely in the following inequality $\int f_{1}(x)f_{2}(x)\mu(dx)\geq\int f_{1}(x)\mu(dx)\int f_{2}(y)\mu(dy)$ and taking expectation we have

[TABLE]

From our choice of $\delta,$ $\kappa:=\|A\|+\delta K\big{(}2+l^{\nabla,\alpha}_{PP^{\prime}}\big{)}\in(0,1).$ Denoting $F_{n+1}(\alpha_{1}):=Ee^{\alpha_{1}|X_{n+1}^{i}|}$ from (5.103) we have the following recursive inequality:

[TABLE]

Iterating the above inequality we have for all $n\geq 1$

[TABLE]

where the second inequality is a consequence of (5.104).

Note further for the system in (2.4) let $\{X_{n}\}_{n\in\mathbb{N}_{0}}$ be defined as the random variables with laws $\mathcal{L}(X_{n}):=\mu_{n}$ for $n\in\mathbb{N}_{0}.$ Then starting similarly from

[TABLE]

using the inequality $\int f_{1}(x)f_{2}(x)\mu(dx)\geq\int f_{1}(x)\mu(dx)\int f_{2}(y)\mu(dy)$ (similar to Lemma 4.11 of [5]) one can prove

[TABLE]

under same conditions on $\delta,\alpha_{1}.$ This is needed for proving $\sup_{n\geq 0}\left<e^{\alpha_{1}|x|},\eta_{n}\right><\infty.$ The result follows.

$\square$

Lemma 5.8

Then there exist $a_{1},a_{2},a_{3},a^{\prime}_{1},a^{\prime}_{2},a^{\prime}_{3}\in(0,\infty)$ such that for all $\epsilon,R>0$ and $n\in\mathbb{N}$ , and $N_{1}\geq\max\{1,\tilde{a}_{1}(\frac{R}{\epsilon})^{d+2}\}$

[TABLE]

5.6.5 Proof of Lemma 5.8:

Follows from similar decompositions given in Lemma 5.6 and Lemma 4.7 of [5]. $\square$

5.6.6 Proof of Theorem 3(b):

Starting from (5.99), the conclusion will follow by applying Lemma 5.8 in (5.6.3).

$\square$

5.7 Proof of Theorem 4

We will start by introducing a coupling. Consider a system of $\mathbb{R}^{d}$ valued auxiliary random variables $\{Y_{n}^{i,N},i=1,\ldots,N\}_{n\geq 0}$ defined as follows.

[TABLE]

Now for each $n\in\mathbb{N},\quad\{Y_{n}^{i,N},i=1,\ldots,N\}$ is a set of $\mathbb{R}^{d}$ valued iid random variables under initial assumption $\mathcal{L}(\{X^{i,N}_{0}\}_{i=1,\ldots,N})=\mu^{\otimes N}_{0}.$ Suppose $\zeta_{n}^{N}:=\frac{1}{N}\sum_{i=1}^{N}\delta_{Y^{i,N}_{n}}$ . The following Lemma will make a connection between $\zeta_{n}^{N}$ and $\mu_{n}^{N}$ .

Lemma 5.9

(Coupling with the auxiliary system) Suppose Assumptions 1,4,5 and 9 hold. Then for every $n\geq 0$ and $N\geq 1,$ with the $C_{1},$ and $\chi_{1}$ defined in (3.19),(3.20)

[TABLE]

Proof. Since by Assumption 1 and $A_{1}(\epsilon)\leq K$ , we have for each $j=1,\ldots,N$

[TABLE]

Using the calculations in (5.46),(5.48),(5.49) and (5.51)

[TABLE]

Thus

[TABLE]

Using (5.112) as the recursion on $a^{j}_{n+1}:=|X_{n+1}^{j,N}-Y_{n+1}^{j,N}|$ with $a^{j}_{0}=0,$ we get

[TABLE]

Denote $\|A\|+\delta K(1+l^{\nabla,\alpha}_{PP^{\prime}})$ by $\chi$ . Observe that

[TABLE]

Denote the quantity in the third bracket of RHS of (5.113) by $b_{k}.$ Using (5.114) and $\eta_{0}^{N}=\eta_{0}$ we have

[TABLE]

where $c_{4}:=\max\{1,(1-\alpha)l^{\nabla}_{P}\alpha l(P^{\prime})\}$ and $c_{5}:=\max\{\alpha l^{\nabla}_{P^{\prime}},(1-\alpha)l(P)\}.$ Thus from (5.113) we have

[TABLE]

Now applying Lemma A.3 we have

[TABLE]

where $\chi_{2}:=\max\{\chi,c_{5}\}$ and $c_{7}:=\frac{c_{4}}{|\chi-c_{5}|}.$ Note that from (5.80) we have for all $n\geq 0,$

[TABLE]

Combining the result above and using triangle inequality in (5.117)

[TABLE]

Applying Lemma A.3 with

[TABLE]

We have

[TABLE]

Simplifying (5.118) one gets

[TABLE]

Note that $\delta Kc_{7}\chi_{2}=C_{1}$ and $\chi_{2}+C_{1}=\chi_{1}$ as defined in (3.19) (3.20) respectively. Thus we have

[TABLE]

The result now follows by an application of triangle inequality. $\square$

5.7.1 Proof of Theorem 4

Since $\chi_{1}<1.$ So we can find $\gamma>0$ such that $\chi_{1}<1-\gamma.$ Taking that $\gamma,$ we have $\nu_{1}:=\frac{1-\gamma}{\chi_{1}}>1$ . For any ${\varepsilon}>0$ , From Lemma 4

[TABLE]

where $i_{\varepsilon}:=\max\{i\geq 0:\frac{\gamma\chi_{1}\varepsilon}{C_{1}}\nu^{i}<1\}.$ Note that for $\delta\in\Big{[}0,\frac{1-\|A\|}{(2+l^{\nabla,\alpha}_{PP^{\prime}})K}\Big{)},$ and $\alpha_{1}\in(0,\frac{\alpha(\delta)}{\delta})$ from (5.107) we have $\sup_{n\geq 0}\left<e^{\alpha_{1}|x|,\mu_{n}}\right><\infty.$ That implies from the statement of Theorem 2 of [10] that for all $N>0,$

[TABLE]

where $a(N,\varepsilon)=e^{-cN\varepsilon^{2}}1_{\{d=1\}}+e^{-cN\big{(}\frac{\varepsilon}{\log(2+\frac{1}{\varepsilon})}\big{)}^{2}}1_{\{d=2\}}+e^{-cN\varepsilon^{d}}1_{\{d>2\}}$ and $b(N,\varepsilon)=e^{-cN\varepsilon}.$ In order to prove (3.21) we will prove only for one case $d>2.$ Rest will follow similarly. There exists $C^{\prime}_{1},C^{\prime}_{2},C^{\prime}_{3}$

[TABLE]

Suppose $k_{0}$ such that $\nu^{i}\geq k_{0}i$ for all $i\geq 1.$ Combining (5.120),(5.121),(5.122) we have for all $N>1$ and $a^{\prime\prime}_{2}=k_{0}\min\{C^{\prime}_{1},C^{\prime}_{2},C^{\prime}_{3}\}.$

[TABLE]

Now there exists $N_{3}:=-\frac{1}{a^{\prime\prime}_{2}}\log(1-\frac{1}{a^{\prime\prime}_{1}})$ such that $N\geq N_{3}\max\{\frac{1}{{\varepsilon}},\frac{1}{{\varepsilon}^{d}}\}$ we have

[TABLE]

$\square$

6 Acknowledgements

A part this article was part of author’s Phd thesis. The author is thankful to Prof. Amarjit Budhiraja for his comments on an earlier version of the manuscript.

Appendix

The first part of the following lemma is an immediate consequence of Ascoli-Arzela theorem where as the second follows from Lemma 5 in [7].

Lemma A.1

(a) For a compact set $K$ in $\mathbb{R}^{d}$ let $\mathcal{F}_{a,b}(K)$ be the space of functions $f:K\to\mathbb{R}$ such that $\sup_{x\in K}|f(x)|\leq a$ and $|f(x)-f(y)|\leq b|x-y|$ for all $x,y\in K$ . Then for any $\epsilon>0$ there is a finite subset $\mathcal{F}_{a,b}^{\epsilon}(K)$ of $\mathcal{F}_{a,b}(K)$ such that for any signed measure $\mu$

[TABLE]

The next lemma is straightforward.

Lemma A.2

Let $P:\mathbb{R}^{d}\times\mathcal{B}(\mathbb{R}^{d})\to[0,1]$ be a transition probability kernel. Fix $N\geq 1$ and let $y_{1},y_{2},...,y_{N}\in\mathbb{R}^{d}$ . Let $X_{1},X_{2},...,X_{N}$ be independent random variables such that $\mathcal{L}(X_{i})=\delta_{y_{i}}P.$ Let $f\in BM(\mathbb{R}^{d})$ and let $m_{0}^{N}=\frac{1}{N}\sum_{i=1}^{N}\delta_{y_{i}}$ , $m_{1}^{N}=\frac{1}{N}\sum_{i=1}^{N}\delta_{X_{i}}$ . Then

[TABLE]

The following is a discrete version of Gronwall’s lemma.

Lemma A.3

(a)

Let $\{a_{i}\}_{i=0}^{\infty},\{b_{i}\}_{i=0}^{\infty},\{p_{i}\}_{i=0}^{\infty}$ be non-negative sequences. Suppose that

[TABLE]

Then

[TABLE] 2. (b)

For any $a,b>0$ and $\{C_{i}\}_{i\geq 0}$ be a nonnegative sequence of elements, then for all $n\geq 0$

[TABLE]

Bibliography17

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] Emmanuel Boissard, Thibaut Le Gouic, et al. On the mean speed of convergence of empirical and occupation measures in wasserstein distance. Annales de l’Institut Henri Poincaré, Probabilités et Statistiques , 50(2):539–563, 2014.
2[2] François Bolley, Arnaud Guillin, and Cédric Villani. Quantitative concentration inequalities for empirical measures on non-compact spaces. Probability Theory and Related Fields , 137(3-4):541–593, 2007.
3[3] Amarjit Budhiraja, Pierre Del Moral, Sylvain Rubenthaler, et al. Discrete time markovian agents interacting through a potential. ESAIM: Probability and Statistics , 2011.
4[4] Amarjit Budhiraja and Wai-Tong Louis Fan. Uniform in time interacting particle approximations for nonlinear equations of patlak-keller-segel type. ar Xiv preprint ar Xiv:1604.08668 , 2016.
5[5] Amarjit Budhiraja and Abhishek Pal Majumder. Long time results for a weakly interacting particle system in discrete time. Stochastic Analysis and Applications , 33(3):429–463, 2015.
6[6] François Caron, Pierre Del Moral, Arnaud Doucet, Michele Pace, et al. Particle approximations of a class of branching distribution flows arising in multi-target tracking. SIAM Journal on Control and Optimization , 49(4):1766–1792, 2011.
7[7] Giacomo Como and Fabio Fagnani. Scaling limits for continuous opinion dynamics systems. volume 21, pages 1537 – 1567. Institute of Mathematical Statistics, 2011.
8[8] Pierre Del Moral and Emmanuel Rio. Concentration inequalities for mean field particle models. The Annals of Applied Probability , 21(3):1017–1052, 2011.