Solution to Zero-Sum Differential Game with Fractional Dynamics via   Approximations

Mikhail Gomoyunov

arXiv:1902.02951·math.OC·August 6, 2019

Solution to Zero-Sum Differential Game with Fractional Dynamics via Approximations

Mikhail Gomoyunov

PDF

TL;DR

This paper proves that a zero-sum differential game with fractional dynamics has a well-defined value by approximating it with classical differential games and demonstrates the convergence of their values.

Contribution

It introduces a method to establish the value of a fractional differential game through approximation by retarded-type differential games and constructs optimal feedback controls.

Findings

01

The game has a well-defined value.

02

Approximate game values converge to the original game value.

03

Optimal feedback controls are constructed based on the approximations.

Abstract

The paper deals with a zero-sum differential game in which the dynamical system is described by a fractional differential equation with the Caputo derivative of an order $α \in (0, 1) .$ The goal of the first (second) player is to minimize (maximize) the value of a given quality index. The main contribution of the paper is the proof of the fact that this differential game has the value, i.e., the lower and upper game values coincide. The proof is based on the appropriate approximation of the game by a zero-sum differential game in which the dynamical system is described by a first order functional differential equation of a retarded type. It is shown that the values of the approximating differential games have a limit, and this limit is the value of the original game. Moreover, the optimal players' feedback control procedures are proposed that use the optimally controlled…

Equations182

∥ x (\cdot) ∥_{\infty} = t \in [t_{0}, ϑ] ess sup ∥ x (t) ∥.

∥ x (\cdot) ∥_{\infty} = t \in [t_{0}, ϑ] ess sup ∥ x (t) ∥.

\begin{array}[]{rcl}(I^{\alpha}x)(t)&=&\displaystyle\frac{1}{\Gamma(\alpha)}\int_{t_{0}}^{t}\frac{x(\tau)}{(t-\tau)^{1-\alpha}}d\tau,\\[11.99998pt] (D^{\alpha}x)(t)&=&\displaystyle\frac{d}{dt}(I^{1-\alpha}x)(t)=\frac{1}{\Gamma(1-\alpha)}\frac{d}{dt}\int_{t_{0}}^{t}\frac{x(\tau)}{(t-\tau)^{\alpha}}d\tau,\quad t\in[t_{0},\vartheta],\end{array}

\begin{array}[]{rcl}(I^{\alpha}x)(t)&=&\displaystyle\frac{1}{\Gamma(\alpha)}\int_{t_{0}}^{t}\frac{x(\tau)}{(t-\tau)^{1-\alpha}}d\tau,\\[11.99998pt] (D^{\alpha}x)(t)&=&\displaystyle\frac{d}{dt}(I^{1-\alpha}x)(t)=\frac{1}{\Gamma(1-\alpha)}\frac{d}{dt}\int_{t_{0}}^{t}\frac{x(\tau)}{(t-\tau)^{\alpha}}d\tau,\quad t\in[t_{0},\vartheta],\end{array}

∥ x (t) - x (t^{'}) ∥ \leq H ∥ (D^{α} x) (\cdot) ∥_{\infty} ∣ t - t^{'} ∣^{α}, t, t^{'} \in [t_{0}, ϑ] .

∥ x (t) - x (t^{'}) ∥ \leq H ∥ (D^{α} x) (\cdot) ∥_{\infty} ∣ t - t^{'} ∣^{α}, t, t^{'} \in [t_{0}, ϑ] .

x (t) = (D^{1 - α} y) (t) = \frac{1}{Γ ( α )} \int_{t_{0}}^{t} \frac{y ˙ ( τ )}{( t - τ ) ^{1 - α}} d τ, t \in [t_{0}, ϑ] .

x (t) = (D^{1 - α} y) (t) = \frac{1}{Γ ( α )} \int_{t_{0}}^{t} \frac{y ˙ ( τ )}{( t - τ ) ^{1 - α}} d τ, t \in [t_{0}, ϑ] .

({}^{C}D^{\alpha}x)(t)=\big{(}D^{\alpha}(x(\cdot)-x(t_{0}))\big{)}(t),\quad t\in[t_{0},\vartheta].

({}^{C}D^{\alpha}x)(t)=\big{(}D^{\alpha}(x(\cdot)-x(t_{0}))\big{)}(t),\quad t\in[t_{0},\vartheta].

\begin{array}[]{c}(^{C}D^{\alpha}x)(t)=f(t,x(t),u(t),v(t)),\quad t\in[t_{0},\vartheta],\\[5.0pt] x(t)\in\mathbb{R}^{n},\quad u(t)\in\mathbb{U},\quad v(t)\in\mathbb{V}.\end{array}

\begin{array}[]{c}(^{C}D^{\alpha}x)(t)=f(t,x(t),u(t),v(t)),\quad t\in[t_{0},\vartheta],\\[5.0pt] x(t)\in\mathbb{R}^{n},\quad u(t)\in\mathbb{U},\quad v(t)\in\mathbb{V}.\end{array}

∥ f (t, x, u, v) - f (t, x^{'}, u, v) ∥ \leq λ ∥ x - x^{'} ∥

∥ f (t, x, u, v) - f (t, x^{'}, u, v) ∥ \leq λ ∥ x - x^{'} ∥

∥ f (t, x, u, v) ∥ \leq (1 + ∥ x ∥) c

∥ f (t, x, u, v) ∥ \leq (1 + ∥ x ∥) c

u \in U min v \in V max ⟨ s, f (t, x, u, v)⟩ = v \in V max u \in U min ⟨ s, f (t, x, u, v)⟩

u \in U min v \in V max ⟨ s, f (t, x, u, v)⟩ = v \in V max u \in U min ⟨ s, f (t, x, u, v)⟩

\begin{array}[]{l}w(t_{0})\in B(R_{0}),\\[5.0pt] w(\cdot)\in\{w(t_{0})\}+I^{\alpha}(L^{\infty}([t_{0},t],\mathbb{R}^{n})),\\[5.0pt] \|(^{C}D^{\alpha}w)(\tau)\|\leq(1+\|w(\tau)\|)c\text{ for a.e. }\tau\in[t_{0},t],\end{array}

\begin{array}[]{l}w(t_{0})\in B(R_{0}),\\[5.0pt] w(\cdot)\in\{w(t_{0})\}+I^{\alpha}(L^{\infty}([t_{0},t],\mathbb{R}^{n})),\\[5.0pt] \|(^{C}D^{\alpha}w)(\tau)\|\leq(1+\|w(\tau)\|)c\text{ for a.e. }\tau\in[t_{0},t],\end{array}

\begin{array}[]{l}\|w(\tau)\|\leq R_{1},\quad\tau\in[t_{0},t],\\[5.0pt] \|(^{C}D^{\alpha}w)(\tau)\|\leq M_{1}\text{ for a.e. }\tau\in[t_{0},t],\\[5.0pt] \|w(\tau)-w(\tau^{\prime})\|\leq H_{1}|\tau-\tau^{\prime}|^{\alpha},\quad\tau,\tau^{\prime}\in[t_{0},t].\end{array}

\begin{array}[]{l}\|w(\tau)\|\leq R_{1},\quad\tau\in[t_{0},t],\\[5.0pt] \|(^{C}D^{\alpha}w)(\tau)\|\leq M_{1}\text{ for a.e. }\tau\in[t_{0},t],\\[5.0pt] \|w(\tau)-w(\tau^{\prime})\|\leq H_{1}|\tau-\tau^{\prime}|^{\alpha},\quad\tau,\tau^{\prime}\in[t_{0},t].\end{array}

R_{1} = (1 + R_{0}) E_{α} ((ϑ - t_{0})^{α} c) - 1, M_{1} = (1 + R_{1}) c, H_{1} = H M_{1},

R_{1} = (1 + R_{0}) E_{α} ((ϑ - t_{0})^{α} c) - 1, M_{1} = (1 + R_{1}) c, H_{1} = H M_{1},

\|w(\tau)-w(t_{0})\|=\bigg{\|}\frac{1}{\Gamma(\alpha)}\int_{t_{0}}^{\tau}\frac{(^{C}D^{\alpha}w)(\xi)}{(\tau-\xi)^{1-\alpha}}d\xi\bigg{\|}\leq\frac{1}{\Gamma(\alpha)}\int_{t_{0}}^{\tau}\frac{(1+\|w(\xi)\|)c}{(\tau-\xi)^{1-\alpha}}d\xi

\|w(\tau)-w(t_{0})\|=\bigg{\|}\frac{1}{\Gamma(\alpha)}\int_{t_{0}}^{\tau}\frac{(^{C}D^{\alpha}w)(\xi)}{(\tau-\xi)^{1-\alpha}}d\xi\bigg{\|}\leq\frac{1}{\Gamma(\alpha)}\int_{t_{0}}^{\tau}\frac{(1+\|w(\xi)\|)c}{(\tau-\xi)^{1-\alpha}}d\xi

∥ w (τ) ∥ \leq R_{0} + \frac{c}{Γ ( α )} \int_{t_{0}}^{τ} \frac{1 + ∥ w ( ξ ) ∥}{( τ - ξ ) ^{1 - α}} d ξ, τ \in [t_{0}, t] .

∥ w (τ) ∥ \leq R_{0} + \frac{c}{Γ ( α )} \int_{t_{0}}^{τ} \frac{1 + ∥ w ( ξ ) ∥}{( τ - ξ ) ^{1 - α}} d ξ, τ \in [t_{0}, t] .

∥ (^{C} D^{α} w) (τ) ∥ \leq (1 + ∥ w (τ) ∥) c \leq (1 + R_{1}) c = M_{1} for a.e. τ \in [t_{0}, t] .

∥ (^{C} D^{α} w) (τ) ∥ \leq (1 + ∥ w (τ) ∥) c \leq (1 + R_{1}) c = M_{1} for a.e. τ \in [t_{0}, t] .

∥ w (τ) - w (τ^{'}) ∥ \leq H M_{1} ∣ τ - τ^{'} ∣^{α} = H_{1} ∣ τ - τ^{'} ∣^{α}, τ, τ^{'} \in [t_{0}, t] .

∥ w (τ) - w (τ^{'}) ∥ \leq H M_{1} ∣ τ - τ^{'} ∣^{α} = H_{1} ∣ τ - τ^{'} ∣^{α}, τ, τ^{'} \in [t_{0}, t] .

x (t) = w_{*} (t), t \in [t_{0}, t_{*}],

x (t) = w_{*} (t), t \in [t_{0}, t_{*}],

x_{t} (τ) = x (τ), τ \in [t_{0}, t] .

x_{t} (τ) = x (τ), τ \in [t_{0}, t] .

\begin{array}[]{l}\displaystyle x(t)=w_{\ast}(t_{0})+\frac{1}{\Gamma(\alpha)}\int_{t_{0}}^{t_{\ast}}\frac{(^{C}D^{\alpha}w_{\ast})(\tau)}{(t-\tau)^{1-\alpha}}d\tau\\[11.99998pt] \displaystyle+\frac{1}{\Gamma(\alpha)}\int_{t_{\ast}}^{t}\frac{f(\tau,x(\tau),u(\tau),v(\tau))}{(t-\tau)^{1-\alpha}}d\tau,\quad t\in[t_{\ast},t^{\ast}].\end{array}

\begin{array}[]{l}\displaystyle x(t)=w_{\ast}(t_{0})+\frac{1}{\Gamma(\alpha)}\int_{t_{0}}^{t_{\ast}}\frac{(^{C}D^{\alpha}w_{\ast})(\tau)}{(t-\tau)^{1-\alpha}}d\tau\\[11.99998pt] \displaystyle+\frac{1}{\Gamma(\alpha)}\int_{t_{\ast}}^{t}\frac{f(\tau,x(\tau),u(\tau),v(\tau))}{(t-\tau)^{1-\alpha}}d\tau,\quad t\in[t_{\ast},t^{\ast}].\end{array}

∥ x (t) ∥ \leq R_{1}, ∥ x (t) - x (t^{'}) ∥ \leq H_{1} ∣ t - t^{'} ∣^{α}, t, t^{'} \in [t_{0}, t^{*}],

∥ x (t) ∥ \leq R_{1}, ∥ x (t) - x (t^{'}) ∥ \leq H_{1} ∣ t - t^{'} ∣^{α}, t, t^{'} \in [t_{0}, t^{*}],

u^{**} (t) = {u (t), u^{*} (t), t \in [t_{*}, t^{*}), t \in [t^{*}, t^{**}), v^{**} (t) = {v (t), v^{*} (t), t \in [t_{*}, t^{*}), t \in [t^{*}, t^{**}) .

u^{**} (t) = {u (t), u^{*} (t), t \in [t_{*}, t^{*}), t \in [t^{*}, t^{**}), v^{**} (t) = {v (t), v^{*} (t), t \in [t_{*}, t^{*}), t \in [t^{*}, t^{**}) .

γ = σ (x (\cdot)) .

γ = σ (x (\cdot)) .

ρ^{(u)} (t_{*}, w_{*} (\cdot)) = α in f v (\cdot) \in V (t_{*}, ϑ) sup γ,

ρ^{(u)} (t_{*}, w_{*} (\cdot)) = α in f v (\cdot) \in V (t_{*}, ϑ) sup γ,

ρ^{(v)} (t_{*}, w_{*} (\cdot)) = β sup u (\cdot) \in U (t_{*}, ϑ) in f γ .

ρ^{(v)} (t_{*}, w_{*} (\cdot)) = β sup u (\cdot) \in U (t_{*}, ϑ) in f γ .

ρ (t_{*}, w_{*} (\cdot)) = ρ^{(u)} (t_{*}, w_{*} (\cdot)) = ρ^{(v)} (t_{*}, w_{*} (\cdot)), (t_{*}, w_{*} (\cdot)) \in G_{*} .

ρ (t_{*}, w_{*} (\cdot)) = ρ^{(u)} (t_{*}, w_{*} (\cdot)) = ρ^{(v)} (t_{*}, w_{*} (\cdot)), (t_{*}, w_{*} (\cdot)) \in G_{*} .

y(t)=\big{(}I^{1-\alpha}(x(\cdot)-w_{\ast}(t_{0}))\big{)}(t),\quad t\in[t_{0},\vartheta].

y(t)=\big{(}I^{1-\alpha}(x(\cdot)-w_{\ast}(t_{0}))\big{)}(t),\quad t\in[t_{0},\vartheta].

\begin{array}[]{l}y(\cdot)\in{\operatorname{Lip}}^{0}([t_{0},\vartheta],\mathbb{R}^{n}),\\[5.0pt] \dot{y}(t)=(^{C}D^{\alpha}x)(t)\text{ for a.e. }t\in[t_{0},\vartheta],\\[5.0pt] x(t)=w_{\ast}(t_{0})+(D^{1-\alpha}y)(t),\quad t\in[t_{0},\vartheta].\end{array}

\begin{array}[]{l}y(\cdot)\in{\operatorname{Lip}}^{0}([t_{0},\vartheta],\mathbb{R}^{n}),\\[5.0pt] \dot{y}(t)=(^{C}D^{\alpha}x)(t)\text{ for a.e. }t\in[t_{0},\vartheta],\\[5.0pt] x(t)=w_{\ast}(t_{0})+(D^{1-\alpha}y)(t),\quad t\in[t_{0},\vartheta].\end{array}

\dot{y}(t)=f\big{(}t,w_{\ast}(t_{0})+(D^{1-\alpha}y)(t),u(t),v(t)\big{)},\quad t\in[t_{\ast},\vartheta],

\dot{y}(t)=f\big{(}t,w_{\ast}(t_{0})+(D^{1-\alpha}y)(t),u(t),v(t)\big{)},\quad t\in[t_{\ast},\vartheta],

y(t)=\big{(}I^{1-\alpha}(w_{\ast}(\cdot)-w_{\ast}(t_{0}))\big{)}(t),\quad t\in[t_{0},t_{\ast}],

y(t)=\big{(}I^{1-\alpha}(w_{\ast}(\cdot)-w_{\ast}(t_{0}))\big{)}(t),\quad t\in[t_{0},t_{\ast}],

\gamma=\sigma\big{(}w_{\ast}(t_{0})+(D^{1-\alpha}y)(\cdot)\big{)}.

\gamma=\sigma\big{(}w_{\ast}(t_{0})+(D^{1-\alpha}y)(\cdot)\big{)}.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

∎

11institutetext: M. Gomoyunov 22institutetext: Krasovskii Institute of Mathematics and Mechanics of the Ural Branch of the Russian Academy of Sciences, S. Kovalevskaya Str., 16, Ekaterinburg, Russia

Ural Federal University, Mira Str., 32, Ekaterinburg, Russia

22email: [email protected]

Solution to Zero-Sum Differential Game with Fractional Dynamics via Approximations

Mikhail Gomoyunov

(Received: date / Accepted: date)

Abstract

The paper deals with a zero-sum differential game in which the dynamical system is described by a fractional differential equation with the Caputo derivative of an order $\alpha\in(0,1).$ The goal of the first (second) player is to minimize (maximize) the value of a given quality index. The main contribution of the paper is the proof of the fact that this differential game has the value, i.e., the lower and upper game values coincide. The proof is based on the appropriate approximation of the game by a zero-sum differential game in which the dynamical system is described by a first order functional differential equation of a retarded type. It is shown that the values of the approximating differential games have a limit, and this limit is the value of the original game. Moreover, the optimal players’ feedback control procedures are proposed that use the optimally controlled approximating system as a guide.

Keywords:

Differential game Value of the game Optimal strategies Fractional derivative Fractional differential equation Approximation Control with a guide

1 Introduction

The paper is devoted to the development of the theory of zero-sum differential games (see, e.g., Bardi_Capuzzo-Dolcetta_1997 ; Basar_Olsder_1999 ; Cardaliaguet_Quincampoix_Saint-Pierre_2007 ; Fleming_Soner_2006 ; Friedman_1971 ; Isaacs_1965 ; Krasovskii_Subbotin_1988 ; Lukoyanov_2011 ; Pontryagin_1981 and the references therein) to the case when a motion of a dynamical system is described a fractional differential equation. For the basics of fractional calculus, theory of fractional differential equations and their applications, the reader is referred to Diethelm_2010 ; Kilbas_Srivastava_Trujillo_2006 ; Miller_Ross_1993 ; Podlubny_1999 ; Samko_Kilbas_Marichev_1993 .

Despite the fact that a great number of various control problems in fractional order systems are intensively studied nowadays, only a few works deal with differential games in such systems (see Bannikov_2017 ; Chikrii_Matychyn_2011 ; Mamatov_Alimov_2018 ; Petrov_2018 and the references therein). Furthermore, in these works, only some special classes of linear pursuit-evasion differential games are investigated.

In the paper, we follow the game-theoretical approach Krasovskii_Krasovskii_1995 ; Krasovskii_1985 ; Krasovskii_Subbotin_1988 ; Lukoyanov_2011 ; Osipov_1971 ; Subbotin_1995 ; Subbotin_Chentsov_1981 and consider a quite general formulation of a zero-sum differential game in a fractional order system. We suppose that a motion of the system is described by a non-linear fractional differential equation with the Caputo derivative of an order $\alpha\in(0,1).$ The game is considered on a finite time interval. The goal of the first (second) player is to minimize (maximize) the value of a given quality index evaluating the system’s motion. The main contribution of the paper is the proof of the fact that the considered differential game has the value, i.e., the lower and upper values of the game coincide.

Due to non-local structure of fractional order derivatives, fractional differential equations are used for describing dynamical systems with the memory effects of a special kind. It makes these equations close to functional differential equations (see, e.g., Bellman_Cooke_1963 ; Hale_Lunel_1993 ; Kolamnovskii_Myshkis_1992 ). In particular, the Riemann-Liouville fractional integral of the order $(1-\alpha)$ of the solution to the considered fractional differential equation is, by the definition, the solution to the corresponding first order functional differential equation of a neutral type. It allows us to introduce a differential game in this neutral type system and study it instead of the original game. However, to the best of our knowledge, there are no results that can be applied for investigating the obtained differential game. Namely, in Baranovskaya_2015 ; Gomoyunov_Lukoyanov_2018 ; Gomoyunov_Lukoyanov_Plaksin_2017 ; Lukoyanov_Gomoyunov_Plaksin_2017 ; Lukoyanov_Plaksin_2015 ; Maksimov_1991 ; Nikol'skii_1972 , only some special classes of neutral type systems are considered, and, in Vasil'ev_1972 , the game is considered in the classes of players’ programm (open-loop) strategies.

Nevertheless, following Gomoyunov_2018 , based on the finite-difference Grünwald-Letnikov formulas for calculation of fractional derivatives (see, e.g., (Samko_Kilbas_Marichev_1993, , p. 386)), one can approximate the obtained differential game in the first order neutral type system by a differential game in a first order retarded type system. Let us note that differential games in dynamical systems described by functional differential equations of a retarded type are quite well studied (see, e.g., Krasovskii_Subbotin_1988 ; Lukoyanov_2011 ; Osipov_1971 and the references therein), especially in comparison with differential games in neutral type systems. Thus, applying the results of Lukoyanov_2000 ; Lukoyanov_2003 ; Lukoyanov_2011 , we derive that the approximating differential game has the value, and, moreover, this value is achieved in the appropriate classes of players’ positional (closed-loop) strategies.

Further, based on the ideas from Krasovskii_Kotelnikova_2012 (see also Lukoyanov_Plaksin_2015 ), to establish a connection between the original and approximating differential games, we consider the players’ feedback control procedures that use the optimally controlled approximating system as a guide (see, e.g., (Krasovskii_Subbotin_1988, , § 8.2)). It allows us to prove that the values of the approximating games have a limit, and this limit coincides with the value of the original game. The key point here is the mutual aiming procedure between the original and approximating systems Gomoyunov_2018 that provides the desired proximity between the systems’ motions. Moreover, in particular, we obtain that the proposed players’ control procedures with a guide guarantee the game value with a given accuracy, and, in this sense, they can be called optimal.

Let us note also that differential games give a natural formalization of control problems under conditions of unknown disturbances (see, e.g., Krasovskii_Krasovskii_1995 ; Krasovskii_1985 ; Krasovskii_Subbotin_1988 ; Subbotin_Chentsov_1981 ). In some other frameworks, such control problems in fractional order systems are studied, e.g., in Jajarmi_Hajipour_Mohammadzadeh_Baleanu_2018 ; Shen_Lam_2014 .

The rest of the paper is organized as follows. In Sect. 2, we introduce the notations, recall the definitions of fractional order integrals and derivatives, and give some of their properties. In Sect. 3, the considered differential game in a fractional order system is described, and, in particular, the notion of the game value is defined. The corresponding differential game in a first order neutral type system is discussed in Sect. 4. In Sect. 5, we propose an approximation of this game by a differential game in a first order retarded type system. In Sect. 6, the mutual aiming procedure between the original and approximating systems and the optimal players’ control procedures with a guide are described, the limit of the values of the approximating differential game is introduced. In Sect. 7, we prove that the original differential game has the value. Concluding remarks are given in Sect. 8.

2 Notations and Definitions

Let $t_{0},\vartheta\in\mathbb{R},$ $t_{0}<\vartheta,$ and $n\in\mathbb{N}$ be fixed. Let $\mathbb{R}^{n}$ be the $n$ -dimensional Euclidian space with the scalar product $\langle\cdot,\cdot\rangle$ and the norm $\|\cdot\|.$ By $L^{\infty}([t_{0},\vartheta],\mathbb{R}^{n}),$ we denote the space of essentially bounded (Lebesgue) measurable functions $x:[t_{0},\vartheta]\rightarrow\mathbb{R}^{n}$ with the norm

[TABLE]

Let $C([t_{0},\vartheta],\mathbb{R}^{n})$ be the space of continuous functions $x:[t_{0},\vartheta]\rightarrow\mathbb{R}^{n}$ with the uniform norm, which is also denoted by $\|\cdot\|_{\infty}.$ Let ${\operatorname{Lip}}^{0}([t_{0},\vartheta],\mathbb{R}^{n})$ be the set of functions $x(\cdot)\in C([t_{0},\vartheta],\mathbb{R}^{n})$ that are Lipschitz continuous and satisfy the equality $x(t_{0})=0.$ For $L\geq 0,$ we denote by ${\operatorname{Lip}}_{L}^{0}([t_{0},\vartheta],\mathbb{R}^{n})$ the set of functions $x(\cdot)\in{\operatorname{Lip}}^{0}([t_{0},\vartheta],\mathbb{R}^{n})$ that satisfy the Lipschitz condition with this constant $L.$

Let $\alpha\in(0,1)$ be fixed. For a function $x:[t_{0},\vartheta]\rightarrow\mathbb{R}^{n},$ the Riemann-Liouville (R.-L.) fractional integral of the order $\alpha$ and the R.-L. fractional derivative of the order $\alpha$ are respectively defined by

[TABLE]

where $\Gamma$ is the gamma function. For the properties of the fractional order integrals and derivatives, the reader is referred to Diethelm_2010 ; Kilbas_Srivastava_Trujillo_2006 ; Miller_Ross_1993 ; Podlubny_1999 ; Samko_Kilbas_Marichev_1993 . In this section, we shortly describe those properties that are used in the paper. The details can also be found in Gomoyunov_2017 ; Gomoyunov_2018 .

Let $I^{\alpha}(L^{\infty}([t_{0},\vartheta],\mathbb{R}^{n}))$ be the set of functions $x:[t_{0},\vartheta]\rightarrow\mathbb{R}^{n}$ that can be represented by the R.-L. fractional integral of the order $\alpha$ of a function $\varphi(\cdot)\in L^{\infty}([t_{0},\vartheta],\mathbb{R}^{n}),$ i.e., $x(t)=(I^{\alpha}\varphi)(t),$ $t\in[t_{0},\vartheta].$

Let $x(\cdot)\in I^{\alpha}(L^{\infty}([t_{0},\vartheta],\mathbb{R}^{n})).$ Then the derivative $(D^{\alpha}x)(t)$ exists for almost every $t\in[t_{0},\vartheta];$ the inclusion $(D^{\alpha}x)(\cdot)\in L^{\infty}([t_{0},\vartheta],\mathbb{R}^{n})$ is valid; and $x(t)=(I^{\alpha}(D^{\alpha}x))(t),$ $t\in[t_{0},\vartheta].$ Moreover, there exists $H>0$ such that, for any $x(\cdot)\in I^{\alpha}(L^{\infty}([t_{0},\vartheta],\mathbb{R}^{n})),$ the following inequality holds:

[TABLE]

Further, let us consider the function $y(t)=(I^{1-\alpha}x)(t),$ $t\in[t_{0},\vartheta].$ Then, according to (1), we have $\dot{y}(t)=(D^{\alpha}x)(t)$ for almost every $t\in[t_{0},\vartheta],$ where we denote $\dot{y}(t)=dy(t)/dt;$ the inclusion $y(\cdot)\in{\operatorname{Lip}}^{0}_{L}([t_{0},\vartheta],\mathbb{R}^{n})$ is valid with the constant $L=\|(D^{\alpha}x)(\cdot)\|_{\infty};$ and the following representation formula holds:

[TABLE]

Finally, for a function $x:[t_{0},\vartheta]\rightarrow\mathbb{R}^{n},$ the Caputo (C.) fractional derivative of the order $\alpha$ is defined by

[TABLE]

In particular, if $x(t_{0})=0,$ then the R.-L. and C. fractional derivatives coincide.

3 Differential Game with Fractional Dynamics

3.1 Fractional Order System

We consider a dynamical system which motion is described by the following fractional differential equation with the C. derivative of the order $\alpha:$

[TABLE]

Here $t$ is the time; $x(t)$ is the value of the state vector at the time $t;$ $u(t)$ and $v(t)$ are respectively the values of the control vectors of the first and second players at the time $t;$ $t_{0}$ and $\vartheta$ are called the initial and terminal times; the sets $\mathbb{U}\subset\mathbb{R}^{r}$ and $\mathbb{V}\subset\mathbb{R}^{s}$ are compact, $r,s\in\mathbb{N}.$ We suppose that the function $f:[t_{0},\vartheta]\times\mathbb{R}^{n}\times\mathbb{U}\times\mathbb{V}\rightarrow\mathbb{R}^{n}$ satisfies the following conditions:

( $A.1$ )

The function $f$ is continuous.

( $A.2$ )

For any $R\geq 0,$ there exists $\lambda>0$ such that

[TABLE]

for any $t\in[t_{0},\vartheta],$ $x,x^{\prime}\in B(R)=\{y\in\mathbb{R}^{n}:\,\|y\|\leq R\},$ $u\in\mathbb{U},$ and $v\in\mathbb{V}.$

( $A.3$ )

There exists $c>0$ such that

[TABLE]

for any $t\in[t_{0},\vartheta],$ $x\in\mathbb{R}^{n},$ $u\in\mathbb{U},$ and $v\in\mathbb{V}.$

( $A.4$ )

The saddle point condition in a small game (Krasovskii_Subbotin_1988, , p. 8) or, in another terminology, the Isaacs’ condition (Isaacs_1965, , p. 35), holds, i.e.,

[TABLE]

for any $t\in[t_{0},\vartheta]$ and $x,s\in\mathbb{R}^{n}.$

Note that these conditions are quite typical for the differential games theory with first order dynamics (see, e.g., (Krasovskii_Subbotin_1988, , p. 7)).

3.2 Admissible Positions of the System

By a position of system (5), we mean a pair $(t,w(\cdot))$ consisting of a time $t\in[t_{0},\vartheta]$ and a function $w(\cdot)\in C([t_{0},t],\mathbb{R}^{n}),$ which is treated as a motion history on the interval $[t_{0},t].$ The set of the positions $(t,w(\cdot))$ is denoted by $G.$ A position $(t,w(\cdot))\in G$ is called admissible if the relations below are valid:

[TABLE]

where $R_{0}>0$ is a fixed constant, $c$ is the constant from condition $(A.3).$ According to the definition given in Sect. 2, the second inclusion in (6) means that there exists a function $\varphi(\cdot)\in L^{\infty}([t_{0},t],\mathbb{R}^{n})$ such that $w(\tau)=w(t_{0})+(I^{\alpha}\varphi)(\tau),$ $\tau\in[t_{0},t].$ The set of the admissible positions is denoted by ${G_{\ast}}.$

Proposition 1

The set ${G_{\ast}}$ is not empty, and there exist $R_{1}>0,$ $M_{1}>0,$ and $H_{1}>0$ such that, for any $(t,w(\cdot))\in{G_{\ast}},$ the inequalities below are valid:

[TABLE]

Proof

Let $t\in[t_{0},\vartheta]$ and $w_{0}\in B(R_{0}).$ Let us consider the function $w(\tau)=w_{0},$ $\tau\in[t_{0},t].$ According to (1) and (4), we have $(^{C}D^{\alpha}w)(\tau)=0,$ $\tau\in[t_{0},t].$ Hence, the inclusion $(t,w(\cdot))\in{G_{\ast}}$ is valid, and, therefore, the set ${G_{\ast}}$ is not empty.

Further, let us define

[TABLE]

where $c$ is the constant from $(A.3),$ $E_{\alpha}$ is the Mittag-Leffler function (see, e.g., (Samko_Kilbas_Marichev_1993, , (1.90))), and $H$ is the constant from (2). Let $(t,w(\cdot))\in{G_{\ast}}.$ Then, due to (6) and the results given in Sect. 2, we have

[TABLE]

for any $\tau\in[t_{0},t],$ and, therefore,

[TABLE]

From this inequality, applying the fractional version of Bellman-Gronwall lemma (see, e.g., (Diethelm_2010, , Lemma 6.19) and also (Gomoyunov_2017, , Lemma 1.1)), we conclude $\|w(\tau)\|\leq R_{1},$ $\tau\in[t_{0},t].$ Thus, according to (6), we have

[TABLE]

Finally, by the choice of $H,$ we derive

[TABLE]

The proposition is proved. $\square$

Let $(t_{\ast},w_{\ast}(\cdot))\in{G_{\ast}}$ and $t^{\ast}\in[t_{\ast},\vartheta].$ By admissible control realizations (controls) of the first and second players on the interval $[t_{\ast},t^{\ast}),$ we mean measurable functions $u:[t_{\ast},t^{\ast})\rightarrow\mathbb{U}$ and $v:[t_{\ast},t^{\ast})\rightarrow\mathbb{V},$ respectively. The sets of the admissible control realizations of the players are denoted by $\mathcal{U}(t_{\ast},t^{\ast})$ and $\mathcal{V}(t_{\ast},t^{\ast}).$ Following Idczak_Kamocki_2011 (see also Gomoyunov_2017 ), by a motion of system (5) generated from the initial position $(t_{\ast},w_{\ast}(\cdot))$ by players’ control realizations $u(\cdot)\in\mathcal{U}(t_{\ast},t^{\ast})$ and $v(\cdot)\in\mathcal{V}(t_{\ast},t^{\ast}),$ we mean a function $x(\cdot)\in\{w_{\ast}(t_{0})\}+I^{\alpha}(L^{\infty}([t_{0},t^{\ast}],\mathbb{R}^{n}))$ that satisfies the initial condition

[TABLE]

and, together with $u(\cdot)$ and $v(\cdot),$ satisfies Eq. (5) for almost every $t\in[t_{\ast},t^{\ast}].$ For such a motion $x(\cdot)$ and a time $t\in[t_{0},t^{\ast}],$ we denote by $(t,x_{t}(\cdot))$ the corresponding position of system (5), i.e.,

[TABLE]

Proposition 2

Let $(t_{\ast},w_{\ast}(\cdot))\in{G_{\ast}}$ and $t^{\ast}\in[t_{\ast},\vartheta].$ Then any players’ control realizations $u(\cdot)\in\mathcal{U}(t_{\ast},t^{\ast})$ and $v(\cdot)\in\mathcal{V}(t_{\ast},t^{\ast})$ generate from the initial position $(t_{\ast},w_{\ast}(\cdot))$ a unique motion $x(\cdot)$ of system $(\ref{system}).$ Moreover, for any $t\in[t_{0},t^{\ast}],$ the inclusion $(t,x_{t}(\cdot))\in{G_{\ast}}$ is valid.

Proof

Let $(t_{\ast},w_{\ast}(\cdot))\in{G_{\ast}},$ $t^{\ast}\in[t_{\ast},\vartheta],$ $u(\cdot)\in\mathcal{U}(t_{\ast},t^{\ast}),$ and $v(\cdot)\in\mathcal{V}(t_{\ast},t^{\ast}).$ The existence and uniqueness of the corresponding motion $x(\cdot)$ of system (5) can be proved by the standard scheme (see, e.g., (Diethelm_2010, , Theorem 6.1), (Wang_Zhou_2011, , Theorem 3.1), and also (Gomoyunov_2017, , Theorem 2.1)), if we note that $x(\cdot)$ is the motion of system (5) if and only if $x(\cdot)$ satisfies the inclusion $x(\cdot)\in C([t_{0},t^{\ast}],\mathbb{R}^{n}),$ initial condition (7), and the integral equation

[TABLE]

Further, for $t\in[t_{0},t_{\ast}],$ the inclusion $(t,x_{t}(\cdot))\in{G_{\ast}}$ follows from initial condition (7) and the inclusion $(t_{\ast},w_{\ast}(\cdot))\in{G_{\ast}}.$ For $t\in(t_{\ast},t^{\ast}],$ the inclusion $(t,x_{t}(\cdot))\in{G_{\ast}}$ is valid due to $(A.3).$ The proposition is proved. $\square$

From Propositions 1 and 2 we derive the following result.

Corollary 1

Let $(t_{\ast},w_{\ast}(\cdot))\in{G_{\ast}}$ and $t^{\ast}\in[t_{\ast},\vartheta].$ Let $x(\cdot)$ be the motion of system $(\ref{system})$ generated from the initial position $(t_{\ast},w_{\ast}(\cdot))$ by players’ control realizations $u(\cdot)\in\mathcal{U}(t_{\ast},t^{\ast})$ and $v(\cdot)\in\mathcal{V}(t_{\ast},t^{\ast}).$ Then the following inequalities hold:

[TABLE]

where the constants $R_{1}$ and $H_{1}$ are taken from Proposition $\ref{prop_G_properties}.$

Let us note also the following property of motions of system (5), which follows directly from Proposition 2. Let $(t_{\ast},w_{\ast}(\cdot))\in{G_{\ast}},$ $t^{\ast}\in[t_{\ast},\vartheta],$ and let $x(\cdot)$ be the motion generated from $(t_{\ast},w_{\ast}(\cdot))$ by $u(\cdot)\in\mathcal{U}(t_{\ast},t^{\ast})$ and $v(\cdot)\in\mathcal{V}(t_{\ast},t^{\ast}).$ Further, let $t^{\ast\ast}\in[t^{\ast},\vartheta],$ and let $x^{\ast}(\cdot)$ be the motion generated from $(t^{\ast},x_{t^{\ast}}(\cdot))$ by $u^{\ast}(\cdot)\in\mathcal{U}(t^{\ast},t^{\ast\ast})$ and $v^{\ast}(\cdot)\in\mathcal{V}(t^{\ast},t^{\ast\ast}).$ Then $x^{\ast}(\cdot)$ can be considered as the motion generated from $(t_{\ast},w_{\ast}(\cdot))$ by the realizations

[TABLE]

In particular, this property allows us to consider step-by-step feedback control procedures for constructing players’ control realizations (see Sect. 6).

3.3 Quality Index

Let $x(\cdot)$ be the motion of system (5) generated from an initial position $(t_{\ast},w_{\ast}(\cdot))\in{G_{\ast}}$ by players’ control realizations $u(\cdot)\in\mathcal{U}(t_{\ast},\vartheta)$ and $v(\cdot)\in\mathcal{V}(t_{\ast},\vartheta).$ Let quality of this motion be evaluated by the index

[TABLE]

We suppose that the function $\sigma:C([t_{0},\vartheta],\mathbb{R}^{n})\rightarrow\mathbb{R}$ satisfies the following condition:

( $A.5$ )

The function $\sigma$ is continuous.

For dynamical system (5) and quality index (9), we consider a zero-sum differential game in which the first player aims to minimize the value of the quality index, and the second player aims to maximize it.

3.4 Non-anticipative Strategies and the Game Value

To define the value of the differential game (5), (9), we consider non-anticipative strategies of the players (see, e.g., (Bardi_Capuzzo-Dolcetta_1997, , Ch. VIII) and the references therein) and introduce the lower and upper values of the game. Note that, in another terminology, such strategies are called quasi-strategies (see, e.g., (Subbotin_Chentsov_1981, , p. 24)) or progressive strategies (see, e.g., (Fleming_Soner_2006, , § XI.4)).

Let $(t_{\ast},w_{\ast}(\cdot))\in{G_{\ast}}$ be an initial position. By a non-anticipative strategy of the first player, we mean a function $\alpha:\mathcal{V}(t_{\ast},\vartheta)\rightarrow\mathcal{U}(t_{\ast},\vartheta)$ with the following property. For any $t^{\ast}\in[t_{\ast},\vartheta]$ and any second player’s control realizations $v(\cdot),v^{\prime}(\cdot)\in\mathcal{V}(t_{\ast},\vartheta),$ if the equality $v(t)=v^{\prime}(t)$ is valid for almost every $t\in[t_{\ast},t^{\ast}],$ then the corresponding images $u(\cdot)=\alpha(v(\cdot))$ and $u^{\prime}(\cdot)=\alpha(v^{\prime}(\cdot))$ satisfy the equality $u(t)=u^{\prime}(t)$ for almost every $t\in[t_{\ast},t^{\ast}].$ The lower value of the differential game (5), (9) is defined by

[TABLE]

where $\gamma$ is the value of quality index (9) that corresponds to the motion $x(\cdot)$ generated from $(t_{\ast},w_{\ast}(\cdot))\in{G_{\ast}}$ by the second player’s control realization $v(\cdot)$ and the first player’s control realization $u(\cdot)=\alpha(v(\cdot)).$

Similarly, a function $\beta:\mathcal{U}(t_{\ast},\vartheta)\rightarrow\mathcal{V}(t_{\ast},\vartheta)$ is a non-anticipative strategy of the second player if, for any $t^{\ast}\in[t_{\ast},\vartheta]$ and any $u(\cdot),u^{\prime}(\cdot)\in\mathcal{U}(t_{\ast},\vartheta)$ such that $u(t)=u^{\prime}(t)$ for almost every $t\in[t_{\ast},t^{\ast}],$ we have $v(t)=v^{\prime}(t)$ for almost every $t\in[t_{\ast},t^{\ast}],$ where $v(\cdot)=\beta(u(\cdot))$ and $v^{\prime}(\cdot)=\beta(u^{\prime}(\cdot)).$ The upper value of the game is defined by

[TABLE]

If the lower and upper game values coincide for any initial position $(t_{\ast},w_{\ast}(\cdot))\in{G_{\ast}},$ then we say that the game has the value

[TABLE]

The goal of the paper is to prove that the differential game (5), (9) has the value, and, for any initial position $(t_{\ast},w_{\ast}(\cdot))\in{G_{\ast}},$ construct the players’ feedback control procedures that guarantee the game value $\rho(t_{\ast},w_{\ast}(\cdot))$ with a given accuracy $\zeta>0.$ These results are formulated in Theorem 7.1 (see Sect. 7). The proof of this theorem follows the scheme from (Lukoyanov_Plaksin_2015, , Theorem 2) and is based on the appropriate approximation of the differential game (5), (9). Before describing this approximation, in the next section, we rewrite the considered differential game in another form.

4 Differential Game in a Neutral Type System

Let $x(\cdot)$ be the motion of system (5) generated from an initial position $(t_{\ast},w_{\ast}(\cdot))\in{G_{\ast}}$ by players’ control realizations $u(\cdot)\in\mathcal{U}(t_{\ast},\vartheta)$ and $v(\cdot)\in\mathcal{V}(t_{\ast},\vartheta).$ Let us consider the function

[TABLE]

Since $x(\cdot)\in\{w_{\ast}(t_{0})\}+I^{\alpha}(L^{\infty}([t_{0},\vartheta],\mathbb{R}^{n})),$ then, according to the results given in Sect. 2, we have

[TABLE]

Substituting these equalities into Eq. (5), we obtain that, instead of the original differential game (5), (9), one can consider the differential game for the dynamical system

[TABLE]

under the initial condition

[TABLE]

and the quality index

[TABLE]

Furthermore, due to (3), one can rewrite Eq. (13) as follows:

[TABLE]

Note that the right-hand side of Eq. (16) depends explicitly on the history of the derivative $\dot{y}(\tau)$ for $\tau\in[t_{0},t].$ Therefore, in the terminology of the theory of functional differential equations (see, e.g., Bellman_Cooke_1963 ; Hale_Lunel_1993 ; Kolamnovskii_Myshkis_1992 ), Eq. (16) is a functional differential equation of a neutral type. To the best of our knowledge, in the theory of differential games in neutral type systems (see the references in Introduction), there are no results that can be directly applied for studying the game (13), (15), and, therefore, the original game (5), (9) too. However, as it is shown in the next section, the game (13), (15) can be approximated by a differential game in a retarded type system.

5 Approximating Differential Game

Following (Gomoyunov_2018, , Sect. 6), let us approximate in relations (13), (15) the fractional derivative $(D^{1-\alpha}y)(t)$ by the divided fractional difference $h^{\alpha-1}(\Delta_{h}^{1-\alpha}y)(t)$ with a step size $h>0,$ where (see, e.g., (Samko_Kilbas_Marichev_1993, , p. 385))

[TABLE]

the symbol $[\tau]$ means the integer part of $\tau\geq 0,$ and $\binom{1-\alpha}{i}$ are the binomial coefficients. In this section, we study the differential game obtained after this approximation.

5.1 Approximating Dynamical System and Quality Index

Let us fix a vector $w_{0}\in B(R_{0})$ and a sufficiently small value of the parameter $h>0.$ Note that, in what follows, the vector $w_{0}$ corresponds to an initial position $(t_{\ast},w_{\ast}(\cdot))\in{G_{\ast}}$ of system (5) such that $w_{0}=w_{\ast}(t_{0}).$ Taking into account the above, we consider the following zero-sum differential game, determined by these two parameters $w_{0}$ and $h.$ We introduce the approximating dynamical system which motion is described by the differential equation

[TABLE]

and the approximating quality index

[TABLE]

Here $y(t)$ is the value of the state vector; $p(t)$ and $q(t)$ are respectively the values of the control vectors of the first and second players. The first player minimizes the value of quality index (19), the second player maximizes it.

Note that, according to (17), at a time $t\in[t_{0},\vartheta],$ the right-hand side of Eq. (18) depends on the values $y(t-ih)$ for $i\in\overline{0,[(t-t_{0})/h]}$ and, in contrast to (16), does not depend explicitly on the history of the derivative $\dot{y}(\tau),$ $\tau\in[t_{0},t].$ Thus, Eq. (18) is a functional differential equation of a retarded type. In what follows, dealing with the game (18), (19), we mainly use the constructions and results from Lukoyanov_2000 ; Lukoyanov_2003 ; Lukoyanov_2011 .

Remark 1

Let us note that, even in a simple case when original quality index (9) is terminal, i.e., $\gamma=\mu(x(\vartheta))$ for a function $\mu:\mathbb{R}^{n}\rightarrow\mathbb{R},$ the corresponding approximating quality index ${\gamma_{w_{0},h}}=\mu(w_{0}+h^{\alpha-1}(\Delta_{h}^{1-\alpha}y)(\vartheta))$ is still non-terminal, since, according to (17), it depends on the values $y(\vartheta-ih)$ for $i\in\overline{0,[(\vartheta-t_{0})/h]}.$

Taking into account (11) and (12), by a position of approximating system (18), we mean a pair $(t,r(\cdot))\in G$ such that $r(t_{0})=0.$ The set of such positions is denoted by $G^{0}.$ This set is considered with the metric (see, e.g., Lukoyanov_2003 and also (Lukoyanov_2011, , p. 25))

[TABLE]

where $(t,r(\cdot)),(t^{\prime},r^{\prime}(\cdot))\in G^{0},$ and

[TABLE]

By the right-hand side of Eqs. (18), (19), let us define the functions

[TABLE]

where $(t,r(\cdot))\in G^{0},$ $p\in\mathbb{U},$ $q\in\mathbb{V},$ and $(\vartheta,y(\cdot))\in G^{0}.$

Directly from properties ( $A.1$ )–( $A.5$ ) of the functions $f$ and $\sigma$ it follows that these functions $f_{w_{0},h}$ and ${\sigma_{w_{0},h}}$ satisfy the following conditions:

$\bullet$

( $B.1$ ) For any $h>0,$ the functions $f_{w_{0},h}$ and ${\sigma_{w_{0},h}}$ are continuous uniformly in $w_{0}\in B(R_{0}).$

$\bullet$

( $B.2$ ) For any $h>0$ and any $R\geq 0,$ there exists $\lambda_{h}>0$ such that, for any $w_{0}\in B(R_{0}),$ the inequality

[TABLE]

is valid for any $(t,r(\cdot)),$ $(t,r^{\prime}(\cdot))\in G^{0}$ satisfying $\|r(\cdot)\|_{\infty}\leq R,$ $\|r^{\prime}(\cdot)\|_{\infty}\leq R$ and any $p\in\mathbb{U},$ $q\in\mathbb{V}.$

$\bullet$

( $B.3$ ) For any $h>0,$ there exists $c_{h}>0$ such that, for any $w_{0}\in B(R_{0}),$ the estimate

[TABLE]

holds for any $(t,r(\cdot))\in G^{0},$ $p\in\mathbb{U},$ and $q\in\mathbb{V}.$

$\bullet$

( $B.4$ ) For any $w_{0}\in B(R_{0})$ and any $h>0,$ the function $f_{w_{0},h}$ satisfies the saddle point condition in a small game, i.e.,

[TABLE]

for any $(t,r(\cdot))\in G^{0}$ and $s\in\mathbb{R}^{n}.$

According to (14), if an initial position $(t_{\ast},w_{\ast}(\cdot))\in{G_{\ast}}$ of original system (5) is given, we define the corresponding initial position $(t_{\ast},r_{\ast}(\cdot))\in G^{0}$ of approximating system (18) as follows:

[TABLE]

Due to Proposition 1 and the results given in Sect. 2, the function $r_{\ast}(\cdot)$ satisfies the inclusion $r_{\ast}(\cdot)\in{\operatorname{Lip}}_{M_{1}}^{0}([t_{0},t_{\ast}],\mathbb{R}^{n}).$ Taking this into account, we call a position $(t,r(\cdot))\in G^{0}$ of approximating system (18) admissible if

[TABLE]

where $\widetilde{c}_{h}=\max\{M_{1},c_{h}\},$ and $c_{h}$ is the constant from condition ( $B.3$ ). The set of such admissible positions is denoted by ${G_{h}^{0}}.$ Note that this set is independent on the parameter $w_{0}.$

Let $(t_{\ast},r_{\ast}(\cdot))\in{G_{h}^{0}}$ and $t^{\ast}\in[t_{\ast},\vartheta].$ As in Sect. 3.2, by admissible control realizations of the players in the approximating game (18), (19), we mean functions $p(\cdot)\in\mathcal{U}(t_{\ast},t^{\ast})$ and $q(\cdot)\in\mathcal{V}(t_{\ast},t^{\ast}).$ Due to properties ( $B.1$ )–( $B.3$ ), from the initial position $(t_{\ast},r_{\ast}(\cdot)),$ such control realizations $p(\cdot)$ and $q(\cdot)$ uniquely generate the motion of approximating system (18) that is the function $y(\cdot)\in{\operatorname{Lip}}^{0}([t_{0},t^{\ast}],\mathbb{R}^{n})$ satisfying the initial condition $y(t)=r_{\ast}(t),$ $t\in[t_{0},t_{\ast}],$ and, together with $p(\cdot)$ and $q(\cdot),$ satisfying Eq. (18) for almost every $t\in[t_{\ast},t^{\ast}].$

Let us note the following properties of the set ${G_{h}^{0}}.$ Firstly, for any $(t_{\ast},w_{\ast}(\cdot))\in{G_{\ast}},$ the inclusion $(t_{\ast},r_{\ast}(\cdot))\in{G_{h}^{0}}$ is valid for the function $r_{\ast}(\cdot)$ defined by (20). Secondly, for the motion $y(\cdot)$ of approximating system (18) generated from $(t_{\ast},r_{\ast}(\cdot))\in{G_{h}^{0}}$ by $p(\cdot)\in\mathcal{U}(t_{\ast},\vartheta)$ and $q(\cdot)\in\mathcal{V}(t_{\ast},\vartheta),$ the inclusion $(t,y_{t}(\cdot))\in{G_{h}^{0}}$ holds for any $t\in[t_{\ast},\vartheta],$ where, according to (8), we denote $y_{t}(\tau)=y(\tau),$ $\tau\in[t_{0},t].$ Finally, the set ${G_{h}^{0}}$ is a compact subset of $G^{0}.$

Following the the scheme from (Gomoyunov_2018, , Lemma 2) and taking into account that the constant $R_{1}$ in Proposition 1 does not depend on an initial position $(t_{\ast},w_{\ast}(\cdot))\in{G_{\ast}},$ one can prove the result below.

Proposition 3

There exists $L_{1}>0$ such that the following statement holds. Let $(t_{\ast},w_{\ast}(\cdot))\in{G_{\ast}}$ be an initial position of original system $(\ref{system}).$ Let us consider approximating system $(\ref{system_y})$ for $w_{0}=w_{\ast}(t_{0}),$ any $h>0,$ and under the initial position $(t_{\ast},r_{\ast}(\cdot))$ defined by $(\ref{r_ast}).$ Then the inclusion $y(\cdot)\in{\operatorname{Lip}}_{L_{1}}^{0}([t_{0},\vartheta],\mathbb{R}^{n})$ is valid for any motion $y(\cdot)$ of the approximating system generated from $(t_{\ast},r_{\ast}(\cdot))$ by $p(\cdot)\in\mathcal{U}(t_{\ast},\vartheta)$ and $q(\cdot)\in\mathcal{V}(t_{\ast},\vartheta).$

5.2 The Value of the Approximating Game

Let $w_{0}\in B(R_{0})$ and $h>0$ be fixed. Similarly to Sect. 3.4, in the approximating differential game (18), (19), one can consider non-anticipative strategies of the players and introduce the lower and upper game values, denoted respectively by $\rho^{(p)}_{w_{0},h}(t_{\ast},r_{\ast}(\cdot))$ and $\rho^{(q)}_{w_{0},h}(t_{\ast},r_{\ast}(\cdot)),$ $(t_{\ast},r_{\ast}(\cdot))\in{G_{h}^{0}}.$ From the results of Lukoyanov_2000 ; Lukoyanov_2003 ; Lukoyanov_2011 (see also Gomoyunov_Lukoyanov_Plaksin_2017 ) it follows that, under conditions $(B.1)$ – $(B.4),$ the approximating game has the value

[TABLE]

and, furthermore, this value can be guaranteed by the players if they use the positional strategies, described in the next section.

5.3 Optimal Positional Strategies

Let $w_{0}\in B(R_{0})$ and $h>0$ be fixed. In the approximating differential game (5), (9), by the positional strategies $P_{w_{0},h}$ and $Q_{w_{0},h}$ of the players, we mean arbitrary functions

[TABLE]

where $\varepsilon$ is the accuracy parameter.

Let $(t_{\ast},r_{\ast}(\cdot))\in{G_{h}^{0}},$ $\varepsilon>0,$ and let

[TABLE]

be a partition of the interval $[t_{\ast},\vartheta].$ The triple $\{P_{w_{0},h},\varepsilon,\Delta\}$ is called a control law of the first player. This law forms in the approximating system a piecewise constant control realization $p(\cdot)\in\mathcal{U}(t_{\ast},\vartheta)$ by the following step-by-step feedback rule:

[TABLE]

where $y_{\tau_{1}}(\cdot)=r_{\ast}(\cdot).$ Thus, from the initial position $(t_{\ast},r_{\ast}(\cdot)),$ the control law of the first player $\{P_{w_{0},h},\varepsilon,\Delta\}$ together with a control realization of the second player $q(\cdot)\in\mathcal{V}(t_{\ast},\vartheta)$ uniquely generate the motion $y(\cdot)$ of the approximating system and, therefore, determine the value ${\gamma_{w_{0},h}}$ of approximating quality index (19).

Similarly, we consider the control law of the second player $\{Q_{w_{0},h},\varepsilon,\Delta\},$ which forms a piecewise constant control realization $q(\cdot)\in\mathcal{V}(t_{\ast},\vartheta)$ as follows:

[TABLE]

From the initial position $(t_{\ast},r_{\ast}(\cdot)),$ the control law $\{Q_{w_{0},h},\varepsilon,\Delta\}$ together with $p(\cdot)\in\mathcal{U}(t_{\ast},\vartheta)$ uniquely generate the motion $y(\cdot)$ of the approximating system and determine the value ${\gamma_{w_{0},h}}$ of approximating quality index (19).

By the scheme from (Lukoyanov_2000, , Theorem 1) (see also (Lukoyanov_2011, , Theorem 17.1)), one can prove the following lemma (see Gomoyunov_Lukoyanov_Plaksin_2017 for a related technique).

Lemma 1

For any $w_{0}\in B(R_{0})$ and any $h>0,$ in the approximating differential game $(\ref{system_y}),$ $(\ref{quality_index_y}),$ there exist the players’ optimal positional strategies $P^{0}_{w_{0},h}$ and $Q^{0}_{w_{0},h}$ that are optimal uniformly in $(t_{\ast},r_{\ast}(\cdot))\in{G_{h}^{0}}$ and $w_{0}\in B(R_{0}).$ Namely, for any $h>0$ and any $\zeta>0,$ one can choose $\varepsilon^{(1)}=\varepsilon^{(1)}(h,\zeta)>0$ and $\delta^{(1)}(\varepsilon)=\delta^{(1)}(\varepsilon,h,\zeta)>0,$ $\varepsilon\in(0,\varepsilon^{(1)}],$ such that the following statement holds. Let $w_{0}\in B(R_{0}),$ $(t_{\ast},r_{\ast}(\cdot))\in{G_{h}^{0}},$ $\varepsilon\in(0,\varepsilon^{(1)}],$ and let $\Delta$ be a partition $(\ref{Delta})$ with the diameter $\operatorname{diam}(\Delta)=\max_{j\in\overline{1,k}}(\tau_{j+1}-\tau_{j})\leq\delta^{(1)}(\varepsilon).$ Then the control law $\{P^{0}_{w_{0},h},\varepsilon,\Delta\}$ of the first player guarantees for the value ${\gamma_{w_{0},h}}$ of approximating quality index $(\ref{quality_index_y})$ the inequality

[TABLE]

for any control realization of the second player $q(\cdot)\in\mathcal{V}(t_{\ast},\vartheta);$ and the control law $\{Q^{0}_{w_{0},h},\varepsilon,\Delta\}$ of the second player guarantees for the value ${\gamma_{w_{0},h}}$ of approximating quality index $(\ref{quality_index_y})$ the inequality

[TABLE]

for any control realization of the first player $p(\cdot)\in\mathcal{U}(t_{\ast},\vartheta).$

Note that the uniformness in the parameter $w_{0}\in B(R_{0})$ is provided by the corresponding uniformness in properties ( $B.1$ )–( $B.3$ ).

Let us describe shortly one of the ways of constructing such optimal strategies $P^{0}_{w_{0},h}$ and $Q^{0}_{w_{0},h}.$ We apply the method of extremal shift to accompanying points (see, e.g., Krasovskii_Krasovskii_1995 ; Krasovskii_1985 and also Lukoyanov_2000 ; Lukoyanov_2011 ). For simplicity of the notation below, it is convenient to consider the so-called pre-strategies of the players in the approximating game (18), (19). Namely, by pre-strategies $\mathbf{p}_{w_{0},h}$ and $\mathbf{q}_{w_{0},h}$ of the first and second players, we mean functions

[TABLE]

that, for any $(t,r(\cdot))\in{G_{h}^{0}}$ and any $s\in\mathbb{R}^{n},$ satisfy the inclusions

[TABLE]

Let $(t,r(\cdot))\in{G_{h}^{0}}$ and $\varepsilon>0.$ For the first and second players, we choose the accompanying points $r^{(p)}_{\varepsilon}(\cdot)$ and $r^{(q)}_{\varepsilon}(\cdot)$ from the conditions

[TABLE]

where the minimum and maximum are calculated over the functions $r_{\varepsilon}(\cdot)$ such that

[TABLE]

and the constant $\lambda_{h}$ is chosen by the set $G_{h}^{0}$ in accordance with property ( $B.2$ ). Note that the minimum and maximum are attained due to continuity of the value function ${G_{h}^{0}}\ni(t_{\ast},r_{\ast}(\cdot))\mapsto\rho_{w_{0},h}(t_{\ast},r_{\ast}(\cdot))\in\mathbb{R}.$ After that, we define

[TABLE]

Remark 2

There are another methods for constructing the optimal positional strategies $P^{0}_{w_{0},h}$ and $Q^{0}_{w_{0},h}$ (see, e.g., Krasovskii_Subbotin_1988 ; Lukoyanov_2000 ; Lukoyanov_2003 ; Lukoyanov_2011 ). For example, if the value function $\rho_{w_{0},h}$ is coinvariantly smooth, then the method of extremal shift in the direction of the coinvariant gradient of $\rho_{w_{0},h}$ can be applied. In the general non-smooth case, such strategies can be constructed by the extremal shift in direction of the coinvariant gradient of a suitable coinvariantly smooth auxiliary function. Also, one can use the methods based on the notions of maximal $u$ - and $v$ -stable bridges. Furthermore, there are some specific methods for constructing the optimal strategies in the linear case (see, e.g., Gomoyunov_Lukoyanov_2012 ; Lukoyanov_Reshetova_1998 ).

6 Players’ Control Procedures with a Guide

In this section, we propose the players’ feedback control procedures that use the optimally controlled approximating system (18) as a guide. It allows us to show that the values of the approximating differential games (18), (19) have the limit when $h\downarrow 0.$ This fact constitutes the basis of the proof of the main result of the paper formulated in Theorem 7.1 (see Sect. 7).

6.1 Mutual Aiming Procedures between the Systems

According to (Gomoyunov_2018, , Sect. 7), let us consider the following mutual aiming procedure between original (5) and approximating (18) systems. First of all, let us introduce pre-strategies of the players in the original game (5), (9). By pre-strategies $\mathbf{u}$ and $\mathbf{v}$ of the first and second players, we mean functions

[TABLE]

that, for any $t\in[t_{0},\vartheta]$ and any $x,s\in\mathbb{R}^{n},$ satisfy the inclusions

[TABLE]

Further, for $(t,w(\cdot))\in{G_{\ast}}$ and $(t,r(\cdot))\in{G_{h}^{0}},$ let us denote

[TABLE]

Let $(t_{\ast},w_{\ast}(\cdot))\in{G_{\ast}}$ be an initial position of original system (5). Let us fix $h>0,$ put $w_{0}=w_{\ast}(t_{0}),$ and consider the corresponding approximating system (18) under the initial position $(t_{\ast},r_{\ast}(\cdot))$ defined by (20). Let us fix also a partition $\Delta$ (21). Let a first player’s control realization $u(\cdot)\in\mathcal{U}(t_{\ast},\vartheta)$ in the original system and a second player’s control realization $q(\cdot)\in\mathcal{V}(t_{\ast},\vartheta)$ in the approximating system be formed simultaneously according to the following step-by-step feedback rule:

[TABLE]

where

[TABLE]

and $\mathbf{q}_{w_{0},h}$ is a pre-strategy of the second player in the approximating game.

Lemma 2

For any $\xi>0,$ there exist $h^{(2)}=h^{(2)}(\xi)>0$ and $\delta^{(2)}=\delta^{(2)}(\xi)>0$ such that, for any initial position $(t_{\ast},w_{\ast}(\cdot))\in{G_{\ast}}$ of original system $(\ref{system})$ and any partition $\Delta$ $(\ref{Delta})$ with the diameter $\operatorname{diam}(\Delta)\leq\delta^{(2)},$ the following statement is valid. Let us consider approximating system $(\ref{system_y})$ for $w_{0}=w_{\ast}(t_{0})$ and $h\in(0,h^{(2)}]$ under the initial position $(t_{\ast},r_{\ast}(\cdot))$ defined by $(\ref{r_ast}).$ Then, for any control realizations $v(\cdot)\in\mathcal{V}(t_{\ast},\vartheta)$ and $p(\cdot)\in\mathcal{U}(t_{\ast},\vartheta),$ if control realizations $u(\cdot)\in\mathcal{U}(t_{\ast},\vartheta)$ and $q(\cdot)\in\mathcal{V}(t_{\ast},\vartheta)$ are formed according to the mutual aiming procedure $(\ref{procedure_1}),$ $(\ref{s_j_1}),$ then the corresponding motions $x(\cdot)$ and $y(\cdot)$ of the original and approximating systems satisfy the inequality

[TABLE]

The lemma is proved by the scheme from (Gomoyunov_2018, , Theorem 3), if we take into account that the constants $R_{1}$ and $H_{1}$ in Corollary 1 and the constant $L_{1}$ in Proposition 3 do not depend on an initial position $(t_{\ast},w_{\ast}(\cdot))\in{G_{\ast}}.$

Similarly, one can consider another mutual aiming procedure between the original and approximating systems. Namely, let $v(\cdot)\in\mathcal{V}(t_{\ast},\vartheta)$ and $p(\cdot)\in\mathcal{U}(t_{\ast},\vartheta)$ be formed on the basis of the partition $\Delta$ as follows:

[TABLE]

where

[TABLE]

By analogy with Lemma 2, we obtain the following result.

Lemma 3

For any $\xi>0,$ there exist $h^{(3)}=h^{(3)}(\xi)>0$ and $\delta^{(3)}=\delta^{(3)}(\xi)>0$ such that, for any initial position $(t_{\ast},w_{\ast}(\cdot))\in{G_{\ast}}$ of original system $(\ref{system})$ and any partition $\Delta$ $(\ref{Delta})$ with the diameter $\operatorname{diam}(\Delta)\leq\delta^{(3)},$ the following statement is valid. Let us consider approximating system $(\ref{system_y})$ for $w_{0}=w_{\ast}(t_{0})$ and $h\in(0,h^{(3)}]$ under the initial position $(t_{\ast},r_{\ast}(\cdot))$ defined by $(\ref{r_ast}).$ Then, for any realizations $u(\cdot)\in\mathcal{U}(t_{\ast},\vartheta)$ and $q(\cdot)\in\mathcal{V}(t_{\ast},\vartheta),$ if realizations $v(\cdot)\in\mathcal{V}(t_{\ast},\vartheta)$ and $p(\cdot)\in\mathcal{U}(t_{\ast},\vartheta)$ are formed according to the mutual aiming procedure $(\ref{procedure_2}),$ $(\ref{s_j_2}),$ then the corresponding motions $x(\cdot)$ and $y(\cdot)$ of the original and approximating systems satisfy inequality $(\ref{lem_procedure_1_main}).$

6.2 First Player’s Control Procedure with a Guide

Let $(t_{\ast},w_{\ast}(\cdot))\in{G_{\ast}},$ $h>0,$ $\varepsilon>0,$ and a partition $\Delta$ (21) be fixed. We propose the following control procedure of the first player in the original differential game (5), (9). Let us consider the approximating differential game (18), (19) for $w_{0}=w_{\ast}(t_{0}),$ the fixed $h,$ and with the initial position $(t_{\ast},r_{\ast}(\cdot))$ defined by (20). By the steps of the partition $\Delta,$ the first player forms a control realization $u(\cdot)\in\mathcal{U}(t_{\ast},\vartheta)$ in the original system and, at the same time, control realizations $p(\cdot)\in\mathcal{U}(t_{\ast},\vartheta)$ and $q(\cdot)\in\mathcal{V}(t_{\ast},\vartheta)$ in the approximating system as follows: $u(\cdot)$ and $q(\cdot)$ are formed according to the mutual aiming procedure (24), (25), and $p(\cdot)$ is formed by the control law $\{P^{0}_{w_{0},h},\varepsilon,\Delta\}$ (see (22)) on the basis of the optimal strategy $P^{0}_{w_{0},h}$ taken from Lemma 1. Note that, from the initial position $(t_{\ast},w_{\ast}(\cdot)),$ the described control procedure together with $v(\cdot)\in\mathcal{V}(t_{\ast},\vartheta)$ uniquely generate the motion $x(\cdot)$ of the original system and, therefore, determine the value $\gamma$ of quality index (9). Moreover, during this control procedure, the first player generates the auxiliary motion $y(\cdot)$ of the approximating system, which can be considered as a guide (see, e.g., (Krasovskii_Subbotin_1988, , § 8.2)). For convenience, in what follows, the described control procedure is referred as $U(t_{\ast},w_{\ast}(\cdot),h,\varepsilon,\Delta).$

For any $h>0,$ let us introduce the function

[TABLE]

where $r_{\ast}(\cdot)$ is defined according to (20), and $\rho_{w_{0},h}(t_{\ast},r_{\ast}(\cdot))$ is the value of the approximating differential game (18), (19) for $w_{0}=w_{\ast}(t_{0})$ and the fixed $h.$

Lemma 4

For any $\zeta>0,$ there exist

[TABLE]

such that, for any $(t_{\ast},w_{\ast}(\cdot))\in{G_{\ast}},$ $h\in(0,h^{(4)}],$ $\varepsilon\in(0,\varepsilon^{(4)}(h)],$ and any partition $\Delta$ $(\ref{Delta})$ with the diameter $\operatorname{diam}(\Delta)\leq\delta^{(4)}(\varepsilon,h),$ the first player’s control procedure with a guide $U(t_{\ast},w_{\ast}(\cdot),h,\varepsilon,\Delta)$ guarantees for the value $\gamma$ of quality index $(\ref{quality_index})$ the inequality

[TABLE]

for any control realization of the second player $v(\cdot)\in\mathcal{V}(t_{\ast},\vartheta).$

Proof

Applying (Gomoyunov_2018, , Proposition 7), by the constant $L_{1}$ from Proposition 3, one can choose $R_{2}>0$ and $H_{2}>0$ such that the inequalities

[TABLE]

are valid for any $h>0$ and any $y(\cdot)\in{\operatorname{Lip}}^{0}_{L_{1}}([t_{0},\vartheta],\mathbb{R}^{n}).$ Taking the constants $R_{1}$ and $H_{1}$ from Corollary 1, we define $R_{3}=\max\{R_{1},R_{0}+R_{2}\},$ $H_{3}=\max\{H_{1},H_{2}\},$ and consider the compact set $D\subset C([t_{0},\vartheta],\mathbb{R}^{n})$ consisting of the functions $x(\cdot)$ such that

[TABLE]

Let $\zeta>0$ be fixed. Due to ( $A.5$ ), there exists $\xi=\xi(\zeta)>0$ such that, for any $x(\cdot),x^{\prime}(\cdot)\in D,$ from the inequality $\|x(\cdot)-x^{\prime}(\cdot)\|_{\infty}\leq\xi$ it follows that $|\sigma(x(\cdot))-\sigma(x^{\prime}(\cdot))|\leq\zeta/2.$ Let us choose $h^{(2)}(\xi)>0$ and $\delta^{(2)}(\xi)>0$ by Lemma 2, and put $h^{(4)}=h^{(2)}(\xi).$ Finally, for any $h\in(0,h^{(4)}],$ we take $\varepsilon^{(1)}(h,\zeta/2)>0$ and $\delta^{(1)}(\varepsilon,h,\zeta/2)>0,$ $\varepsilon\in(0,\varepsilon^{(1)}(h,\zeta/2)],$ from Lemma 1, and define

[TABLE]

Let us show that the statement of the lemma is valid for the chosen parameters.

Let $(t_{\ast},w_{\ast}(\cdot))\in{G_{\ast}},$ $h\in(0,h^{(4)}],$ $\varepsilon\in(0,\varepsilon^{(4)}(h)],$ and let $\Delta$ be a partition (21) with the diameter $\operatorname{diam}(\Delta)\leq\delta^{(4)}(\varepsilon,h).$ Let us consider the motion $x(\cdot)$ of system (5) generated from the initial position $(t_{\ast},w_{\ast}(\cdot))$ by the first player’s control procedure with a guide $U=U(t_{\ast},w_{\ast}(\cdot),h,\varepsilon,\Delta)$ and a second player’s control realization $v(\cdot)\in\mathcal{V}(t_{\ast},\vartheta).$ Let us consider the corresponding first player’s control realization $u(\cdot)\in\mathcal{U}(t_{\ast},\vartheta)$ in the original system and players’ control realizations $p(\cdot)\in\mathcal{U}(t_{\ast},\vartheta)$ and $q(\cdot)\in\mathcal{V}(t_{\ast},\vartheta)$ in the approximating system (18) for $w_{0}=w_{\ast}(t_{0}),$ the fixed $h,$ and with the initial position $(t_{\ast},r_{\ast}(\cdot))$ defined by (20). Let $y(\cdot)$ be the corresponding motion of the approximating system. By the definition of $U,$ the motion $y(\cdot)$ is generated by the control law $\{P^{0}_{w_{0},h},\varepsilon,\Delta\}$ on the basis of the first player’s optimal positional strategy $P^{0}_{w_{0},h}.$ Hence, for the auxiliary function $x^{\prime}(t)=w_{0}+h^{\alpha-1}(\Delta^{1-\alpha}_{h}y)(t),$ $t\in[t_{0},\vartheta],$ due to the choice of $\varepsilon$ and $\Delta,$ we obtain

[TABLE]

Moreover, the control realizations $u(\cdot)$ and $q(\cdot)$ are formed according to the mutual aiming procedure (24), (25). Therefore, according to the choice of $h$ and $\Delta,$ we derive $\|x(\cdot)-x^{\prime}(\cdot)\|_{\infty}\leq\xi.$ Thus, taking into account the inclusions $x(\cdot),x^{\prime}(\cdot)\in D,$ by the choice of $\xi,$ we have

[TABLE]

The lemma is proved. $\square$

6.3 Second Player’s Control Procedure with a Guide

Similarly to Sect. 6.2, we propose the following second player’s control procedure with a guide in the original differential game (5), (9). Let $(t_{\ast},w_{\ast}(\cdot))\in{G_{\ast}},$ $h>0,$ $\varepsilon>0,$ and a partition $\Delta$ (21) be fixed. Let us consider the approximating differential game (18), (19) for $w_{0}=w_{\ast}(t_{0}),$ the fixed $h,$ and with the initial position $(t_{\ast},r_{\ast}(\cdot))$ defined by (20). By the steps of the partition $\Delta,$ the second player forms a control realization $v(\cdot)\in\mathcal{V}(t_{\ast},\vartheta)$ in the original system and, at the same time, control realizations $p(\cdot)\in\mathcal{U}(t_{\ast},\vartheta)$ and $q(\cdot)\in\mathcal{V}(t_{\ast},\vartheta)$ in the approximating system as follows: $v(\cdot)$ and $p(\cdot)$ are formed according to the mutual aiming procedure (27), (28), and $q(\cdot)$ is formed by the control law $\{Q^{0}_{w_{0},h},\varepsilon,\Delta\}$ (see (23)) on the basis of the optimal strategy $Q^{0}_{w_{0},h}$ taken from Lemma 1. From the initial position $(t_{\ast},w_{\ast}(\cdot)),$ the described control procedure together with $u(\cdot)\in\mathcal{U}(t_{\ast},\vartheta)$ uniquely generate the motion $x(\cdot)$ of the original system and determine the value $\gamma$ of quality index (9). In what follows, this control procedure with a guide is referred as $V(t_{\ast},w_{\ast}(\cdot),h,\varepsilon,\Delta).$

By analogy with Lemma 4, on the basis of Lemma 3, the following result can be proved.

Lemma 5

For any $\zeta>0,$ there exist

[TABLE]

such that, for any $(t_{\ast},w_{\ast}(\cdot))\in{G_{\ast}},$ $h\in(0,h^{(5)}],$ $\varepsilon\in(0,\varepsilon^{(5)}(h)],$ and any partition $\Delta$ $(\ref{Delta})$ with the diameter $\operatorname{diam}(\Delta)\leq\delta^{(5)}(\varepsilon,h),$ the second player’s control procedure with a guide $V(t_{\ast},w_{\ast}(\cdot),h,\varepsilon,\Delta)$ guarantees for the value $\gamma$ of quality index $(\ref{quality_index})$ the inequality

[TABLE]

for any control realization of the first player $u(\cdot)\in\mathcal{U}(t_{\ast},\vartheta).$

6.4 Limit of the Values of the Approximating Games

Considering in the original differential game (5), (9) the case when the both players use the described in Sect. 6.2 and 6.3 control procedures with a guide, we obtain the result below.

Lemma 6

For any initial position $(t_{\ast},w_{\ast}(\cdot))\in{G_{\ast}},$ the following limit exists:

[TABLE]

where $\widehat{\rho}_{h}(t_{\ast},w_{\ast}(\cdot))$ is defined by $(\ref{valh}).$ Moreover, the convergence is uniform in $(t_{\ast},w_{\ast}(\cdot))\in{G_{\ast}}.$

Proof

By the Cauchy criterion, to prove the lemma, it is sufficient to show that, for any $\zeta>0,$ there exists $h=h(\zeta)>0$ such that, for any $h_{1},h_{2}\in(0,h]$ and any $(t_{\ast},w_{\ast}(\cdot))\in{G_{\ast}},$ the inequality below is valid:

[TABLE]

Let $\zeta>0$ be fixed. By Lemmas 4 and 5, for $i\in\{4,5\},$ let us choose

[TABLE]

and put $h=\min\{h^{(4)},h^{(5)}\}.$ Let $h_{1},h_{2}\in(0,h].$ We define

[TABLE]

Let $(t_{\ast},w_{\ast}(\cdot))\in{G_{\ast}},$ and $\Delta$ be a partition (21) with the diameter $\operatorname{diam}(\Delta)\leq\delta.$ Let us consider the motion $x(\cdot)$ of system (5) generated by the players’ control procedures with a guide $U(t_{\ast},w_{\ast}(\cdot),h_{1},\varepsilon,\Delta)$ and $V(t_{\ast},w_{\ast}(\cdot),h_{2},\varepsilon,\Delta).$ Then, for the realized value $\gamma=\sigma(x(\cdot))$ of quality index (9), due to the choice of $h_{1},$ $h_{2},$ $\varepsilon$ and $\Delta,$ we have

[TABLE]

wherefrom we derive (31). The lemma is proved. $\square$

7 Value of the Game

The main result of the paper is the following.

Theorem 7.1

Let conditions $(A.1)$ – $(A.5)$ be satisfied. Then:

The differential game $(\ref{system}),$ $(\ref{initial_condition})$ has the value $\rho(t_{\ast},w_{\ast}(\cdot)),$ $(t_{\ast},w_{\ast}(\cdot))\in{G_{\ast}}.$ 2. 2.

*This value coincides with the limit $\widehat{\rho}\,(t_{\ast},w_{\ast}(\cdot))$ * $($ see $(\ref{lem_limit_main}))$ of the values of the approximating differential games $(\ref{system_y}),$ $(\ref{quality_index_y}).$ 3. 3.

For any $\zeta>0,$ there exist

[TABLE]

such that the following statement holds. Let $(t_{\ast},w_{\ast}(\cdot))\in{G_{\ast}},$ $h\in(0,h_{\ast}],$ $\varepsilon\in(0,\varepsilon_{\ast}(h)],$ and let $\Delta$ be a partition $(\ref{Delta})$ with the diameter $\operatorname{diam}(\Delta)\leq\delta_{\ast}(\varepsilon,h).$ Then the control procedure with a guide of the first player $U(t_{\ast},w_{\ast}(\cdot),h,\varepsilon,\Delta)$ guarantees for the value $\gamma$ of quality index $(\ref{quality_index})$ the inequality

[TABLE]

for any control realization of the second player $u(\cdot)\in\mathcal{V}(t_{\ast},\vartheta);$ and the control procedure with a guide of the second player $V(t_{\ast},w_{\ast}(\cdot),h,\varepsilon,\Delta)$ guarantees for the value $\gamma$ of quality index $(\ref{quality_index})$ the inequality

[TABLE]

for any control realization of the first player $u(\cdot)\in\mathcal{U}(t_{\ast},\vartheta).$

Proof

Let $\zeta>0$ be fixed. Let us define

[TABLE]

where $h^{(i)}>0,$ $\varepsilon^{(i)}(h)>0$ and $\delta^{(i)}(\varepsilon,h)>0$ for $i\in\{4,5\}$ are chosen as in (32), and $h^{(6)}=h^{(6)}(\zeta/2)>0$ is chosen according to Lemma 6 such that, for any $h\in(0,h^{(6)}]$ and any $(t_{\ast},w_{\ast}(\cdot))\in{G_{\ast}},$ the inequality below is valid:

[TABLE]

Let $(t_{\ast},w_{\ast}(\cdot))\in{G_{\ast}},$ $h\in(0,h_{\ast}],$ $\varepsilon\in(0,\varepsilon_{\ast}(h)],$ and let $\Delta$ be a partition $(\ref{Delta})$ with the diameter $\operatorname{diam}(\Delta)\leq\delta_{\ast}(\varepsilon,h).$ Let us consider the first player’s control procedure with a guide $U=U(t_{\ast},w_{\ast}(\cdot),h,\varepsilon,\Delta).$ On the basis of this procedure, we define the first player’s non-anticipative strategy $\alpha$ (see Sect. 3.4) as follows. For any $v(\cdot)\in\mathcal{V}(t_{\ast},\vartheta),$ we consider the unique motion $x(\cdot)$ of system (5) and the control realization $u(\cdot)$ that are formed by $U$ and $v(\cdot),$ and put $\alpha(v(\cdot))=u(\cdot).$ Further, since, by the choice of $h,$ $\varepsilon$ and $\Delta,$ for any $v(\cdot)\in\mathcal{V}(t_{\ast},\vartheta),$ the corresponding value $\gamma=\sigma(x(\cdot))$ of quality index (9) satisfies the inequality

[TABLE]

then, by definition (10) of the lower game value $\rho^{(u)}(t_{\ast},w_{\ast}(\cdot)),$ we obtain

[TABLE]

Taking into account that this inequality is valid for any $\zeta>0,$ we derive

[TABLE]

Now, arguing by contradiction, let us suppose that

[TABLE]

for a number $\zeta^{\ast}>0.$ Let $\alpha^{\ast}$ be a first player’s non-anticipative strategy such that, for any $v(\cdot)\in\mathcal{V}(t_{\ast},\vartheta),$ the motion $x(\cdot)$ of system (5) generated from $(t_{\ast},w_{\ast}(\cdot))$ by $v(\cdot)$ and $u(\cdot)=\alpha^{\ast}(v(\cdot))$ satisfies the inequality

[TABLE]

Similarly to above, based on Lemmas 5 and 6, by the number $\zeta^{\ast}/3,$ one can choose $h^{\ast}>0,$ $\varepsilon^{\ast}>0,$ and a partition $\Delta^{\ast}$ (21) such that the motion $x(\cdot)$ of system (5) generated from $(t_{\ast},w_{\ast}(\cdot))$ by the second player’s control procedure with a guide $V=V(t_{\ast},w_{\ast}(\cdot),h^{\ast},\varepsilon^{\ast},\Delta^{\ast})$ and a first player’s control realization $u(\cdot)\in\mathcal{U}(t_{\ast},\vartheta)$ satisfies the inequality

[TABLE]

According to the definition (see Sect 6.3), the control procedure $V$ forms $v(\cdot)$ by the steps of the partition $\Delta^{\ast}$ on the basis of the information about the realized values of the state vectors of the original and approximating systems. Therefore, since $\alpha^{\ast}$ is non-anticipative, one can consider the motion $x^{\ast}(\cdot)$ generated by $u(\cdot)\in\mathcal{U}(t_{\ast},\vartheta)$ and $v(\cdot)\in\mathcal{V}(t_{\ast},\vartheta)$ such that $u(\cdot)=\alpha^{\ast}(v(\cdot)),$ and, at the same time, $v(\cdot)$ is formed by $V.$ For this motion $x^{\ast}(\cdot),$ we have

[TABLE]

wherefrom we obtain

[TABLE]

The obtained inequality contradicts (36) since $\zeta^{\ast}>0.$ Hence, we derive

[TABLE]

The validity of the equality $\rho^{(v)}(t_{\ast},w_{\ast}(\cdot))=\widehat{\rho}\,(t_{\ast},w_{\ast}(\cdot))$ can be established in a similar way. Thus, the first and second parts of the theorem are proved. Inequality (33) in the third part of the theorem follows directly from (35) and (37). The validity of inequality (34) can be shown similarly. The theorem is proved. $\square$

Remark 3

Let us note that, following (Krasovskii_Subbotin_1988, , § 8.2) (see also Lukoyanov_Plaksin_2015 for details), one can consider another formalization of the differential game (5), (9). Namely, one can formally describe a sufficiently wide classes of players’ strategies with a guide and introduce the corresponding values of the players’ optimal guaranteed results. One can show that from Theorem 7.1 it follows that these optimal guaranteed results coincide, i.e., the differential game has the value in the classes of strategies with a guide, and this value is equal to $\rho(t_{\ast},w_{\ast}(\cdot)).$ Moreover, the players’ strategies with a guide that guarantee inequalities (33) and (34) can be constructed on the basis of the proposed in Sects. 6.2 and 6.3 control procedures. In this sense, these control procedures with a guide can be called optimal.

Remark 4

In addition to Remark 2, another possible way of solving the approximating differential game (18), (19) is to approximate functional differential equation of a retarded type (18) by a high-dimensional system of ordinary differential equations (see, e.g., Lukoyanov_Plaksin_2015_2 and the references therein). Note that this approach can also be used for proving the existence of the game value and constructing the players’ optimal control procedures with a guide in the original differential game (5), (9).

8 Conclusion

In the paper, we have considered a zero-sum differential game in a dynamical system which motion is described by a fractional differential equation. We have proved that the lower and upper game values coincide, i.e., the differential game has the value. The proof is based on the appropriate approximation of the game by a differential game in a dynamical system which motion is described by a first order functional differential equation of a retarded type. This approach has also allowed us to propose the optimal players’ feedback control procedures with a guide, which can be effectively applied if the optimal in the approximating game players’ positional strategies are found.

Bibliography46

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1(1) Bannikov AS (2017) Evasion from pursuers in a problem of group pursuit with fractional derivatives and phase constraints. Vestn Udmurtsk Univ Mat Mekh Komp Nauki 27(3):309–314 (in Russian)
2(2) Baranovskaya LV (2015) A method of resolving functions for one class of pursuit problems. East-Eur J Enterp Technol 74(4):4–8 (in Russian)
3(3) Bardi M, Capuzzo-Dolcetta I (1997) Optimal control and viscosity solutions of Hamilton-Jacobi-Bellman equations. Birkhäuser, Basel
4(4) Başar T, Olsder GJ (1999) Dynamic noncooperative game theory. Second edition, Classics in Applied Mathematics, SIAM, Philadelphia
5(5) Bellman R, Cooke KL (1963) Differential-difference equations. Academic Press, London
6(6) Cardaliaguet P, Quincampoix M, Saint-Pierre P (2007) Differential games through viability theory: old and recent results. In: Advances in dynamic game theory: numerical methods, algorithms, and applications to ecology and economics, Birkhäuser, Boston, pp. 3–35
7(7) Chikrii A, Matychyn I (2011) Riemann–Liouville, Caputo, and sequential fractional derivatives in differential games. In: Advances in dynamic games: theory, applications, and numerical methods for differential and stochastic games, Birkhäuser, Boston, pp. 61–81
8(8) Diethelm K (2010) The analysis of fractional differential equations. Springer, Berlin