Solution to Zero-Sum Differential Game with Fractional Dynamics via Approximations
Mikhail Gomoyunov

TL;DR
This paper proves that a zero-sum differential game with fractional dynamics has a well-defined value by approximating it with classical differential games and demonstrates the convergence of their values.
Contribution
It introduces a method to establish the value of a fractional differential game through approximation by retarded-type differential games and constructs optimal feedback controls.
Findings
The game has a well-defined value.
Approximate game values converge to the original game value.
Optimal feedback controls are constructed based on the approximations.
Abstract
The paper deals with a zero-sum differential game in which the dynamical system is described by a fractional differential equation with the Caputo derivative of an order The goal of the first (second) player is to minimize (maximize) the value of a given quality index. The main contribution of the paper is the proof of the fact that this differential game has the value, i.e., the lower and upper game values coincide. The proof is based on the appropriate approximation of the game by a zero-sum differential game in which the dynamical system is described by a first order functional differential equation of a retarded type. It is shown that the values of the approximating differential games have a limit, and this limit is the value of the original game. Moreover, the optimal players' feedback control procedures are proposed that use the optimally controlled…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
∎
11institutetext: M. Gomoyunov 22institutetext: Krasovskii Institute of Mathematics and Mechanics of the Ural Branch of the Russian Academy of Sciences, S. Kovalevskaya Str., 16, Ekaterinburg, Russia
Ural Federal University, Mira Str., 32, Ekaterinburg, Russia
22email: [email protected]
Solution to Zero-Sum Differential Game with Fractional Dynamics via Approximations
Mikhail Gomoyunov
(Received: date / Accepted: date)
Abstract
The paper deals with a zero-sum differential game in which the dynamical system is described by a fractional differential equation with the Caputo derivative of an order The goal of the first (second) player is to minimize (maximize) the value of a given quality index. The main contribution of the paper is the proof of the fact that this differential game has the value, i.e., the lower and upper game values coincide. The proof is based on the appropriate approximation of the game by a zero-sum differential game in which the dynamical system is described by a first order functional differential equation of a retarded type. It is shown that the values of the approximating differential games have a limit, and this limit is the value of the original game. Moreover, the optimal players’ feedback control procedures are proposed that use the optimally controlled approximating system as a guide.
Keywords:
Differential game Value of the game Optimal strategies Fractional derivative Fractional differential equation Approximation Control with a guide
1 Introduction
The paper is devoted to the development of the theory of zero-sum differential games (see, e.g., Bardi_Capuzzo-Dolcetta_1997 ; Basar_Olsder_1999 ; Cardaliaguet_Quincampoix_Saint-Pierre_2007 ; Fleming_Soner_2006 ; Friedman_1971 ; Isaacs_1965 ; Krasovskii_Subbotin_1988 ; Lukoyanov_2011 ; Pontryagin_1981 and the references therein) to the case when a motion of a dynamical system is described a fractional differential equation. For the basics of fractional calculus, theory of fractional differential equations and their applications, the reader is referred to Diethelm_2010 ; Kilbas_Srivastava_Trujillo_2006 ; Miller_Ross_1993 ; Podlubny_1999 ; Samko_Kilbas_Marichev_1993 .
Despite the fact that a great number of various control problems in fractional order systems are intensively studied nowadays, only a few works deal with differential games in such systems (see Bannikov_2017 ; Chikrii_Matychyn_2011 ; Mamatov_Alimov_2018 ; Petrov_2018 and the references therein). Furthermore, in these works, only some special classes of linear pursuit-evasion differential games are investigated.
In the paper, we follow the game-theoretical approach Krasovskii_Krasovskii_1995 ; Krasovskii_1985 ; Krasovskii_Subbotin_1988 ; Lukoyanov_2011 ; Osipov_1971 ; Subbotin_1995 ; Subbotin_Chentsov_1981 and consider a quite general formulation of a zero-sum differential game in a fractional order system. We suppose that a motion of the system is described by a non-linear fractional differential equation with the Caputo derivative of an order The game is considered on a finite time interval. The goal of the first (second) player is to minimize (maximize) the value of a given quality index evaluating the system’s motion. The main contribution of the paper is the proof of the fact that the considered differential game has the value, i.e., the lower and upper values of the game coincide.
Due to non-local structure of fractional order derivatives, fractional differential equations are used for describing dynamical systems with the memory effects of a special kind. It makes these equations close to functional differential equations (see, e.g., Bellman_Cooke_1963 ; Hale_Lunel_1993 ; Kolamnovskii_Myshkis_1992 ). In particular, the Riemann-Liouville fractional integral of the order of the solution to the considered fractional differential equation is, by the definition, the solution to the corresponding first order functional differential equation of a neutral type. It allows us to introduce a differential game in this neutral type system and study it instead of the original game. However, to the best of our knowledge, there are no results that can be applied for investigating the obtained differential game. Namely, in Baranovskaya_2015 ; Gomoyunov_Lukoyanov_2018 ; Gomoyunov_Lukoyanov_Plaksin_2017 ; Lukoyanov_Gomoyunov_Plaksin_2017 ; Lukoyanov_Plaksin_2015 ; Maksimov_1991 ; Nikol'skii_1972 , only some special classes of neutral type systems are considered, and, in Vasil'ev_1972 , the game is considered in the classes of players’ programm (open-loop) strategies.
Nevertheless, following Gomoyunov_2018 , based on the finite-difference Grünwald-Letnikov formulas for calculation of fractional derivatives (see, e.g., (Samko_Kilbas_Marichev_1993, , p. 386)), one can approximate the obtained differential game in the first order neutral type system by a differential game in a first order retarded type system. Let us note that differential games in dynamical systems described by functional differential equations of a retarded type are quite well studied (see, e.g., Krasovskii_Subbotin_1988 ; Lukoyanov_2011 ; Osipov_1971 and the references therein), especially in comparison with differential games in neutral type systems. Thus, applying the results of Lukoyanov_2000 ; Lukoyanov_2003 ; Lukoyanov_2011 , we derive that the approximating differential game has the value, and, moreover, this value is achieved in the appropriate classes of players’ positional (closed-loop) strategies.
Further, based on the ideas from Krasovskii_Kotelnikova_2012 (see also Lukoyanov_Plaksin_2015 ), to establish a connection between the original and approximating differential games, we consider the players’ feedback control procedures that use the optimally controlled approximating system as a guide (see, e.g., (Krasovskii_Subbotin_1988, , § 8.2)). It allows us to prove that the values of the approximating games have a limit, and this limit coincides with the value of the original game. The key point here is the mutual aiming procedure between the original and approximating systems Gomoyunov_2018 that provides the desired proximity between the systems’ motions. Moreover, in particular, we obtain that the proposed players’ control procedures with a guide guarantee the game value with a given accuracy, and, in this sense, they can be called optimal.
Let us note also that differential games give a natural formalization of control problems under conditions of unknown disturbances (see, e.g., Krasovskii_Krasovskii_1995 ; Krasovskii_1985 ; Krasovskii_Subbotin_1988 ; Subbotin_Chentsov_1981 ). In some other frameworks, such control problems in fractional order systems are studied, e.g., in Jajarmi_Hajipour_Mohammadzadeh_Baleanu_2018 ; Shen_Lam_2014 .
The rest of the paper is organized as follows. In Sect. 2, we introduce the notations, recall the definitions of fractional order integrals and derivatives, and give some of their properties. In Sect. 3, the considered differential game in a fractional order system is described, and, in particular, the notion of the game value is defined. The corresponding differential game in a first order neutral type system is discussed in Sect. 4. In Sect. 5, we propose an approximation of this game by a differential game in a first order retarded type system. In Sect. 6, the mutual aiming procedure between the original and approximating systems and the optimal players’ control procedures with a guide are described, the limit of the values of the approximating differential game is introduced. In Sect. 7, we prove that the original differential game has the value. Concluding remarks are given in Sect. 8.
2 Notations and Definitions
Let and be fixed. Let be the -dimensional Euclidian space with the scalar product and the norm By we denote the space of essentially bounded (Lebesgue) measurable functions with the norm
[TABLE]
Let be the space of continuous functions with the uniform norm, which is also denoted by Let be the set of functions that are Lipschitz continuous and satisfy the equality For we denote by the set of functions that satisfy the Lipschitz condition with this constant
Let be fixed. For a function the Riemann-Liouville (R.-L.) fractional integral of the order and the R.-L. fractional derivative of the order are respectively defined by
[TABLE]
where is the gamma function. For the properties of the fractional order integrals and derivatives, the reader is referred to Diethelm_2010 ; Kilbas_Srivastava_Trujillo_2006 ; Miller_Ross_1993 ; Podlubny_1999 ; Samko_Kilbas_Marichev_1993 . In this section, we shortly describe those properties that are used in the paper. The details can also be found in Gomoyunov_2017 ; Gomoyunov_2018 .
Let be the set of functions that can be represented by the R.-L. fractional integral of the order of a function i.e.,
Let Then the derivative exists for almost every the inclusion is valid; and Moreover, there exists such that, for any the following inequality holds:
[TABLE]
Further, let us consider the function Then, according to (1), we have for almost every where we denote the inclusion is valid with the constant and the following representation formula holds:
[TABLE]
Finally, for a function the Caputo (C.) fractional derivative of the order is defined by
[TABLE]
In particular, if then the R.-L. and C. fractional derivatives coincide.
3 Differential Game with Fractional Dynamics
3.1 Fractional Order System
We consider a dynamical system which motion is described by the following fractional differential equation with the C. derivative of the order
[TABLE]
Here is the time; is the value of the state vector at the time and are respectively the values of the control vectors of the first and second players at the time and are called the initial and terminal times; the sets and are compact, We suppose that the function satisfies the following conditions:
()
The function is continuous.
()
For any there exists such that
[TABLE]
for any and
()
There exists such that
[TABLE]
for any and
()
The saddle point condition in a small game (Krasovskii_Subbotin_1988, , p. 8) or, in another terminology, the Isaacs’ condition (Isaacs_1965, , p. 35), holds, i.e.,
[TABLE]
for any and
Note that these conditions are quite typical for the differential games theory with first order dynamics (see, e.g., (Krasovskii_Subbotin_1988, , p. 7)).
3.2 Admissible Positions of the System
By a position of system (5), we mean a pair consisting of a time and a function which is treated as a motion history on the interval The set of the positions is denoted by A position is called admissible if the relations below are valid:
[TABLE]
where is a fixed constant, is the constant from condition According to the definition given in Sect. 2, the second inclusion in (6) means that there exists a function such that The set of the admissible positions is denoted by
Proposition 1
The set is not empty, and there exist and such that, for any the inequalities below are valid:
[TABLE]
Proof
Let and Let us consider the function According to (1) and (4), we have Hence, the inclusion is valid, and, therefore, the set is not empty.
Further, let us define
[TABLE]
where is the constant from is the Mittag-Leffler function (see, e.g., (Samko_Kilbas_Marichev_1993, , (1.90))), and is the constant from (2). Let Then, due to (6) and the results given in Sect. 2, we have
[TABLE]
for any and, therefore,
[TABLE]
From this inequality, applying the fractional version of Bellman-Gronwall lemma (see, e.g., (Diethelm_2010, , Lemma 6.19) and also (Gomoyunov_2017, , Lemma 1.1)), we conclude Thus, according to (6), we have
[TABLE]
Finally, by the choice of we derive
[TABLE]
The proposition is proved.
Let and By admissible control realizations (controls) of the first and second players on the interval we mean measurable functions and respectively. The sets of the admissible control realizations of the players are denoted by and Following Idczak_Kamocki_2011 (see also Gomoyunov_2017 ), by a motion of system (5) generated from the initial position by players’ control realizations and we mean a function that satisfies the initial condition
[TABLE]
and, together with and satisfies Eq. (5) for almost every For such a motion and a time we denote by the corresponding position of system (5), i.e.,
[TABLE]
Proposition 2
Let and Then any players’ control realizations and generate from the initial position a unique motion of system Moreover, for any the inclusion is valid.
Proof
Let and The existence and uniqueness of the corresponding motion of system (5) can be proved by the standard scheme (see, e.g., (Diethelm_2010, , Theorem 6.1), (Wang_Zhou_2011, , Theorem 3.1), and also (Gomoyunov_2017, , Theorem 2.1)), if we note that is the motion of system (5) if and only if satisfies the inclusion initial condition (7), and the integral equation
[TABLE]
Further, for the inclusion follows from initial condition (7) and the inclusion For the inclusion is valid due to The proposition is proved.
From Propositions 1 and 2 we derive the following result.
Corollary 1
Let and Let be the motion of system generated from the initial position by players’ control realizations and Then the following inequalities hold:
[TABLE]
where the constants and are taken from Proposition \ref{prop_G_properties}.
Let us note also the following property of motions of system (5), which follows directly from Proposition 2. Let and let be the motion generated from by and Further, let and let be the motion generated from by and Then can be considered as the motion generated from by the realizations
[TABLE]
In particular, this property allows us to consider step-by-step feedback control procedures for constructing players’ control realizations (see Sect. 6).
3.3 Quality Index
Let be the motion of system (5) generated from an initial position by players’ control realizations and Let quality of this motion be evaluated by the index
[TABLE]
We suppose that the function satisfies the following condition:
()
The function is continuous.
For dynamical system (5) and quality index (9), we consider a zero-sum differential game in which the first player aims to minimize the value of the quality index, and the second player aims to maximize it.
3.4 Non-anticipative Strategies and the Game Value
To define the value of the differential game (5), (9), we consider non-anticipative strategies of the players (see, e.g., (Bardi_Capuzzo-Dolcetta_1997, , Ch. VIII) and the references therein) and introduce the lower and upper values of the game. Note that, in another terminology, such strategies are called quasi-strategies (see, e.g., (Subbotin_Chentsov_1981, , p. 24)) or progressive strategies (see, e.g., (Fleming_Soner_2006, , § XI.4)).
Let be an initial position. By a non-anticipative strategy of the first player, we mean a function with the following property. For any and any second player’s control realizations if the equality is valid for almost every then the corresponding images and satisfy the equality for almost every The lower value of the differential game (5), (9) is defined by
[TABLE]
where is the value of quality index (9) that corresponds to the motion generated from by the second player’s control realization and the first player’s control realization
Similarly, a function is a non-anticipative strategy of the second player if, for any and any such that for almost every we have for almost every where and The upper value of the game is defined by
[TABLE]
If the lower and upper game values coincide for any initial position then we say that the game has the value
[TABLE]
The goal of the paper is to prove that the differential game (5), (9) has the value, and, for any initial position construct the players’ feedback control procedures that guarantee the game value with a given accuracy These results are formulated in Theorem 7.1 (see Sect. 7). The proof of this theorem follows the scheme from (Lukoyanov_Plaksin_2015, , Theorem 2) and is based on the appropriate approximation of the differential game (5), (9). Before describing this approximation, in the next section, we rewrite the considered differential game in another form.
4 Differential Game in a Neutral Type System
Let be the motion of system (5) generated from an initial position by players’ control realizations and Let us consider the function
[TABLE]
Since then, according to the results given in Sect. 2, we have
[TABLE]
Substituting these equalities into Eq. (5), we obtain that, instead of the original differential game (5), (9), one can consider the differential game for the dynamical system
[TABLE]
under the initial condition
[TABLE]
and the quality index
[TABLE]
Furthermore, due to (3), one can rewrite Eq. (13) as follows:
[TABLE]
Note that the right-hand side of Eq. (16) depends explicitly on the history of the derivative for Therefore, in the terminology of the theory of functional differential equations (see, e.g., Bellman_Cooke_1963 ; Hale_Lunel_1993 ; Kolamnovskii_Myshkis_1992 ), Eq. (16) is a functional differential equation of a neutral type. To the best of our knowledge, in the theory of differential games in neutral type systems (see the references in Introduction), there are no results that can be directly applied for studying the game (13), (15), and, therefore, the original game (5), (9) too. However, as it is shown in the next section, the game (13), (15) can be approximated by a differential game in a retarded type system.
5 Approximating Differential Game
Following (Gomoyunov_2018, , Sect. 6), let us approximate in relations (13), (15) the fractional derivative by the divided fractional difference with a step size where (see, e.g., (Samko_Kilbas_Marichev_1993, , p. 385))
[TABLE]
the symbol means the integer part of and are the binomial coefficients. In this section, we study the differential game obtained after this approximation.
5.1 Approximating Dynamical System and Quality Index
Let us fix a vector and a sufficiently small value of the parameter Note that, in what follows, the vector corresponds to an initial position of system (5) such that Taking into account the above, we consider the following zero-sum differential game, determined by these two parameters and We introduce the approximating dynamical system which motion is described by the differential equation
[TABLE]
and the approximating quality index
[TABLE]
Here is the value of the state vector; and are respectively the values of the control vectors of the first and second players. The first player minimizes the value of quality index (19), the second player maximizes it.
Note that, according to (17), at a time the right-hand side of Eq. (18) depends on the values for and, in contrast to (16), does not depend explicitly on the history of the derivative Thus, Eq. (18) is a functional differential equation of a retarded type. In what follows, dealing with the game (18), (19), we mainly use the constructions and results from Lukoyanov_2000 ; Lukoyanov_2003 ; Lukoyanov_2011 .
Remark 1
Let us note that, even in a simple case when original quality index (9) is terminal, i.e., for a function the corresponding approximating quality index is still non-terminal, since, according to (17), it depends on the values for
Taking into account (11) and (12), by a position of approximating system (18), we mean a pair such that The set of such positions is denoted by This set is considered with the metric (see, e.g., Lukoyanov_2003 and also (Lukoyanov_2011, , p. 25))
[TABLE]
where and
[TABLE]
By the right-hand side of Eqs. (18), (19), let us define the functions
[TABLE]
where and
Directly from properties ()–() of the functions and it follows that these functions and satisfy the following conditions:
() For any the functions and are continuous uniformly in
() For any and any there exists such that, for any the inequality
[TABLE]
is valid for any satisfying and any
() For any there exists such that, for any the estimate
[TABLE]
holds for any and
() For any and any the function satisfies the saddle point condition in a small game, i.e.,
[TABLE]
for any and
According to (14), if an initial position of original system (5) is given, we define the corresponding initial position of approximating system (18) as follows:
[TABLE]
Due to Proposition 1 and the results given in Sect. 2, the function satisfies the inclusion Taking this into account, we call a position of approximating system (18) admissible if
[TABLE]
where and is the constant from condition (). The set of such admissible positions is denoted by Note that this set is independent on the parameter
Let and As in Sect. 3.2, by admissible control realizations of the players in the approximating game (18), (19), we mean functions and Due to properties ()–(), from the initial position such control realizations and uniquely generate the motion of approximating system (18) that is the function satisfying the initial condition and, together with and satisfying Eq. (18) for almost every
Let us note the following properties of the set Firstly, for any the inclusion is valid for the function defined by (20). Secondly, for the motion of approximating system (18) generated from by and the inclusion holds for any where, according to (8), we denote Finally, the set is a compact subset of
Following the the scheme from (Gomoyunov_2018, , Lemma 2) and taking into account that the constant in Proposition 1 does not depend on an initial position one can prove the result below.
Proposition 3
There exists such that the following statement holds. Let be an initial position of original system Let us consider approximating system for any and under the initial position defined by Then the inclusion is valid for any motion of the approximating system generated from by and
5.2 The Value of the Approximating Game
Let and be fixed. Similarly to Sect. 3.4, in the approximating differential game (18), (19), one can consider non-anticipative strategies of the players and introduce the lower and upper game values, denoted respectively by and From the results of Lukoyanov_2000 ; Lukoyanov_2003 ; Lukoyanov_2011 (see also Gomoyunov_Lukoyanov_Plaksin_2017 ) it follows that, under conditions – the approximating game has the value
[TABLE]
and, furthermore, this value can be guaranteed by the players if they use the positional strategies, described in the next section.
5.3 Optimal Positional Strategies
Let and be fixed. In the approximating differential game (5), (9), by the positional strategies and of the players, we mean arbitrary functions
[TABLE]
where is the accuracy parameter.
Let and let
[TABLE]
be a partition of the interval The triple is called a control law of the first player. This law forms in the approximating system a piecewise constant control realization by the following step-by-step feedback rule:
[TABLE]
where Thus, from the initial position the control law of the first player together with a control realization of the second player uniquely generate the motion of the approximating system and, therefore, determine the value of approximating quality index (19).
Similarly, we consider the control law of the second player which forms a piecewise constant control realization as follows:
[TABLE]
From the initial position the control law together with uniquely generate the motion of the approximating system and determine the value of approximating quality index (19).
By the scheme from (Lukoyanov_2000, , Theorem 1) (see also (Lukoyanov_2011, , Theorem 17.1)), one can prove the following lemma (see Gomoyunov_Lukoyanov_Plaksin_2017 for a related technique).
Lemma 1
For any and any in the approximating differential game there exist the players’ optimal positional strategies and that are optimal uniformly in and Namely, for any and any one can choose and such that the following statement holds. Let and let be a partition with the diameter Then the control law of the first player guarantees for the value of approximating quality index the inequality
[TABLE]
for any control realization of the second player and the control law of the second player guarantees for the value of approximating quality index the inequality
[TABLE]
for any control realization of the first player
Note that the uniformness in the parameter is provided by the corresponding uniformness in properties ()–().
Let us describe shortly one of the ways of constructing such optimal strategies and We apply the method of extremal shift to accompanying points (see, e.g., Krasovskii_Krasovskii_1995 ; Krasovskii_1985 and also Lukoyanov_2000 ; Lukoyanov_2011 ). For simplicity of the notation below, it is convenient to consider the so-called pre-strategies of the players in the approximating game (18), (19). Namely, by pre-strategies and of the first and second players, we mean functions
[TABLE]
that, for any and any satisfy the inclusions
[TABLE]
Let and For the first and second players, we choose the accompanying points and from the conditions
[TABLE]
where the minimum and maximum are calculated over the functions such that
[TABLE]
and the constant is chosen by the set in accordance with property (). Note that the minimum and maximum are attained due to continuity of the value function After that, we define
[TABLE]
Remark 2
There are another methods for constructing the optimal positional strategies and (see, e.g., Krasovskii_Subbotin_1988 ; Lukoyanov_2000 ; Lukoyanov_2003 ; Lukoyanov_2011 ). For example, if the value function is coinvariantly smooth, then the method of extremal shift in the direction of the coinvariant gradient of can be applied. In the general non-smooth case, such strategies can be constructed by the extremal shift in direction of the coinvariant gradient of a suitable coinvariantly smooth auxiliary function. Also, one can use the methods based on the notions of maximal - and -stable bridges. Furthermore, there are some specific methods for constructing the optimal strategies in the linear case (see, e.g., Gomoyunov_Lukoyanov_2012 ; Lukoyanov_Reshetova_1998 ).
6 Players’ Control Procedures with a Guide
In this section, we propose the players’ feedback control procedures that use the optimally controlled approximating system (18) as a guide. It allows us to show that the values of the approximating differential games (18), (19) have the limit when This fact constitutes the basis of the proof of the main result of the paper formulated in Theorem 7.1 (see Sect. 7).
6.1 Mutual Aiming Procedures between the Systems
According to (Gomoyunov_2018, , Sect. 7), let us consider the following mutual aiming procedure between original (5) and approximating (18) systems. First of all, let us introduce pre-strategies of the players in the original game (5), (9). By pre-strategies and of the first and second players, we mean functions
[TABLE]
that, for any and any satisfy the inclusions
[TABLE]
Further, for and let us denote
[TABLE]
Let be an initial position of original system (5). Let us fix put and consider the corresponding approximating system (18) under the initial position defined by (20). Let us fix also a partition (21). Let a first player’s control realization in the original system and a second player’s control realization in the approximating system be formed simultaneously according to the following step-by-step feedback rule:
[TABLE]
where
[TABLE]
and is a pre-strategy of the second player in the approximating game.
Lemma 2
For any there exist and such that, for any initial position of original system and any partition with the diameter the following statement is valid. Let us consider approximating system for and under the initial position defined by Then, for any control realizations and if control realizations and are formed according to the mutual aiming procedure (\ref{s_j_1}), then the corresponding motions and of the original and approximating systems satisfy the inequality
[TABLE]
The lemma is proved by the scheme from (Gomoyunov_2018, , Theorem 3), if we take into account that the constants and in Corollary 1 and the constant in Proposition 3 do not depend on an initial position
Similarly, one can consider another mutual aiming procedure between the original and approximating systems. Namely, let and be formed on the basis of the partition as follows:
[TABLE]
where
[TABLE]
By analogy with Lemma 2, we obtain the following result.
Lemma 3
For any there exist and such that, for any initial position of original system and any partition with the diameter the following statement is valid. Let us consider approximating system for and under the initial position defined by Then, for any realizations and if realizations and are formed according to the mutual aiming procedure (\ref{s_j_2}), then the corresponding motions and of the original and approximating systems satisfy inequality (\ref{lem_procedure_1_main}).
6.2 First Player’s Control Procedure with a Guide
Let and a partition (21) be fixed. We propose the following control procedure of the first player in the original differential game (5), (9). Let us consider the approximating differential game (18), (19) for the fixed and with the initial position defined by (20). By the steps of the partition the first player forms a control realization in the original system and, at the same time, control realizations and in the approximating system as follows: and are formed according to the mutual aiming procedure (24), (25), and is formed by the control law (see (22)) on the basis of the optimal strategy taken from Lemma 1. Note that, from the initial position the described control procedure together with uniquely generate the motion of the original system and, therefore, determine the value of quality index (9). Moreover, during this control procedure, the first player generates the auxiliary motion of the approximating system, which can be considered as a guide (see, e.g., (Krasovskii_Subbotin_1988, , § 8.2)). For convenience, in what follows, the described control procedure is referred as
For any let us introduce the function
[TABLE]
where is defined according to (20), and is the value of the approximating differential game (18), (19) for and the fixed
Lemma 4
For any there exist
[TABLE]
such that, for any and any partition with the diameter the first player’s control procedure with a guide guarantees for the value of quality index the inequality
[TABLE]
for any control realization of the second player
Proof
Applying (Gomoyunov_2018, , Proposition 7), by the constant from Proposition 3, one can choose and such that the inequalities
[TABLE]
are valid for any and any Taking the constants and from Corollary 1, we define and consider the compact set consisting of the functions such that
[TABLE]
Let be fixed. Due to (), there exists such that, for any from the inequality it follows that Let us choose and by Lemma 2, and put Finally, for any we take and from Lemma 1, and define
[TABLE]
Let us show that the statement of the lemma is valid for the chosen parameters.
Let and let be a partition (21) with the diameter Let us consider the motion of system (5) generated from the initial position by the first player’s control procedure with a guide and a second player’s control realization Let us consider the corresponding first player’s control realization in the original system and players’ control realizations and in the approximating system (18) for the fixed and with the initial position defined by (20). Let be the corresponding motion of the approximating system. By the definition of the motion is generated by the control law on the basis of the first player’s optimal positional strategy Hence, for the auxiliary function due to the choice of and we obtain
[TABLE]
Moreover, the control realizations and are formed according to the mutual aiming procedure (24), (25). Therefore, according to the choice of and we derive Thus, taking into account the inclusions by the choice of we have
[TABLE]
The lemma is proved.
6.3 Second Player’s Control Procedure with a Guide
Similarly to Sect. 6.2, we propose the following second player’s control procedure with a guide in the original differential game (5), (9). Let and a partition (21) be fixed. Let us consider the approximating differential game (18), (19) for the fixed and with the initial position defined by (20). By the steps of the partition the second player forms a control realization in the original system and, at the same time, control realizations and in the approximating system as follows: and are formed according to the mutual aiming procedure (27), (28), and is formed by the control law (see (23)) on the basis of the optimal strategy taken from Lemma 1. From the initial position the described control procedure together with uniquely generate the motion of the original system and determine the value of quality index (9). In what follows, this control procedure with a guide is referred as
By analogy with Lemma 4, on the basis of Lemma 3, the following result can be proved.
Lemma 5
For any there exist
[TABLE]
such that, for any and any partition with the diameter the second player’s control procedure with a guide guarantees for the value of quality index the inequality
[TABLE]
for any control realization of the first player
6.4 Limit of the Values of the Approximating Games
Considering in the original differential game (5), (9) the case when the both players use the described in Sect. 6.2 and 6.3 control procedures with a guide, we obtain the result below.
Lemma 6
For any initial position the following limit exists:
[TABLE]
where is defined by Moreover, the convergence is uniform in
Proof
By the Cauchy criterion, to prove the lemma, it is sufficient to show that, for any there exists such that, for any and any the inequality below is valid:
[TABLE]
Let be fixed. By Lemmas 4 and 5, for let us choose
[TABLE]
and put Let We define
[TABLE]
Let and be a partition (21) with the diameter Let us consider the motion of system (5) generated by the players’ control procedures with a guide and Then, for the realized value of quality index (9), due to the choice of and we have
[TABLE]
wherefrom we derive (31). The lemma is proved.
7 Value of the Game
The main result of the paper is the following.
Theorem 7.1
Let conditions – be satisfied. Then:
The differential game has the value 2. 2.
*This value coincides with the limit *see of the values of the approximating differential games 3. 3.
For any there exist
[TABLE]
such that the following statement holds. Let and let be a partition with the diameter Then the control procedure with a guide of the first player guarantees for the value of quality index the inequality
[TABLE]
for any control realization of the second player and the control procedure with a guide of the second player guarantees for the value of quality index the inequality
[TABLE]
for any control realization of the first player
Proof
Let be fixed. Let us define
[TABLE]
where and for are chosen as in (32), and is chosen according to Lemma 6 such that, for any and any the inequality below is valid:
[TABLE]
Let and let be a partition with the diameter Let us consider the first player’s control procedure with a guide On the basis of this procedure, we define the first player’s non-anticipative strategy (see Sect. 3.4) as follows. For any we consider the unique motion of system (5) and the control realization that are formed by and and put Further, since, by the choice of and for any the corresponding value of quality index (9) satisfies the inequality
[TABLE]
then, by definition (10) of the lower game value we obtain
[TABLE]
Taking into account that this inequality is valid for any we derive
[TABLE]
Now, arguing by contradiction, let us suppose that
[TABLE]
for a number Let be a first player’s non-anticipative strategy such that, for any the motion of system (5) generated from by and satisfies the inequality
[TABLE]
Similarly to above, based on Lemmas 5 and 6, by the number one can choose and a partition (21) such that the motion of system (5) generated from by the second player’s control procedure with a guide and a first player’s control realization satisfies the inequality
[TABLE]
According to the definition (see Sect 6.3), the control procedure forms by the steps of the partition on the basis of the information about the realized values of the state vectors of the original and approximating systems. Therefore, since is non-anticipative, one can consider the motion generated by and such that and, at the same time, is formed by For this motion we have
[TABLE]
wherefrom we obtain
[TABLE]
The obtained inequality contradicts (36) since Hence, we derive
[TABLE]
The validity of the equality can be established in a similar way. Thus, the first and second parts of the theorem are proved. Inequality (33) in the third part of the theorem follows directly from (35) and (37). The validity of inequality (34) can be shown similarly. The theorem is proved.
Remark 3
Let us note that, following (Krasovskii_Subbotin_1988, , § 8.2) (see also Lukoyanov_Plaksin_2015 for details), one can consider another formalization of the differential game (5), (9). Namely, one can formally describe a sufficiently wide classes of players’ strategies with a guide and introduce the corresponding values of the players’ optimal guaranteed results. One can show that from Theorem 7.1 it follows that these optimal guaranteed results coincide, i.e., the differential game has the value in the classes of strategies with a guide, and this value is equal to Moreover, the players’ strategies with a guide that guarantee inequalities (33) and (34) can be constructed on the basis of the proposed in Sects. 6.2 and 6.3 control procedures. In this sense, these control procedures with a guide can be called optimal.
Remark 4
In addition to Remark 2, another possible way of solving the approximating differential game (18), (19) is to approximate functional differential equation of a retarded type (18) by a high-dimensional system of ordinary differential equations (see, e.g., Lukoyanov_Plaksin_2015_2 and the references therein). Note that this approach can also be used for proving the existence of the game value and constructing the players’ optimal control procedures with a guide in the original differential game (5), (9).
8 Conclusion
In the paper, we have considered a zero-sum differential game in a dynamical system which motion is described by a fractional differential equation. We have proved that the lower and upper game values coincide, i.e., the differential game has the value. The proof is based on the appropriate approximation of the game by a differential game in a dynamical system which motion is described by a first order functional differential equation of a retarded type. This approach has also allowed us to propose the optimal players’ feedback control procedures with a guide, which can be effectively applied if the optimal in the approximating game players’ positional strategies are found.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1(1) Bannikov AS (2017) Evasion from pursuers in a problem of group pursuit with fractional derivatives and phase constraints. Vestn Udmurtsk Univ Mat Mekh Komp Nauki 27(3):309–314 (in Russian)
- 2(2) Baranovskaya LV (2015) A method of resolving functions for one class of pursuit problems. East-Eur J Enterp Technol 74(4):4–8 (in Russian)
- 3(3) Bardi M, Capuzzo-Dolcetta I (1997) Optimal control and viscosity solutions of Hamilton-Jacobi-Bellman equations. Birkhäuser, Basel
- 4(4) Başar T, Olsder GJ (1999) Dynamic noncooperative game theory. Second edition, Classics in Applied Mathematics, SIAM, Philadelphia
- 5(5) Bellman R, Cooke KL (1963) Differential-difference equations. Academic Press, London
- 6(6) Cardaliaguet P, Quincampoix M, Saint-Pierre P (2007) Differential games through viability theory: old and recent results. In: Advances in dynamic game theory: numerical methods, algorithms, and applications to ecology and economics, Birkhäuser, Boston, pp. 3–35
- 7(7) Chikrii A, Matychyn I (2011) Riemann–Liouville, Caputo, and sequential fractional derivatives in differential games. In: Advances in dynamic games: theory, applications, and numerical methods for differential and stochastic games, Birkhäuser, Boston, pp. 61–81
- 8(8) Diethelm K (2010) The analysis of fractional differential equations. Springer, Berlin
