A stochastic Pontryagin maximum principle on the Sierpinski gasket
Xuan Liu

TL;DR
This paper develops a stochastic Pontryagin maximum principle tailored for control problems on the Sierpinski gasket, addressing unique challenges posed by fractal geometry and measure singularity.
Contribution
It introduces a novel stochastic maximum principle for fractal spaces, incorporating two necessity equations due to measure singularity, expanding control theory on complex geometries.
Findings
Derived an order comparison lemma using heat kernel estimates.
Established a Pontryagin maximum principle with two necessity equations.
Analyzed linear regulator problems on the gasket.
Abstract
In this paper, we consider stochastic control problems on the Sierpinski gasket. An order comparison lemma is derived using heat kernel estimate for Brownian motion on the gasket. Using the order comparison lemma and techniques of BSDEs, we establish a Pontryagin stochastic maximum principle for these control problems. It turns out that the stochastic maximum principle on the Sierpinski gasket involves two necessity equations in contrast to its counterpart on Euclidean spaces. This effect is due to singularity between the Hausdorff measure and the energy dominant measure on the gasket, which is a common feature shared by many fractal spaces. The linear regulator problems on the gasket is also considered as an example.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMathematical Dynamics and Fractals · Complex Systems and Time Series Analysis · Mathematical and Theoretical Analysis
A stochastic Pontryagin maximum principle on the Sierpinski gasket
Xuan Liu Nomura International, 30/FL Two International Finance Centre, Hong Kong. Email: [email protected].
This research was carried out when the author was reading DPhil in Mathematics at the University of Oxford.
Abstract
In this paper, we consider stochastic control problems on the Sierpinski gasket. An order comparison lemma is derived using heat kernel estimate for Brownian motion on the gasket. Using the order comparison lemma and techniques of BSDEs, we establish a Pontryagin stochastic maximum principle for these control problems. It turns out that the stochastic maximum principle on the Sierpinski gasket involves two necessity equations in contrast to its counterpart on Euclidean spaces. This effect is due to singularity between the Hausdorff measure and the energy dominant measure on the gasket, which is a common feature shared by many fractal spaces. The linear regulator problems on the gasket is also considered as an example.
1 Introduction
Recently, to study non-linear analysis on the Sierpinski gasket, [7] developed a theory of backward stochastic differential equations (BSDEs) on the Sierpinski gasket. BSDEs and related stochastic analysis on fractals, though initially considered as efficient tools to treat quasi-linear parabolic PDEs on fractals, also have interests on their own from a mathematical finance point of view. Several interesting mathematical finance problems are formulated as stochastic control problems on Euclidean spaces, which are based upon the assumption that uncertainties in financial models are sourced from Brownian filtration on Euclidean spaces. However, it had been widely observed from the real data that many financial time series exhibit fractal behaviours (see, for example, [1, 8, 2] and etc.), which suggests the possibility that uncertainties in the markets might come from filtrations exhibiting fractal structures. Therefore, it is of significance to consider stochastic control problems for controlled systems with noise coming from filtrations determined by the diffusions on fractals.
The motivation of this paper is to establish a stochastic Pontryagin maximum principle for stochastic control problems on the Sierpinski gasket, with uncertainties in the controlled dynamic systems generated by the diffusion on the gasket. It turns out that, in contrast to its counterpart on Euclidean spaces, the stochastic maximum principle on the gasket consists of two necessity equations rather than a single one (see [9] and [10, Section 3.2]). As we shall see, this is due to the singularity between two measures which are both necessary for analysis on fractals.
This paper is organized as follows. In Section 2, we introduce notations which will be enforced throughout this paper, and review some related results in literature. The main results of this paper is formulated and collected in Section 3. Section 4 is devoted to the proof of the stochastic maximum principle on the Sierpinski gasket. The linear regulator problem on the gasket is considered in Section 5 as an example. Though results of this paper are established for two-dimensional Sierpinski gasket, we however believe that our results also hold for higher-dimensional cases, where argument in this paper should remain valid.
2 Notations and related results
In this section, we introduce notations which will be enforced throughout this paper. We also review several results in literature needed in the following sections.
Let with , and be the contraction mappings given by Define inductively by , and . The (two-dimensional) Sierpinski gasket is defined to be the closure of in .
For a given set , we denote by the space of all real-valued functions on . The standard Dirichlet form on the Sierpinski gasket is defined by
[TABLE]
where the forms are given by
[TABLE]
Let be the Hausdorff measure on with weight , that is, is the unique Borel probability measure on such that for each and each . Then the form is a regular Dirichlet form on , and is the corresponding Dirichlet space.
The Kusuoka measure on is defined by , where is the energy measures of the harmonic function with boundary value , which is the unique minimizer of .
According to the general theory of Dirichlet forms and Markov processes (see [4, Chapter 7]), associated to the form there exists a standard Hunt process \mathbf{M}=\big{(}\Omega,\mathcal{F},\{X_{t}\}_{t\in[0,\infty]},\allowbreak\{\mathbb{P}_{x}\}_{x\in\mathbb{S}\cup\{\Delta\}}\big{)} with state space , where is the “cemetery” of . The process is called *Brownian motion *on . The semigroup of will be denoted by .
Let be the family of all Borel probability measures on . For each , the probability measure on is defined by . The expectation with respect to will be denoted by . Let , the -completion of in , and the minimal completed admissible filtration (cf. [4, p. 385]) of , that is, .
We end this section with a review on the representing martingale on the Sierpinski gasket. The following result was first shown in [6, Theorem (5.4)] (see also [7, Theorem 2.6]).
Theorem 2.1**.**
There exists a martingale additive functional satisfying the following:
(i) has as its energy measure;
(ii) For any , there exists a unique such that
[TABLE]
where is the martingale part of .
The martingale additive functional given by (2.1) is called the Brownian martingale on . The following result on the singularity between the Lebesgue-Stieltjes measure induced by and the Lebesgue measure on was proved in [7, Lemma 4.10].
Lemma 2.2**.**
The Lebesgue-Stieltjes measure is singular to the Lebesgue measure on .
The following lemma, which is shown in [7, Lemma 4.11], gives the exponential integrability of .
Lemma 2.3**.**
For each and ,
[TABLE]
where is a universal constant.
3 Formulation of the main result
Let satisfy . Let the decision space be a separable metric space. Let be Borel measurable functions. For any valued progressively measurable process , we introduce the *cost functional *
[TABLE]
for the controlled system of which the dynamics is given by the following SDE on \big{(}\Omega,\mathcal{F},\{\mathcal{F}_{t}^{\lambda}\}_{t\geq 0},\mathbb{P}_{\lambda}\big{)}:
[TABLE]
where are Borel measurable functions, and .111The existence and uniqueness of solutions to (3.2) can be easily shown by an a priori estimate similar to [7, eqn. (3.8), p. 8].
Definition 3.1**.**
Denote by the family of all valued processes such that
[TABLE]
where is the controlled process given by (3.2). Any is called an admissible control, and is called an admissible pair.
We consider the following optimization problem
[TABLE]
subject to the controlled dynamics (3.2). To formulate our result, we shall need the following definition.
Definition 3.2**.**
We define the measure on to be
[TABLE]
and the measure to be the unique measure on the optional field222That is, the field on generated by the family of all right continuous left limit processes. on such that
[TABLE]
for any stopping times with , where .
Remark 3.3*.*
By and Lemma 2.2, the measures and are mutually singular.
Theorem 3.4**.**
*Let be absolutely continuous with respect to . Assume that:
(A.1)*
[TABLE]
*for , and
(A.2)*
[TABLE]
for , where is a constant.
Suppose that is a solution to (P). Let and be the solutions of the adjoint equations
[TABLE]
and
[TABLE]
and let be the Hamiltonians defined by
[TABLE]
[TABLE]
Then
[TABLE]
Remark 3.5*.*
(i) Notice that the assumptions (A.1) and (A.2) imply that are uniformly bounded for . Indeed, the assumption (A.1) implies the uniform boundedness of . The boundedness of for can also be deduced from (A.1) with , which together with (A.2) implies the uniform boundedness of . Similarly, is also uniformly bounded.
(ii) The adjoint equations (3.6) and (3.7) are introduced in order to reduce the general case with a non-trivial in the cost functional to the one without an term. In other words, it transforms the cost at terminal time into a cumulative cost over the interval . This can be seen more clearly from the proof of Theorem 3.4.
4 Proof of the stochastic maximum principle
In this section, we prove Theorem 3.4 for the optimization problem (P) on the Sierpinski gasket. Our argument is based on the idea of approximation and duality used in the paper [9] and the monograph [10] for classical Euclidean setting, while overcoming some difficulties concerning the driver martingale on the Sierpinski gasket. More specifically, as we shall see, a crucial ingredient of our argument is an order comparison lemma (Lemma 4.2), which is needed for stochastic Taylor expansions. Another technical lemma crucial to the proof of Theorem 3.4 is Lemma 4.6, which gives the orders of approximation errors.
Definition 4.1**.**
Let and be a progressively measurable set. For each , we denote
[TABLE]
Clearly, the map is a Borel measure on , where is the one-dimensional Lebesgue measure. We denote by the completion of with respect to the measure .
Lemma 4.2**.**
Let , and be a progressively measurable set. Let be a family of -measurable subsets of such that . Then, for some universal constant ,
[TABLE]
In particular,
[TABLE]
for all and .
Proof.
Let . Then, for each , is a bounded progressively measurable process. Clearly, we have the following iterated integral representation
[TABLE]
Since is progressively measurable, we have . Therefore, by (4.3) and the tower property,
[TABLE]
Recall that . By [7, Lemma 4.17], we have
[TABLE]
where, for any Borel measure on ,
[TABLE]
with being the transition kernel of , which is jointly continuous on . By [5, Theorem 5.3.1], there exists a universal constant such that
[TABLE]
where is the spectral dimension of . Therefore,
[TABLE]
For with , by (4.5),
[TABLE]
Hence, by (4.4),
[TABLE]
By (4.3) again, we conclude that
[TABLE]
which is (4.1).
When is an integer, the asymptotic (4.2) is a direct corollary of (4.1). For real-valued , the conclusion follows easily from interpolation
[TABLE]
∎
Remark 4.3*.*
The order estimate (4.1) implies that , which is quite sharp. In fact, since the heat kernel estimate (4.6) is two-sided, by [7, Lemma 4.17], we have that
[TABLE]
Therefore, by (4.1),
[TABLE]
Notice that the reverse of the above inequality is a direct consequence of Hölder’s inequality. Therefore, we see that, up to a multiplicative constant,
[TABLE]
We shall also need the following estimate for solutions of linear SDEs driven by the Brownian martingale .
Lemma 4.4**.**
Let be progressively measurable processes, and be a predictable process. Let be the solution to the SDE
[TABLE]
Suppose that
[TABLE]
where is a constant. Then, for each and each ,
[TABLE]
where is a constant depending only on ,
[TABLE]
for any and any progressively measurable process , and
[TABLE]
for a sufficiently large constant depending only on (e.g. will suffice). Therefore,
[TABLE]
Remark 4.5*.*
From now on, for the ease of notation, we shall use the same notation to denote with possibly different constants depending only on and the norms of coefficients of SDEs.
Proof.
To simplify notation, we shall denote for any . By Itô’s formula,
[TABLE]
Denote . Then
[TABLE]
By the Burkholder–Davis–Gundy inequality,
[TABLE]
where is a universal constant. Choosing and sufficiently small gives
[TABLE]
where denotes a constant depending only on . Since , (4.10) follows easily from the above and (4.11) and Young’s inequality. Notice that, in the above inequality, we have used the fact that (or alternatively an localization argument together with a.s.), which can be shown by an iteration argument similar to the proof of [7, Theorem 3.10]. ∎
We now turn to the derivation of the stochastic maximum principle. Suppose that is a minimizer of (P), and is the corresponding controlled process. Let be an arbitrary family of -measurable subsets of such that .
Let be disjoint optional sets such that is supported on and on . An example of such is , , where
[TABLE]
are the Radon–Nikodym derivatives with respect to the optional -field on . For arbitrary , let
[TABLE]
Let
[TABLE]
Then is progressively measurable. Notice that if , then for all .
We denote by the controlled process corresponding to , and let
[TABLE]
Define the first-order approximating process by
[TABLE]
and the second-order approximating process by
[TABLE]
where, for any function , we denote
[TABLE]
Clearly, . We shall need the following estimates.
Lemma 4.6**.**
Let be the progressively measurable set defined by (4.12). Then, for each , as ,
[TABLE]
Proof.
We only present the proof of (4.16) and (4.19), since the proof of (4.17) is similar to that of (4.16), while the proof of (4.18) and (4.20) are similar to that of (4.19). The difference between the proof of (4.16) and (4.19) is that the SDE for involves as bias terms in Lemma 4.4 (see (4.22) below), which requires further estimate. This is also the case for and , and hence their estimates are similar to that of .
For any function , denote
[TABLE]
By (3.2),
[TABLE]
Let . Then . By Lemma 4.4,
[TABLE]
where denotes a constant depending only on , but might be different at various appearances. We should point out that we explicitly include the term in the last equation to reflect the fact that \mathbb{E}_{\lambda}\big{[}\big{(}\int_{0}^{T}|\delta b_{1}(t)|dt\big{)}^{2k}\big{]}=0 whenever . The appearances of in other estimates are out of the same purpose. The above estimate completes the proof of (4.16). The proof of (4.17) is similar.
We now turn to the proof of (4.19). By the definition of , we have
[TABLE]
Let , and
[TABLE]
Then, by (4.21) and the fact that for , we have
[TABLE]
In order to apply Lemma 4.4, since the desired estimates involving follow directly from definition, we need to estimate \mathbb{E}_{\lambda}\big{[}\big{(}\int_{0}^{T}\chi_{1}(t)\,dt\big{)}^{2k}\big{]}, \mathbb{E}_{\lambda}\big{[}\big{(}\int_{0}^{T}\chi_{1}(t)\,d\langle W\rangle_{t}\big{)}^{2k}\big{]} and \mathbb{E}_{\lambda}\big{[}\big{(}\int_{0}^{T}\chi_{1}(t)^{2}\,d\langle W\rangle_{t}\big{)}^{k}\big{]}, where
[TABLE]
We first estimate \mathbb{E}_{\lambda}\big{[}\big{(}\int_{0}^{T}\chi_{1}(t)\,dt\big{)}^{2k}\big{]}. Notice that, by Lemma 2.3 and (4.16),
[TABLE]
Moreover, for any ,
[TABLE]
which implies that
[TABLE]
Therefore,
[TABLE]
Next, we estimate \mathbb{E}_{\lambda}\big{[}\big{(}\int_{0}^{T}\chi_{1}(t)\,d\langle W\rangle_{t}\big{)}^{2k}\big{]}. For any , by (4.16),
[TABLE]
which, in view of the fact that , implies that
[TABLE]
Moreover, by Lemma 2.3 again,
[TABLE]
which, by (4.16), implies that
[TABLE]
Therefore,
[TABLE]
We now estimate \mathbb{E}_{\lambda}\big{[}\big{(}\int_{0}^{T}\chi_{1}(t)^{2}\,d\langle W\rangle_{t}\big{)}^{k}\big{]}. Similarly to the above, for any ,
[TABLE]
which implies that
[TABLE]
Moreover, for any , by Young’s inequality,
[TABLE]
which, together with (4.16), implies that
[TABLE]
Hence,
[TABLE]
With the estimates (4.24)–(4.27), we are now in a position to apply Lemma 4.4 and deduce (4.19). The proof of (4.18) is similar to that of (4.19), except that in the derivation, we need to use both (4.16), (4.17), and (4.19).
The proof of (4.20) is also similar in essence to that of (4.19). By a second order Taylor expansion, it is not difficult to see that, for ,
[TABLE]
Therefore,
[TABLE]
for , and
[TABLE]
Let , and
[TABLE]
Then, by substituting (4.28) and (4.29) into the SDE of , we have
[TABLE]
In view of (4.24) and (4.25), in order to apply Lemma 4.4, it suffices to estimate \mathbb{E}_{\lambda}\big{[}\big{(}\int_{0}^{T}\chi_{2}(t)\,dt\big{)}^{2k}\big{]}, \mathbb{E}_{\lambda}\big{[}\big{(}\int_{0}^{T}\chi_{2}(t)\,d\langle W\rangle_{t}\big{)}^{2k}\big{]} and \mathbb{E}_{\lambda}\big{[}\big{(}\int_{0}^{T}\chi_{2}(t)^{2}\,d\langle W\rangle_{t}\big{)}^{k}\big{]}, which can be done similarly to those of in the above using the established estimates (4.16), (4.17), and (4.19). ∎
Proof of Theorem 3.4.
Let be the progressively measurable set defined by (4.12). By definition of , we have
[TABLE]
Notice that we have the following approximations
[TABLE]
The approximation (4.30) follows directly from (4.20). The approximation (4.31) follows from together with (4.16), (4.17), and (4.19) in Lemma 4.6. For (4.32), in view of and the boundedness of , we have
[TABLE]
By and Lemma 4.6, it is easily seen that
[TABLE]
which yields the approximation (4.32). Therefore,
[TABLE]
Next, we transform the cost into a cumulative one. By (3.6),
[TABLE]
Notice that the last integral term \mathbb{E}_{\lambda}\big{(}\int_{0}^{T}\delta(\partial_{x}\sigma)(t)q(t)y^{\epsilon}(t)d\langle W\rangle_{t}\big{)} in the above is also of order \mathfrak{M}_{1}(E)\,\mathrm{o}(|I_{\epsilon}|)+\mathrm{o}\big{(}m_{1,\lambda}(I_{\epsilon};E)\big{)}. To see this, by [7, Theorem 3.5], \mathbb{E}_{\lambda}\big{(}\int_{0}^{T}q(t)^{2}\mathrm{e}_{t}d\langle W\rangle_{t}\big{)} is bounded. Therefore, for any , in view of and Lemma 2.3,
[TABLE]
Hence, the equality (4.34) can be further written as
[TABLE]
Also, we transform \mathbb{E}_{\lambda}[\partial_{x}^{2}h\big{(}\bar{x}(T)\big{)}y^{\epsilon}(T)^{2}] into a cumulative cost. By (3.7),
[TABLE]
Similar to before, it can be shown that the term \mathbb{E}_{\lambda}\big{(}\int_{0}^{T}[2\partial_{x}\sigma(t)P(t)+Q(t)]\delta\sigma(t)y^{\epsilon}(t)\,d\langle W\rangle_{t}\big{)} is of order \mathfrak{M}_{1}(E)\,\mathrm{o}(|I_{\epsilon}|)+\mathrm{o}\big{(}m_{1,\lambda}(I_{\epsilon};E)\big{)}. Therefore,
[TABLE]
Combining (4.33), (4.35), and (4.36), we arrive at
[TABLE]
We now show that the optimality of and (4.37) implies that
[TABLE]
By separability of and the continuity of in , there exist progressively measurable processes and such that
[TABLE]
We first set . Then , and therefore . Moreover, (4.37) reduces to
[TABLE]
which clearly implies the first inequality in (4.38).
We now turn to the proof of the second inequality in (4.38). For any , let
[TABLE]
Set . Then and . Therefore, (4.37) reduces to
[TABLE]
By the definition of and , we have
[TABLE]
Therefore,
[TABLE]
which clearly implies
[TABLE]
Therefore, \mathfrak{M}_{2}(E_{a})=\mathbb{E}_{\lambda}\big{(}\int_{0}^{T}1_{E_{a}}(t,\omega)d\langle W\rangle_{t}\big{)}=0 in view of the arbitrariness of . This completes the proof. ∎
5 An example: linear regulator problem
Let with and , and take as the decision space . We consider the following linear regulator problem, which has wide applications in mathematical finance and engineering (see [3, p. 23] and references therein):
[TABLE]
with
[TABLE]
Suppose that is an optimal pair of the problem (5.1). The adjoint equations are
[TABLE]
[TABLE]
Clearly, is the solution to (5.4).
The Hamiltonians are
[TABLE]
Let be the measures on given by (3.4) and (3.5), and . By Theorem 3.4,
[TABLE]
and
[TABLE]
which implies that
[TABLE]
It follows from the above and (5.3) that
[TABLE]
Therefore, is given by the system
[TABLE]
where is given by (5.5). Note that, compared to BSDEs, the system (5.6) takes the random variable as a part of its solution so that the additional condition is satisfied. Therefore, (5.6) is not a simple SDE or BSDE but a forward–backward type SDE.
We now look for a solution to the form , where is a process of the form
[TABLE]
By Itô’s formula,
[TABLE]
Comparing the above with (5.6) gives that
[TABLE]
[TABLE]
Therefore, and Furthermore, is given by the BSDE
[TABLE]
of which a unique solution exists (cf. [7, Theorem 3.10]).
For the moment, let us assume that . Then by , we have that and
[TABLE]
The optimal pair is given by
[TABLE]
and
[TABLE]
where and are given by (5.7) and (5.8).
It remains to show that . Let
[TABLE]
By Itô’s formula,
[TABLE]
Therefore,
[TABLE]
which implies that is a martingale. Therefore,
[TABLE]
which gives that
[TABLE]
This, together with the fact that , shows that .
Acknowledgements
The author would like to thank the anonymous referees for providing many useful comments and suggestions, which helps improve the quality of the current paper.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Ausloos et al., [2002] Ausloos, M., Ivanova, K., and Vandewalle, N. (2002). Crashes: Symptoms, diagnoses and remedies. In Empirical Science of Financial Fluctuations , pages 62–76. Springer.
- 2Dong and Wang, [2014] Dong, Y. and Wang, J. (2014). Complex system analysis of market return percolation model on Sierpinski carpet lattice fractal. Journal of Systems Science and Complexity , 27(4):743–759.
- 3Fleming and Rishel, [2012] Fleming, W. H. and Rishel, R. W. (2012). Deterministic and Stochastic Optimal Control , volume 1. Springer Science & Business Media.
- 4Fukushima et al., [2010] Fukushima, M., Oshima, Y., and Takeda, M. (2010). Dirichlet Forms and Symmetric Markov Processes , volume 19 of De Gruyter Studies in Mathematics . De Gruyter, 2nd edition.
- 5Kigami, [2001] Kigami, J. (2001). Analysis on Fractals , volume 143 of Cambridge Tracts in Mathematics . Cambridge University Press.
- 6Kusuoka, [1989] Kusuoka, S. (1989). Dirichlet forms on fractals and products of random matrices. Publications of the Research Institute for Mathematical Sciences , 25(4):659–680.
- 7Liu and Qian, [2017] Liu, X. and Qian, Z. (2017). Backward problems for stochastic differential equations on the Sierpinski gasket. Stochastic Processes and Their Applications .
- 8Mandelbrot, [2013] Mandelbrot, B. B. (2013). Fractals and Scaling in Finance: Discontinuity, Concentration, Risk. Selecta Volume E . Springer Science & Business Media.
