Duality and upper bounds in optimal stochastic control governed by partial differential equations
Shinji Tanimoto

TL;DR
This paper introduces a dual control framework for stochastic PDE-governed systems, establishing duality theorems that help bound and identify the optimal control values.
Contribution
It develops a dual control problem for stochastic PDE systems and proves duality theorems linking the original and dual problems, aiding in optimal value estimation.
Findings
Duality theorems relate original and dual problem values.
Dual problem provides upper bounds for the original control problem.
The approach helps in estimating and achieving optimal control values.
Abstract
A dual control problem is presented for the optimal stochastic control of a system governed by partial differential equations. Relationships between the optimal values of the original and the dual problems are investigated and two duality theorems are proved. The dual problem serves to provide upper bounds for the optimal and maximum value of the original one or even to give the optimal value.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic processes and financial applications
**Duality and upper bounds in optimal stochastic control
governed by partial differential equations
** **Shinji Tanimoto
** Department of Mathematics, University of Kochi,
Kochi 780-8515, Japan***Former affiliation.
Abstract
A dual control problem is presented for the optimal stochastic control of a system governed by partial differential equations. Relationships between the optimal values of the original and the dual problems are investigated and two duality theorems are proved. The dual problem serves to provide upper bounds for the optimal and maximum value of the original one or even to give the optimal value.
1. Introduction
The original problem (or primal problem) considered is the optimal control of a system governed by a stochastic heat equation that is described in [4], which is a maximization problem. In this paper, to the problem we associate another, called its dual problem, which is in turn a minimization problem. We prove two types of duality theorem.
First we show that solutions of the dual problem provide upper bounds for the maximum of the primal problem. We call this assertion a weak duality theorem. Next, under some conditions related to the maximum principle of control theory, the maximum can be attained by solving the dual problem. Such a property is called a strong duality theorem.
Let and be a bounded and open domain in with boundary . On we consider the following stochastically controlled system. The one-dimensional Brownian motion is defined on a filtered probability space . The state of the system is denoted by , which is controlled by for and at . The control process satisfies , where is a bounded set of , and it is -measurable for all . The state is described by a stochastic heat equation of the form
[TABLE]
The boundary value functions on , and on are real-valued and deterministic. is a second order partial differential operator acting on smooth functions of :
[TABLE]
where is a symmetric nonnegative definite matrix with entries and for . A control process is called admissible if the corresponding solution of Eqs.(1)-(3) is unique and belongs to , where is the Lebesgue measure on . The set of all admissible controls is denoted by ;
[TABLE]
The functions and in (1) are, respectively, and . The expected performance (or payoff) is given by, for each ,
[TABLE]
where . Throughout this paper we impose the following;
(Assumption) is a function that is concave with respect to .
is a bounded continuous function, and denotes the expectation with respect to the probability measure . The aim of the primal problem is to find a maximizing control and such that
[TABLE]
Thus the primal problem is formulated as
[TABLE]
In the next section a dual problem to (5) is proposed. Similar dual control problems were constructed for max-min control problems in [5], for non-well-posed distributed systems in [6] and for optimal stochastic control in [7]. When a primal problem is a minimization problem, its dual problem serves to provide lower bounds for the minimum value of the primal one. Here the primal problem is a maximization problem, its dual problem provides upper bounds for the maximum value. Under some conditions related to the maximum principle of control theory it is also able to attain the maximum.
2. Dual Problem
The adjoint of the differential operator is defined by
[TABLE]
In order to present the dual problem, for each real number , we define a function
[TABLE]
Note that the control variable of the primal problem disappears at this stage.
The dual control problem is the system with performance functional that is to be minimized:
[TABLE]
over all variables and that satisfy:
[TABLE]
The variable plays a role of control process of the dual problem that is a continuous process belonging to . As indicated by the strong duality theorem (Section 4), may be a solution of Eqs.(1)–(3), which indeed becomes a continuous process. Or it can be even a deterministic and continuous variable. Hence the dual problem is more manageable than the primal one. The variable , in turn, represents the state of the dual problem. We denote by the set of all pairs that satisfy Eqs.(8)–(10). So the dual problem is formulated as
[TABLE]
3. Weak Duality Theorem
In this section we show that solutions of the dual problem provide upper bounds for the maximum of problem (5). We call this property a weak duality theorem.
Theorem 1. Under the concavity of the function it follows that
[TABLE]
Proof. Let be an admissible control and let us fix it for the moment. Let be the solution of Eqs.(1)-(3) for and put (see Eq.(4));
[TABLE]
On the other hand, for the same we consider the following expectation, using an arbitrary :
[TABLE]
where is a solution of Eqs.(8)-(10):
[TABLE]
Making use of these fixed and , the difference between and is
[TABLE]
By the concavity of with respect to we have
[TABLE]
from which we have the inequality
[TABLE]
We show that the right-hand side of (13) is equal to zero. From Eq.(8) it follows that
[TABLE]
and that
[TABLE]
On the other hand, since satisfies
[TABLE]
we get by integration of parts ([2])
[TABLE]
where we used and . Since (see Eqs.(3) and (10)) and on , the surface of , the first Green formula ([8, p.258]) implies
[TABLE]
From this equality it follows that
[TABLE]
Upon substituting Eqs.(15), (16) into (14), we see that the right-hand side of Eq.(14) (and (13)) is equal to zero. Hence we can conclude that for each it follows that
[TABLE]
Since is arbitrary, we have
[TABLE]
The optimal value for the primal problem is and it satisfies
[TABLE]
By a well-known inequality of game theory [3], we have
[TABLE]
In view of (6) we see that for each fixed the value is identical to Eq.(7) of the dual problem, that is, , which is to be minimized. Therefore, we obtain
[TABLE]
This proves the weak duality theorem.
The last inequality shows that each provides an upper bound for the primal problem.
4. Strong Duality Theorem
In this sction we assume that a control process satisfies a sort of the maximum principle of optimality such as in [4, Theorem 2.1]. Under the concavity of the function in Eq.(4), it entails the strong duality theorem. More precisely, the corresponding solution of Eqs.(1)-(3) provides an optimal control for the dual problem and there is no duality gap; both extreme values (5) and (11) are exactly equal.
Theorem 2. Suppose is a solution of Eqs.(1)-(3)* for an admissible control , and that , together with this , is a solution of Eqs.(8)-(10). If satisfies*
[TABLE]
the function being defined by (6), then is an optimal control of the primal problem and is that of the dual one. Moreover, there is no duality gap;
[TABLE]
Proof. The proof is similar to that of Theorem 1. Let us put
[TABLE]
On the other hand, using (7) and (18), we have
[TABLE]
We evaluate the difference
[TABLE]
Now it is easy to prove that the difference is equal to zero, using a similar calculation to the right-hand side of (13); . Using Theorem 1 (weak duality), it follows that is an optimal control for the primal problem and that is an optimal pair for the dual one. This completes the proof.
Although our system is simpler than that of [4] and the approach is different from it, Eq.(18) turns out a sufficient optimality condition for the primal problem.
5. Partial Observation Control
In partially observable systems as in [1], it is necessary to consider controls that do not depend on the space variable . We denote the subset of such controls by ;
[TABLE]
The primal problem is to maximize the functional
[TABLE]
over together with satisfying
[TABLE]
The dual system is governed by Eqs.(8)-(10) as before. In order to formulate the dual problem, let us put
[TABLE]
for functions that are solutions of Eqs.(8)-(10). The dual problem is to minimize the functional
[TABLE]
over all and satisfying Eqs.(8)-(10). Note that this type of dual problem takes a more similar form to the one dealt with in [7].
We prove two duality theorems. To do this, let us take an arbitrarily chosen control , and introduce the corresponding functional similar to Eq.(12), while satisfy Eqs.(8)-(10), i.e., . Then we can derive the inequality analogous to (17);
[TABLE]
for all . Among the terms of , those relevant to are and . Hence we divide into two parts: one is
[TABLE]
and the other is
[TABLE]
[TABLE]
Using a measurable selection theorem, Fubini’s theorem and Eq.(19), we see that the expectation (20) can be written as
[TABLE]
This together with (21) yields the functional for which the weak duality theorem holds;
[TABLE]
Next suppose that is a solution of Eqs.(1)-(3) for an admissible control , and that satisfies (averaged maximum condition in [4])
[TABLE]
Then we obtain the equality and hence the strong duality theorem as in Section 4, implying no duality gap
[TABLE]
Moreover, from the weak duality theorem it follows that provides an optimal control for the primal problem, and so does the pair for the dual problem.
References
- [1]
A. Bensoussan, Stochastic Control of Partially Observable Systems, Cambridge Univ. Press, 1992.
- [2]
K. L. Chung and R. J. Williams, Introduction to Stochastic Integration, Second Edition, Birkhäuser, 1990.
- [3]
S. Karlin, Mathematical Methods and Theory in Games, Programming and Economics, Vol. I, Addison-Wesley, 1959.
- [4]
B. Øksendal, Optimal control of stochastic partial differential equations, Stochastic Analysis and Applications, 23, No. 1, 165–179, 2005.
- [5]
S. Tanimoto, A duality theorem for max-min control problems, IEEE Transactions on Automatic Control, AC-27, No. 5, 1129–1131, 1982.
- [6]
S. Tanimoto, Duality in the optimal control of non-well-posed distributed systems, Journal of Mathematical Analysis and Applications, 171, 277–287, 1992.
- [7]
S. Tanimoto, Duality and lower bounds in optimal stochastic control, International Journal of Systems Science, 25, 1365–1372, 1994.
- [8]
J. Wloka, Partial Differential Equations, Cambridge Univ. Press, 1987.
